Exploring Cross-Image Pixel Contrast for Semantic Segmentation

Last update: Jan 02, 2023

Overview

Exploring Cross-Image Pixel Contrast for Semantic Segmentation

Exploring Cross-Image Pixel Contrast for Semantic Segmentation,
Wenguan Wang, Tianfei Zhou, Fisher Yu, Jifeng Dai, Ender Konukoglu and Luc Van Gool
arXiv technical report (arXiv 2101.11939)

Abstract

Current semantic segmentation methods focus only on mining “local” context, i.e., dependencies between pixels within individual images, by context-aggregation modules (e.g., dilated convolution, neural attention) or structureaware optimization criteria (e.g., IoU-like loss). However, they ignore “global” context of the training data, i.e., rich semantic relations between pixels across different images. Inspired by the recent advance in unsupervised contrastive representation learning, we propose a pixel-wise contrastive framework for semantic segmentation in the fully supervised setting. The core idea is to enforce pixel embeddings belonging to a same semantic class to be more similar than embeddings from different classes. It raises a pixel-wise metric learning paradigm for semantic segmentation, by explicitly exploring the structures of labeled pixels, which are long ignored in the field. Our method can be effortlessly incorporated into existing segmentation frameworks without extra overhead during testing.

We experimentally show that, with famous segmentation models (i.e., DeepLabV3, HRNet, OCR) and backbones (i.e., ResNet, HRNet), our method brings consistent performance improvements across diverse datasets (i.e., Cityscapes, PASCALContext, COCO-Stuff).

Installation

This implementation is built on openseg.pytorch. Many thanks to the authors for the efforts.

Please follow the Getting Started for installation and dataset preparation.

Running

Cityscapes

Train DeepLabV3

bash scripts/cityscapes/deeplab/run_r_101_d_8_deeplabv3_train_contrast.sh train 'resnet101-deeplabv3-contrast'

Features (in progress)

t-SNE Visualization

Pixel-wise Cross-Entropy Loss

Pixel-wise Contrastive Learning Objective

Citation

@article{wang2021exploring,
  title   = {Exploring Cross-Image Pixel Contrast for Semantic Segmentation},
  author  = {Wang, Wenguan and Zhou, Tianfei and Yu, Fisher and Dai, Jifeng and Konukoglu, Ender and Van Gool, Luc},
  journal = {arXiv preprint arXiv:2101.11939},
  year    = {2021}
}

Exploring Cross-Image Pixel Contrast for Semantic Segmentation

Related tags

Overview

Exploring Cross-Image Pixel Contrast for Semantic Segmentation

Abstract

Installation

Running

Cityscapes

Features (in progress)

t-SNE Visualization

Citation

Owner

Tianfei Zhou

Pytorch-3dunet - 3D U-Net model for volumetric semantic segmentation written in pytorch

Harmonious Textual Layout Generation over Natural Images via Deep Aesthetics Learning

A new version of the CIDACS-RL linkage tool suitable to a cluster computing environment.

Instance-conditional Knowledge Distillation for Object Detection

AFLFast (extends AFL with Power Schedules)

Jigsaw Rate Severity of Toxic Comments

Rest API Written In Python To Classify NSFW Images.

[CVPR'21] Projecting Your View Attentively: Monocular Road Scene Layout Estimation via Cross-view Transformation

Discovering Interpretable GAN Controls [NeurIPS 2020]

Python library for science observations from the James Webb Space Telescope

WarpRNNT loss ported in Numba CPU/CUDA for Pytorch

A modular domain adaptation library written in PyTorch.

Revisiting Oxford and Paris: Large-Scale Image Retrieval Benchmarking

Focal and Global Knowledge Distillation for Detectors

Instance-wise Occlusion and Depth Orders in Natural Scenes (CVPR 2022)

Deep Learning Interviews book: Hundreds of fully solved job interview questions from a wide range of key topics in AI.

This repository contains the implementation of the paper: "Towards Frequency-Based Explanation for Robust CNN"

A Keras implementation of CapsNet in the paper: Sara Sabour, Nicholas Frosst, Geoffrey E Hinton. Dynamic Routing Between Capsules

[NeurIPS'21] Projected GANs Converge Faster