An official implementation of "Exploiting a Joint Embedding Space for Generalized Zero-Shot Semantic Segmentation" (ICCV 2021) in PyTorch.

Last update: Oct 26, 2022

Related tags

Overview

Exploiting a Joint Embedding Space for Generalized Zero-Shot Semantic Segmentation

This is an official implementation of the paper "Exploiting a Joint Embedding Space for Generalized Zero-Shot Semantic Segmentation", accepted to ICCV2021.

For more information, please checkout the project site [website] and the paper [arXiv].

Pre-requisites

This repository uses the following libraries:

Python (3.6)
Pytorch (1.8.1)

Getting Started

Datasets

VOC

The structure of data path should be organized as follows:

/dataset/PASCALVOC/VOCdevkit/VOC2012/                         % Pascal VOC datasets root
/dataset/PASCALVOC/VOCdevkit/VOC2012/JPEGImages/              % Pascal VOC images
/dataset/PASCALVOC/VOCdevkit/VOC2012/SegmentationClass/       % Pascal VOC segmentation maps
/dataset/PASCALVOC/VOCdevkit/VOC2012/ImageSets/Segmentation/  % Pascal VOC splits

CONTEXT

The structure of data path should be organized as follows:

/dataset/context/                                 % Pascal CONTEXT dataset root
/dataset/context/59_labels.pth                    % Pascal CONTEXT segmentation maps
/dataset/context/pascal_context_train.txt         % Pascal CONTEXT splits
/dataset/context/pascal_context_val.txt           % Pascal CONTEXT splits
/dataset/PASCALVOC/VOCdevkit/VOC2012/JPEGImages/  % Pascal VOC images

Training

We use DeepLabV3+ with ResNet-101 as our visual encoder. Following ZS3Net, ResNet-101 is initialized with the pre-trained weights for ImageNet classification, where training samples of seen classes are used only. (weights here)

VOC

python train_pascal_zs3setting.py -c configs/config_pascal_zs3setting.json -d 0,1,2,3

Trained visual and semantic encoder weights

CONTEXT

python train_context_zs3setting.py -c configs/config_context_zs3setting.json -d 0,1,2,3

Trained visual and semantic encoder weights

Testing

VOC

python train_pascal_zs3setting.py -c configs/config_pascal_zs3setting.json -d 0,1,2,3 -r <visual encoder>.pth --test

CONTEXT

python train_pascal_zs3setting.py -c configs/config_pascal_zs3setting.json -d 0,1,2,3 -r <visual encoder>.pth --test

Acknowledgements

This template is borrowed from pytorch-template.

You might also like...

Official implementation of NPMs: Neural Parametric Models for 3D Deformable Shapes - ICCV 2021

NPMs: Neural Parametric Models Project Page | Paper | ArXiv | Video NPMs: Neural Parametric Models for 3D Deformable Shapes Pablo Palafox, Aljaz Bozic

109 Nov 22, 2022

Official implementation of the paper Vision Transformer with Progressive Sampling, ICCV 2021.

Vision Transformer with Progressive Sampling This is the official implementation of the paper Vision Transformer with Progressive Sampling, ICCV 2021.

123 Jan 1, 2023

Official implementation of the ICCV 2021 paper "Conditional DETR for Fast Training Convergence".

The DETR approach applies the transformer encoder and decoder architecture to object detection and achieves promising performance. In this paper, we handle the critical issue, slow training convergence, and present a conditional cross-attention mechanism for fast DETR training. Our approach is motivated by that the cross-attention in DETR relies highly on the content embeddings and that the spatial embeddings make minor contributions, increasing the need for high-quality content embeddings and thus increasing the training difficulty.

281 Dec 30, 2022

The Official Implementation of the ICCV-2021 Paper: Semantically Coherent Out-of-Distribution Detection.

SCOOD-UDG (ICCV 2021) This repository is the official implementation of the paper: Semantically Coherent Out-of-Distribution Detection Jingkang Yang,

62 Nov 21, 2022

Official implementation of the ICCV 2021 paper: "The Power of Points for Modeling Humans in Clothing".

The Power of Points for Modeling Humans in Clothing (ICCV 2021) This repository contains the official PyTorch implementation of the ICCV 2021 paper: T

158 Nov 24, 2022

Official implementation of the ICCV 2021 paper "Joint Inductive and Transductive Learning for Video Object Segmentation"

JOINT This is the official implementation of Joint Inductive and Transductive learning for Video Object Segmentation, to appear in ICCV 2021. @inproce

35 Oct 16, 2022

[ICCV 2021] Official Tensorflow Implementation for "Single Image Defocus Deblurring Using Kernel-Sharing Parallel Atrous Convolutions"

KPAC: Kernel-Sharing Parallel Atrous Convolutional block This repository contains the official Tensorflow implementation of the following paper: Singl

50 Dec 29, 2022

Official implementation of Protected Attribute Suppression System, ICCV 2021

6 Jan 1, 2023

Official Pytorch Implementation of 'Learning Action Completeness from Points for Weakly-supervised Temporal Action Localization' (ICCV-21 Oral)

Learning-Action-Completeness-from-Points Official Pytorch Implementation of 'Learning Action Completeness from Points for Weakly-supervised Temporal A

67 Jan 3, 2023

Comments

datasets

Thank you for your work～

self._cat_dir = self._base_dir / ("%d_labels.pth" % (self.n_categories))

Could you tell me how to generate the "59_labels.pth" file of the context dataset?

opened by Wangyiqi 1
train_aug.txt

Dear Authors,

When I run your code, there is an error:

FileNotFoundError: [Errno 2] No such file or directory: 'dataset/PASCALVOC/VOCdevkit/VOC2012/ImageSets/Segmentation/train_aug.txt'

Could you tell me how to get train_aug.txt?

opened by AmingWu 1
dataset split

After introducing the SBD (Semantic Boundary Dataset), what kind of split (train_split and test_split include how many images ) is adopted by this paper?

opened by zaiquanyang 0

Releases(v1.0)

v1.0(Aug 11, 2021)

Source code(tar.gz)
Source code(zip)
59_labels.pth(1725.75 MB)
context_zs3_unseen_02.zip(211.84 MB)
context_zs3_unseen_04.zip(211.74 MB)
context_zs3_unseen_06.zip(211.76 MB)
context_zs3_unseen_08.zip(211.76 MB)
context_zs3_unseen_10.zip(211.76 MB)
pascal_context_train.txt(331.89 KB)
pascal_context_val.txt(339.00 KB)
resnet_backbone_pretrained_imagenet_wo_pascalvoc.pth(170.44 MB)
train_aug.txt(682.04 KB)
voc_zs3_unseen_02.zip(211.59 MB)
voc_zs3_unseen_04.zip(211.54 MB)
voc_zs3_unseen_06.zip(211.54 MB)
voc_zs3_unseen_08.zip(211.51 MB)
voc_zs3_unseen_10.zip(211.51 MB)

Owner

CV Lab @ Yonsei University

GitHub Repository

some academic posters as references. May we have in-person poster session soon!

472 Jan 06, 2023

[arXiv] What-If Motion Prediction for Autonomous Driving ❓🚗💨

WIMP - What If Motion Predictor Reference PyTorch Implementation for What If Motion Prediction [PDF] [Dynamic Visualizations] Setup Requirements The W

96 Dec 29, 2022

GraphRNN: Generating Realistic Graphs with Deep Auto-regressive Models

GraphRNN: Generating Realistic Graphs with Deep Auto-regressive Model This repository is the official PyTorch implementation of GraphRNN, a graph gene

568 Dec 29, 2022

Vision Transformer for 3D medical image registration (Pytorch).

ViT-V-Net: Vision Transformer for Volumetric Medical Image Registration keywords: vision transformer, convolutional neural networks, image registratio

192 Dec 20, 2022

[CVPR 2022] Official code for the paper: "A Stitch in Time Saves Nine: A Train-Time Regularizing Loss for Improved Neural Network Calibration"

MDCA Calibration This is the official PyTorch implementation for the paper: "A Stitch in Time Saves Nine: A Train-Time Regularizing Loss for Improved

21 Dec 22, 2022

[CVPR'21] FedDG: Federated Domain Generalization on Medical Image Segmentation via Episodic Learning in Continuous Frequency Space

FedDG: Federated Domain Generalization on Medical Image Segmentation via Episodic Learning in Continuous Frequency Space by Quande Liu, Cheng Chen, Ji

178 Jan 06, 2023

An official implementation of "Exploiting a Joint Embedding Space for Generalized Zero-Shot Semantic Segmentation" (ICCV 2021) in PyTorch.

Related tags

Overview

Exploiting a Joint Embedding Space for Generalized Zero-Shot Semantic Segmentation

Pre-requisites

Getting Started

Datasets

VOC

CONTEXT

Training

VOC

CONTEXT

Testing

VOC

CONTEXT

Acknowledgements

You might also like...

Official implementation of NPMs: Neural Parametric Models for 3D Deformable Shapes - ICCV 2021

Official implementation of the paper Vision Transformer with Progressive Sampling, ICCV 2021.

Official implementation of the ICCV 2021 paper "Conditional DETR for Fast Training Convergence".

The Official Implementation of the ICCV-2021 Paper: Semantically Coherent Out-of-Distribution Detection.

Official implementation of the ICCV 2021 paper: "The Power of Points for Modeling Humans in Clothing".

Official implementation of the ICCV 2021 paper "Joint Inductive and Transductive Learning for Video Object Segmentation"

[ICCV 2021] Official Tensorflow Implementation for "Single Image Defocus Deblurring Using Kernel-Sharing Parallel Atrous Convolutions"

Official implementation of Protected Attribute Suppression System, ICCV 2021

Official Pytorch Implementation of 'Learning Action Completeness from Points for Weakly-supervised Temporal Action Localization' (ICCV-21 Oral)

Comments

datasets

train_aug.txt

dataset split

Releases(v1.0)

v1.0(Aug 11, 2021)

Owner

CV Lab @ Yonsei University

some academic posters as references. May we have in-person poster session soon!

[arXiv] What-If Motion Prediction for Autonomous Driving ❓🚗💨

GraphRNN: Generating Realistic Graphs with Deep Auto-regressive Models

Vision Transformer for 3D medical image registration (Pytorch).

[CVPR 2022] Official code for the paper: "A Stitch in Time Saves Nine: A Train-Time Regularizing Loss for Improved Neural Network Calibration"

Some useful blender add-ons for SMPL skeleton's poses and global translation.

Byte-based multilingual transformer TTS for low-resource/few-shot language adaptation.

style mixing for animation face

Semantic Image Synthesis with SPADE

Cross-view Transformers for real-time Map-view Semantic Segmentation (CVPR 2022 Oral)

[ICCV'21] NEAT: Neural Attention Fields for End-to-End Autonomous Driving

Explainable Medical ImageSegmentation via GenerativeAdversarial Networks andLayer-wise Relevance Propagation

Deep Learning applied to Integral data analysis

DanceTrack: Multiple Object Tracking in Uniform Appearance and Diverse Motion

Meshed-Memory Transformer for Image Captioning. CVPR 2020

以孤立语假设和宽度优先搜索为基础，构建了一种多通道堆叠注意力Transformer结构的斗地主ai

Real-Time Seizure Detection using EEG: A Comprehensive Comparison of Recent Approaches under a Realistic Setting

Home for cuQuantum Python & NVIDIA cuQuantum SDK C++ samples

Transfer SemanticKITTI labeles into other dataset/sensor formats.

[CVPR'21] FedDG: Federated Domain Generalization on Medical Image Segmentation via Episodic Learning in Continuous Frequency Space