[PAMI 2020] Show, Match and Segment: Joint Weakly Supervised Learning of Semantic Matching and Object Co-segmentation

Last update: Nov 25, 2022

Related tags

Overview

Show, Match and Segment: Joint Weakly Supervised Learning of Semantic Matching and Object Co-segmentation

This repository contains the source code for the paper Show, Match and Segment: Joint Weakly Supervised Learning of Semantic Matching and Object Co-segmentation.

Abstract

We present an approach for jointly matching and segmenting object instances of the same category within a collection of images. In contrast to existing algorithms that tackle the tasks of semantic matching and object co-segmentation in isolation, our method exploits the complementary nature of the two tasks. The key insights of our method are two-fold. First, the estimated dense correspondence fields from semantic matching provide supervision for object co-segmentation by enforcing consistency between the predicted masks from a pair of images. Second, the predicted object masks from object co-segmentation in turn allow us to reduce the adverse effects due to background clutters for improving semantic matching. Our model is end-to-end trainable and does not require supervision from manually annotated correspondences and object masks. We validate the efficacy of our approach on five benchmark datasets: TSS, Internet, PF-PASCAL, PF-WILLOW, and SPair-71k, and show that our algorithm performs favorably against the state-of-the-art methods on both semantic matching and object co-segmentation tasks.

Citation

If you find our code useful, please consider citing our work using the following bibtex:

@article{MaCoSNet,
    title={Show, Match and Segment: Joint Weakly Supervised Learning of Semantic Matching and Object Co-segmentation},
    author={Chen, Yun-Chun and Lin, Yen-Yu and Yang, Ming-Hsuan and Huang, Jia-Bin},
    journal={IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI)},
    year={2020}
}

@inproceedings{WeakMatchNet,
  title={Deep Semantic Matching with Foreground Detection and Cycle-Consistency},
  author={Chen, Yun-Chun and Huang, Po-Hsiang and Yu, Li-Yu and Huang, Jia-Bin and Yang, Ming-Hsuan and Lin, Yen-Yu},
  booktitle={Asian Conference on Computer Vision (ACCV)},
  year={2018}
}

Environment

Install Anaconda Python3.7
This code is tested on NVIDIA V100 GPU with 16GB memory

pip install -r requirements.txt

Dataset

Please download the PF-PASCAL, PF-WILLOW, SPair-71k, TSS, and Internet datasets
Please modify the variable DATASET_DIR in config.py
Please modify the variable CSV_DIR in config.py

Training

You may determine which dataset to be the training set by changing the $DATASET variable in train.sh
You may change the $BATCH_SIZE variable in train.sh to a suitable value based on the GPU memory
The trained model will be saved under the trained_models folder

sh train.sh

Evaluation

You may determine which dataset to be evaluated by changing the $DATASET variable in eval.sh
You may change the $BATCH_SIZE variable in eval.sh to a suitable value based on the GPU memory

sh eval.sh

Acknowledgement

This code is heavily borrowed from Rocco et al.

[PAMI 2020] Show, Match and Segment: Joint Weakly Supervised Learning of Semantic Matching and Object Co-segmentation

Related tags

Overview

Show, Match and Segment: Joint Weakly Supervised Learning of Semantic Matching and Object Co-segmentation

Abstract

Citation

Environment

Dataset

Training

Evaluation

Acknowledgement

Owner

Yun-Chun Chen

Implementation of "A MLP-like Architecture for Dense Prediction"

StackGAN: Text to Photo-realistic Image Synthesis with Stacked Generative Adversarial Networks

StyleSpace Analysis: Disentangled Controls for StyleGAN Image Generation

ICLR 2021, Fair Mixup: Fairness via Interpolation

Methods to get the probability of a changepoint in a time series.

Image inpainting using Gaussian Mixture Models

OpenDILab Multi-Agent Environment

Chess reinforcement learning by AlphaGo Zero methods.

Optimizing Value-at-Risk and Conditional Value-at-Risk of Black Box Functions with Lacing Values (LV)

Reinforcement learning models in ViZDoom environment

Non-Imaging Transient Reconstruction And TEmporal Search (NITRATES)

Implementation of SegNet: A Deep Convolutional Encoder-Decoder Architecture for Semantic Pixel-Wise Labelling

Code for our paper "MG-GAN: A Multi-Generator Model Preventing Out-of-Distribution Samples in Pedestrian Trajectory Prediction" published at ICCV 2021.

MoveNet Single Pose on OpenVINO

Codebase for the solution that won first place and was awarded the most human-like agent in the 2021 NeurIPS Competition MineRL BASALT Challenge.

Scalable and Elastic Deep Reinforcement Learning Using PyTorch. Please star. 🔥

2021-MICCAI-Progressively Normalized Self-Attention Network for Video Polyp Segmentation

Pytorch implementation of the paper Time-series Generative Adversarial Networks

A certifiable defense against adversarial examples by training neural networks to be provably robust

Official DGL implementation of "Rethinking High-order Graph Convolutional Networks"