DeepCO3: Deep Instance Co-segmentation by Co-peak Search and Co-saliency

Last update: Dec 22, 2022

Overview

[CVPR19] DeepCO³: Deep Instance Co-segmentation by Co-peak Search and Co-saliency (Oral paper)

Authors: Kuang-Jui Hsu, Yen-Yu Lin, Yung-Yu Chuang

PDF: High-Resolution, Low-Resolution
Supplementary material: High-Resolution, Low-Resolution

Abstract

In this paper, we address a new task called instance cosegmentation. Given a set of images jointly covering object instances of a specific category, instance co-segmentation aims to identify all of these instances and segment each of them, i.e. generating one mask for each instance. This task is important since instance-level segmentation is preferable for humans and many vision applications. It is also challenging because no pixel-wise annotated training data are available and the number of instances in each image is unknown. We solve this task by dividing it into two sub-tasks, co-peak search and instance mask segmentation. In the former sub-task, we develop a CNN-based network to detect the co-peaks as well as co-saliency maps for a pair of images. A co-peak has two endpoints, one in each image, that are local maxima in the response maps and similar to each other. Thereby, the two endpoints are potentially covered by a pair of instances of the same category. In the latter subtask, we design a ranking function that takes the detected co-peaks and co-saliency maps as inputs and can select the object proposals to produce the final results. Our method for instance co-segmentation and its variant for object colocalization are evaluated on four datasets, and achieve favorable performance against the state-of-the-art methods.

Examples

Two examples of instance co-segmentation on categories bird and sheep, respectively. An instance here refers to an object appearing in an image. In each example, the top row gives the input images while the bottom row shows the instances segmented by our method. The instance-specific coloring indicates that our method produces a segmentation mask for each instance.

Overview of our method

The proposed method contains two stages, co-peak search within the blue-shaded background and instance mask segmentation within the red-shaded background. For searching co-peaks in a pair of images, our model extracts image features, estimates their co-saliency maps, and performs feature correlation for co-peak localization. The model is optimized by three losses, including the co-peak loss, the affinity loss, and the saliency loss. For instance mask segmentation, we design a ranking function taking the detected co-peaks, the co-saliency maps, and the object proposals as inputs, and select the top-ranked proposal for each detected instance.

Results

Instance co-segmentation

The performance of instance co-segmentation on the four collected datasets is shown. The numbers in red and green show the best and the second best results, respectively. The column “trained” indicates whether additional training data are used.

Object co-localization

The performance of object co-localization on the four datasets is shown. The numbers in red and green indicate the best and the second best results, respectively. The column “trained” indicates whether additional training data are used.

Please cite our paper if this code is useful for your research.


@inproceedings{HsuCVPR19,
  author = {Kuang-Jui Hsu and Yen-Yu Lin and Yung-Yu Chuang},
  booktitle = {IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR)},
  title = {DeepCO$^3$: Deep Instance Co-segmentation by Co-peak Search and Co-saliency Detection},
  year = {2019}
}

Codes for DeepCO³

Contact: Kuang-Jui Hsu
Last update: 2019/04/09
Platform: Ubuntu 14.04, MatConvnet 1.0-beta24 (Don't support any installation problem of MatConvnet.)

Demo for all stages: "RunDeepInstCoseg.m"

Including all files in "Lib" (Downloading MatConvnet is not necessary)
May be slightly different from the ones in paper because of the randdom seeds

Datasets (about 34 GB):

Including four collected datasets
Containing the images, ground-truth masks, salinecy maps and object proposals
GoogleDrive

Results reported in the papers (about 4 GB):

Only including the results of DeepCO³
GoogleDrive

Download Codes from GoogleDrive :

GoogleDrive

Errata:

Thank Howard Yu-Chun Lo for pointing the typo in Eq. (4). The corrected one is listed in the following:

DeepCO3: Deep Instance Co-segmentation by Co-peak Search and Co-saliency

Related tags

Overview

[CVPR19] DeepCO³: Deep Instance Co-segmentation by Co-peak Search and Co-saliency (Oral paper)

Authors: Kuang-Jui Hsu, Yen-Yu Lin, Yung-Yu Chuang

Abstract

Examples

Overview of our method

Results

Codes for DeepCO³

Demo for all stages: "RunDeepInstCoseg.m"

Datasets (about 34 GB):

Results reported in the papers (about 4 GB):

Download Codes from GoogleDrive :

Errata:

Owner

Kuang-Jui Hsu

RIFE: Real-Time Intermediate Flow Estimation for Video Frame Interpolation

YoloV5 implemented by TensorFlow2 , with support for training, evaluation and inference.

Official Implementation of "Designing an Encoder for StyleGAN Image Manipulation"

SporeAgent: Reinforced Scene-level Plausibility for Object Pose Refinement

Unofficial implementation of Point-Unet: A Context-Aware Point-Based Neural Network for Volumetric Segmentation

Source code for paper "Deep Superpixel-based Network for Blind Image Quality Assessment"

TensorFlow ROCm port

TransFGU: A Top-down Approach to Fine-Grained Unsupervised Semantic Segmentation

Large dataset storage format for Pytorch

Adversarial-Information-Bottleneck - Distilling Robust and Non-Robust Features in Adversarial Examples by Information Bottleneck (NeurIPS21)

Code for T-Few from "Few-Shot Parameter-Efficient Fine-Tuning is Better and Cheaper than In-Context Learning"

🏖 Keras Implementation of Painting outside the box

Fully Convolutional Networks for Semantic Segmentation by Jonathan Long, Evan Shelhamer, and Trevor Darrell. CVPR 2015 and PAMI 2016.

Co-GAIL: Learning Diverse Strategies for Human-Robot Collaboration

Codes to calculate solar-sensor zenith and azimuth angles directly from hyperspectral images collected by UAV. Works only for UAVs that have high resolution GNSS/IMU unit.

Synthesize photos from PhotoDNA using machine learning 🌱

Release of SPLASH: Dataset for semantic parse correction with natural language feedback in the context of text-to-SQL parsing

This is just a funny project that we want to see AutoEncoder (AE) can actually work to enhance the features we want

Finetune alexnet with tensorflow - Code for finetuning AlexNet in TensorFlow >= 1.2rc0

A robust pointcloud registration pipeline based on correlation.

DeepCO3: Deep Instance Co-segmentation by Co-peak Search and Co-saliency

Related tags

Overview

[CVPR19] DeepCO3: Deep Instance Co-segmentation by Co-peak Search and Co-saliency (Oral paper)

Authors: Kuang-Jui Hsu, Yen-Yu Lin, Yung-Yu Chuang

Abstract

Examples

Overview of our method

Results

Codes for DeepCO3

Demo for all stages: "RunDeepInstCoseg.m"

Datasets (about 34 GB):

Results reported in the papers (about 4 GB):

Download Codes from GoogleDrive :

Errata:

Owner

Kuang-Jui Hsu

RIFE: Real-Time Intermediate Flow Estimation for Video Frame Interpolation

YoloV5 implemented by TensorFlow2 , with support for training, evaluation and inference.

Official Implementation of "Designing an Encoder for StyleGAN Image Manipulation"

SporeAgent: Reinforced Scene-level Plausibility for Object Pose Refinement

Unofficial implementation of Point-Unet: A Context-Aware Point-Based Neural Network for Volumetric Segmentation

Source code for paper "Deep Superpixel-based Network for Blind Image Quality Assessment"

TensorFlow ROCm port

TransFGU: A Top-down Approach to Fine-Grained Unsupervised Semantic Segmentation

Large dataset storage format for Pytorch

Adversarial-Information-Bottleneck - Distilling Robust and Non-Robust Features in Adversarial Examples by Information Bottleneck (NeurIPS21)

Code for T-Few from "Few-Shot Parameter-Efficient Fine-Tuning is Better and Cheaper than In-Context Learning"

🏖 Keras Implementation of Painting outside the box

Fully Convolutional Networks for Semantic Segmentation by Jonathan Long*, Evan Shelhamer*, and Trevor Darrell. CVPR 2015 and PAMI 2016.

Co-GAIL: Learning Diverse Strategies for Human-Robot Collaboration

Codes to calculate solar-sensor zenith and azimuth angles directly from hyperspectral images collected by UAV. Works only for UAVs that have high resolution GNSS/IMU unit.

Synthesize photos from PhotoDNA using machine learning 🌱

Release of SPLASH: Dataset for semantic parse correction with natural language feedback in the context of text-to-SQL parsing

This is just a funny project that we want to see AutoEncoder (AE) can actually work to enhance the features we want

Finetune alexnet with tensorflow - Code for finetuning AlexNet in TensorFlow >= 1.2rc0

A robust pointcloud registration pipeline based on correlation.

[CVPR19] DeepCO³: Deep Instance Co-segmentation by Co-peak Search and Co-saliency (Oral paper)

Codes for DeepCO³

Fully Convolutional Networks for Semantic Segmentation by Jonathan Long, Evan Shelhamer, and Trevor Darrell. CVPR 2015 and PAMI 2016.