Official implementation of CATs: Cost Aggregation Transformers for Visual Correspondence NeurIPS'21

Last update: Jan 04, 2023

Overview

CATs: Cost Aggregation Transformers for Visual Correspondence NeurIPS'21

For more information, check out the paper on [arXiv].

Training with different backbones and evaluations of them are to be updated soon..

Check out our new paper! [arXiv]

Network

Our model CATs is illustrated below:

Environment Settings

git clone https://github.com/SunghwanHong/CATs
cd CATs

conda create -n CATs python=3.6
conda activate CATs

pip install torch==1.8.0+cu111 torchvision==0.9.0+cu111 torchaudio==0.8.0 -f https://download.pytorch.org/whl/torch_stable.html
pip install -U scikit-image
pip install git+https://github.com/albumentations-team/albumentations
pip install tensorboardX termcolor timm tqdm requests pandas

Evaluation

Download pre-trained weights on Link
All datasets are automatically downloaded into directory specified by argument datapath

Result on SPair-71k: (PCK 49.9%)

  python test.py --pretrained "/path_to_pretrained_model/spair" --benchmark spair

Result on SPair-71k, feature backbone frozen: (PCK 42.4%)

  python test.py --pretrained "/path_to_pretrained_model/spair_frozen" --benchmark spair

Results on PF-PASCAL: (PCK 75.4%, 92.6%, 96.4%)

  python test.py --pretrained "/path_to_pretrained_model/pfpascal" --benchmark pfpascal

Results on PF-PACAL, feature backbone frozen: (PCK 67.5%, 89.1%, 94.9%)

  python test.py --pretrained "/path_to_pretrained_model/pfpascal_frozen" --benchmark pfpascal

Acknowledgement

We borrow code from public projects (huge thanks to all the projects). We mainly borrow code from DHPF and GLU-Net.

BibTeX

If you find this research useful, please consider citing:

@inproceedings{cho2021cats,
  title={CATs: Cost Aggregation Transformers for Visual Correspondence},
  author={Cho, Seokju and Hong, Sunghwan and Jeon, Sangryul and Lee, Yunsung and Sohn, Kwanghoon and Kim, Seungryong},
  booktitle={Thirty-Fifth Conference on Neural Information Processing Systems},
  year={2021}
}

Official implementation of CATs: Cost Aggregation Transformers for Visual Correspondence NeurIPS'21

Related tags

Overview

CATs: Cost Aggregation Transformers for Visual Correspondence NeurIPS'21

Network

Environment Settings

Evaluation

Acknowledgement

BibTeX

Owner

Sunghwan Hong

LV-BERT: Exploiting Layer Variety for BERT (Findings of ACL 2021)

📚 A collection of all the Deep Learning Metrics that I came across which are not accuracy/loss.

TC-GNN with Pytorch integration

SemiNAS: Semi-Supervised Neural Architecture Search

[IJCAI'21] Deep Automatic Natural Image Matting

Referring Video Object Segmentation

Time-series-deep-learning - Developing Deep learning LSTM, BiLSTM models, and NeuralProphet for multi-step time-series forecasting of stock price.

GE2340 project source code without credentials.

How to Become More Salient? Surfacing Representation Biases of the Saliency Prediction Model

An Implementation of Transformer in Transformer in TensorFlow for image classification, attention inside local patches

Pytorch codes for "Self-supervised Multi-view Stereo via Effective Co-Segmentation and Data-Augmentation"

Code for the CVPR 2021 paper "Triple-cooperative Video Shadow Detection"

A Decentralized Omnidirectional Visual-Inertial-UWB State Estimation System for Aerial Swar.

CapsuleVOS: Semi-Supervised Video Object Segmentation Using Capsule Routing

FairMOT for Multi-Class MOT using YOLOX as Detector

Neural network for digit classification powered by cuda

Model serving at scale

A full-fledged version of Pix2Seq

The original weights of some Caffe models, ported to PyTorch.

pyspark🍒🥭 is delicious，just eat it!😋😋