This repository contains code for the paper "Disentangling Label Distribution for Long-tailed Visual Recognition", published at CVPR' 2021

Last update: Oct 18, 2022

Related tags

Deep Learning LADE

Overview

Disentangling Label Distribution for Long-tailed Visual Recognition (CVPR 2021)

Arxiv link
Blog post
This codebase is built on Causal Norm.

Install

conda create -n longtail pip python=3.7 -y
source activate longtail
conda install pytorch torchvision cudatoolkit=10.1 -c pytorch
pip install pyyaml tqdm matplotlib sklearn h5py tensorboard

Training

Preliminaries

Download pretrained caffe resnet152 model for Places-LT: please refer to link.
Prepare dataset: CIFAR-100, Places-LT, ImageNet-LT, iNaturalist 2018
- Please download those datasets following Decoupling.

CIFAR-100 training

For CIFAR-100 with imbalance ratio 0.01, using LADE:

python main.py --seed 1 --cfg config/CIFAR100_LT/lade.yaml --exp_name lade2021/cifar100_imb0.01_lade --cifar_imb_ratio 0.01 --remine_lambda 0.01 --alpha 0.1 --gpu 0

Places-LT training

For PC Softmax:

python main.py --seed 1 --cfg config/Places_LT/ce.yaml --exp_name lade2021/places_pc_softmax --lr 0.05 --gpu 0,1,2,3

For LADE:

python main.py --seed 1 --cfg config/Places_LT/lade.yaml --exp_name lade2021/places_lade --lr 0.05 --remine_lambda 0.1 --alpha 0.005 --gpu 0,1,2,3

ImageNet-LT training

For LADE:

python main.py --seed 1 --cfg config/ImageNet_LT/lade.yaml  --exp_name lade2021/imagenet_lade --lr 0.05 --remine_lambda 0.5 --alpha 0.05 --gpu 0,1,2,3

iNaturalist18 training

For LADE:

python main.py --seed 1 --cfg ./config/iNaturalist18/lade.yaml --exp_name lade2021/inat_lade --lr 0.1 --alpha 0.05 --gpu 0,1,2,3

Evaluate on shifted test set & Confidence calibration

For Imagenet (Section 4.3, 4.4):

./notebooks/imagenet-shift-calib.ipynb

For CIFAR-100 (Supplementary material):

./notebooks/cifar100-shift-calib.ipynb

License

The use of this software is released under BSD-3.

Citation

If you find our paper or this project helps your research, please kindly consider citing our paper in your publications.

@article{hong2020disentangling,
  title={Disentangling Label Distribution for Long-tailed Visual Recognition},
  author={Hong, Youngkyu and Han, Seungju and Choi, Kwanghee and Seo, Seokjun and Kim, Beomsu and Chang, Buru},
  journal={arXiv preprint arXiv:2012.00321},
  year={2020}
}

This repository contains code for the paper "Disentangling Label Distribution for Long-tailed Visual Recognition", published at CVPR' 2021

Related tags

Overview

Disentangling Label Distribution for Long-tailed Visual Recognition (CVPR 2021)

Install

Training

Preliminaries

CIFAR-100 training

Places-LT training

ImageNet-LT training

iNaturalist18 training

Evaluate on shifted test set & Confidence calibration

License

Citation

Owner

Hyperconnect

MaskTrackRCNN for video instance segmentation based on mmdetection

Like Dirt-Samples, but cleaned up

Character-Input - Create a program that asks the user to enter their name and their age

Motion planning environment for Sampling-based Planners

Event-forecasting - Event Forecasting Algorithms With Python

E2EDNA2 - An automated pipeline for simulation of DNA aptamers complexed with small molecules and short peptides

The code for replicating the experiments from the LFI in SSMs with Unknown Dynamics paper.

A pytorch-based real-time segmentation model for autonomous driving

This repository includes the official project for the paper: TransMix: Attend to Mix for Vision Transformers.

3ds-Ghidra-Scripts - Ghidra scripts to help with 3ds reverse engineering

A series of Python scripts to access measurements from Fluke 28X meters. Fluke IR Remote Interface required.

Adversarial Learning for Modeling Human Motion

Fake News Detection Using Machine Learning Methods

Numenta Platform for Intelligent Computing is an implementation of Hierarchical Temporal Memory (HTM), a theory of intelligence based strictly on the neuroscience of the neocortex.

Fewshot-face-translation-GAN - Generative adversarial networks integrating modules from FUNIT and SPADE for face-swapping.

Code for the AAAI-2022 paper: Imagine by Reasoning: A Reasoning-Based Implicit Semantic Data Augmentation for Long-Tailed Classification

Code for the paper "Regularizing Variational Autoencoder with Diversity and Uncertainty Awareness"

A Rao-Blackwellized Particle Filter for 6D Object Pose Tracking

El-Gamal on Elliptic Curve (Python)

Code and data for ImageCoDe, a contextual vison-and-language benchmark