Class-Balanced Loss Based on Effective Number of Samples. CVPR 2019

Last update: Jan 08, 2023

Overview

Class-Balanced Loss Based on Effective Number of Samples

Tensorflow code for the paper:

Class-Balanced Loss Based on Effective Number of Samples
Yin Cui, Menglin Jia, Tsung-Yi Lin, Yang Song, Serge Belongie

Dependencies:

Python (3.6)
Tensorflow (1.14)

Datasets:

Long-Tailed CIFAR. We provide a download link that includes all the data used in our paper in .tfrecords format. The data was converted and generated by src/generate_cifar_tfrecords.py (original CIFAR) and src/generate_cifar_tfrecords_im.py (long-tailed CIFAR).

Effective Number of Samples:

For a visualization of the data and effective number of samples, please take a look at data.ipynb.

Key Implementation Details:

Training and Evaluation:

We provide 3 .sh scripts for training and evaluation.

On original CIFAR dataset:

./cifar_trainval.sh

On long-tailed CIFAR dataset (the hyperparameter IM_FACTOR is the inverse of "Imbalance Factor" in the paper):

./cifar_im_trainval.sh

On long-tailed CIFAR dataset using the proposed class-balanced loss (set non-zero BETA):

./cifar_im_trainval_cb.sh

Run Tensorboard for visualization:

tensorboard --logdir=./results --port=6006

The figure below are the results of running ./cifar_im_trainval.sh and ./cifar_im_trainval_cb.sh:

Training with TPU:

We train networks on iNaturalist and ImageNet datasets using Google's Cloud TPU. The code for this section is in tpu/. Our code is based on the official implementation of Training ResNet on Cloud TPU and forked from https://github.com/tensorflow/tpu.

Data Preparation:

Download datasets (except images) from this link and unzip it under tpu/. The unzipped directory tpu/raw_data/ contains the training and validation splits. For raw images, please download from the following links and put them into the corresponding folders in tpu/raw_data/:
Convert datasets into .tfrecords format and upload to Google Cloud Storage (gcs) using tpu/tools/datasets/dataset_to_gcs.py:

python dataset_to_gcs.py \
  --project=$PROJECT \
  --gcs_output_path=$GCS_DATA_DIR \
  --local_scratch_dir=$LOCAL_TFRECORD_DIR \
  --raw_data_dir=$LOCAL_RAWDATA_DIR

The following 3 .sh scripts in tpu/ can be used to train and evaluate models on iNaturalist and ImageNet using Cloud TPU. For more details on how to use Cloud TPU, please refer to Training ResNet on Cloud TPU.

Note that the image mean and standard deviation and input size need to be updated accordingly.

On ImageNet (ILSVRC 2012):

./run_ILSVRC2012.sh

On iNaturalist 2017:

./run_inat2017.sh

On iNaturalist 2018:

./run_inat2018.sh

The pre-trained models, including all logs viewable on tensorboard, can be downloaded from the following links:

Dataset	Network	Loss	Input Size	Download Link
ILSVRC 2012	ResNet-50	Class-Balanced Focal Loss	224	link
iNaturalist 2018	ResNet-50	Class-Balanced Focal Loss	224	link

Citation

If you find our work helpful in your research, please cite it as:

@inproceedings{cui2019classbalancedloss,
  title={Class-Balanced Loss Based on Effective Number of Samples},
  author={Cui, Yin and Jia, Menglin and Lin, Tsung-Yi and Song, Yang and Belongie, Serge},
  booktitle={CVPR},
  year={2019}
}

Class-Balanced Loss Based on Effective Number of Samples. CVPR 2019

Related tags

Overview

Class-Balanced Loss Based on Effective Number of Samples

Dependencies:

Datasets:

Effective Number of Samples:

Key Implementation Details:

Training and Evaluation:

Training with TPU:

Citation

Owner

Yin Cui

Official implementation of the paper 'High-Resolution Photorealistic Image Translation in Real-Time: A Laplacian Pyramid Translation Network' in CVPR 2021

This repository contains notebook implementations of the following Neural Process variants: Conditional Neural Processes (CNPs), Neural Processes (NPs), Attentive Neural Processes (ANPs).

Object Detection Projekt in GKI WS2021/22

Colossal-AI: A Unified Deep Learning System for Large-Scale Parallel Training

Useful materials and tutorials for 110-1 NTU DBME5028 (Application of Deep Learning in Medical Imaging)

Official implementation of the NRNS paper: No RL, No Simulation: Learning to Navigate without Navigating

Attention for PyTorch with Linear Memory Footprint

(CVPR 2021) Back-tracing Representative Points for Voting-based 3D Object Detection in Point Clouds

Experiments on continual learning from a stream of pretrained models.

TEDSummary is a speech summary corpus. It includes TED talks subtitle (Document), Title-Detail (Summary), speaker name (Meta info), MP4 URL, and utterance id

HODEmu, is both an executable and a python library that is based on Ragagnin 2021 in prep.

ExCon: Explanation-driven Supervised Contrastive Learning

3D-Transformer: Molecular Representation with Transformer in 3D Space

MOT-Tracking-by-Detection-Pipeline - For Tracking-by-Detection format MOT (Multi Object Tracking), is it a framework that separates Detection and Tracking processes?

2D&3D human pose estimation

Pomodoro timer that acknowledges the inexorable, infinite passage of time

Gradient-free global optimization algorithm for multidimensional functions based on the low rank tensor train format

Variational Attention: Propagating Domain-Specific Knowledge for Multi-Domain Learning in Crowd Counting (ICCV, 2021)

(Personalized) Page-Rank computation using PyTorch

(CVPR 2021) PAConv: Position Adaptive Convolution with Dynamic Kernel Assembling on Point Clouds