Simple data balancing baselines for worst-group-accuracy benchmarks.

Last update: Dec 02, 2022

Related tags

Overview

BalancingGroups

Code to replicate the experimental results from Simple data balancing baselines achieve competitive worst-group-accuracy.

Replicating the main results

Installing dependencies

Easiest way to have a working environment for this repo is to create a conda environement with the following commands

conda env create -f environment.yaml
conda activate balancinggroups

If conda is not available, please install the dependencies listed in the requirements.txt file.

Download, extract and Generate metadata for datasets

This script downloads, extracts and formats the datasets metadata so that it works with the rest of the code out of the box.

python setup_datasets.py --download --data_path data

Launch jobs

To reproduce the experiments in the paper on a SLURM cluster :

# Launching 1400 combo seeds = 50 hparams for 4 datasets for 7 algorithms
# Each combo seed is ran 5 times to compute error bars, totalling 7000 jobs
python train.py --data_path data --output_dir main_sweep --num_hparams_seeds 1400 --num_init_seeds 5 --partition <slurm_partition>

If you want to run the jobs localy, omit the --partition argument.

Parse results

The parse.py script can generate all of the plots and tables from the paper. By default, it generates the best test worst-group-accuracy table for each dataset/method. This script can be called while the experiments are still running.

python parse.py main_sweep

License

This source code is released under the CC-BY-NC license, included here.

Simple data balancing baselines for worst-group-accuracy benchmarks.

Related tags

Overview

BalancingGroups

Replicating the main results

Installing dependencies

Download, extract and Generate metadata for datasets

Launch jobs

Parse results

License

Owner

Meta Research

"MST++: Multi-stage Spectral-wise Transformer for Efficient Spectral Reconstruction" (CVPRW 2022) & (Winner of NTIRE 2022 Challenge on Spectral Reconstruction from RGB)

Manage the availability of workspaces within Frappe/ ERPNext (sidebar) based on user-roles

Implementation of SSMF: Shifting Seasonal Matrix Factorization

Recommendationsystem - Movie-recommendation - matrixfactorization colloborative filtering recommendation system user

A Joint Video and Image Encoder for End-to-End Retrieval

This code uses generative adversarial networks to generate diverse task allocation plans for Multi-agent teams.

Single-Shot Motion Completion with Transformer

Must-read Papers on Physics-Informed Neural Networks.

(AAAI2020)Grapy-ML: Graph Pyramid Mutual Learning for Cross-dataset Human Parsing

This is a Deep Leaning API for classifying emotions from human face and human audios.

This repository is for DSA and CP scripts for reference.

A denoising diffusion probabilistic model synthesises galaxies that are qualitatively and physically indistinguishable from the real thing.

Repo for CReST: A Class-Rebalancing Self-Training Framework for Imbalanced Semi-Supervised Learning

The code for our CVPR paper PISE: Person Image Synthesis and Editing with Decoupled GAN, Project Page, supp.

Music Classification: Beyond Supervised Learning, Towards Real-world Applications

Notebooks em Python para Métodos Eletromagnéticos

Official repository for the paper, MidiBERT-Piano: Large-scale Pre-training for Symbolic Music Understanding.

CS50x-AI - Artificial Intelligence with Python from Harvard University

PyTorch implementations of Generative Adversarial Networks.

PASTRIE: A Corpus of Prepositions Annotated with Supersense Tags in Reddit International English