A PyTorch implementation of the paper Mixup: Beyond Empirical Risk Minimization in PyTorch

Last update: Dec 17, 2022

Related tags

Overview

Mixup: Beyond Empirical Risk Minimization in PyTorch

This is an unofficial PyTorch implementation of mixup: Beyond Empirical Risk Minimization. The code is adapted from PyTorch CIFAR.

The results:

I only tested using CIFAR 10 and CIFAR 100. The network we used is PreAct ResNet-18. For mixup, we set alpha to be default value 1, meaning we sample the weight uniformly between zero and one. I trained 200 epochs for each setting. The learning rate is 0.1 (iter 1-100), 0.01 (iter 101-150) and 0.001 (iter 151-200). The batch size is 128.

Dataset and Model	Acc.
CIFAR 10 no mixup	94.97%
CIFAR 10 mixup	95.53%
CIFAR 100 no mixup	76.53%
CIFAR 100 mixup	77.83%

CIFAR 10 test accuracy evolution

CIFAR 100 test accuracy evolution

Usage

# Train and test CIFAR 10 with mixup.
python main_cifar10.py --mixup --exp='cifar10_nomixup'
# Train and test CIFAR 10 without mixup.
python main_cifar10.py --exp='cifar10_nomixup'
# Train and test CIFAR 100 with mixup.
python main_cifar100.py --mixup --exp='cifar100_mixup'
# Train and test CIFAR 100 without mixup.
python main_cifar100.py --exp='cifar100_nomixup'

A PyTorch implementation of the paper Mixup: Beyond Empirical Risk Minimization in PyTorch

Related tags

Overview

Mixup: Beyond Empirical Risk Minimization in PyTorch

The results:

CIFAR 10 test accuracy evolution

CIFAR 100 test accuracy evolution

Usage

Owner

Harry Yang

Exploring the link between uncertainty estimates obtained via "exact" Bayesian inference and out-of-distribution (OOD) detection.

PipeTransformer: Automated Elastic Pipelining for Distributed Training of Large-scale Models

Pytorch Implementation for CVPR2018 Paper: Learning to Compare: Relation Network for Few-Shot Learning

Leveraging Two Types of Global Graph for Sequential Fashion Recommendation, ICMR 2021

Graph neural network message passing reframed as a Transformer with local attention

Attempt at implementation of a simple GAN using Keras

PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis

Learning Super-Features for Image Retrieval

FIRA: Fine-Grained Graph-Based Code Change Representation for Automated Commit Message Generation

As a part of the HAKE project, includes the reproduced SOTA models and the corresponding HAKE-enhanced versions (CVPR2020).

Official implementation of VQ-Diffusion

Personal project about genus-0 meshes, spherical harmonics and a cow

Deep metric learning methods implemented in Chainer

PaRT: Parallel Learning for Robust and Transparent AI

A python module for configuration of block devices

Tutorial repo for an end-to-end Data Science project

Charsiu: A transformer-based phonetic aligner

A cross-lingual COVID-19 fake news dataset

Lighthouse: Predicting Lighting Volumes for Spatially-Coherent Illumination

YOLOv7 - Framework Beyond Detection