Anderson Accelerated Deep Learning (AADL)

AADL is a Python package that implements the Anderson acceleration to speed-up the training of deep learning (DL) models using the PyTorch library.
AA is an extrapolation technique that can accelerate fixed-point iterations such those arising from the iterative training of DL models. However, large volume of data are typically processed in sequential random batches which introduces stochastic oscillations in the fixed-point iteration that hinders AA acceleration. AADL implements a moving average that reduces the oscillations and results in a smoother sequence of gradient descent updates which enables the use of AA. AADL uses a criterion to automatically decide if the moving average is needed by monitoring if the relative standard deviation between consecutive stochastic gradient updates exceeds a tolerance defined by the user.

Requirements

Python 3.5 or greater
PyTorch (any version works)

Installation

AADL comes with a setuptools install script:

python3 setup.py install

Usage

import torch
import torch.nn
import torch.optim
import AADL

# Creation of the DL model (neural network)
class model(torch.nn.Module):
	...

# Definition of the stochastic optimizer used to train the model
optimizer = torch.optim.SGD(model.parameters(), lr=1e-3, momentum=0.9, nesterov = True)

# Parameters for Anderson acceleration
relaxation = 0.5
wait_iterations = 0
history_depth = 10
store_each_nth = 10
frequency = store_each_nth
reg_acc = 0.0
safeguard = True
average = True

# Over-writing of the torch.optim.step() method 
AADL.accelerate(optimizer_anderson, "anderson", relaxation, wait_iterations, history_depth, store_each_nth, frequency, reg_acc, average)

Contributing

Pull requests are welcome. For major changes, please open an issue first to discuss what you would like to change.

License

BSD-3-Clause

Citations

"AADL: Anderson Accelerated Deep Learning", Copyright ID#: 81927550 https://doi.org/10.11578/dc.20210723.1

Anderson Acceleration for Deep Learning

Related tags

Overview

Anderson Accelerated Deep Learning (AADL)

Requirements

Installation

Usage

Contributing

License

Citations

Owner

Oak Ridge National Laboratory

Rainbow is all you need! A step-by-step tutorial from DQN to Rainbow

The project of phase's key role in complex and real NN

SWA Object Detection

Guided Internet-delivered Cognitive Behavioral Therapy Adherence Forecasting

Code associated with the paper "Deep Optics for Single-shot High-dynamic-range Imaging"

xitorch: differentiable scientific computing library

Boundary-preserving Mask R-CNN (ECCV 2020)

Self-Supervised Speech Pre-training and Representation Learning Toolkit.

PyTorch implementation of the REMIND method from our ECCV-2020 paper "REMIND Your Neural Network to Prevent Catastrophic Forgetting"

[NeurIPS'21 Spotlight] PyTorch code for our paper "Aligned Structured Sparsity Learning for Efficient Image Super-Resolution"

Database Reasoning Over Text project for ACL paper

E2C implementation in PyTorch

Python implementation of Project Fluent

Code for Phase diagram of Stochastic Gradient Descent in high-dimensional two-layer neural networks

TensorFlow implementation of AlexNet and its training and testing on ImageNet ILSVRC 2012 dataset

Tzer: TVM Implementation of "Coverage-Guided Tensor Compiler Fuzzing with Joint IR-Pass Mutation (OOPSLA'22)“.

A PaddlePaddle version of Neural Renderer, refer to its PyTorch version

Local Attention - Flax module for Jax

Code Release for ICCV 2021 (oral), "AdaFit: Rethinking Learning-based Normal Estimation on Point Clouds"

deep learning model that learns to code with drawing in the Processing language