This repository provides an efficient PyTorch-based library for training deep models.

Last update: Jan 05, 2023

Related tags

Deep Learning Hammer

Overview

An Efficient Library for Training Deep Models

This repository provides an efficient PyTorch-based library for training deep models.

Installation

Make sure your Python >= 3.7, CUDA version >= 11.1, and CUDNN version >= 7.6.5.

Install package requirements via conda:

conda create -n <ENV_NAME> python=3.7  # create virtual environment with Python 3.7
conda activate <ENV_NAME>
pip install -r requirements/minimal.txt -f https://download.pytorch.org/whl/cu111/torch_stable.html

To use video visualizer (optional), please also install ffmpeg.
- Ubuntu: sudo apt-get install ffmpeg.
- MacOS: brew install ffmpeg.
To reduce memory footprint (optional), you can switch to either jemalloc (recommended) or tcmalloc rather than your default memory allocator.
- jemalloc (recommended):
  - Ubuntu: sudo apt-get install libjemalloc
- tcmalloc:
  - Ubuntu: sudo apt-get install google-perftools

(optional) To speed up data loading on NVIDIA GPUs, you can install DALI, together with dill to pickle python objects. It is optional to also install CuPy for some customized operations if needed:

pip install --extra-index-url https://developer.download.nvidia.com/compute/redist --upgrade nvidia-dali-<CUDA_VERSION>
pip install dill
pip install cupy  # optional, installation can be slow

For example, on CUDA 11.1, DALI can be installed via:

pip install --extra-index-url https://developer.download.nvidia.com/compute/redist --upgrade nvidia-dali-cuda110  # CUDA 11.1 compatible
pip install dill
pip install cupy  # optional, installation can be slow

Quick Demo

Train StyleGAN2 on FFHQ in Resolution of 256x256

In your Terminal, run:

./scripts/training_demos/stylegan2_ffhq256.sh <NUM_GPUS> <PATH_TO_DATA> [OPTIONS]

where

refers to the number of GPUs. Setting as 1 helps launch a training job on single-GPU platforms.
refers to the path of FFHQ dataset (in resolution of 256x256) with zip format. If running on local machines, a soft link of the data will be created under the data folder of the working directory to save disk space.
[OPTIONS] refers to any additional option to pass. Detailed instructions on available options can be shown via ./scripts/training_demos/stylegan2_ffhq256.sh --help.

This demo script uses stylegan2_ffhq256 as the default value of job_name, which is particularly used to identify experiments. Concretely, a directory with name job_name will be created under the root working directory (with is set as work_dirs/ by default). To prevent overwriting previous experiments, an exception will be raised to interrupt the training if the job_name directory has already existed. To change the job name, please use --job_name= option.

More Demos

Please find more training demos under ./scripts/training_demos/.

Inspect Training Results

Besides using TensorBoard to track the training process, the raw results (e.g., training losses and running time) are saved in JSON format. They can be easily inspected with the following script

import json

file_name = '
   
    /log.json'
   

data_entries = []
with open(file_name, 'r') as f:
    for line in f:
        data_entry = json.loads(line)
        data_entries.append(data_entry)

# An example of data entry
# {"Loss/D Fake": 0.4833524551040682, "Loss/D Real": 0.4966000154727226, "Loss/G": 1.1439273656869773, "Learning Rate/Discriminator": 0.002352941082790494, "Learning Rate/Generator": 0.0020000000949949026, "data time": 0.0036810599267482758, "iter time": 0.24490128830075264, "run time": 66108.140625}

Convert Pre-trained Models

See Model Conversion for details.

Prepare Datasets

See Dataset Preparation for details.

Develop

See Contributing Guide for details.

License

The project is under MIT License.

Acknowledgement

This repository originates from GenForce, with all modules carefully optimized to make it more flexible and robust for distributed training. On top of GenForce where only StyleGAN training is provided, this repository also supports training StyleGAN2 and StyleGAN3, both of which are fully reproduced. Any new method is welcome to merge into this repository! Please refer to the Develop section.

Contributors

The main contributors are listed as follows.

Member	Contribution
Yujun Shen	Refactor and optimize the entire codebase and reproduce start-of-the-art approaches.
Zhiyi Zhang	Contribute to a number of sub-modules and functions, especially dataset related.
Dingdong Yang	Contribute to DALI data loading acceleration.
Yinghao Xu	Originally contribute to runner and loss functions in GenForce.
Ceyuan Yang	Originally contribute to data loader in GenForce.
Jiapeng Zhu	Originally contribute to evaluation metrics in GenForce.

BibTex

We open source this library to the community to facilitate the research. If you do like our work and use the codebase for your projects, please cite our work as follows.

@misc{hammer2022,
  title =        {Hammer: An Efficient Toolkit for Training Deep Models.},
  author =       {Shen, Yujun and Zhang, Zhiyi and Yang, Dingdong and Xu, Yinghao and Yang, Ceyuan and Zhu, Jiapeng},
  howpublished = {\url{https://github.com/bytedance/Hammer}},
  year =         {2022}
}

This repository provides an efficient PyTorch-based library for training deep models.

Related tags

Overview

An Efficient Library for Training Deep Models

Installation

Quick Demo

Train StyleGAN2 on FFHQ in Resolution of 256x256

More Demos

Inspect Training Results

Convert Pre-trained Models

Prepare Datasets

Develop

License

Acknowledgement

Contributors

BibTex

Owner

Bytedance Inc.

python debugger and anti-vm that checks if you're in a virtual machine or if someones trying to debug your file

A Python 3 package for state-of-the-art statistical dimension reduction methods

Listing arxiv - Personalized list of today's articles from ArXiv

Projecting interval uncertainty through the discrete Fourier transform

Exploring Machine Learning Models for detecting anomalous behavior in credit-card transactions. It's crucial that credit-card companies are able to recognize fraudulent activity so that customers are not charged for items they didn't purchase.

Imitating Deep Learning Dynamics via Locally Elastic Stochastic Differential Equations

Code for STFT Transformer used in BirdCLEF 2021 competition.

FedGS: A Federated Group Synchronization Framework Implemented by LEAF-MX.

I3-master-layout - Simple master and stack layout script

Tightness-aware Evaluation Protocol for Scene Text Detection

Machine Learning toolbox for Humans

On-device speech-to-intent engine powered by deep learning

Deep Dual Consecutive Network for Human Pose Estimation (CVPR2021)

Official Repsoitory for "Activate or Not: Learning Customized Activation." [CVPR 2021]

The source code of "SIDE: Center-based Stereo 3D Detector with Structure-aware Instance Depth Estimation", accepted to WACV 2022.

ISTR: End-to-End Instance Segmentation with Transformers (https://arxiv.org/abs/2105.00637)

Adversarial Robustness Toolbox (ART) - Python Library for Machine Learning Security - Evasion, Poisoning, Extraction, Inference - Red and Blue Teams

Clockwork Convnets for Video Semantic Segmentation

SafePicking: Learning Safe Object Extraction via Object-Level Mapping, ICRA 2022

code for our ECCV-2020 paper: Self-supervised Video Representation Learning by Pace Prediction