Application of the L2HMC algorithm to simulations in lattice QCD.

Last update: Dec 14, 2022

Overview

l2hmc-qcd

📊 Slides

Recent talk on Training Topological Samplers for Lattice Gauge Theory from the Machine Learning for High Energy Physics, on and off the Lattice @ ect* Trento (09/30/2021)

📒 Example Notebook

Accepted to the Deep Learning for Simulation (SimDL) Workshop at ICLR 2021
- 📚 : arXiv:2105.03418
- 📊 : poster

Overview

The L2HMC algorithm aims to improve upon HMC by optimizing a carefully chosen loss function which is designed to minimize autocorrelations within the Markov Chain, thereby improving the efficiency of the sampler.

This work is based on the original implementation: brain-research/l2hmc/.

A detailed description of the L2HMC algorithm can be found in the paper:

Generalizing Hamiltonian Monte Carlo with Neural Network

by Daniel Levy, Matt D. Hoffman and Jascha Sohl-Dickstein.

Broadly, given an analytically described target distribution, π(x), L2HMC provides a statistically exact sampler that:

Quickly converges to the target distribution (fast burn-in).
Quickly produces uncorrelated samples (fast mixing).
Is able to efficiently mix between energy levels.
Is capable of traversing low-density zones to mix between modes (often difficult for generic HMC).

L2HMC for LatticeQCD

Goal: Use L2HMC to efficiently generate gauge configurations for calculating observables in lattice QCD.

A detailed description of the (ongoing) work to apply this algorithm to simulations in lattice QCD (specifically, a 2D U(1) lattice gauge theory model) can be found in doc/main.pdf.

Organization

Dynamics / Network

The base class for the augmented L2HMC leapfrog integrator is implemented in the BaseDynamics (a tf.keras.Model object).

The GaugeDynamics is a subclass of BaseDynamics containing modifications for the 2D U(1) pure gauge theory.

The network is defined in l2hmc-qcd/network/functional_net.py.

Network Architecture

An illustration of the leapfrog layer updating (x, v) --> (x', v') can be seen below.

Lattice

Lattice code can be found in lattice.py, specifically the GaugeLattice object that provides the base structure on which our target distribution exists.

Additionally, the GaugeLattice object implements a variety of methods for calculating physical observables such as the average plaquette, ɸₚ, and the topological charge Q,

Training

The training loop is implemented in l2hmc-qcd/utils/training_utils.py .

To train the sampler on a 2D U(1) gauge model using the parameters specified in bin/train_configs.json:

$ python3 /path/to/l2hmc-qcd/l2hmc-qcd/train.py --json_file=/path/to/l2hmc-qcd/bin/train_configs.json

Or via the bin/train.sh script provided in bin/.

Features

Distributed training (via horovod): If horovod is installed, the model can be trained across multiple GPUs (or CPUs) by:

#!/bin/bash

TRAINER=/path/to/l2hmc-qcd/l2hmc-qcd/train.py
JSON_FILE=/path/to/l2hmc-qcd/bin/train_configs.json

horovodrun -np ${PROCS} python3 ${TRAINER} --json_file=${JSON_FILE}

Contact

Code author: Sam Foreman

Pull requests and issues should be directed to: saforem2

Citation

If you use this code or found this work interesting, please cite our work along with the original paper:

@misc{foreman2021deep,
      title={Deep Learning Hamiltonian Monte Carlo}, 
      author={Sam Foreman and Xiao-Yong Jin and James C. Osborn},
      year={2021},
      eprint={2105.03418},
      archivePrefix={arXiv},
      primaryClass={hep-lat}
}

@article{levy2017generalizing,
  title={Generalizing Hamiltonian Monte Carlo with Neural Networks},
  author={Levy, Daniel and Hoffman, Matthew D. and Sohl-Dickstein, Jascha},
  journal={arXiv preprint arXiv:1711.09268},
  year={2017}
}

Acknowledgement

This research used resources of the Argonne Leadership Computing Facility, which is a DOE Office of Science User Facility supported under contract DE_AC02-06CH11357. This work describes objective technical results and analysis. Any subjective views or opinions that might be expressed in the work do not necessarily represent the views of the U.S. DOE or the United States Government. Declaration of Interests - None.

Comments

Remove upper bound on python_requires

(I'm moving between meetings so can iterate on this more later, so excuse the very brief Issue for now).

At the moment the project has an upper bound on python_requires

https://github.com/saforem2/l2hmc-qcd/blob/2eb6ee63cc0c53b187e6d716f4c12f418c8b8515/setup.py#L165

Assuming that you're intending l2hmc to be a library and not an application, then I would highly recommend removing this for the reasons summarized in Henry's detailed blog post on the subject.

Congrats on getting l2hmc up on PyPI though! :snake: :rocket:

opened by matthewfeickert 2
Alpha
Pull upstream alpha branch into main

Major changes

new src/ hierarchical module organization

Contains skeleton implementation of 4D SU(3) lattice gauge model

src/l2hmc/lattice/gauge/lattice.py

Framework independent configuration

Unified configuration system simplifies logic, same configs used for both tensorflow and pytorch experiments

Plan to be able to specify which backend to use through config option

Unified (and framework independent) configurations between tensorflow and pytorch implementations

Definitions can be found in l2hmc-qcd/src/l2hmc/configs.py

Note: This is still very much a WIP. Many existing features still need to be re-implemented / updated into new code in src/.

Todo

[ ] Write unit tests

[ ] Use simple configs for end-to-end workflow test + integrate into CI

[ ] dynamic learning rate scheduling

[ ] Test 4D SU(3) numpy code

[ ] Write tensorflow and pytorch implementations of LatticeSU3 objects

[ ] Improved / simplified ( / trainable?) annealing schedule

[ ] Distributed training support

[ ] horovod

[ ] DDP for pytorch implementation

[ ] DeepSpeed from Microsoft??

[ ] Testing / inference logic

[ ] Automatic checkpointing

[ ] Metric logging

[ ] Tensorboard?

[ ] Sacred?

[ ] build custom dashboard? plot.ly?

[ ] Setup packaging / distribution through pip

[ ] Resolve issue
opened by saforem2 1
Alpha
Major upgrades to how training is initialized in l2hmc-qcd/utils/training_utils.py, particularly when trying to restore a model from an existing checkpoint.

Significant upgrades to logging mechanics in l2hmc-qcd/utils/logger.py and l2hmc-qcd/utils/logger_config.py which now use a RichHandler to nicely format log messages characterized by severity, including automatic file rotation, etc.

Improvements to test suite in l2hmc-qcd/tests/test_training.py, more robust tests on larger set of possible cases

TODO: Automate using github actions for CI

Improvements to l2hmc-qcd/dynamics/gauge_dynamics.py but still a WIP
opened by saforem2 1
Rich
General improvements, rewrote logging methods to use Rich for better formatting.

Adds dynamic (trainable) step size eps for each separate x and v updates, seems to generally increase the total energy towards the middle of the trajectory but it remains unclear if this corresponds to an improvement in the tunneling rate

Adds methods for calculating autocorrelations of the topological charge, as well as notebooks for generating the plots

Updates to the writeup in doc/main.pdf

Will likely be last changes to writeup before public release of official draft
opened by saforem2 1
Dev
Updates to README

Ability to load network with new training instance

Updates to doc/, removes old sections related to debugging the bias in the plaquette
opened by saforem2 1
Saveable model
Complete rewrite of dynamics.xnet and dynamics.vnet models to use tf.keras.functional Models.

Additional changes include:

Non-Compact Projection update for gauge fields

Ability to specify convolution structure to be prepended at beginning of gauge network
opened by saforem2 1
Dev

Removes models/gauge_model.py entirely.

Instead, a base dynamics class is implemented in dynamics/dynamics.py, and an example subclass is provided in dynamics/gauge_dynamics.py.

opened by saforem2 1
Split networks

Major rewrite of existing codebase.

This pull request updates everything to be compatible with tensorflow >= 2.2 and removes a bunch of redundant legacy code.

opened by saforem2 1
Dev
Dynamics object is now compatible with tf >= 2.0

Running inference on trained model with tensorflow now creates identical graphs and summary files to numpy inference code

Inference with numpy now uses object oriented structure

Adds LaTeX + PDF documentation in doc/
opened by saforem2 1
Cooley dev

Adds new GaugeNetwork architecture as the default for training GaugeModel

Additionally, replaces pickle with joblib for saving data as .z compressed files (as opposed to .pkl files).

opened by saforem2 1
Testing

Implemented nnehmc_loss calculation for an alternative loss function using the approach suggested in https://infoscience.epfl.ch/record/264887/files/robust_parameter_estimation.pdf.

This modified loss function can be chosen (instead of the standard loss described in the original paper) by passing --use_nnehmc_loss as a command line argument.

opened by saforem2 1

Packaging and PyPI distribution?

As you've made a library and are using it as such:

# snippet from toy_distributions.ipynb

# append parent directory to `sys.path`
# to load from modules in `../l2hmc-qcd/`
module_path = os.path.join('..')
if module_path not in sys.path:
    sys.path.append(module_path)

# Local imports
from utils.attr_dict import AttrDict
from utils.training_utils import train_dynamics
from dynamics.config import DynamicsConfig
from dynamics.base_dynamics import BaseDynamics
from dynamics.generic_dynamics import GenericDynamics
from network.config import LearningRateConfig
from config import (State, NetWeights, MonteCarloStates,
                    BASE_DIR, BIN_DIR, TF_FLOAT)

from utils.distributions import (plot_samples2D, contour_potential,
                                 two_moons_potential, sin_potential,
                                 sin_potential1, sin_potential2)

do you have any plans and/or interest in packaging it as a Python library so it can either be pip installed from GitHub or be distributed on PyPI?

opened by matthewfeickert 5

Releases(0.12.0)

0.12.0(Aug 9, 2022)

Source code(tar.gz)
Source code(zip)
0.8.0(Apr 14, 2022)

Full Changelog: https://github.com/saforem2/l2hmc-qcd/compare/0.7.0...0.8.0
Source code(tar.gz)
Source code(zip)
0.7.0(Apr 14, 2022)

pypi release: v0.7.0

Full Changelog: https://github.com/saforem2/l2hmc-qcd/compare/0.4.0...0.7.0
Source code(tar.gz)
Source code(zip)
0.4.0(Apr 8, 2022)

Full Changelog: https://github.com/saforem2/l2hmc-qcd/compare/0.3.0...0.4.0
Source code(tar.gz)
Source code(zip)

Owner

Sam Foreman

Computational science Postdoc at Argonne National Laboratory working on applying machine learning to simulations in lattice QCD.

GitHub Repository https://samforeman.me/l2hmc-qcd

All the code and files related to the MI-Lab of UE19CS305 course in sem 5

Machine-Intelligence-Lab-CS305 The compilation of all the code an drelated files from MI-Lab UE19CS305 (of batch 2019-2023) offered by PES University

3 Nov 10, 2022

LinkNet - This repository contains our Torch7 implementation of the network developed by us at e-Lab.

LinkNet This repository contains our Torch7 implementation of the network developed by us at e-Lab. You can go to our blogpost or read the article Lin

158 Nov 11, 2022

Source code for CIKM 2021 paper for Relation-aware Heterogeneous Graph for User Profiling

RHGN Source code for CIKM 2021 paper for Relation-aware Heterogeneous Graph for User Profiling Dependencies torch==1.6.0 torchvision==0.7.0 dgl==0.7.1

6 Nov 29, 2022

Learning Continuous Signed Distance Functions for Shape Representation

DeepSDF This is an implementation of the CVPR '19 paper "DeepSDF: Learning Continuous Signed Distance Functions for Shape Representation" by Park et a

1.1k Jan 01, 2023

Pytorch implementation of Each Part Matters: Local Patterns Facilitate Cross-view Geo-localization https://arxiv.org/abs/2008.11646

[TCSVT] Each Part Matters: Local Patterns Facilitate Cross-view Geo-localization LPN [Paper] NEWs Prerequisites Python 3.6 GPU Memory = 8G Numpy 1.

46 Dec 14, 2022

A scanpy extension to analyse single-cell TCR and BCR data.

Scirpy: A Scanpy extension for analyzing single-cell immune-cell receptor sequencing data Scirpy is a scalable python-toolkit to analyse T cell recept

145 Jan 03, 2023

Codebase to experiment with a hybrid Transformer that combines conditional sequence generation with regression

Regression Transformer Codebase to experiment with a hybrid Transformer that combines conditional sequence generation with regression . Development se

27 Jan 05, 2023

Koopman operator identification library in Python

pykoop pykoop is a Koopman operator identification library written in Python. It allows the user to specify Koopman lifting functions and regressors i

34 Jan 04, 2023

Meta Language-Specific Layers in Multilingual Language Models

Meta Language-Specific Layers in Multilingual Language Models This repo contains the source codes for our paper On Negative Interference in Multilingu

20 Feb 13, 2022

Contrastive Learning Inverts the Data Generating Process

Official code to reproduce the results and data presented in the paper Contrastive Learning Inverts the Data Generating Process.

71 Nov 25, 2022

Generative Art Using Neural Visual Grammars and Dual Encoders

Generative Art Using Neural Visual Grammars and Dual Encoders Arnheim 1 The original algorithm from the paper Generative Art Using Neural Visual Gramm

231 Jan 05, 2023

An OpenAI-Gym Package for Training and Testing Reinforcement Learning algorithms with OpenSim Models

Authors: Utkarsh A. Mishra and Dr. Dimitar Stanev Advisors: Dr. Dimitar Stanev and Prof. Auke Ijspeert, Biorobotics Laboratory (BioRob), EPFL Video Pl

16 Dec 13, 2022

Learning Representational Invariances for Data-Efficient Action Recognition

Learning Representational Invariances for Data-Efficient Action Recognition Official PyTorch implementation for Learning Representational Invariances

27 Nov 22, 2022

Official implementation of ACTION-Net: Multipath Excitation for Action Recognition (CVPR'21).

ACTION-Net Official implementation of ACTION-Net: Multipath Excitation for Action Recognition (CVPR'21). Getting Started EgoGesture data folder struct

171 Dec 26, 2022

This is my research project for the Irving Center for Cancer Dynamics/Azizi Lab, Columbia University.

bayesian_uncertainty This is my research project for the Irving Center for Cancer Dynamics/Azizi Lab, Columbia University. In this project I build a s

1 Feb 13, 2022

PyTorch implementation of Value Iteration Networks (VIN): Clean, Simple and Modular. Visualization in Visdom.

VIN: Value Iteration Networks This is an implementation of Value Iteration Networks (VIN) in PyTorch to reproduce the results.(TensorFlow version) Key

215 Dec 07, 2022

Official implementation of the NeurIPS 2021 paper Online Learning Of Neural Computations From Sparse Temporal Feedback

Online Learning Of Neural Computations From Sparse Temporal Feedback This repository is the official implementation of the NeurIPS 2021 paper Online L

3 Dec 15, 2021

D-NeRF: Neural Radiance Fields for Dynamic Scenes

D-NeRF: Neural Radiance Fields for Dynamic Scenes [Project] [Paper] D-NeRF is a method for synthesizing novel views, at an arbitrary point in time, of

291 Jan 02, 2023

Official Pytorch implementation of "Learning Debiased Representation via Disentangled Feature Augmentation (Neurips 2021, Oral)"

Learning Debiased Representation via Disentangled Feature Augmentation (Neurips 2021, Oral): Official Project Webpage This repository provides the off

68 Dec 17, 2022

ActNN: Reducing Training Memory Footprint via 2-Bit Activation Compressed Training

ActNN : Activation Compressed Training This is the official project repository for ActNN: Reducing Training Memory Footprint via 2-Bit Activation Comp

178 Jan 05, 2023

Application of the L2HMC algorithm to simulations in lattice QCD.

Related tags

Overview

l2hmc-qcd

📊 Slides

📒 Example Notebook

Overview

L2HMC for LatticeQCD

Organization

Dynamics / Network

Network Architecture

Lattice

Training

Features

Contact

Citation

Acknowledgement

Comments

Major changes

Todo

Releases(0.12.0)

0.12.0(Aug 9, 2022)

0.8.0(Apr 14, 2022)

0.7.0(Apr 14, 2022)

0.4.0(Apr 8, 2022)

Owner

Sam Foreman

All the code and files related to the MI-Lab of UE19CS305 course in sem 5

LinkNet - This repository contains our Torch7 implementation of the network developed by us at e-Lab.

Source code for CIKM 2021 paper for Relation-aware Heterogeneous Graph for User Profiling

Learning Continuous Signed Distance Functions for Shape Representation

Pytorch implementation of Each Part Matters: Local Patterns Facilitate Cross-view Geo-localization https://arxiv.org/abs/2008.11646

A scanpy extension to analyse single-cell TCR and BCR data.

Codebase to experiment with a hybrid Transformer that combines conditional sequence generation with regression

Koopman operator identification library in Python

Meta Language-Specific Layers in Multilingual Language Models

Contrastive Learning Inverts the Data Generating Process

Generative Art Using Neural Visual Grammars and Dual Encoders

An OpenAI-Gym Package for Training and Testing Reinforcement Learning algorithms with OpenSim Models

Learning Representational Invariances for Data-Efficient Action Recognition

Official implementation of ACTION-Net: Multipath Excitation for Action Recognition (CVPR'21).

This is my research project for the Irving Center for Cancer Dynamics/Azizi Lab, Columbia University.

PyTorch implementation of Value Iteration Networks (VIN): Clean, Simple and Modular. Visualization in Visdom.

Official implementation of the NeurIPS 2021 paper Online Learning Of Neural Computations From Sparse Temporal Feedback

D-NeRF: Neural Radiance Fields for Dynamic Scenes

Official Pytorch implementation of "Learning Debiased Representation via Disentangled Feature Augmentation (Neurips 2021, Oral)"

ActNN: Reducing Training Memory Footprint via 2-Bit Activation Compressed Training