The full training script for Enformer (Tensorflow Sonnet) on TPU clusters

Last update: Oct 19, 2022

Overview

Enformer TPU training script (wip)

The full training script for Enformer (Tensorflow Sonnet) on TPU clusters, in an effort to migrate the model to pytorch.

This was pieced together from the Deepmind Enformer repository, the colab training notebook, as well as Basenji sequence augmentation code

It accounts for:

distributed TPU training
distributed datasets
distributed validation
gradient clipping
cross replica batchnorms
dataset augmentation

Training takes about 3 days on v3-64

Todo

fix script for differences in sequence length in basenji training data, which is ~130k vs ~190k bp as in paper

Citations

@article {Avsec2021.04.07.438649,
    author  = {Avsec, {\v Z}iga and Agarwal, Vikram and Visentin, Daniel and Ledsam, Joseph R. and Grabska-Barwinska, Agnieszka and Taylor, Kyle R. and Assael, Yannis and Jumper, John and Kohli, Pushmeet and Kelley, David R.},
    title   = {Effective gene expression prediction from sequence by integrating long-range interactions},
    elocation-id = {2021.04.07.438649},
    year    = {2021},
    doi     = {10.1101/2021.04.07.438649},
    publisher = {Cold Spring Harbor Laboratory},
    URL     = {https://www.biorxiv.org/content/early/2021/04/08/2021.04.07.438649},
    eprint  = {https://www.biorxiv.org/content/early/2021/04/08/2021.04.07.438649.full.pdf},
    journal = {bioRxiv}
}

The full training script for Enformer (Tensorflow Sonnet) on TPU clusters

Related tags

Overview

Enformer TPU training script (wip)

Todo

Citations

Owner

Phil Wang

The implementation of 'Image synthesis via semantic composition'.

Code for paper "Document-Level Argument Extraction by Conditional Generation". NAACL 21'

Pytorch implement of 'Unmixing based PAN guided fusion network for hyperspectral imagery'

DuBE: Duple-balanced Ensemble Learning from Skewed Data

[AAAI 2022] Negative Sample Matters: A Renaissance of Metric Learning for Temporal Grounding

Final project code: Implementing MAE with downscaled encoders and datasets, for ESE546 FA21 at University of Pennsylvania

A collection of scripts I developed for personal and working projects.

Image Lowpoly based on Centroid Voronoi Diagram via python-opencv and taichi

a grammar based feedback fuzzer

P-Tuning v2: Prompt Tuning Can Be Comparable to Finetuning Universally Across Scales and Tasks

TAP: Text-Aware Pre-training for Text-VQA and Text-Caption, CVPR 2021 (Oral)

Attention over nodes in Graph Neural Networks using PyTorch (NeurIPS 2019)

Voxel-based Network for Shape Completion by Leveraging Edge Generation (ICCV 2021, oral)

CM-NAS: Cross-Modality Neural Architecture Search for Visible-Infrared Person Re-Identification (ICCV2021)

The codes and related files to reproduce the results for Image Similarity Challenge Track 1.

Pytorch implementation for DFN: Distributed Feedback Network for Single-Image Deraining.

Technical Analysis library in pandas for backtesting algotrading and quantitative analysis

Adversarial-Information-Bottleneck - Distilling Robust and Non-Robust Features in Adversarial Examples by Information Bottleneck (NeurIPS21)

Self-Supervised Pillar Motion Learning for Autonomous Driving (CVPR 2021)

A unet implementation for Image semantic segmentation