Evolving Neural Networks in JAX

This repository holds code displaying techniques for applying evolutionary network training strategies in JAX. Each script trains a network to solve the same problem: given a sequence of regularly-spaced values on a sine wave, predict the next value. The problem is trivial - the interesting part is intended to be the way in which this is accomplished, by updating network parameters directly and without gradient calculations, in parallel across devices. A lengthy tutorial is included, explaining the ideas and rationale. Much of the code is duplicated between scripts so that readers can run them individually and, if they like, view the differences between files to see what changes in each section.

The evolutionary ideas present here are mainly taken from OpenAI's blog post describing their efforts at scaling evolution strategies (and the associated code.)

tutorial.md

A longform tutorial that explains why I think evolutionary optimization strategies are interesting and some of the JAX techniques that I use to implement them. Individual bits of the code in each of the script files are discussed here.

simple.py

In this file, a very basic evolutionary strategy is implemented, without many optimizations. You can get a grasp here on how some fundamental JAX methods like scan and vmap are used to execute our training routine.

advanced.py

Here, some optimizations that OpenAI made in their code are added to our training routine. The various optimizations are discussed in depth in the article.

parallel.py

In this file, we prepare to scale the network to more than one device and to greater sizes. Vectorization becomes parallelization, and the code is sliced up so that we can calculate our network updates on a single device.

Evolving neural network parameters in JAX.

Related tags

Overview

Evolving Neural Networks in JAX

tutorial.md

simple.py

advanced.py

parallel.py

Owner

Trevor Thackston

Tool cek opsi checkpoint facebook!

Curved Projection Reformation

OpenCV, MediaPipe Pose Estimation, Affine Transform for Icon Overlay

Keras Implementation of The One Hundred Layers Tiramisu: Fully Convolutional DenseNets for Semantic Segmentation by (Simon Jégou, Michal Drozdzal, David Vazquez, Adriana Romero, Yoshua Bengio)

Encode and decode text application

Unofficial implementation of Perceiver IO: A General Architecture for Structured Inputs & Outputs

Study of human inductive biases in CNNs and Transformers.

General Vision Benchmark, a project from OpenGVLab

This repository is an implementation of paper : Improving the Training of Graph Neural Networks with Consistency Regularization

Public scripts, services, and configuration for running a smart home K3S network cluster

DeepVoxels is an object-specific, persistent 3D feature embedding.

A repository with exploration into using transformers to predict DNA ↔ transcription factor binding

Vector Quantization, in Pytorch

Hl classification bc - A Network-Based High-Level Data Classification Algorithm Using Betweenness Centrality

[CVPR2021] Invertible Image Signal Processing

Lip Reading - Cross Audio-Visual Recognition using 3D Convolutional Neural Networks

This Repo is the official CUDA implementation of ICCV 2019 Oral paper for CARAFE: Content-Aware ReAssembly of FEatures

Automatic meme generation model using Tensorflow Keras.

Learning To Have An Ear For Face Super-Resolution

Ranger deep learning optimizer rewrite to use newest components