PyTorch implementation for the visual prior component (i.e. perception module) of the Visually Grounded Physics Learner [Li et al., 2020].

Last update: Dec 29, 2022

Related tags

Deep Learning VGPL-Visual-Prior

Overview

VGPL-Visual-Prior

PyTorch implementation for the visual prior component (i.e. perception module) of the Visually Grounded Physics Learner (VGPL). Given visual obseravtions, the visual prior proposes their corresponding particle representations, in the form of particle positions and groupings. Please see the following paper for more details.

Visual Grounding of Learned Physical Models

Yunzhu Li, Toru Lin*, Kexin Yi*, Daniel M. Bear, Daniel L. K. Yamins, Jiajun Wu, Joshua B. Tenenbaum, and Antonio Torralba

ICML 2020 [website] [paper] [video]

Demo

Input RGB videos and predictions from our learned model

Prerequisites

Python 3
PyTorch 1.0 or higher, with NVIDIA CUDA Support
Other required packages in requirements.txt

Code overview

Helper files

config.py contains all configurations used for model training, model evaluation and output generation.

dataset.py contains helper functions for loading and standardizing data and related variables. Note that paths to data directories is specified in the _DATA_DIR variable in this file, not in config.py.

loss.py contains helper functions for calculating Chamfer loss in different settings (e.g. in a single frame, across a time sequence, etc.).

model.py implements the neural network model used for prediction.

Main files

The following files can be run directly; see "Training and evaluation" section for more details.

train.py trains a model that could convert input observations into their particle representations.

eval.py evaluates a trained model by visualizing its predictions, and/or stores the output predictions in .h5 format.

Training and evaluation

Download the training and evaluation data from the following links, and put them in data folder. Optionally, download our trained model checkpoints and put them in dump folder.

MassRope [data(4.89GB)] [model]
RigidFall [data(4.87GB)] [model]

To train a model:

python train.py --set loss_type l2 dataset RigidFall

To debug (by overfitting model on small batch of data):

python train.py --set loss_type l2 dataset RigidFall debug True

To evaluate a trained model and generate outputs using our provided checkpoints:

python eval.py --set loss_type l2 dataset RigidFall n_frames 4 n_frames_eval 30 load_path dump/rigid_fall_4frame_l2.pth
python eval.py --set loss_type l2 dataset MassRope n_frames 4 n_frames_eval 30 load_path dump/mass_rope_4frame_l2.pth

See config.py for more details on customizable configurations.

Citing VGPL

If you find this codebase useful in your research, please consider citing:

@inproceedings{li2020visual,
    Title={Visual Grounding of Learned Physical Models},
    Author={Li, Yunzhu and Lin, Toru and Yi, Kexin and Bear, Daniel and Yamins, Daniel L.K. and Wu, Jiajun and Tenenbaum, Joshua B. and Torralba, Antonio},
    Booktitle={ICML},
    Year={2020}
}

@inproceedings{li2019learning,
    Title={Learning Particle Dynamics for Manipulating Rigid Bodies, Deformable Objects, and Fluids},
    Author={Li, Yunzhu and Wu, Jiajun and Tedrake, Russ and Tenenbaum, Joshua B and Torralba, Antonio},
    Booktitle={ICLR},
    Year={2019}
}

PyTorch implementation for the visual prior component (i.e. perception module) of the Visually Grounded Physics Learner [Li et al., 2020].

Related tags

Overview

VGPL-Visual-Prior

Demo

Prerequisites

Code overview

Helper files

Main files

Training and evaluation

Citing VGPL

Owner

Toru

Code for "Graph-Evolving Meta-Learning for Low-Resource Medical Dialogue Generation". [AAAI 2021]

pytorch implementation of GPV-Pose

Simple Tensorflow implementation of Toward Spatially Unbiased Generative Models (ICCV 2021)

Detecting Potentially Harmful and Protective Suicide-related Content on Twitter

Image processing in Python

Text-to-Music Retrieval using Pre-defined/Data-driven Emotion Embeddings

Chinese Mandarin tts text-to-speech 中文 (普通话) 语音合成 , by fastspeech 2 , implemented in pytorch, using waveglow as vocoder,

FairFuzz: AFL extension targeting rare branches

FusionNet: A deep fully residual convolutional neural network for image segmentation in connectomics

Classic Papers for Beginners and Impact Scope for Authors.

Differentiable Annealed Importance Sampling (DAIS)

A keras implementation of ENet (abandoned for the foreseeable future)

Generative Flow Networks for Discrete Probabilistic Modeling

This code is an unofficial implementation of HiFiSinger.

Official PyTorch implementation of the ICRA 2021 paper: Adversarial Differentiable Data Augmentation for Autonomous Systems.

[SIGGRAPH Asia 2019] Artistic Glyph Image Synthesis via One-Stage Few-Shot Learning

Automatic meme generation model using Tensorflow Keras.

zeus is a Python implementation of the Ensemble Slice Sampling method.

Nerf pl - NeRF (Neural Radiance Fields) and NeRF in the Wild using pytorch-lightning

Implementation of the HMAX model of vision in PyTorch

PyTorch implementation for the visual prior component (i.e. perception module) of the Visually Grounded Physics Learner [Li et al., 2020].

Related tags

Overview

VGPL-Visual-Prior

Demo

Prerequisites

Code overview

Helper files

Main files

Training and evaluation

Citing VGPL

Owner

Toru

Code for "Graph-Evolving Meta-Learning for Low-Resource Medical Dialogue Generation". [AAAI 2021]

pytorch implementation of GPV-Pose

Simple Tensorflow implementation of Toward Spatially Unbiased Generative Models (ICCV 2021)

Detecting Potentially Harmful and Protective Suicide-related Content on Twitter

Image processing in Python

Text-to-Music Retrieval using Pre-defined/Data-driven Emotion Embeddings

Chinese Mandarin tts text-to-speech 中文 (普通话) 语音 合成 , by fastspeech 2 , implemented in pytorch, using waveglow as vocoder,

FairFuzz: AFL extension targeting rare branches

FusionNet: A deep fully residual convolutional neural network for image segmentation in connectomics

Classic Papers for Beginners and Impact Scope for Authors.

Differentiable Annealed Importance Sampling (DAIS)

A keras implementation of ENet (abandoned for the foreseeable future)

Generative Flow Networks for Discrete Probabilistic Modeling

This code is an unofficial implementation of HiFiSinger.

Official PyTorch implementation of the ICRA 2021 paper: Adversarial Differentiable Data Augmentation for Autonomous Systems.

[SIGGRAPH Asia 2019] Artistic Glyph Image Synthesis via One-Stage Few-Shot Learning

Automatic meme generation model using Tensorflow Keras.

zeus is a Python implementation of the Ensemble Slice Sampling method.

Nerf pl - NeRF (Neural Radiance Fields) and NeRF in the Wild using pytorch-lightning

Implementation of the HMAX model of vision in PyTorch

Chinese Mandarin tts text-to-speech 中文 (普通话) 语音合成 , by fastspeech 2 , implemented in pytorch, using waveglow as vocoder,