Implementation of FitVid video prediction model in JAX/Flax.

Last update: Nov 25, 2022

Related tags

Overview

FitVid Video Prediction Model

Implementation of FitVid video prediction model in JAX/Flax.

If you find this code useful, please cite it in your paper:

@article{babaeizadeh2021fitvid,
  title={FitVid: Overfitting in Pixel-Level Video Prediction},
  author= {Babaeizadeh, Mohammad and Saffar, Mohammad Taghi and Nair, Suraj 
  and Levine, Sergey and Finn, Chelsea and Erhan, Dumitru},
  journal={arXiv preprint arXiv:2106.13195},
  year={2020}
}

Method

FitVid is a new architecture for conditional variational video prediction. It has ~300 million parameters and can be trained with minimal training tricks.

Sample Videos

Human3.6M	RoboNet

For more samples please visit FitVid. website: https://sites.google.com/view/fitvidpaper

Instructions

Get dependencies:

pip3 install --user tensorflow
pip3 install --user tensorflow_addons
pip3 install --user flax
pip3 install --user ffmpeg

Train on RoboNet:

python -m fitvid.train  --output_dir /tmp/output

Disclaimer: Not an official Google product.

Owner

Google Research

GitHub Repository

Unofficial Implementation of Oboe (SIGCOMM'18').

Oboe-Reproduce This is the unofficial implementation of the paper "Oboe: Auto-tuning video ABR algorithms to network conditions, Zahaib Akhtar, Yun Se

13 Nov 04, 2022

Diffusion Normalizing Flow (DiffFlow) Neurips2021

Diffusion Normalizing Flow (DiffFlow) Reproduce setup environment The repo heavily depends on jam, a personal toolbox developed by Qsh.zh. The API may

76 Jan 01, 2023

Second-Order Neural ODE Optimizer, NeurIPS 2021 spotlight

Second-order Neural ODE Optimizer (NeurIPS 2021 Spotlight) [arXiv] ✔️ faster convergence in wall-clock time | ✔️ O(1) memory cost | ✔️ better test-tim

39 Oct 22, 2022

Code for our method RePRI for Few-Shot Segmentation. Paper at http://arxiv.org/abs/2012.06166

Region Proportion Regularized Inference (RePRI) for Few-Shot Segmentation In this repo, we provide the code for our paper : "Few-Shot Segmentation Wit

138 Dec 12, 2022

Extracts essential Mediapipe face landmarks and arranges them in a sequenced order.

simplified_mediapipe_face_landmarks Extracts essential Mediapipe face landmarks and arranges them in a sequenced order. The default 478 Mediapipe face

13 Oct 04, 2022

This repository focus on Image Captioning & Video Captioning & Seq-to-Seq Learning & NLP

Awesome-Visual-Captioning Table of Contents ACL-2021 CVPR-2021 AAAI-2021 ACMMM-2020 NeurIPS-2020 ECCV-2020 CVPR-2020 ACL-2020 AAAI-2020 ACL-2019 NeurI

362 Jan 03, 2023

Official implementation of Protected Attribute Suppression System, ICCV 2021

6 Jan 01, 2023

UNION: An Unreferenced Metric for Evaluating Open-ended Story Generation

UNION Automatic Evaluation Metric described in the paper UNION: An UNreferenced MetrIc for Evaluating Open-eNded Story Generation (EMNLP 2020). Please

50 Dec 30, 2022

Official Implementation of SWAGAN: A Style-based Wavelet-driven Generative Model

Official Implementation of SWAGAN: A Style-based Wavelet-driven Generative Model SWAGAN: A Style-based Wavelet-driven Generative Model Rinon Gal, Dana

55 Dec 06, 2022

A Broader Picture of Random-walk Based Graph Embedding

Random-walk Embedding Framework This repository is a reference implementation of the random-walk embedding framework as described in the paper: A Broa

23 Dec 13, 2022

OMAMO: orthology-based model organism selection

OMAMO: orthology-based model organism selection OMAMO is a tool that suggests the best model organism to study a biological process based on orthologo

5 Apr 22, 2022

Soft actor-critic is a deep reinforcement learning framework for training maximum entropy policies in continuous domains.

This repository is no longer maintained. Please use our new Softlearning package instead. Soft Actor-Critic Soft actor-critic is a deep reinforcement

752 Jan 07, 2023

Implementation of FitVid video prediction model in JAX/Flax.

Related tags

Overview

FitVid Video Prediction Model

Method

Sample Videos

Instructions

Owner

Google Research

Unofficial Implementation of Oboe (SIGCOMM'18').

Diffusion Normalizing Flow (DiffFlow) Neurips2021

Second-Order Neural ODE Optimizer, NeurIPS 2021 spotlight

Code for our method RePRI for Few-Shot Segmentation. Paper at http://arxiv.org/abs/2012.06166

Extracts essential Mediapipe face landmarks and arranges them in a sequenced order.

This repository focus on Image Captioning & Video Captioning & Seq-to-Seq Learning & NLP

Official implementation of Protected Attribute Suppression System, ICCV 2021

UNION: An Unreferenced Metric for Evaluating Open-ended Story Generation

Official Implementation of SWAGAN: A Style-based Wavelet-driven Generative Model

A Broader Picture of Random-walk Based Graph Embedding

OMAMO: orthology-based model organism selection

Soft actor-critic is a deep reinforcement learning framework for training maximum entropy policies in continuous domains.

Code for PackNet: Adding Multiple Tasks to a Single Network by Iterative Pruning

Jittor is a high-performance deep learning framework based on JIT compiling and meta-operators.

EfficientNetV2-with-TPU - Cifar-10 case study

An Open Source Machine Learning Framework for Everyone

Code for our ALiBi method for transformer language models.

RobustVideoMatting and background composing in one model by using onnxruntime.

Building blocks for uncertainty-aware cycle consistency presented at NeurIPS'21.

GuideDog is an AI/ML-based mobile app designed to assist the lives of the visually impaired, 100% voice-controlled