[CVPR 2022] Thin-Plate Spline Motion Model for Image Animation.

Last update: Dec 30, 2022

Overview

[CVPR2022] Thin-Plate Spline Motion Model for Image Animation

Source code of the CVPR'2022 paper "Thin-Plate Spline Motion Model for Image Animation"

Example animation

PS: The paper trains the model for 100 epochs for a fair comparison. You can use more data and train for more epochs to get better performance.

Web demo for animation

Try the web demo for animation here:
Google Colab:

Pre-trained models

Installation

We support python3.(Recommended version is Python 3.9). To install the dependencies run:

pip install -r requirements.txt

YAML configs

There are several configuration files one for each dataset in the config folder named as config/dataset_name.yaml.

See description of the parameters in the config/taichi-256.yaml.

Datasets

MGif. Follow Monkey-Net.
TaiChiHD and VoxCeleb. Follow instructions from video-preprocessing.
TED-talks. Follow instructions from MRAA.

Training

To train a model on specific dataset run:

CUDA_VISIBLE_DEVICES=0,1 python run.py --config config/dataset_name.yaml --device_ids 0,1

A log folder named after the timestamp will be created. Checkpoints, loss values, reconstruction results will be saved to this folder.

Training AVD network

To train a model on specific dataset run:

CUDA_VISIBLE_DEVICES=0 python run.py --mode train_avd --checkpoint '{checkpoint_folder}/checkpoint.pth.tar' --config config/dataset_name.yaml

Checkpoints, loss values, reconstruction results will be saved to {checkpoint_folder}.

Evaluation on video reconstruction

To evaluate the reconstruction performance run:

CUDA_VISIBLE_DEVICES=0 python run.py --mode reconstruction --config config/dataset_name.yaml --checkpoint '{checkpoint_folder}/checkpoint.pth.tar'

The reconstruction subfolder will be created in {checkpoint_folder}. The generated video will be stored to this folder, also generated videos will be stored in png subfolder in loss-less '.png' format for evaluation. To compute metrics, follow instructions from pose-evaluation.

Image animation demo

notebook: demo.ipynb, edit the config cell and run for image animation.
python:

CUDA_VISIBLE_DEVICES=0 python demo.py --config config/vox-256.yaml --checkpoint checkpoints/vox.pth.tar --source_image ./source.jpg --driving_video ./driving.mp4

Acknowledgments

The main code is based upon FOMM and MRAA

Thanks for the excellent works!

Thanks iperov, this work has been integrated in DeepFaceLive

[CVPR 2022] Thin-Plate Spline Motion Model for Image Animation.

Related tags

Overview

[CVPR2022] Thin-Plate Spline Motion Model for Image Animation

Example animation

Web demo for animation

Pre-trained models

Installation

YAML configs

Datasets

Training

Training AVD network

Evaluation on video reconstruction

Image animation demo

Acknowledgments

Owner

yoyo-nb

Official implementation for "Low-light Image Enhancement via Breaking Down the Darkness"

Official repository for "Exploiting Session Information in BERT-based Session-aware Sequential Recommendation", SIGIR 2022 short.

Code for all the Advent of Code'21 challenges mostly written in python

Unofficial Tensorflow 2 implementation of the paper Implicit Neural Representations with Periodic Activation Functions

This framework implements the data poisoning method found in the paper Adversarial Examples Make Strong Poisons

[CVPR 2022 Oral] MixFormer: End-to-End Tracking with Iterative Mixed Attention

Moving Object Segmentation in 3D LiDAR Data: A Learning-based Approach Exploiting Sequential Data

Official repository for CVPR21 paper "Deep Stable Learning for Out-Of-Distribution Generalization".

[CVPR 2021] Forecasting the panoptic segmentation of future video frames

Self-Supervised Pre-Training for Transformer-Based Person Re-Identification

A deep learning framework for historical document image analysis

Learning Lightweight Low-Light Enhancement Network using Pseudo Well-Exposed Images

Memory efficient transducer loss computation

Deploy pytorch classification model using Flask and Streamlit

GPU-Accelerated Deep Learning Library in Python

Code for unmixing audio signals in four different stems "drums, bass, vocals, others". The code is adapted from "Jukebox: A Generative Model for Music"

Snscrape-jsonl-urls-extractor - Extracts urls from jsonl produced by snscrape

Subpopulation detection in high-dimensional single-cell data

Like a cowsay but without cows!

rastrainer is a QGIS plugin to training remote sensing semantic segmentation model based on PaddlePaddle.