Official PyTorch Implementation of paper "Deep 3D Mask Volume for View Synthesis of Dynamic Scenes", ICCV 2021.

Last update: Oct 12, 2022

Related tags

Deep Learning deep-3dmask

Overview

Deep 3D Mask Volume for View Synthesis of Dynamic Scenes

Official PyTorch Implementation of paper "Deep 3D Mask Volume for View Synthesis of Dynamic Scenes", ICCV 2021.

Kai-En Lin¹, Lei Xiao², Feng Liu², Guowei Yang¹, Ravi Ramamoorthi¹

¹University of California, San Diego, ²Facebook Reality Labs

Requirements

Install required packages

Make sure you have up-to-date NVIDIA drivers supporting CUDA 11.1 (10.2 could work but need to change cudatoolkit package accordingly)

Run

conda env create -f environment.yml
conda activate video_viewsynth

Usage

Rendering

Download our pretrained checkpoint and testing data. Extract the content to [path_to_data_directory]. It contains frames and background folders, as well as poses_bounds.npy.
In configs, setup data path by changing render_video.txt

root_dir should point to the frames folder mentioned in 1. and bg_dir should point to background folder.

out_dir can be your desired output folder.

ckpt_path should be the pretrained checkpoint path.
Run python render_llff_video.py --config [config_file_path]

e.g. python render_llff_video.py --config ../configs/render_video.txt

(Optional) For your own data, please run prepare_data.sh

sh render.sh [frame_folder] [starting_frame] [ending_frame] [output_folder_name]

Make sure your data is in this structure before running
```
[frame_folder] --- cam00 --- 00000.jpg
                |         |- 00001.jpg
                |         ...
                |- cam01
                |- cam02
                ...
                |- poses_bounds.npy
```
e.g. sh render.sh ~/deep_3d_data/frames 0 20 qual

Training

Train MPI

Download RealEstate10K dataset and extract the frames. There are scripts in preprocessing folder which can be used to generate the data.

The order should be download_data.py -> extract_frames.py -> compress_data.py.

Remember to change the path in compress_data.py.
Change the paths in config file train_realestate10k.txt

Run

cd train_mpi
python train.py --config ../configs/train_realestate10k.txt

Train Mask

Once MPI is trained, we can use the checkpoint to train 3D mask network.

Download dataset
Change the paths in config file train_mask.txt

Run

cd train_mask
python train.py --config ../configs/train_mask.txt

Citation

@inproceedings {lin2021deep,
    title = {Deep 3D Mask Volume for View Synthesis of Dynamic Scenes},
    author = {Kai-En Lin and Lei Xiao and Feng Liu and Guowei Yang and Ravi Ramamoorthi},
    booktitle = {ICCV},
    year = {2021},
}

Official PyTorch Implementation of paper "Deep 3D Mask Volume for View Synthesis of Dynamic Scenes", ICCV 2021.

Related tags

Overview

Deep 3D Mask Volume for View Synthesis of Dynamic Scenes

Requirements

Install required packages

Usage

Rendering

Training

Train MPI

Train Mask

Citation

Owner

Ken Lin

GLANet - The code for Global and Local Alignment Networks for Unpaired Image-to-Image Translation arxiv

Relative Positional Encoding for Transformers with Linear Complexity

TAug :: Time Series Data Augmentation using Deep Generative Models

a reimplementation of LiteFlowNet in PyTorch that matches the official Caffe version

Heart Arrhythmia Classification

3D-Transformer: Molecular Representation with Transformer in 3D Space

Code release for "BoxeR: Box-Attention for 2D and 3D Transformers"

WeakVRD-Captioning - Implementation of paper Improving Image Captioning with Better Use of Caption

Codebase for ECCV18 "The Sound of Pixels"

Toolbox of models, callbacks, and datasets for AI/ML researchers.

A fast poisson image editing implementation that can utilize multi-core CPU or GPU to handle a high-resolution image input.

Supporting code for the paper "Dangers of Bayesian Model Averaging under Covariate Shift"

HiFi++: a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement

A multi-functional library for full-stack Deep Learning. Simplifies Model Building, API development, and Model Deployment.

This is an official PyTorch implementation of Task-Adaptive Neural Network Search with Meta-Contrastive Learning (NeurIPS 2021, Spotlight).

Interpolation-based reduced-order models

Unsupervised Representation Learning by Invariance Propagation

This is an official implementation of the High-Resolution Transformer for Dense Prediction.

Official PyTorch implementation for paper "Efficient Two-Stage Detection of Human–Object Interactions with a Novel Unary–Pairwise Transformer"

Deep Learning ❤️ OneFlow