NeRViS: Neural Re-rendering for Full-frame Video Stabilization

Last update: Jun 17, 2022

Related tags

Deep Learning NeRViS

Overview

NeRViS: Neural Re-rendering for Full-frame Video Stabilization

Project Page | Video | Paper | Google Colab

Setup

Setup environment for [Yu and Ramamoorthi 2020].

cd CVPR2020CODE_yulunliu_modified
conda create --name NeRViS_CVPR2020 python=3.6
conda activate NeRViS_CVPR2020
pip install -r requirements_CVPR2020.txt
./install.sh

Download pre-trained checkpoints of [Yu and Ramamoorthi 2020].

wget https://www.cmlab.csie.ntu.edu.tw/~yulunliu/NeRViS/CVPR2020_ckpts.zip
unzip CVPR2020_ckpts.zip
cd ..

Setup environment for NeRViS.

conda deactivate
conda create --name NeRViS python=3.6
conda activate NeRViS
conda install pytorch=1.6.0 torchvision=0.7.0 cudatoolkit=10.1 -c pytorch
conda install matplotlib
conda install tensorboard
conda install scipy
conda install opencv
conda install -c conda-forge cupy cudatoolkit=10.1
pip install PyMaxflow

Running code

Calculate smoothed flow using [Yu and Ramamoorthi 2020].

conda activate NeRViS_CVPR2020
cd CVPR2020CODE_yulunliu_modified
python main.py [input_frames_path] [output_frames_path] [output_warping_field_path]

e.g.

python main.py ../../NUS/Crowd/0/ NUS_results/Crowd/0/ CVPR2020_warping_field/

Run NeRViS video stabilization.

conda deactivate
conda activate NeRViS
cd ..
python run_NeRViS.py --load [model_checkpoint_path] --input_frames_path [input_frames_path] --warping_field_path [warping_field_path] --output_path [output_frames_path] --temporal_width [temporal_width] --temporal_step [temporal_step]

e.g.

python run_NeRViS.py --load NeRViS_model/checkpoint/model_epoch050.pth --input_frames_path ../NUS/Crowd/0/ --warping_field_path CVPR2020CODE_yulunliu_modified/CVPR2020_warping_field/ --output_path output/ --temporal_width 41 --temporal_step 4

Citation

@inproceedings{Liu-NeRViS-2021,
    author    = {Liu, Yu-Lun and Lai, Wei-Sheng and Yang, Ming-Hsuan and Chuang, Yung-Yu and Huang, Jia-Bin}, 
    title     = {Neural Re-rendering for Full-frame Video Stabilization}, 
    journal   = {arXiv preprint},
    year      = {2021}
}

Acknowledgements

Parts of the code were based on from AdaCoF-pytorch. Some functions are borrowed from softmax-splatting, RAFT, and [Yu and Ramamoorthi 2020]

NeRViS: Neural Re-rendering for Full-frame Video Stabilization

Related tags

Overview

NeRViS: Neural Re-rendering for Full-frame Video Stabilization

Project Page | Video | Paper | Google Colab

Setup

Running code

Citation

Acknowledgements

Owner

Yu-Lun Liu

A motion tracking system for any arbitaray points in a video frame.

MegEngine implementation of YOLOX

Mixed Transformer UNet for Medical Image Segmentation

Lite-HRNet: A Lightweight High-Resolution Network

style mixing for animation face

SAS output to EXCEL converter for Cornell/MIT Language and acquisition lab

This is the codebase for the ICLR 2021 paper Trajectory Prediction using Equivariant Continuous Convolution

Breast cancer is been classified into benign tumour and malignant tumour.

LegoDNN: a block-grained scaling tool for mobile vision systems

DeLag: Detecting Latency Degradation Patterns in Service-based Systems

Code for the Shortformer model, from the paper by Ofir Press, Noah A. Smith and Mike Lewis.

[ICRA 2022] An opensource framework for cooperative detection. Official implementation for OPV2V.

Knowledgeable Prompt-tuning: Incorporating Knowledge into Prompt Verbalizer for Text Classification

This repository stores the code to reproduce the results published in "TiWS-iForest: Isolation Forest in Weakly Supervised and Tiny ML scenarios"

This repository contains the DendroMap implementation for scalable and interactive exploration of image datasets in machine learning.

Use of Attention Gates in a Convolutional Neural Network / Medical Image Classification and Segmentation

Transfer Learning for Pose Estimation of Illustrated Characters

Hypersim: A Photorealistic Synthetic Dataset for Holistic Indoor Scene Understanding

MassiveSumm: a very large-scale, very multilingual, news summarisation dataset

Post-Training Quantization for Vision transformers.