an implementation of Revisiting Adaptive Convolutions for Video Frame Interpolation using PyTorch

Last update: Dec 22, 2022

Overview

revisiting-sepconv

This is a reference implementation of Revisiting Adaptive Convolutions for Video Frame Interpolation [1] using PyTorch. Given two frames, it will make use of adaptive convolution [2] in a separable manner [3] to interpolate the intermediate frame. Should you be making use of our work, please cite our paper [1].

For the original SepConv, see: https://github.com/sniklaus/sepconv-slomo
For softmax splatting, please see: https://github.com/sniklaus/softmax-splatting

setup

The separable convolution layer is implemented in CUDA using CuPy, which is why CuPy is a required dependency. It can be installed using pip install cupy or alternatively using one of the provided binary packages as outlined in the CuPy repository.

If you plan to process videos, then please also make sure to have pip install moviepy installed.

usage

To run it on your own pair of frames, use the following command.

python run.py --model paper --one ./images/one.png --two ./images/two.png --out ./out.png

To run in on a video, use the following command.

python run.py --model paper --video ./videos/car-turn.mp4 --out ./out.mp4

For a quick benchmark using examples from the Middlebury benchmark for optical flow, run python benchmark.py. You can use it to easily verify that the provided implementation runs as expected.

video

license

Please refer to the appropriate file within this repository.

references

[1]  @inproceedings{Niklaus_WACV_2021,
         author = {Simon Niklaus and Long Mai and Oliver Wang},
         title = {Revisiting Adaptive Convolutions for Video Frame Interpolation},
         booktitle = {IEEE Winter Conference on Applications of Computer Vision},
         year = {2021}
     }

[2]  @inproceedings{Niklaus_ICCV_2017,
         author = {Simon Niklaus and Long Mai and Feng Liu},
         title = {Video Frame Interpolation via Adaptive Separable Convolution},
         booktitle = {IEEE International Conference on Computer Vision},
         year = {2017}
     }

[3]  @inproceedings{Niklaus_CVPR_2017,
         author = {Simon Niklaus and Long Mai and Feng Liu},
         title = {Video Frame Interpolation via Adaptive Convolution},
         booktitle = {IEEE Conference on Computer Vision and Pattern Recognition},
         year = {2017}
     }

an implementation of Revisiting Adaptive Convolutions for Video Frame Interpolation using PyTorch

Related tags

Overview

revisiting-sepconv

setup

usage

video

license

references

Owner

Simon Niklaus

Cross-Image Region Mining with Region Prototypical Network for Weakly Supervised Segmentation

Code for pre-training CharacterBERT models (as well as BERT models).

Knowledge Management for Humans using Machine Learning & Tags

LaneDetectionAndLaneKeeping - Lane Detection And Lane Keeping

🚀 PyTorch Implementation of "Progressive Distillation for Fast Sampling of Diffusion Models(v-diffusion)"

A tight inclusion function for continuous collision detection

Leaderboard and Visualization for RLCard

Deploy tensorflow graphs for fast evaluation and export to tensorflow-less environments running numpy.

Pytorch Implementation of Zero-Shot Image-to-Text Generation for Visual-Semantic Arithmetic

Implementation of EMNLP 2017 Paper "Natural Language Does Not Emerge 'Naturally' in Multi-Agent Dialog" using PyTorch and ParlAI

Revisiting Video Saliency: A Large-scale Benchmark and a New Model (CVPR18, PAMI19)

Tensorflow implementation of "Learning Deconvolution Network for Semantic Segmentation"

PyMatting: A Python Library for Alpha Matting

CVPR 2021 - Official code repository for the paper: On Self-Contact and Human Pose.

Code release for NeRF (Neural Radiance Fields)

Self-Supervised Learning for Domain Adaptation on Point-Clouds

PyTorch implementation of "Optimization Planning for 3D ConvNets"

Fast RFC3339 compliant Python date-time library

HiFi-GAN: High Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversarial Networks

Using a Seq2Seq RNN architecture via TensorFlow to predict future Bitcoin prices