VSR-Transformer - This paper proposes a new Transformer for video super-resolution (called VSR-Transformer).

Last update: Nov 13, 2022

Related tags

Overview

VSR-Transformer

By Jiezhang Cao, Yawei Li, Kai Zhang, Luc Van Gool

This paper proposes a new Transformer for video super-resolution (called VSR-Transformer). Our VSR-Transformer block contains a spatial-temporal convolutional self-attention layer and a bidirectionaloptical flow-based feed-forward layer. Our VSR-Transformer is able to improve the performance of VSR. This repository is the official implementation of "Video Super-Resolution Transformer".

Dependencies and Installation

Python >= 3.7 (Recommend to use Anaconda or Miniconda)
PyTorch >= 1.3
NVIDIA GPU + CUDA

Clone repository

git clone https://github.com/caojiezhang/VSR-Transformer.git

Install dependent packages

cd VSR-Transformer
pip install -r requirements.txt

Compile environment
```
python setup.py develop
```

Dataset Preparation

Please refer to DatasetPreparation.md for more details.
The descriptions of currently supported datasets (torch.utils.data.Dataset classes) are in Datasets.md.

Training

Please refer to configuration of training for more details and pretrained models.

# Train on REDS
CUDA_VISIBLE_DEVICES=0,1,2,3,4,5,6,7 python -m torch.distributed.launch --nproc_per_node=8 --master_port=4321 basicsr/train.py -opt options/train/train_vsrTransformer_x4_REDS.yml --launcher pytorch
# Train on Vimeo-90K
CUDA_VISIBLE_DEVICES=0,1,2,3,4,5,6,7 python -m torch.distributed.launch --nproc_per_node=8 --master_port=4321 basicsr/train.py -opt options/train/train_vsrTransformer_x4_Vimeo.yml --launcher pytorch

Testing

Please refer to configuration of testing for more details.

# Test on REDS
CUDA_VISIBLE_DEVICES=0,1,2,3,4,5,6,7 python -m torch.distributed.launch --nproc_per_node=8 --master_port=4321 basicsr/test.py -opt options/test/test_vsrTransformer_x4_REDS.yml --launcher pytorch

# Test on Vimeo-90K
CUDA_VISIBLE_DEVICES=0,1,2,3,4,5,6,7 python -m torch.distributed.launch --nproc_per_node=8 --master_port=4321 basicsr/test.py -opt options/test/test_vsrTransformer_x4_Vimeo.yml --launcher pytorch

# Test on Vid4
CUDA_VISIBLE_DEVICES=0,1,2,3,4,5,6,7 python -m torch.distributed.launch --nproc_per_node=8 --master_port=4321 basicsr/test.py -opt options/test/test_vsrTransformer_x4_Vid4.yml --launcher pytorch

Citation

If you use this code of our paper please cite:

@article{cao2021vsrt,
  title={Video Super-Resolution Transformer},
  author={Cao, Jiezhang and Li, Yawei and Zhang, Kai and Van Gool, Luc},
  journal={arXiv},
  year={2021}
}

Acknowledgments

This repository is implemented based on BasicSR. If you use the repository, please consider citing BasicSR.

VSR-Transformer - This paper proposes a new Transformer for video super-resolution (called VSR-Transformer).

Related tags

Overview

VSR-Transformer

Dependencies and Installation

Dataset Preparation

Training

Testing

Citation

Acknowledgments

Owner

Jiezhang Cao

existing and custom freqtrade strategies supporting the new hyperstrategy format.

Extreme Dynamic Classifier Chains - XGBoost for Multi-label Classification

Learning Modified Indicator Functions for Surface Reconstruction

DI-smartcross - Decision Intelligence Platform for Traffic Crossing Signal Control

Code for our SIGCOMM'21 paper "Network Planning with Deep Reinforcement Learning".

Automatic Calibration for Non-repetitive Scanning Solid-State LiDAR and Camera Systems

Event queue (Equeue) dialect is an MLIR Dialect that models concurrent devices in terms of control and structure.

A python-image-classification web application project, written in Python and served through the Flask Microframework. This Project implements the VGG16 covolutional neural network, through Keras and Tensorflow wrappers, to make predictions on uploaded images.

Offline Reinforcement Learning with Implicit Q-Learning

System Combination for Grammatical Error Correction Based on Integer Programming

Robotics with GPU computing

My usage of Real-ESRGAN to upscale anime, some test and results in the test_img folder

Active and Sample-Efficient Model Evaluation

AAAI 2022: Stationary diffusion state neural estimation

Attention-driven Robot Manipulation (ARM) which includes Q-attention

Keyword2Text This repository contains the code of the paper: "A Plug-and-Play Method for Controlled Text Generation"

This project helps to colorize grayscale images using multiple exemplars.

IMBENS: class-imbalanced ensemble learning in Python.

ML-Decoder: Scalable and Versatile Classification Head

Recovering Brain Structure Network Using Functional Connectivity