Using image super resolution models with vapoursynth and speeding them up with TensorRT

Last update: Aug 23, 2022

Overview

vs-RealEsrganAnime-tensorrt-docker

Using image super resolution models with vapoursynth and speeding them up with TensorRT. Also a docker image since TensorRT is hard to install. Testing showed ~70% more speed on my 1070ti compared to normal PyTorch in 480p. Using the 2x model with TensorRT and 848x480 input was 0.517x realtime speed for 24fps video.

I was forced to use onnx/onnx-tensorrt instead of NVIDIA/Torch-TensorRT because of convertion errors with PyTorch, but the only disadvantage should be that a new onnx model needs to be created for a different input resolution, which takes a bit time.

This repo uses a lot of code from HolyWu/vs-realesrgan and xinntao/Real-ESRGAN. The models are from here.

Usage:

# install docker, command for arch
yay -S docker nvidia-docker nvidia-container-toolkit
# Put the dockerfile in a directory and run that inside that directory
docker build -t realsr_tensorrt:latest .
# run with a mounted folder
docker run --privileged --gpus all -it --rm -v /home/Desktop/tensorrt:/workspace/tensorrt realsr_tensorrt:latest
# you can use it in various ways, ffmpeg example
vspipe --y4m inference.py - | ffmpeg -i pipe: example.mkv

If docker does not want to start, try this before you use docker:

# fixing docker errors
systemctl start docker
sudo chmod 666 /var/run/docker.sock

If you don't want to use docker, vapoursynth install commands are here and a TensorRT example is here.

Set the input video path in inference.py and access videos with the mounted folder. You can also choose between the 4x and 2x model.

It is also possible to directly pipe the video into mpv. Change the mounted folder path to your own videofolder and use the mpv dockerfile instead. Only tested in Manjaro.

yay -S pulseaudio

# i am not sure if it is needed, but go into pulseaudio settings and check "make pulseaudio network audio devices discoverable in the local network" and reboot

# start docker
docker run --rm -i -t \
    --network host \
    -e DISPLAY \
    -v /home/Schreibtisch/test/:/home/mpv/media \
    --ipc=host \
    --privileged \
    --gpus all \
    -e PULSE_COOKIE=/run/pulse/cookie \
    -v ~/.config/pulse/cookie:/run/pulse/cookie \
    -e PULSE_SERVER=unix:${XDG_RUNTIME_DIR}/pulse/native \
    -v ${XDG_RUNTIME_DIR}/pulse/native:${XDG_RUNTIME_DIR}/pulse/native \
    realsr_tensorrt:latest
    
# run mpv
vspipe --y4m inference.py - | mpv -

Using image super resolution models with vapoursynth and speeding them up with TensorRT

Related tags

Overview

vs-RealEsrganAnime-tensorrt-docker

Owner

Implementation of self-attention mechanisms for general purpose. Focused on computer vision modules. Ongoing repository.

Co-GAIL: Learning Diverse Strategies for Human-Robot Collaboration

Notspot robot simulation - Python version

Python script that analyses the given datasets and comes up with the best polynomial regression representation with the smallest polynomial degree possible

Nb workflows - A workflow platform which allows you to run parameterized notebooks programmatically

Breaking the Dilemma of Medical Image-to-image Translation

Source code for our EMNLP'21 paper 《Raise a Child in Large Language Model: Towards Effective and Generalizable Fine-tuning》

SAS output to EXCEL converter for Cornell/MIT Language and acquisition lab

A Python framework for developing parallelized Computational Fluid Dynamics software to solve the hyperbolic 2D Euler equations on distributed, multi-block structured grids.

Code for "Diversity can be Transferred: Output Diversification for White- and Black-box Attacks"

OpenL3: Open-source deep audio and image embeddings

Very simple NCHW and NHWC conversion tool for ONNX. Change to the specified input order for each and every input OP. Also, change the channel order of RGB and BGR. Simple Channel Converter for ONNX.

Implementation for "Seamless Manga Inpainting with Semantics Awareness" (SIGGRAPH 2021 issue)

Code for paper "Which Training Methods for GANs do actually Converge? (ICML 2018)"

BMN: Boundary-Matching Network

The PyTorch implementation of paper REST: Debiased Social Recommendation via Reconstructing Exposure Strategies

Code for our paper "Sematic Representation for Dialogue Modeling" in ACL2021

Understanding Convolutional Neural Networks from Theoretical Perspective via Volterra Convolution

PyTorch implementation of ''Background Activation Suppression for Weakly Supervised Object Localization''.

Pytorch implemenation of Stochastic Multi-Label Image-to-image Translation (SMIT)