[CVPR'20] TTSR: Learning Texture Transformer Network for Image Super-Resolution

Last update: Dec 28, 2022

Related tags

Overview

TTSR

Official PyTorch implementation of the paper Learning Texture Transformer Network for Image Super-Resolution accepted in CVPR 2020.

Introduction
Requirements and dependencies
Model
Quick test
Dataset prepare
Evaluation
Train
Citation
Contact

Introduction

We proposed an approach named TTSR for RefSR task. Compared to SISR, RefSR has an extra high-resolution reference image whose textures can be utilized to help super-resolve low-resolution input.

Contribution

We are one of the first to introduce the transformer architecture into image generation tasks. More specifically, we propose a texture transformer with four closely-related modules for image SR which achieves significant improvements over SOTA approaches.
We propose a novel cross-scale feature integration module for image generation tasks which enables our approach to learn a more powerful feature representation by stacking multiple texture transformers.

Approach overview

Main results

Requirements and dependencies

python 3.7 (recommend to use Anaconda)
python packages: pip install opencv-python imageio
pytorch >= 1.1.0
torchvision >= 0.4.0

Model

Pre-trained models can be downloaded from onedrive, baidu cloud(0u6i), google drive.

TTSR-rec.pt: trained with only reconstruction loss
TTSR.pt: trained with all losses

Quick test

Clone this github repo

git clone https://github.com/FuzhiYang/TTSR.git
cd TTSR

Download pre-trained models and modify "model_path" in test.sh
Run test

sh test.sh

The results are in "save_dir" (default: ./test/demo/output)

Dataset prepare

Download CUFED train set and CUFED test set
Make dataset structure be:

CUFED
- train
  - input
  - ref
- test
  - CUFED5

Evaluation

Prepare CUFED dataset and modify "dataset_dir" in eval.sh
Download pre-trained models and modify "model_path" in eval.sh
Run evaluation

sh eval.sh

The results are in "save_dir" (default: ./eval/CUFED/TTSR)

Train

Prepare CUFED dataset and modify "dataset_dir" in train.sh
Run training

sh train.sh

The training results are in "save_dir" (default: ./train/CUFED/TTSR)

Citation

@InProceedings{yang2020learning,
author = {Yang, Fuzhi and Yang, Huan and Fu, Jianlong and Lu, Hongtao and Guo, Baining},
title = {Learning Texture Transformer Network for Image Super-Resolution},
booktitle = {CVPR},
year = {2020},
month = {June}
}

Contact

If you meet any problems, please describe them in issues or contact:

Fuzhi Yang: [email protected]

[CVPR'20] TTSR: Learning Texture Transformer Network for Image Super-Resolution

Related tags

Overview

TTSR

Contents

Introduction

Contribution

Approach overview

Main results

Requirements and dependencies

Model

Quick test

Dataset prepare

Evaluation

Train

Citation

Contact

Owner

Multimedia Research

PointRCNN: 3D Object Proposal Generation and Detection from Point Cloud, CVPR 2019.

A hybrid SOTA solution of LiDAR panoptic segmentation with C++ implementations of point cloud clustering algorithms. ICCV21, Workshop on Traditional Computer Vision in the Age of Deep Learning

This repository contains the code for the binaural-detection model used in the publication arXiv:2111.04637

Code for CVPR 2021 paper TransNAS-Bench-101: Improving Transferrability and Generalizability of Cross-Task Neural Architecture Search.

Learning to Self-Train for Semi-Supervised Few-Shot

A stable algorithm for GAN training

A Pytorch implementation of "Manifold Matching via Deep Metric Learning for Generative Modeling" (ICCV 2021)

The Easy-to-use Dialogue Response Selection Toolkit for Researchers

Rethinking Space-Time Networks with Improved Memory Coverage for Efficient Video Object Segmentation

Graph Neural Networks with Keras and Tensorflow 2.

Semi-Supervised Semantic Segmentation with Cross-Consistency Training (CCT)

Code for "FPS-Net: A convolutional fusion network for large-scale LiDAR point cloud segmentation".

Image Lowpoly based on Centroid Voronoi Diagram via python-opencv and taichi

LoL Runes Recommender With Python

MSG-Transformer: Exchanging Local Spatial Information by Manipulating Messenger Tokens

[CVPR 2021] NormalFusion: Real-Time Acquisition of Surface Normals for High-Resolution RGB-D Scanning

Multi-modal Vision Transformers Excel at Class-agnostic Object Detection

Official repository for ABC-GAN

Multi-Stage Spatial-Temporal Convolutional Neural Network (MS-GCN)

Personal project about genus-0 meshes, spherical harmonics and a cow