PyTorch Implementation of "Light Field Image Super-Resolution with Transformers"

Last update: Nov 28, 2022

Related tags

Deep Learning LFT

Overview

LFT

PyTorch implementation of "Light Field Image Super-Resolution with Transformers", arXiv 2021. [pdf].

Contributions:

We make the first attempt to adapt Transformers to LF image processing, and propose a Transformer-based network for LF image SR.
We propose a novel paradigm (i.e., angular and spatial Transformers) to incorporate angular and spatial information in an LF.
With a small model size and low computational cost, our LFT achieves superior SR performance than other state-of-the-art methods.

Codes and Models:

Requirement

PyTorch 1.3.0, torchvision 0.4.1. The code is tested with python=3.6, cuda=9.0.
Matlab (For training/test data generation and performance evaluation)

Datasets

We used the EPFL, HCInew, HCIold, INRIA and STFgantry datasets for both training and test. Please first download our dataset via Baidu Drive (key:7nzy) or OneDrive, and place the 5 datasets to the folder ./datasets/.

Train

Run Generate_Data_for_Training.m to generate training data. The generated data will be saved in ./data_for_train/ (SR_5x5_2x, SR_5x5_4x).

Run train.py to perform network training. Example for training LFT on 5x5 angular resolution for 4x/2xSR:

$ python train.py --model_name LFT --angRes 5 --scale_factor 4 --batch_size 4
$ python train.py --model_name LFT --angRes 5 --scale_factor 2 --batch_size 8

Checkpoint will be saved to ./log/.

Test

Run Generate_Data_for_Test.m to generate test data. The generated data will be saved in ./data_for_test/ (SR_5x5_2x, SR_5x5_4x).

Run test.py to perform network inference. Example for test LFT on 5x5 angular resolution for 4x/2xSR:

python test.py --model_name LFT --angRes 5 --scale_factor 4 \ 
--use_pre_pth True --path_pre_pth './pth/LFT_5x5_4x_epoch_50_model.pth

python test.py --model_name LFT --angRes 5 --scale_factor 2 \ 
--use_pre_pth True --path_pre_pth './pth/LFT_5x5_2x_epoch_50_model.pth

The PSNR and SSIM values of each dataset will be saved to ./log/.

Results:

Quantitative Results

Efficiency

Visual Comparisons

Angular Consistency

Spatial-Aware Angular Modeling

Citiation

If you find this work helpful, please consider citing:

@Article{LFT,
    author    = {Liang, Zhengyu and Wang, Yingqian and Wang, Longguang and Yang, Jungang and Zhou, Shilin},
    title     = {Light Field Image Super-Resolution with Transformers},
    journal   = {arXiv preprint},
    month     = {August},
    year      = {2021},   
}

Contact

Any question regarding this work can be addressed to [email protected].

PyTorch Implementation of "Light Field Image Super-Resolution with Transformers"

Related tags

Overview

LFT

PyTorch implementation of "Light Field Image Super-Resolution with Transformers", arXiv 2021. [pdf].

Contributions:

Codes and Models:

Requirement

Datasets

Train

Test

Results:

Citiation

Contact

Owner

Squidward

Code for the paper "Learning-Augmented Algorithms for Online Steiner Tree"

Python package provinding tools for artistic interactive applications using AI

Reverse engineering Rosetta 2 in M1 Mac

PyTorch Implementation of Meta-StyleSpeech : Multi-Speaker Adaptive Text-to-Speech Generation

MatchGAN: A Self-supervised Semi-supervised Conditional Generative Adversarial Network

Reviatalizing Optimization for 3D Human Pose and Shape Estimation: A Sparse Constrained Formulation

The code for our NeurIPS 2021 paper "Kernelized Heterogeneous Risk Minimization".

How Effective is Incongruity? Implications for Code-mix Sarcasm Detection.

Source Code of NeurIPS21 paper: Recognizing Vector Graphics without Rasterization

Implementations of paper Controlling Directions Orthogonal to a Classifier

Official repository for the ISBI 2021 paper Transformer Assisted Convolutional Neural Network for Cell Instance Segmentation

A comprehensive and up-to-date developer education platform for Urbit.

Locationinfo - A script helps the user to show network information such as ip address

“袋鼯麻麻——智能购物平台”能够精准地定位识别每一个商品

Tensorflow implementation of Character-Aware Neural Language Models.

OpenCVのGrabCut()を利用したセマンティックセグメンテーション向けアノテーションツール(Annotation tool using GrabCut() of OpenCV. It can be used to create datasets for semantic segmentation.)

Collect super-resolution related papers, data, repositories

Implementation of the paper NAST: Non-Autoregressive Spatial-Temporal Transformer for Time Series Forecasting.

https://sites.google.com/cornell.edu/recsys2021tutorial

Use deep learning, genetic programming and other methods to predict stock and market movements