Detail-Preserving Transformer for Light Field Image Super-Resolution

Last update: Jan 01, 2023

Related tags

Overview

DPT

Official Pytorch implementation of the paper "Detail-Preserving Transformer for Light Field Image Super-Resolution" accepted by AAAI 2022 .

Updates

2022.01: Our method is available at the newly-released repository BasicLFSR, an open-source and easy-to-use toolbox for LF image SR.
2022.01: The code is released.

Requirements

Python 3.7.7
Pytorch=1.5.0
torchvision=0.6.0
h5py=2.8.0
Matlab

Dataset

We use the EPFL, HCInew, HCIold, INRIA and STFgantry datasets for both training and testing. You can download the above dataset from Baidu Drive (key:912V).

Download the visual results

We share the super-resolved results generated by our DPT. Then, researchers can compare their methods to our DPT without performing inference. Results are available at Baidu Drive (key:912V).

Prepare the datasets

To generate the training data,

 Using Matlab to run `GenerateTrainingData.m`

To generate the testing data,

 Using Matlab to run `GenerateTestData.m`

We also provide the processed datasets we used in the paper. The processed datasets are avaliable at Baidu Drive (key:912V).

Train

To perform DPT training, please run

python train.py

Checkpoint will be saved to ./log/.

Test

To evaluate DPT performance, please run

python test.py

The performance of DPT on five datasets will be printed on the screen. The visual result of each scene will be saved in ./Results/. The PSNR and SSIM values of each scene will aslo be saved in ./PSNRSSIM/.

Generate visual results

To generate the visual super-resolved results,

Using Matlab to run `GenerateResultImages.m`

The '.mat' files in ./Results/ will be converted to '.png' images to ./SRimages/.

To generate the visual gradient results, please run

python generate_visual_gradient_map.py

Gradient results will be saved to ./GRAimages/.

Citation

If you find this work helpful, please consider citing the following paper:

@article{wang2022detail,
  title={Detail Preserving Transformer for Light Field Image Super-Resolution},
  author={Wang, Shunzhou and Zhou, Tianfei and Lu, Yao and Di, Huijun},
  journal={arXiv preprint arXiv:2201.00346},
  year={2022}
}

Acknowledgements

This code is heavily based on LF-DFNet. We also refer to the codes in VSR-Transformer, COLA-Net, and SPSR. We thank the authors for sharing the codes. We would like to thank Yingqian Wang for his help with LFSR. We would also like to thank Zhengyu Liang for adding our DPT to the repository BasicLFSR.

Contact

If you have any question about this work, feel free to concat with me via [email protected].

Detail-Preserving Transformer for Light Field Image Super-Resolution

Related tags

Overview

DPT

Updates

Requirements

Dataset

Download the visual results

Prepare the datasets

Train

Test

Generate visual results

Citation

Acknowledgements

Contact

Owner

Applying PVT to Semantic Segmentation

This repository contains the code for the paper 'PARM: Paragraph Aggregation Retrieval Model for Dense Document-to-Document Retrieval' published at ECIR'22.

💡 Learnergy is a Python library for energy-based machine learning models.

ML-Decoder: Scalable and Versatile Classification Head

Selene is a Python library and command line interface for training deep neural networks from biological sequence data such as genomes.

SberSwap Video Swap base on deep learning

Generating Anime Images by Implementing Deep Convolutional Generative Adversarial Networks paper

Low-code/No-code approach for deep learning inference on devices

Invariant Causal Prediction for Block MDPs

This repository contains demos I made with the Transformers library by HuggingFace.

DeepLM: Large-scale Nonlinear Least Squares on Deep Learning Frameworks using Stochastic Domain Decomposition (CVPR 2021)

LoveDA: A Remote Sensing Land-Cover Dataset for Domain Adaptive Semantic Segmentation (NeurIPS2021 Benchmark and Dataset Track)

PaddleBoBo是基于PaddlePaddle和PaddleSpeech、PaddleGAN等开发套件的虚拟主播快速生成项目

Evolutionary Scale Modeling (esm): Pretrained language models for proteins

Automated Evidence Collection for Fake News Detection

[CVPR 2022] Official code for the paper: "A Stitch in Time Saves Nine: A Train-Time Regularizing Loss for Improved Neural Network Calibration"

GeoTransformer - Geometric Transformer for Fast and Robust Point Cloud Registration

GLODISMO: Gradient-Based Learning of Discrete Structured Measurement Operators for Signal Recovery

Semantic-aware Grad-GAN for Virtual-to-Real Urban Scene Adaption

Code of U2Fusion: a unified unsupervised image fusion network for multiple image fusion tasks, including multi-modal, multi-exposure and multi-focus image fusion.