Official Pytorch Implementation of Unsupervised Image Denoising with Frequency Domain Knowledge

Last update: Sep 26, 2022

Related tags

Overview

Unsupervised Image Denoising with Frequency Domain Knowledge (BMVC 2021 Oral) : Official Project Page

This repository provides the official PyTorch implementation of the following paper:

Unsupervised Image Denoising with Frequency Domain Knowledge

Nahyun Kim* (KAIST), Donggon Jang* (KAIST), Sunhyeok Lee (KAIST), Bomi Kim (KAIST), and Dae-Shik Kim (KAIST) (*The authors have equally contributed.)

BMVC 2021, Accepted as Oral Paper.

Abstract: Supervised learning-based methods yield robust denoising results, yet they are inherently limited by the need for large-scale clean/noisy paired datasets. The use of unsupervised denoisers, on the other hand, necessitates a more detailed understanding of the underlying image statistics. In particular, it is well known that apparent differences between clean and noisy images are most prominent on high-frequency bands, justifying the use of low-pass filters as part of conventional image preprocessing steps. However, most learning-based denoising methods utilize only one-sided information from the spatial domain without considering frequency domain information. To address this limitation, in this study we propose a frequency-sensitive unsupervised denoising method. To this end, a generative adversarial network (GAN) is used as a base structure. Subsequently, we include spectral discriminator and frequency reconstruction loss to transfer frequency knowledge into the generator. Results using natural and synthetic datasets indicate that our unsupervised learning method augmented with frequency information achieves state-of-the-art denoising performance, suggesting that frequency domain information could be a viable factor in improving the overall performance of unsupervised learning-based methods.

Requirements

To install requirements:

conda env create -n [your env name] -f environment.yaml
conda activate [your env name]

To train the model

Synthetic Noise (AWGN)

Download DIV2K dataset for training in here
Randomly split the DIV2K dataset into Clean/Noisy set. Please refer the .txt files in split_data.
Place the splitted dataset(DIV2K_C and DIV2K_N) in ./dataset directory.

dataset
└─── DIV2K_C
└─── DIV2K_N
└─── test

Use gen_dataset_synthetic.py to package dataset in the h5py format.
After that, run this command:

sh ./scripts/train_awgn_sigma15.sh # AWGN with a noise level = 15
sh ./scripts/train_awgn_sigma25.sh # AWGN with a noise level = 25
sh ./scripts/train_awgn_sigma50.sh # AWGN with a noise level = 50

After finishing the training, .pth file is stored in ./exp/[exp_name]/[seed_number]/saved_models/ directory.

Real-World Noise

Download SIDD-Medium Dataset for training in here
Radnomly split the SIDD-Medium Dataset into Clean/Noisy set. Please refer the .txt files in split_data.
Place the splitted dataset(SIDD_C and SIDD_N) in ./dataset directory.

dataset
└─── SIDD_C
└─── SIDD_N
└─── test

Use gen_dataset_real.py to package dataset in the h5py format.
After that, run this command:

sh ./scripts/train_real.sh

After finishing the training, .pth file is stored in ./exp/[exp_name]/[seed_number]/saved_models/ directory.

To evaluate the model

Synthetic Noise (AWGN)

Download CBSD68 dataset for evaluation in here
Place the dataset in ./dataset/test directory.

dataset
└─── train
└─── test
     └─── CBSD68
     └─── SIDD_test

After that, run this command:

sh ./scripts/test_awgn_sigma15.sh # AWGN with a noise level = 15
sh ./scripts/test_awgn_sigma25.sh # AWGN with a noise level = 25
sh ./scripts/test_awgn_sigma50.sh # AWGN with a noise level = 50

Real-World Noise

Download the SIDD test dataset for evaluation in here
Place the dataset in ./dataset/test directory.

dataset
└─── train
└─── test
     └─── CBSD68
     └─── SIDD_test

After that, run this command:

sh ./scripts/test_real.sh

Pre-trained model

We provide pre-trained models in ./checkpoints directory.

checkpoints
|   AWGN_sigma15.pth # pre-trained model (AWGN with a noise level = 15)
|   AWGN_sigma25.pth # pre-trained model (AWGN with a noise level = 25)
|   AWGN_sigma50.pth # pre-trained model (AWGN with a noise level = 50)
|   SIDD.pth # pre-trained model (Real-World noise)

Acknowledgements

This code is built on U-GAT-IT,CARN, SSD-GAN. We thank the authors for sharing their codes.

Contact

If you have any questions, feel free to contact me ([email protected])

Official Pytorch Implementation of Unsupervised Image Denoising with Frequency Domain Knowledge

Related tags

Overview

Unsupervised Image Denoising with Frequency Domain Knowledge (BMVC 2021 Oral) : Official Project Page

Requirements

To train the model

Synthetic Noise (AWGN)

Real-World Noise

To evaluate the model

Synthetic Noise (AWGN)

Real-World Noise

Pre-trained model

Acknowledgements

Contact

Owner

Donggon Jang

[CVPR 2022] Unsupervised Image-to-Image Translation with Generative Prior

Implemented fully documented Particle Swarm Optimization algorithm (basic model with few advanced features) using Python programming language

ByteTrack(Multi-Object Tracking by Associating Every Detection Box)のPythonでのONNX推論サンプル

ADSPM: Attribute-Driven Spontaneous Motion in Unpaired Image Translation

my graduation project is about live human face augmentation by projection mapping by using CNN

[AI6101] Introduction to AI & AI Ethics is a core course of MSAI, SCSE, NTU, Singapore

Functional deep learning

This code is for eCaReNet: explainable Cancer Relapse Prediction Network.

Latent Network Models to Account for Noisy, Multiply-Reported Social Network Data

A library of scripts that interact with the PythonTurtle module to create games, drawings, and more

A Simple Example for Imitation Learning with Dataset Aggregation (DAGGER) on Torcs Env

git《Pseudo-ISP: Learning Pseudo In-camera Signal Processing Pipeline from A Color Image Denoiser》(2021) GitHub: [fig5]

Code used to generate the results appearing in "Train longer, generalize better: closing the generalization gap in large batch training of neural networks"

code and data for paper "GIANT: Scalable Creation of a Web-scale Ontology"

null

End-To-End Optimization of LiDAR Beam Configuration

Spatial Temporal Graph Convolutional Networks (ST-GCN) for Skeleton-Based Action Recognition in PyTorch

A Streamlit component to render ECharts.

PyTorch implementation of the cross-modality generative model that synthesizes dance from music.

Scripts and outputs related to the paper Prediction of Adverse Biological Effects of Chemicals Using Knowledge Graph Embeddings.