MIMO-UNet - Official Pytorch Implementation

Last update: Jan 02, 2023

Related tags

Overview

MIMO-UNet - Official Pytorch Implementation

This repository provides the official PyTorch implementation of the following paper:

Rethinking Coarse-to-Fine Approach in Single Image Deblurring

Sung-Jin Cho *, Seo-Won Ji *, Jun-Pyo Hong, Seung-Won Jung, Sung-Jea Ko

In ICCV 2021. (* indicates equal contribution)

Paper: https://arxiv.org/abs/2108.05054

Abstract: Coarse-to-fine strategies have been extensively used for the architecture design of single image deblurring networks. Conventional methods typically stack sub-networks with multi-scale input images and gradually improve sharpness of images from the bottom sub-network to the top sub-network, yielding inevitably high computational costs. Toward a fast and accurate deblurring network design, we revisit the coarse-to-fine strategy and present a multi-input multi-output U-net (MIMO-UNet). The MIMO-UNet has three distinct features. First, the single encoder of the MIMO-UNet takes multi-scale input images to ease the difficulty of training. Second, the single decoder of the MIMO-UNet outputs multiple deblurred images with different scales to mimic multi-cascaded U-nets using a single U-shaped network. Last, asymmetric feature fusion is introduced to merge multi-scale features in an efficient manner. Extensive experiments on the GoPro and RealBlur datasets demonstrate that the proposed network outperforms the state-of-the-art methods in terms of both accuracy and computational complexity.

The contents of this repository are as follows:

Dependencies
Dataset
Train
Test
Performance
Model

Dependencies

Python
Pytorch (1.4)
- Different versions may cause some errors.
scikit-image
opencv-python
Tensorboard

Dataset

Download deblur dataset from the GoPro dataset .
Unzip files dataset folder.
Preprocess dataset by running the command below:

python data/preprocessing.py

After preparing data set, the data folder should be like the format below:

GOPRO
├─ train
│ ├─ blur    % 2103 image pairs
│ │ ├─ xxxx.png
│ │ ├─ ......
│ │
│ ├─ sharp
│ │ ├─ xxxx.png
│ │ ├─ ......
│
├─ test    % 1111 image pairs
│ ├─ ...... (same as train)

Train

To train MIMO-UNet+ , run the command below:

python main.py --model_name "MIMO-UNetPlus" --mode "train" --data_dir "dataset/GOPRO"

or to train MIMO-UNet, run the command below:

python main.py --model_name "MIMO-UNet" --mode "train" --data_dir "dataset/GOPRO"

Model weights will be saved in results/model_name/weights folder.

Test

To test MIMO-UNet+ , run the command below:

python main.py --model_name "MIMO-UNetPlus" --mode "test" --data_dir "dataset/GOPRO" --test_model "MIMO-UNetPlus.pkl"

or to test MIMO-UNet, run the command below:

python main.py --model_name "MIMO-UNet" --mode "test" --data_dir "dataset/GOPRO" --test_model "MIMO-UNet.pkl"

Output images will be saved in results/model_name/result_image folder.

Performance

Method	MIMO-UNet	MIMO-UNet+	MIMO-UNet++
PSNR (dB)	31.73	32.45	32.68
SSIM	0.951	0.957	0.959
Runtime (s)	0.008	0.017	0.040

Model

We provide our pre-trained models. You can test our network following the instruction above.

MIMO-UNet - Official Pytorch Implementation

Related tags

Overview

MIMO-UNet - Official Pytorch Implementation

Contents

Dependencies

Dataset

Train

Test

Performance

Model

Owner

Sungjin Cho

This is a Deep Leaning API for classifying emotions from human face and human audios.

Code and project page for ICCV 2021 paper "DisUnknown: Distilling Unknown Factors for Disentanglement Learning"

Code for paper [ACE: Ally Complementary Experts for Solving Long-Tailed Recognition in One-Shot] (ICCV 2021, oral))

yolov5目标检测模型的知识蒸馏（基于响应的蒸馏）

A deep learning network built with TensorFlow and Keras to classify gender and estimate age.

CodeContests is a competitive programming dataset for machine-learning

Progressive Coordinate Transforms for Monocular 3D Object Detection

Code release for the paper “Worldsheet Wrapping the World in a 3D Sheet for View Synthesis from a Single Image”, ICCV 2021.

Unofficial implementation of Pix2SEQ

Code for ICCV2021 paper SPEC: Seeing People in the Wild with an Estimated Camera

Algebraic effect handlers in Python

VideoGPT: Video Generation using VQ-VAE and Transformers

PyTorch implementation for COMPLETER: Incomplete Multi-view Clustering via Contrastive Prediction (CVPR 2021)

Context-Sensitive Misspelling Correction of Clinical Text via Conditional Independence, CHIL 2022

PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.

Unsupervised Domain Adaptation for Nighttime Aerial Tracking (CVPR2022)

Source code and dataset of the paper "Contrastive Adaptive Propagation Graph Neural Networks forEfficient Graph Learning"

[ WSDM '22 ] On Sampling Collaborative Filtering Datasets

"MST++: Multi-stage Spectral-wise Transformer for Efficient Spectral Reconstruction" (CVPRW 2022) & (Winner of NTIRE 2022 Challenge on Spectral Reconstruction from RGB)

TensorFlow (Python API) implementation of Neural Style