[CVPR2021] Invertible Image Signal Processing

Last update: Dec 31, 2022

Related tags

Overview

Invertible Image Signal Processing

This repository includes official codes for "Invertible Image Signal Processing (CVPR2021)".

Figure: Our framework

Unprocessed RAW data is a highly valuable image format for image editing and computer vision. However, since the file size of RAW data is huge, most users can only get access to processed and compressed sRGB images. To bridge this gap, we design an Invertible Image Signal Processing (InvISP) pipeline, which not only enables rendering visually appealing sRGB images but also allows recovering nearly perfect RAW data. Due to our framework's inherent reversibility, we can reconstruct realistic RAW data instead of synthesizing RAW data from sRGB images, without any memory overhead. We also integrate a differentiable JPEG compression simulator that empowers our framework to reconstruct RAW data from JPEG images. Extensive quantitative and qualitative experiments on two DSLR demonstrate that our method obtains much higher quality in both rendered sRGB images and reconstructed RAW data than alternative methods.

Invertible Image Signal Processing
Yazhou Xing*, Zian Qian*, Qifeng Chen (* indicates joint first authors)
HKUST

[Paper] [Project Page] [Technical Video (Coming soon)]

Figure: Our results

Installation

Clone this repo.

git clone https://github.com/yzxing87/Invertible-ISP.git 
cd Invertible-ISP/

We have tested our code on Ubuntu 18.04 LTS with PyTorch 1.4.0, CUDA 10.1 and cudnn7.6.5. Please install dependencies by

conda env create -f environment.yml

Preparing datasets

We use MIT-Adobe FiveK Dataset for training and evaluation. To reproduce our results, you need to first download the NIKON D700 and Canon EOS 5D subsets from their website. The images (DNG) can be downloaded by

cd data/
bash data_preprocess.sh

The downloading may take a while. After downloading, we need to prepare the bilinearly demosaiced RAW and white balance parameters as network input, and ground truth sRGB (in JPEG format) as supervision.

python data_preprocess.py --camera="NIKON_D700"
python data_preprocess.py --camera="Canon_EOS_5D"

The dataset will be organized into

Path	Size	Files	Format	Description
data	585 GB	1		Main folder
├ Canon_EOS_5D	448 GB	1		Canon sub-folder
├ NIKON_D700	137 GB	1		NIKON sub-folder
├ DNG	2.9 GB	487	DNG	In-the-wild RAW.
├ RAW	133 GB	487	NPZ	Preprocessed RAW.
├ RGB	752 MB	487	JPG	Ground-truth RGB.
├ NIKON_D700_train.txt	1 KB	1	TXT	Training data split.
├ NIKON_D700_test.txt	5 KB	1	TXT	Test data split.

Training networks

We specify the training arguments into train.sh. Simply run

cd ../
bash train.sh

The checkpoints will be saved into ./exps/{exp_name}/checkpoint/.

Test and evaluation

To reconstruct the RAW from JPEG RGB, we need to first save the rendered RGB into disk then do test to recover RAW. Original RAW images are too huge to be directly tested on one 2080 Ti GPU. We provide two ways to test the model.

Subsampling the RAW for visualization purpose:

python test_rgb.py --task=EXPERIMENT_NAME \
                --data_path="./data/" \
                --gamma \
                --camera=CAMERA_NAME \
                --out_path=OUTPUT_PATH \
                --ckpt=CKPT_PATH

After finish, run

python test_raw.py --task=EXPERIMENT_NAME \
                --data_path="./data/" \
                --gamma \
                --camera=CAMERA_NAME \
                --out_path=OUTPUT_PATH \
                --ckpt=CKPT_PATH

Spliting the RAW data into patches, for quantitatively evaluation purpose. Turn on the --split_to_patch argument. See test.sh. The PSNR and SSIM metrics can be obtained by

python cal_metrics.py --path=PATH_TO_SAVED_PATCHES

Citation

@inproceedings{xing21invertible,
  title     = {Invertible Image Signal Processing},
  author    = {Xing, Yazhou and Qian, Zian and Chen, Qifeng},
  booktitle = {CVPR},
  year      = {2021}
}

Acknowledgement

Part of the codes benefit from DiffJPEG and Invertible-Image-Rescaling.

Contact

Free feel to contact me if there is any question. (Yazhou Xing, [email protected])

[CVPR2021] Invertible Image Signal Processing

Related tags

Overview

Invertible Image Signal Processing

Installation

Preparing datasets

Training networks

Test and evaluation

Citation

Acknowledgement

Contact

Owner

Yazhou XING

Code accompanying the paper "ProxyFL: Decentralized Federated Learning through Proxy Model Sharing"

How Do Adam and Training Strategies Help BNNs Optimization? In ICML 2021.

A scikit-learn compatible neural network library that wraps PyTorch

Implicit Graph Neural Networks

Easy Parallel Library (EPL) is a general and efficient deep learning framework for distributed model training.

Nest - A flexible tool for building and sharing deep learning modules

Text Extraction Formulation + Feedback Loop for state-of-the-art WSD (EMNLP 2021)

clustering moroccan stocks time series data using k-means with dtw (dynamic time warping)

Implementation of trRosetta and trDesign for Pytorch, made into a convenient package

Official Code for ICML 2021 paper "Revisiting Point Cloud Shape Classification with a Simple and Effective Baseline"

the official code for ICRA 2021 Paper: "Multimodal Scale Consistency and Awareness for Monocular Self-Supervised Depth Estimation"

Codebase for Amodal Segmentation through Out-of-Task andOut-of-Distribution Generalization with a Bayesian Model

Non-Attentive-Tacotron - This is Pytorch Implementation of Google's Non-attentive Tacotron.

Explaining Deep Neural Networks - A comparison of different CAM methods based on an insect data set

Code for CVPR2019 paper《Unequal Training for Deep Face Recognition with Long Tailed Noisy Data》

Codebase for the solution that won first place and was awarded the most human-like agent in the 2021 NeurIPS Competition MineRL BASALT Challenge.

A simple pygame dino game which can also be trained and played by a NEAT KI

Attack on Confidence Estimation algorithm from the paper "Disrupting Deep Uncertainty Estimation Without Harming Accuracy"

Omnidirectional camera calibration in python

Continuous Query Decomposition for Complex Query Answering in Incomplete Knowledge Graphs