Intrinsic Image Harmonization

Last update: Dec 21, 2022

Related tags

Deep Learning IntrinsicHarmony

Overview

Intrinsic Image Harmonization [Paper]

Zonghui Guo, Haiyong Zheng, Yufeng Jiang, Zhaorui Gu, Bing Zheng

Here we provide PyTorch implementation and the trained model of our framework.

Prerequisites

Linux
Python 3
CPU or NVIDIA GPU + CUDA CuDNN

Train/Test

Download iHarmony4 dataset, and our HVIDIT dataset Google Drive or BaiduCloud (access code: akbi).
Train a model:

CUDA_VISIBLE_DEVICES=0 python train.py --model retinexltifpm  --name retinexltifpm_allihd  --dataset_root <dataset_dir> --dataset_name IHD --batch_size xx --init_port xxxx

Test the model

CUDA_VISIBLE_DEVICES=0 python test.py --model retinexltifpm  --name retinexltifpm_allihd  --dataset_root <dataset_dir> --dataset_name IHD --batch_size xx --init_port xxxx

Apply a pre-trained model

Download the pretrained model from Google Drive or BaiduCloud (access code: 20m6), and put net_G.pth in the directory checkpoints/experiment. Run:

CUDA_VISIBLE_DEVICES=0 python test.py --model retinexltifpm  --name experiment  --dataset_root <dataset_dir> --dataset_name IHD --batch_size xx --init_port xxxx

Evaluation

We provide the code in ih_evaluation.py. Run:

CUDA_VISIBLE_DEVICES=0 python evaluation/ih_evaluation.py --dataroot <dataset_dir> --result_root  results/experiment/test_latest/images/ --evaluation_type our --dataset_name ALL

Quantitative Result

Dataset	Metrics	Composite	Ours (iHarmony4)	Ours (iHarmony4+HVIDIT)
HCOCO	PSNR MSE fMSE	33.99 69.37 996.59	37.61 23.25 386.39	37.77 21.84 367.38
HAdobe5k	PSNR MSE fMSE	28.52 345.54 2051.61	36.20 42.21 296.76	36.49 39.53 266.49
HFlickr	PSNR MSE fMSE	28.43 264.35 1574.37	31.74 100.86 676.71	32.08 96.87 635.60
Hday2night	PSNR MSE fMSE	34.36 109.65 1409.98	36.48 50.64 755.88	36.60 50.37 763.33
HVIDIT	PSNR MSE fMSE	38.72 53.12 1604.41	- - -	41.83 22.49 691.06
ALL	PSNR MSE fMSE	32.07 167.39 1386.12	36.53 37.95 399.34	36.96 35.33 388.50

Bibtex

If you use this code for your research, please cite our papers.

@InProceedings{Guo_2021_CVPR,
    author    = {Guo, Zonghui and Zheng, Haiyong and Jiang, Yufeng and Gu, Zhaorui and Zheng, Bing},
    title     = {Intrinsic Image Harmonization},
    booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
    month     = {June},
    year      = {2021},
    pages     = {16367-16376}
}

Acknowledgement

For some of the data modules and model functions used in this source code, we need to acknowledge the repo of DoveNet and CycleGAN.

You might also like...

python library for invisible image watermark (blind image watermark)

invisible-watermark invisible-watermark is a python library and command line tool for creating invisible watermark over image.(aka. blink image waterm

572 Jan 7, 2023

AOT-GAN for High-Resolution Image Inpainting (codebase for image inpainting)

AOT-GAN for High-Resolution Image Inpainting Arxiv Paper | AOT-GAN: Aggregated Contextual Transformations for High-Resolution Image Inpainting Yanhong

214 Jan 3, 2023

Code for Dual Contrastive Learning for Unsupervised Image-to-Image Translation, NTIRE, CVPRW 2021.

arXiv Dual Contrastive Learning Adversarial Generative Networks (DCLGAN) We provide our PyTorch implementation of DCLGAN, which is a simple yet powerf

119 Dec 4, 2022

Deep Image Search is an AI-based image search engine that includes deep transfor learning features Extraction and tree-based vectorized search.

Deep Image Search - AI-Based Image Search Engine Deep Image Search is an AI-based image search engine that includes deep transfer learning features Ex

139 Jan 1, 2023

Third party Pytorch implement of Image Processing Transformer (Pre-Trained Image Processing Transformer arXiv:2012.00364v2)

ImageProcessingTransformer Third party Pytorch implement of Image Processing Transformer (Pre-Trained Image Processing Transformer arXiv:2012.00364v2)

61 Jan 1, 2023

[CVPR 2021] Teachers Do More Than Teach: Compressing Image-to-Image Models (CAT)

Accurate 3D Face Reconstruction with Weakly-Supervised Learning: From Single Image to Image Set (CVPRW 2019). A PyTorch implementation.

Accurate 3D Face Reconstruction with Weakly-Supervised Learning: From Single Image to Image Set —— PyTorch implementation This is an unofficial offici

833 Dec 28, 2022

Comments

Model Inference

Hello, is there a way to infer the model by reading an image and passing the image and its mask to the model and getting the harmonized output? Without the need to store the image's path in a text file and reading it from the text file then loading the image?

opened by AhmedHashish123 2
visdom interface is blank

first，thanks for your excellent work！ When I execute the training code, the visdom interface does not display the result picture and the training loss. it works when I execute the code of dovenet. could you tell me how to solve this problem? thanks again

opened by Ligouhi 0

Releases(v1.0)

v1.0(Feb 9, 2022)

Code version of our CVPR work [Paper].
Source code(tar.gz)
Source code(zip)

Intrinsic Image Harmonization

Related tags

Overview

Intrinsic Image Harmonization [Paper]

Prerequisites

Train/Test

Apply a pre-trained model

Evaluation

Quantitative Result

Bibtex

Acknowledgement

You might also like...

python library for invisible image watermark (blind image watermark)

AOT-GAN for High-Resolution Image Inpainting (codebase for image inpainting)

Code for Dual Contrastive Learning for Unsupervised Image-to-Image Translation, NTIRE, CVPRW 2021.

Deep Image Search is an AI-based image search engine that includes deep transfor learning features Extraction and tree-based vectorized search.

Third party Pytorch implement of Image Processing Transformer (Pre-Trained Image Processing Transformer arXiv:2012.00364v2)

[CVPR 2021] Teachers Do More Than Teach: Compressing Image-to-Image Models (CAT)

Official implementation of "SinIR: Efficient General Image Manipulation with Single Image Reconstruction" (ICML 2021)

This is the PyTorch implementation of GANs N’ Roses: Stable, Controllable, Diverse Image to Image Translation

Accurate 3D Face Reconstruction with Weakly-Supervised Learning: From Single Image to Image Set (CVPRW 2019). A PyTorch implementation.

Comments

Model Inference

visdom interface is blank

Releases(v1.0)

v1.0(Feb 9, 2022)

Owner

VISION @ OUC

Code and datasets for the paper "Combining Events and Frames using Recurrent Asynchronous Multimodal Networks for Monocular Depth Prediction" (RA-L, 2021)

This is the code related to "Sparse-to-dense Feature Matching: Intra and Inter domain Cross-modal Learning in Domain Adaptation for 3D Semantic Segmentation" (ICCV 2021).

Style-based Point Generator with Adversarial Rendering for Point Cloud Completion (CVPR 2021)

Reference implementation for Deep Unsupervised Learning using Nonequilibrium Thermodynamics

Count GitHub Stars ⭐

TrackTech: Real-time tracking of subjects and objects on multiple cameras

Code and hyperparameters for the paper "Generative Adversarial Networks"

Official repository for the paper "GN-Transformer: Fusing AST and Source Code information in Graph Networks".

A Moonraker plug-in for real-time compensation of frame thermal expansion

Latent Execution for Neural Program Synthesis

《Single Image Reflection Removal Beyond Linearity》(CVPR 2019)

Python script that analyses the given datasets and comes up with the best polynomial regression representation with the smallest polynomial degree possible

Code for Blind Image Decomposition (BID) and Blind Image Decomposition network (BIDeN).

VSR-Transformer - This paper proposes a new Transformer for video super-resolution (called VSR-Transformer).

Pytorch implementation for RelTransformer

Registration Loss Learning for Deep Probabilistic Point Set Registration

Improving Object Detection by Estimating Bounding Box Quality Accurately

Efficiently computes derivatives of numpy code.

Code for CVPR2021 paper 'Where and What? Examining Interpretable Disentangled Representations'.

Setup and customize deep learning environment in seconds.