[AAAI 2021] EMLight: Lighting Estimation via Spherical Distribution Approximation and [ICCV 2021] Sparse Needlets for Lighting Estimation with Spherical Transport Loss

Last update: Jan 06, 2023

Related tags

Deep Learning Illumination-Estimation

Overview

EMLight: Lighting Estimation via Spherical Distribution Approximation (AAAI 2021)

Update

12/2021: We release our Virtual Object Relighting (VOR) Dataset for lighting estimation evaluation. Please refer to Virtual Object Insertion & Rendering section.
07/2021: Our new work Sparse Needlets for Lighting Estimation with Spherical Transport Loss is accepted to ICCV 2021. This work introduces a new Needlets basis for lighting representation which allows to represent illumination in both spatial and frequency domains. The implementation code is available in Needlets/ of this repository.

Prerequisites

Linux or macOS
Python3, PyTorch
CPU or NVIDIA GPU + CUDA CuDNN

Dataset Preparation

Laval Indoor HDR Dataset
Thanks to the intellectual property of Laval Indoor dataset, the original datasets and processed training data can not be released from me. Please get access to the dataset by contacting the dataset creator [email protected].

After getting the dataset, the raw illumination map can be processed to generate the training data of the regression network as below:

cd RegressionNetwork/representation/
python3 distribution_representation.py

Pretrained Models

The pretrained regression model of EMLight (96 anchor points, without depth branch) as well as pretrained densenet-121 can be downloaded from Google Drive. Saving the pretrained models in RegressionNetwork/checkpoints. The model parameters should be adjusted accordingly for inference.

Training

Then run the command

cd RegressionNetwork/
python3 train.py

Training tip1: you may overfit the model on a small subset first, then train the model on the full set, to avoid divergence during training.

Training tip2: you can try to reduce the number of anchor points (e.g., 96) in the model, which helps to converge during training.

Virtual Object Insertion & Rendering

To evaluate the performance of lighting estimation, we create a Virtual Object Relighting (VOR) dataset to conduct object insertion & rendering in Blender. The lighting estimaiton performance is evaluated by using the predicted illumination map as the environment light in Blender.

The background scenes of this set include images from Laval Indoor HDR, Fast Spatially-Varying Indoor, and some wild scenes. This dataset can be downloaded from Google Drive.

Citation

If you use this code for your research, please cite our papers.

@inproceedings{zhan2021emlight,
  title={EMLight: Lighting Estimation via Spherical Distribution Approximation},
  author={Zhan, Fangneng and Zhang, Changgong and Yu, Yingchen and Chang, Yuan and Lu, Shijian and Ma, Feiying and Xie, Xuansong},
  booktitle={Proceedings of the AAAI Conference on Artificial Intelligence},
  year={2021}
}

@inproceedings{zhan2021emlight,
  title={Sparse Needlets for Lighting Estimation with Spherical Transport Loss},
  author={Zhan, Fangneng and Zhang, Changgong and Hu, Wenbo and Lu, Shijian and Ma, Feiying and Xie, Xuansong and Shao, Ling},
  booktitle={Proceedings of the IEEE International Conference on Computer Vision},
  year={2021}
}

[AAAI 2021] EMLight: Lighting Estimation via Spherical Distribution Approximation and [ICCV 2021] Sparse Needlets for Lighting Estimation with Spherical Transport Loss

Related tags

Overview

EMLight: Lighting Estimation via Spherical Distribution Approximation (AAAI 2021)

Update

Prerequisites

Dataset Preparation

Pretrained Models

Training

Virtual Object Insertion & Rendering

Citation

Owner

Fangneng Zhan

IEGAN — Official PyTorch Implementation Independent Encoder for Deep Hierarchical Unsupervised Image-to-Image Translation

Bayesian Optimization Library for Medical Image Segmentation.

[peer review] An Arbitrary Scale Super-Resolution Approach for 3D MR Images using Implicit Neural Representation

Self-supervised spatio-spectro-temporal represenation learning for EEG analysis

Multi-Scale Progressive Fusion Network for Single Image Deraining

A collection of resources and papers on Diffusion Models, a darkhorse in the field of Generative Models

InvTorch: memory-efficient models with invertible functions

A machine learning project which can detect and predict the skin disease through image recognition.

CoReNet is a technique for joint multi-object 3D reconstruction from a single RGB image.

HistoKT: Cross Knowledge Transfer in Computational Pathology

ArtEmis: Affective Language for Art

prior-based-losses-for-medical-image-segmentation

Official Implementation of "Transformers Can Do Bayesian Inference"

[arXiv] What-If Motion Prediction for Autonomous Driving ❓🚗💨

Implementation of gaze tracking and demo

Unsupervised Semantic Segmentation by Contrasting Object Mask Proposals.

VGGFace2-HQ - A high resolution face dataset for face editing purpose

Suite of 500 procedurally-generated NLP tasks to study language model adaptability

Who calls the shots? Rethinking Few-Shot Learning for Audio (WASPAA 2021)

Machine Learning Toolkit for Kubernetes