Estimating and Exploiting the Aleatoric Uncertainty in Surface Normal Estimation

Last update: Jan 04, 2023

Overview

Estimating and Exploiting the Aleatoric Uncertainty in Surface Normal Estimation

Official implementation of the paper

Estimating and Exploiting the Aleatoric Uncertainty in Surface Normal Estimation
ICCV 2021 [oral]
Gwangbin Bae, Ignas Budvytis, and Roberto Cipolla
[arXiv]

The proposed method estimates the per-pixel surface normal probability distribution, from which the expected angular error can be inferred to quantify the aleatoric uncertainty. We also introduce a novel decoder framework where pixel-wise MLPs are trained on a subset of pixels selected based on the uncertainty. Such uncertainty-guided sampling prevents the bias in training towards large planar surfaces, thereby improving the level of the detail in the prediction.

Getting Started

We recommend using a virtual environment.

python3.6 -m venv --system-site-packages ./venv
source ./venv/bin/activate

Install the necessary dependencies by

python3.6 -m pip install -r requirements.txt

Download the pre-trained model weights and sample images.

python download.py && cd examples && unzip examples.zip && cd ..

Running the above will download

./checkpoints/nyu.pt (model trained on NYUv2)
./checkpoints/scannet.pt (model trained on ScanNet)
./examples/*.png (sample images)

Run Demo

To test on your own images, please add them under ./examples/. The images should be in .png or .jpg.

Test using the network trained on NYUv2. We used the ground truth and data split provided by GeoNet.

Please note that the ground truth for NYUv2 is only defined for the center crop of image. The prediction is therefore not accurate outside the center. When testing on your own images, we recommend using the network trained on ScanNet.

python test.py --pretrained nyu --architecture GN

Test using the network trained on ScanNet. We used the ground truth and data split provided by FrameNet.

python test.py --pretrained scannet --architecture BN

Running the above will save the predicted surface normal and uncertainty under ./examples/results/. If successful, you will obtain images like below.

The predictions in the figure above are obtained by the network trained only on ScanNet. The network generalizes well to objects unseen during training (e.g., humans, cars, animals). The last row shows interesting examples where the input image only contains edges.

Citation

If you find our work useful in your research please consider citing our paper:

@InProceedings{Bae2021,
    title   = {Estimating and Exploiting the Aleatoric Uncertainty in Surface Normal Estimation}
    author  = {Gwangbin Bae and Ignas Budvytis and Roberto Cipolla},
    booktitle = {International Conference on Computer Vision (ICCV)},
    year = {2021}                         
}

Estimating and Exploiting the Aleatoric Uncertainty in Surface Normal Estimation

Related tags

Overview

Estimating and Exploiting the Aleatoric Uncertainty in Surface Normal Estimation

Getting Started

Run Demo

Citation

Owner

Bae, Gwangbin

Problem-943.-ACMP - Problem 943. ACMP

MLP-Like Vision Permutator for Visual Recognition (PyTorch)

Functional deep learning

A lightweight library to compare different PyTorch implementations of the same network architecture.

[ICRA 2022] CaTGrasp: Learning Category-Level Task-Relevant Grasping in Clutter from Simulation

A denoising autoencoder + adversarial losses and attention mechanisms for face swapping.

A Comprehensive Study on Learning-Based PE Malware Family Classification Methods

A method that utilized Generative Adversarial Network (GAN) to interpret the black-box deep image classifier models by PyTorch.

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).

Code for the paper "Graph Attention Tracking". (CVPR2021)

A collection of models for image<->text generation in ACM MM 2021.

Edge-aware Guidance Fusion Network for RGB-Thermal Scene Parsing

[CVPR 2021] Official PyTorch Implementation for "Iterative Filter Adaptive Network for Single Image Defocus Deblurring"

Spectrum Surveying: Active Radio Map Estimation with Autonomous UAVs

CARMS: Categorical-Antithetic-REINFORCE Multi-Sample Gradient Estimator

This PyTorch package implements MoEBERT: from BERT to Mixture-of-Experts via Importance-Guided Adaptation (NAACL 2022).

NeRD: Neural Reflectance Decomposition from Image Collections

Implementation of "RaScaNet: Learning Tiny Models by Raster-Scanning Image" from CVPR 2021.

High-level library to help with training and evaluating neural networks in PyTorch flexibly and transparently.

A selection of State Of The Art research papers (and code) on human locomotion (pose + trajectory) prediction (forecasting)