CAMPARI: Camera-Aware Decomposed Generative Neural Radiance Fields

Last update: Nov 29, 2022

Related tags

Overview

CAMPARI: Camera-Aware Decomposed Generative Neural Radiance Fields

Paper | Supplementary | Video | Poster

If you find our code or paper useful, please cite as

@inproceedings{CAMPARINiemeyer2021,
    author = {Niemeyer, Michael and Geiger, Andreas},
    title = {CAMPARI: Camera-Aware Decomposed Generative Neural Radiance Fields},
    booktitle = {International Conference on 3D Vision (3DV)},
    year = {2021}
}

TL; DR - Quick Start

First you have to make sure that you have all dependencies in place. The simplest way to do so, is to use anaconda.

You can create an anaconda environment called campari using

conda env create -f environment.yml
conda activate campari

You can now test our code on the provided pre-trained models. For example, for creating short video clips, run simply run

python eval_video.py configs/celeba_pretrained.yaml

python eval_figures.py configs/celeba_pretrained.yaml

for creating respective figures.

This script should create a model output folder out/celeba_pretrained. The animations are then saved to the respective subfolders.

Usage

Datasets and Stats Files

To train a model from scratch, you have to download the respective dataset.

For this, please run

bash scripts/download_dataset.sh

and following the instructions. This script should download and unpack the data automatically into the data/ folder.

Note: For FID evaluation or creating figures containing the GT camera distributions, you need to download the "stats files" (select "4 - Camera stats files" for this).

Controllable Image Synthesis

To render short clips or figures from a trained model, run

python eval_video.py CONFIG.yaml

python eval_figures.py CONFIG.yaml

where you replace CONFIG.yaml with the correct config file. The easiest way is to use a pre-trained model. You can do this by using one of the config files which are indicated with *_pretrained.yaml.

For example, for our model trained on celebA, run

python eval_video.py configs/celeba_pretrained.yaml

Our script will automatically download the model checkpoints and render images. You can find the outputs in the out/*_pretrained folders.

Please note that the config files *_pretrained.yaml are only for evaluation or rendering, not for training new models: when these configs are used for training, the model will be trained from scratch, but during inference our code will still use the pre-trained model.

FID Evaluation

For evaluation of the models, we provide the script eval_fid.py. Make sure to have downloaded the stats files (see Usage - Datasets and Stats Files). You can run it using

python eval_fid.py CONFIG.yaml

The script generates 20000 images and calculates the FID score.

Training

Finally, to train a new network from scratch, run

python train.py CONFIG.yaml

where you replace CONFIG.yaml with the name of the configuration file you want to use.

You can monitor on http://localhost:6006 the training process using tensorboard:

cd OUTPUT_DIR
tensorboard --logdir ./logs

where you replace OUTPUT_DIR with the respective output directory. For available training options, please take a look at configs/default.yaml.

Futher Information

More Work on Coordinate-based Neural Representations

If you like the CAMPARI project, please check out related works on neural representions from our group:

CAMPARI: Camera-Aware Decomposed Generative Neural Radiance Fields

Related tags

Overview

CAMPARI: Camera-Aware Decomposed Generative Neural Radiance Fields

Paper | Supplementary | Video | Poster

TL; DR - Quick Start

Usage

Datasets and Stats Files

Controllable Image Synthesis

FID Evaluation

Training

Futher Information

More Work on Coordinate-based Neural Representations

Owner

An implementation of Fastformer: Additive Attention Can Be All You Need in TensorFlow

Dahua Camera and Doorbell Home Assistant Integration

clDice - a Novel Topology-Preserving Loss Function for Tubular Structure Segmentation

Deep Watershed Transform for Instance Segmentation

Code to generate datasets used in "How Useful is Self-Supervised Pretraining for Visual Tasks?"

[CVPR 2022] PoseTriplet: Co-evolving 3D Human Pose Estimation, Imitation, and Hallucination under Self-supervision (Oral)

GluonMM is a library of transformer models for computer vision and multi-modality research

Pytorch implementation of the paper: "A Unified Framework for Separating Superimposed Images", in CVPR 2020.

PyTorch implementation of the paper The Lottery Ticket Hypothesis for Object Recognition

[CVPR 2021] VirTex: Learning Visual Representations from Textual Annotations

Implementation of Memory-Efficient Neural Networks with Multi-Level Generation, ICCV 2021

Source code for the plant extraction workflow introduced in the paper “Agricultural Plant Cataloging and Establishment of a Data Framework from UAV-based Crop Images by Computer Vision”

An end-to-end library for editing and rendering motion of 3D characters with deep learning [SIGGRAPH 2020]

Lepard: Learning Partial point cloud matching in Rigid and Deformable scenes

Face Detection and Alignment using Multi-task Cascaded Convolutional Networks (MTCNN)

PySOT - SenseTime Research platform for single object tracking, implementing algorithms like SiamRPN and SiamMask.

Semantic Segmentation for Aerial Imagery using Convolutional Neural Network

Orange Chicken: Data-driven Model Generalizability in Crosslinguistic Low-resource Morphological Segmentation

The reference baseline of final exam for XMU machine learning course

PyTorch implementations of neural network models for keyword spotting