VolumeGAN - 3D-aware Image Synthesis via Learning Structural and Textural Representations

Last update: Dec 26, 2022

Related tags

Overview

VolumeGAN - 3D-aware Image Synthesis via Learning Structural and Textural Representations

3D-aware Image Synthesis via Learning Structural and Textural Representations
Yinghao Xu, Sida Peng, Ceyuan Yang, Yujun Shen, Bolei Zhou
arXiv preprint arXiv:

[Paper] [Project Page] [Demo]

This paper aims at achieving high-fidelity 3D-aware images synthesis. We propose a novel framework, termed as VolumeGAN, for synthesizing images under different camera views, through explicitly learning a structural representation and a textural representation. We first learn a feature volume to represent the underlying structure, which is then converted to a feature field using a NeRF-like model. The feature field is further accumulated into a 2D feature map as the textural representation, followed by a neural renderer for appearance synthesis. Such a design enables independent control of the shape and the appearance. Extensive experiments on a wide range of datasets show that our approach achieves sufficiently higher image quality and better 3D control than the previous methods.

Qualitative Results

Independent control of structure (shape) and texture (appearance).

Comparison to prior work on various datasets.

Code Coming Soon

BibTeX

@article{xu2021volumegan,
  title   = {3D-aware Image Synthesis via Learning Structural and Textural Representations},
  author  = {Xu, Yinghao and Peng, Sida and Yang, Ceyuan and Shen, Yujun and Zhou, Bolei},
  article = {arXiv preprint arXiv:2112.10759},
  year    = {2021}
}

VolumeGAN - 3D-aware Image Synthesis via Learning Structural and Textural Representations

Related tags

Overview

VolumeGAN - 3D-aware Image Synthesis via Learning Structural and Textural Representations

Qualitative Results

Code Coming Soon

BibTeX

Owner

GenForce: May Generative Force Be with You

Real-time ground filtering algorithm of cloud points acquired using Terrestrial Laser Scanner (TLS)

Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.

Pytorch implementation of the paper: "SAPNet: Segmentation-Aware Progressive Network for Perceptual Contrastive Image Deraining"

Trying to understand alias-free-gan.

PyTorch-Multi-Style-Transfer - Neural Style and MSG-Net

ICNet and PSPNet-50 in Tensorflow for real-time semantic segmentation

Pytorch domain adaptation package

DeepHyper: Scalable Asynchronous Neural Architecture and Hyperparameter Search for Deep Neural Networks

Non-Vacuous Generalisation Bounds for Shallow Neural Networks

A Broader Picture of Random-walk Based Graph Embedding

potpourri3d - An invigorating blend of 3D geometry tools in Python.

Pytorch implementation of Compressive Transformers, from Deepmind

ReferFormer - Official Implementation of ReferFormer

Wide Residual Networks (WideResNets) in PyTorch

Experiments and code to generate the GINC small-scale in-context learning dataset from "An Explanation for In-context Learning as Implicit Bayesian Inference"

Stroke-predictions-ml-model - Machine learning model to predict individuals chances of having a stroke

A ssl analyzer which could analyzer target domain's certificate.

Txt2Xml tool will help you convert from txt COCO format to VOC xml format in Object Detection Problem.

Image augmentation library in Python for machine learning.

The code for the NSDI'21 paper "BMC: Accelerating Memcached using Safe In-kernel Caching and Pre-stack Processing".