VolumeGAN - 3D-aware Image Synthesis via Learning Structural and Textural Representations

Last update: Dec 26, 2022

Related tags

Overview

VolumeGAN - 3D-aware Image Synthesis via Learning Structural and Textural Representations

3D-aware Image Synthesis via Learning Structural and Textural Representations
Yinghao Xu, Sida Peng, Ceyuan Yang, Yujun Shen, Bolei Zhou
arXiv preprint arXiv:

[Paper] [Project Page] [Demo]

This paper aims at achieving high-fidelity 3D-aware images synthesis. We propose a novel framework, termed as VolumeGAN, for synthesizing images under different camera views, through explicitly learning a structural representation and a textural representation. We first learn a feature volume to represent the underlying structure, which is then converted to a feature field using a NeRF-like model. The feature field is further accumulated into a 2D feature map as the textural representation, followed by a neural renderer for appearance synthesis. Such a design enables independent control of the shape and the appearance. Extensive experiments on a wide range of datasets show that our approach achieves sufficiently higher image quality and better 3D control than the previous methods.

Qualitative Results

Independent control of structure (shape) and texture (appearance).

Comparison to prior work on various datasets.

Code Coming Soon

BibTeX

@article{xu2021volumegan,
  title   = {3D-aware Image Synthesis via Learning Structural and Textural Representations},
  author  = {Xu, Yinghao and Peng, Sida and Yang, Ceyuan and Shen, Yujun and Zhou, Bolei},
  article = {arXiv preprint arXiv:2112.10759},
  year    = {2021}
}

VolumeGAN - 3D-aware Image Synthesis via Learning Structural and Textural Representations

Related tags

Overview

VolumeGAN - 3D-aware Image Synthesis via Learning Structural and Textural Representations

Qualitative Results

Code Coming Soon

BibTeX

Owner

GenForce: May Generative Force Be with You

Implementation of ConvMixer for "Patches Are All You Need? 🤷"

A neuroanatomy-based augmented reality experience powered by computer vision. Features 3D visuals of the Atlas Brain Map slices.

Show-attend-and-tell - TensorFlow Implementation of "Show, Attend and Tell"

Callable PyTrees and filtered JIT/grad transformations => neural networks in JAX.

[ICLR'21] FedBN: Federated Learning on Non-IID Features via Local Batch Normalization

Automatic Number Plate Recognition using Contours and Convolution Neural Networks (CNN)

3DMV jointly combines RGB color and geometric information to perform 3D semantic segmentation of RGB-D scans.

Discovering Dynamic Salient Regions with Spatio-Temporal Graph Neural Networks

ICCV2021 Expert-Goal Trajectory Prediction

Random Forests for Regression with Missing Entries

POPPY (Physical Optics Propagation in Python) is a Python package that simulates physical optical propagation including diffraction

The official implementation code of "PlantStereo: A Stereo Matching Benchmark for Plant Surface Dense Reconstruction."

make ASCII Art by Deep Learning

NeuroFind - A solution to the to the Task given by the Oberseminar of Messtechnik Institute of TU Dresden in 2021

TuckER: Tensor Factorization for Knowledge Graph Completion

Code for weakly supervised segmentation of a single class

D-NeRF: Neural Radiance Fields for Dynamic Scenes

A framework for GPU based high-performance medical image processing and visualization

Code & Models for 3DETR - an End-to-end transformer model for 3D object detection

[NeurIPS'20] Self-supervised Co-Training for Video Representation Learning. Tengda Han, Weidi Xie, Andrew Zisserman.