Style-based Point Generator with Adversarial Rendering for Point Cloud Completion (CVPR 2021)

Last update: Jan 02, 2023

Overview

Style-based Point Generator with Adversarial Rendering for Point Cloud Completion (CVPR 2021)

An efficient PyTorch library for Point Cloud Completion.

Project page | Paper | Video

Chulin Xie*, Chuxin Wang*, Bo Zhang, Hao Yang, Dong Chen, and Fang Wen. (*Equal contribution)

Abstract

We proposed a novel Style-based Point Generator with Adversarial Rendering (SpareNet) for point cloud completion. Firstly, we present the channel-attentive EdgeConv to fully exploit the local structures as well as the global shape in point features. Secondly, we observe that the concatenation manner used by vanilla foldings limits its potential of generating a complex and faithful shape. Enlightened by the success of StyleGAN, we regard the shape feature as style code that modulates the normalization layers during the folding, which considerably enhances its capability. Thirdly, we realize that existing point supervisions, e.g., Chamfer Distance or Earth Mover’s Distance, cannot faithfully reﬂect the perceptual quality of the reconstructed points. To address this, we propose to project the completed points to depth maps with a differentiable renderer and apply adversarial training to advocate the perceptual realism under different viewpoints. Comprehensive experiments on ShapeNet and KITTI prove the effectiveness of our method, which achieves state-of-the-art quantitative performance while offering superior visual quality.

Installation

Create a virtual environment via conda.

conda create -n sparenet python=3.7
conda activate sparenet

Install torch and torchvision.

conda install pytorch cudatoolkit=10.1 torchvision -c pytorch

Install requirements.
```
pip install -r requirements.txt
```
Install cuda
```
sh setup_env.sh
```

Dataset

Download the processed ShapeNet dataset generated by GRNet, and the KITTI dataset.

Update the file path of the datasets in configs/base_config.py:

__C.DATASETS.shapenet.partial_points_path = "/path/to/datasets/ShapeNetCompletion/%s/partial/%s/%s/%02d.pcd"
__C.DATASETS.shapenet.complete_points_path = "/path/to/datasets/ShapeNetCompletion/%s/complete/%s/%s.pcd"
__C.DATASETS.kitti.partial_points_path = "/path/to/datasets/KITTI/cars/%s.pcd"
__C.DATASETS.kitti.bounding_box_file_path = "/path/to/datasets/KITTI/bboxes/%s.txt"

# Dataset Options: ShapeNet, ShapeNetCars, KITTI
__C.DATASET.train_dataset = "ShapeNet"
__C.DATASET.test_dataset = "ShapeNet"

Get Started

Inference Using Pretrained Model

The pretrained models:

SpareNet for ShapeNet (316 MB)
PCN for ShapeNet
GRNet for ShapeNet (307 MB)
GRNet for KITTI (307 MB)
MSN for ShapeNet (8192 points)

run

python   --gpu ${GPUS}\
         --work_dir ${WORK_DIR} \
         --model ${network} \
         --weights ${path to checkpoint} \
         --test_mode ${mode}

example

python  test.py --gpu 0 --work_dir /path/to/logfiles --model sparenet --weights /path/to/cheakpoint --test_mode default

Train

All log files in the training process, such as log message, checkpoints, etc, will be saved to the work directory.

run

python   --gpu ${GPUS}\
         --work_dir ${WORK_DIR} \
         --model ${network} \
         --weights ${path to checkpoint}

example

python  train.py --gpu 0,1,2,3 --work_dir /path/to/logfiles --model sparenet --weights /path/to/cheakpoint

Differentiable Renderer

A fully differentiable point renderer that enables end-to-end rendering from 3D point cloud to 2D depth maps. See the paper for details.

Usage of Renderer

The inputs of renderer are pcd, views and radius, and the outputs of renderer are depth_maps.

example

# `projection_mode`: a str with value "perspective" or "orthorgonal"
# `eyepos_scale`: a float that defines the distance of eyes to (0, 0, 0)
# `image_size`: an int defining the output image size
renderer = ComputeDepthMaps(projection_mode, eyepos_scale, image_size)

# `data`: a tensor with shape [batch_size, num_points, 3]
# `view_id`: the index of selected view satisfying 0 <= view_id < 8
# `radius_list`: a list of floats, defining the kernel radius to render each point
depthmaps = renderer(data, view_id, radius_list)

License

The codes and the pretrained model in this repository are under the MIT license as specified by the LICENSE file.

This project has adopted the Microsoft Open Source Code of Conduct. For more information see the Code of Conduct FAQ or contact [email protected] with any additional questions or comments.

BibTex

If you like our work and use the codebase or models for your research, please cite our work as follows.

@inproceedings{xie2021stylebased,
      title={Style-based Point Generator with Adversarial Rendering for Point Cloud Completion}, 
      author={Chulin Xie and Chuxin Wang and Bo Zhang and Hao Yang and Dong Chen and Fang Wen},
      booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
      year={2021},
}

Style-based Point Generator with Adversarial Rendering for Point Cloud Completion (CVPR 2021)

Related tags

Overview

Style-based Point Generator with Adversarial Rendering for Point Cloud Completion (CVPR 2021)

Project page | Paper | Video

Abstract

Installation

Dataset

Get Started

Inference Using Pretrained Model

Train

Differentiable Renderer

Usage of Renderer

License

BibTex

Owner

Microsoft

ISNAS-DIP: Image Specific Neural Architecture Search for Deep Image Prior [CVPR 2022]

PyTorch implementation of SimCLR: A Simple Framework for Contrastive Learning of Visual Representations

Keras-retinanet - Keras implementation of RetinaNet object detection.

Using contrastive learning and OpenAI's CLIP to find good embeddings for images with lossy transformations

A minimal implementation of face-detection models using flask, gunicorn, nginx, docker, and docker-compose

League of Legends Reinforcement Learning Environment (LoLRLE) multiple training scenarios using PPO.

A fast Protein Chain / Ligand Extractor and organizer.

PaddlePaddle GAN library, including lots of interesting applications like First-Order motion transfer, wav2lip, picture repair, image editing, photo2cartoon, image style transfer, and so on.

Dynamica causal Bayesian optimisation

A deep learning based semantic search platform that computes similarity scores between provided query and documents

face2comics by Sxela (Alex Spirin) - face2comics datasets

The code for paper Efficiently Solve the Max-cut Problem via a Quantum Qubit Rotation Algorithm

Official PyTorch implementation of "IntegralAction: Pose-driven Feature Integration for Robust Human Action Recognition in Videos", CVPRW 2021

🔥RandLA-Net in Tensorflow (CVPR 2020, Oral & IEEE TPAMI 2021)

An addernet CUDA version

SAMO: Streaming Architecture Mapping Optimisation

DECAF: Deep Extreme Classification with Label Features

[CVPR 2022] PoseTriplet: Co-evolving 3D Human Pose Estimation, Imitation, and Hallucination under Self-supervision (Oral)

How Effective is Incongruity? Implications for Code-mix Sarcasm Detection.

Deep Residual Learning for Image Recognition