Semantic Segmentation for Real Point Cloud Scenes via Bilateral Augmentation and Adaptive Fusion (CVPR 2021)

Last update: Dec 29, 2022

Related tags

Overview

Semantic Segmentation for Real Point Cloud Scenes via Bilateral Augmentation and Adaptive Fusion (CVPR 2021)

This repository is for BAAF-Net introduced in the following paper:

"Semantic Segmentation for Real Point Cloud Scenes via Bilateral Augmentation and Adaptive Fusion"
Shi Qiu, Saeed Anwar, Nick Barnes
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2021)

Paper and Citation

The paper can be downloaded from here (CVF) or here (arXiv).
If you find our paper/codes/results are useful, please cite:

@inproceedings{qiu2021semantic,
  title={Semantic Segmentation for Real Point Cloud Scenes via Bilateral Augmentation and Adaptive Fusion},
  author={Qiu, Shi and Anwar, Saeed and Barnes, Nick},
  booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
  pages={1757-1767},
  year={2021}
}

Updates

04/05/2021 Results for S3DIS dataset (mIoU: 72.2%, OA: 88.9%, mAcc: 83.1%) are available now.
04/05/2021 Test results (sequence 11-21: mIoU: 59.9%, OA: 89.8%) for SemanticKITTI dataset are available now.
04/05/2021 Validation results (sequence 08: mIoU: 58.7%, OA: 91.3%) for SemanticKITTI are available now.
28/05/2021 Pretrained models can be downloaded on all 6 areas of S3DIS dataset are available at google drive.
28/05/2021 codes released!

Settings

The project is tested on Python 3.6, Tensorflow 1.13.1 and cuda 10.0
Then install the dependencies: pip install -r helper_requirements.txt
And compile the cuda-based operators: sh compile_op.sh
(Note: may change the cuda root directory CUDA_ROOT in ./util/sampling/compile_ops.sh)

Dataset

Download S3DIS dataset from here.
Unzip and move the folder Stanford3dDataset_v1.2_Aligned_Version to ./data.
Run: python utils/data_prepare_s3dis.py
(Note: may specify other directory as dataset_path in ./util/data_prepare_s3dis.py)

Training/Test

Training:

python -B main_S3DIS.py --gpu 0 --mode train --test_area 5

(Note: specify the --test_area from 1~6)

Test:

python -B main_S3DIS.py --gpu 0 --mode test --test_area 5 --model_path 'pretrained/Area5/snap-32251'

(Note: specify the --test_area index and the trained model path --model_path)

6-fold Cross Validation

Conduct training and test on each area.
Extract all test results, Area_1_conferenceRoom_1.ply ... Area_6_pantry_1.ply (272 .ply files in total), to the folder ./data/results
Run: python utils/6_fold_cv.py
(Note: may change the target folder original_data_dir and the test results base_dir in ./util/6_fold_cv.py)

Pretrained Models and Results on S3DIS Dataset

BAAF-Net pretrained models on all 6 areas can be downloaded from google drive.
Download our results (ply files) via google drive for visualizations/comparisons.
More Functions about loading/writing/etc. ply files can be found from here.

Results on SemanticKITTI Dataset

Online test results (sequence 11-21): mIoU: 59.9%, OA: 89.8%
Download our test results (sequence 11-21 label files) via google drive for visualizations/comparisons.

Validation results (sequence 08): mIoU: 58.7%, OA: 91.3%
Download our validation results (sequence 08 label files) via google drive for visualizations/comparisons.
Visualization tools can be found from semantic-kitti-api.

Acknowledgment

The code is built on RandLA-Net. We thank the authors for sharing the codes.

Semantic Segmentation for Real Point Cloud Scenes via Bilateral Augmentation and Adaptive Fusion (CVPR 2021)

Related tags

Overview

Semantic Segmentation for Real Point Cloud Scenes via Bilateral Augmentation and Adaptive Fusion (CVPR 2021)

Paper and Citation

Updates

Settings

Dataset

Training/Test

6-fold Cross Validation

Pretrained Models and Results on S3DIS Dataset

Results on SemanticKITTI Dataset

Acknowledgment

Owner

👨‍💻 run nanosaur in simulation with Gazebo/Ingnition

Geometry-Aware Learning of Maps for Camera Localization (CVPR2018)

Reproduced Code for Image Forgery Detection papers.

This repository contains the code used for the implementation of the paper "Probabilistic Regression with HuberDistributions"

PyTorch deep learning projects made easy.

Implementation for "Manga Filling Style Conversion with Screentone Variational Autoencoder" (SIGGRAPH ASIA 2020 issue)

CTF challenges and write-ups for MicroCTF 2021.

Character-Input - Create a program that asks the user to enter their name and their age

[CVPR 2022] Official Pytorch code for OW-DETR: Open-world Detection Transformer

HistoSeg : Quick attention with multi-loss function for multi-structure segmentation in digital histology images

Learned Token Pruning for Transformers

MoCoPnet - Deformable 3D Convolution for Video Super-Resolution

Official repository of the paper Learning to Regress 3D Face Shape and Expression from an Image without 3D Supervision

Code, Data and Demo for Paper: Controllable Generation from Pre-trained Language Models via Inverse Prompting

Deep learning models for classification of 15 common weeds in the southern U.S. cotton production systems.

Zero-Cost Proxies for Lightweight NAS

Rot-Pro: Modeling Transitivity by Projection in Knowledge Graph Embedding

This repository holds the code for the paper "Deep Conditional Gaussian Mixture Model forConstrained Clustering".

PAMI stands for PAttern MIning. It constitutes several pattern mining algorithms to discover interesting patterns in transactional/temporal/spatiotemporal databases

[SIGGRAPH 2021 Asia] DeepVecFont: Synthesizing High-quality Vector Fonts via Dual-modality Learning