Code for the paper One Thing One Click: A Self-Training Approach for Weakly Supervised 3D Semantic Segmentation, CVPR 2021.

Last update: Dec 12, 2022

Related tags

Deep Learning One-Thing-One-Click

Overview

One Thing One Click

One Thing One Click: A Self-Training Approach for Weakly Supervised 3D Semantic Segmentation (CVPR2021)

Code for the paper One Thing One Click: A Self-Training Approach for Weakly Supervised 3D Semantic Segmentation, CVPR 2021.

This code is based on PointGroup https://github.com/llijiang/PointGroup

Authors: Zhengzhe Liu, Xiaojuan Qi, Chi-Wing Fu

Installation

Requirements

Python 3.7.0
Pytorch 1.3.0
CUDA 10.1

Virtual Environment

conda create -n pointgroup python==3.7
source activate pointgroup

Install `PointGroup`

(1) Clone the PointGroup repository.

git clone https://github.com/liuzhengzhe/One-Thing-One-Click --recursive 
cd One-Thing-One-Click

(2) Install the dependent libraries.

pip install -r requirements.txt
conda install -c bioconda google-sparsehash

(3) For the SparseConv, we apply the implementation of spconv. The repository is recursively downloaded at step (1). We use the version 1.0 of spconv.

Note: The author of PointGroup further modified spconv\spconv\functional.py to make grad_output contiguous. Make sure you use our modified spconv.

To compile spconv, firstly install the dependent libraries.

conda install libboost
conda install -c daleydeng gcc-5 # need gcc-5.4 for sparseconv

Add the $INCLUDE_PATH$ that contains boost in lib/spconv/CMakeLists.txt. (Not necessary if it could be found.)

include_directories($INCLUDE_PATH$)

Compile the spconv library.

cd lib/spconv
python setup.py bdist_wheel

Run cd dist and use pip to install the generated .whl file.

(4) Compile the pointgroup_ops library.

cd lib/pointgroup_ops
python setup.py develop

If any header files could not be found, run the following commands.

python setup.py build_ext --include-dirs=$INCLUDE_PATH$
python setup.py develop

$INCLUDE_PATH$ is the path to the folder containing the header files that could not be found.

Data Preparation

Download the ScanNet v2 dataset.
Put the data in the corresponding folders.
Put the file scannetv2-labels.combined.tsv in the data/ folder.
Change the path in prepare_data_otoc.py Line 20.

cd data/
python prepare_data_otoc.py

Split the generated files into the data/train_weakly and data/val_weakly folders according to the ScanNet v2 train/val split.

Pretrained Model

We provide a pretrained model trained on ScanNet v2 dataset. Download it here. Its performance on ScanNet v2 validation set is 71.94 mIoU.

Inference and Evaluation

(1) 3D U-Net Evaluation

set the data_root in config/pointgroup_run1_scannet.yaml

cd 3D-U-Net
python test.py --config config/pointgroup_run1_scannet.yaml --pretrain pointgroup_run1_scannet-000001250.pth

Its performance on ScanNet v2 validation set is 68.96 mIoU.

(2) Relation Net Evaluation

cd relation
python test.py --config config/pointgroup_run1_scannet.yaml --pretrain pointgroup_run1_scannet-000002891_weight.pth

(3) Overall Evaluation

cd merge
python test.py --config config/pointgroup_run1_scannet.yaml

Self Training

(1) Train 3D U-Net

set the data_root/dataset in config/pointgroup_run1_scannet.yaml

cd 3D-U-Net
CUDA_VISIBLE_DEVICES=0 python train.py --config config/pointgroup_run1_scannet.yaml

(2) Generate features and predictions of 3D U-Net

CUDA_VISIBLE_DEVICES=0 python test_train.py --config config/pointgroup_run1_scannet.yaml --pretrain $PATH_TO_THE_MODEL$.pth

(3) Train Relation Net

set the data_root/dataset in config/pointgroup_run1_scannet.yaml

cd relation
CUDA_VISIBLE_DEVICES=0 python train.py --config config/pointgroup_run1_scannet.yaml

(4) Generate features and predictions of Relation Net

CUDA_VISIBLE_DEVICES=0 python test_train.py --config config/pointgroup_run1_scannet.yaml --pretrain $PATH_TO_THE_MODEL$_weight.pth

(5) Merge the Results via Graph Propagation

cd merge
CUDA_VISIBLE_DEVICES=0 python test_train.py --config config/pointgroup_run1_scannet.yaml

(6) Repeat from (1) to (5) for self-training for 3 to 5 times

Acknowledgement

This repo is built upon several repos, e.g., PointGrouop, SparseConvNet, spconv and ScanNet.

Contact

If you have any questions or suggestions about this repo, please feel free to contact me ([email protected]).

Code for the paper One Thing One Click: A Self-Training Approach for Weakly Supervised 3D Semantic Segmentation, CVPR 2021.

Related tags

Overview

One Thing One Click

One Thing One Click: A Self-Training Approach for Weakly Supervised 3D Semantic Segmentation (CVPR2021)

Installation

Requirements

Virtual Environment

Install `PointGroup`

Data Preparation

Pretrained Model

Inference and Evaluation

Self Training

Acknowledgement

Contact

Owner

Code for the IJCAI 2021 paper "Structure Guided Lane Detection"

More Photos are All You Need: Semi-Supervised Learning for Fine-Grained Sketch Based Image Retrieval

text_recognition_toolbox: The reimplementation of a series of classical scene text recognition papers with Pytorch in a uniform way.

public repo for ESTER dataset and modeling (EMNLP'21)

A PyTorch implementation of Learning to learn by gradient descent by gradient descent

PyTorch implementation of the R2Plus1D convolution based ResNet architecture described in the paper "A Closer Look at Spatiotemporal Convolutions for Action Recognition"

Official Pytorch Implementation of Unsupervised Image Denoising with Frequency Domain Knowledge

SciFive: a text-text transformer model for biomedical literature

Image Matching Evaluation

A pre-trained language model for social media text in Spanish

Understanding Convolutional Neural Networks from Theoretical Perspective via Volterra Convolution

Zalo AI challenge 2021 task hum to song

Jittor is a high-performance deep learning framework based on JIT compiling and meta-operators.

Auto HMM: Automatic Discrete and Continous HMM including Model selection

Data cleaning, missing value handle, EDA use in this project

Bag of Tricks for Natural Policy Gradient Reinforcement Learning

Efficient 3D human pose estimation in video using 2D keypoint trajectories

This program can detect your face and add an Christams hat on the top of your head

Finite difference solution of 2D Poisson equation. Can handle Dirichlet, Neumann and mixed boundary conditions.

[AAAI2022] Source code for our paper《Suppressing Static Visual Cues via Normalizing Flows for Self-Supervised Video Representation Learning》

Code for the paper One Thing One Click: A Self-Training Approach for Weakly Supervised 3D Semantic Segmentation, CVPR 2021.

Related tags

Overview

One Thing One Click

One Thing One Click: A Self-Training Approach for Weakly Supervised 3D Semantic Segmentation (CVPR2021)

Installation

Requirements

Virtual Environment

Install PointGroup

Data Preparation

Pretrained Model

Inference and Evaluation

Self Training

Acknowledgement

Contact

Owner

Code for the IJCAI 2021 paper "Structure Guided Lane Detection"

More Photos are All You Need: Semi-Supervised Learning for Fine-Grained Sketch Based Image Retrieval

text_recognition_toolbox: The reimplementation of a series of classical scene text recognition papers with Pytorch in a uniform way.

public repo for ESTER dataset and modeling (EMNLP'21)

A PyTorch implementation of Learning to learn by gradient descent by gradient descent

PyTorch implementation of the R2Plus1D convolution based ResNet architecture described in the paper "A Closer Look at Spatiotemporal Convolutions for Action Recognition"

Official Pytorch Implementation of Unsupervised Image Denoising with Frequency Domain Knowledge

SciFive: a text-text transformer model for biomedical literature

Image Matching Evaluation

A pre-trained language model for social media text in Spanish

Understanding Convolutional Neural Networks from Theoretical Perspective via Volterra Convolution

Zalo AI challenge 2021 task hum to song

Jittor is a high-performance deep learning framework based on JIT compiling and meta-operators.

Auto HMM: Automatic Discrete and Continous HMM including Model selection

Data cleaning, missing value handle, EDA use in this project

Bag of Tricks for Natural Policy Gradient Reinforcement Learning

Efficient 3D human pose estimation in video using 2D keypoint trajectories

This program can detect your face and add an Christams hat on the top of your head

Finite difference solution of 2D Poisson equation. Can handle Dirichlet, Neumann and mixed boundary conditions.

[AAAI2022] Source code for our paper《Suppressing Static Visual Cues via Normalizing Flows for Self-Supervised Video Representation Learning》

Install `PointGroup`