[ICCV'21] PlaneTR: Structure-Guided Transformers for 3D Plane Recovery

Last update: Nov 30, 2022

Overview

PlaneTR: Structure-Guided Transformers for 3D Plane Recovery

This is the official implementation of our ICCV 2021 paper

News

There maybe some bugs in the current public code and I am trying my best to solve them.

Contact me if you have any question.

TODO

Supplement 2D/3D visualization code.

Getting Started

Clone the repository:

git clone https://github.com/IceTTTb/PlaneTR3D.git

We use Python 3.6 and PyTorch 1.6.0 in our implementation, please install dependencies:

conda create -n planeTR python=3.6
conda activate planeTR
conda install pytorch=1.6.0 torchvision=0.7.0 torchaudio cudatoolkit=10.2 -c pytorch
pip install -r requirements.txt

Data Preparation

We train and test our network on the plane dataset created by PlaneNet. We follow PlaneAE to convert the .tfrecords to .npz files. Please refer to PlaneAE for more details.

We generate line segments using the state-of-the-art line segment detection algorithm HAWP with their pretrained model. The processed line segments data we used can be downloaded here.

The structure of the data folder should be

plane_data/
  --train/*.npz
  --train_img/*
  --val/*.npz
  --val_img/*
  --train.txt
  --val.txt

Training

Download the pretrained model of HRNet and place it under the 'ckpts/' folder.

Change the 'root_dir' in config files to the path where you save the data.

Run the following command to train our network on one GPU:

CUDA_VISIBLE_DEVICES=0 python train_planeTR.py

Run the following command to train our network on multiple GPUs:

CUDA_VISIBLE_DEVICES=0,1,2 python -m torch.distributed.launch --nproc_per_node=3 --master_port 295025 train_planeTR.py

Evaluation

Download the pretrained model here and place it under the 'ckpts/' folder.

Change the 'resume_dir' in 'config_planeTR_eval.yaml' to the path where you save the weight file.

Change the 'root_dir' in config files to the path where you save the data.

Run the following command to evaluate the performance:

CUDA_VISIBLE_DEVICES=0 python eval_planeTR.py

Citations

If you find our work useful in your research, please consider citing:

@inproceedings{tan2021planeTR,
title={PlaneTR: Structure-Guided Transformers for 3D Plane Recovery},
author={Tan, Bin and Xue, Nan and Bai, Song and Wu, Tianfu and Xia, Gui-Song},
booktitle = {International Conference on Computer Vision},
year={2021}
}

Contact

[email protected]

https://xuenan.net/

Acknowledgements

We thank the authors of PlaneAE, PlaneRCNN, interplane and DETR. Our implementation is heavily built upon their codes.

[ICCV'21] PlaneTR: Structure-Guided Transformers for 3D Plane Recovery

Related tags

Overview

PlaneTR: Structure-Guided Transformers for 3D Plane Recovery

News

TODO

Getting Started

Data Preparation

Training

Evaluation

Citations

Contact

Acknowledgements

Owner

Official PyTorch implementation of the paper Image-Based CLIP-Guided Essence Transfer.

Code for "Solving Graph-based Public Good Games with Tree Search and Imitation Learning"

chen2020iros: Learning an Overlap-based Observation Model for 3D LiDAR Localization.

SuMa++: Efficient LiDAR-based Semantic SLAM (Chen et al IROS 2019)

This repository includes the code of the sequence-to-sequence model for discontinuous constituent parsing described in paper Discontinuous Grammar as a Foreign Language.

A Python framework for developing parallelized Computational Fluid Dynamics software to solve the hyperbolic 2D Euler equations on distributed, multi-block structured grids.

CNN visualization tool in TensorFlow

Codes and pretrained weights for winning submission of 2021 Brain Tumor Segmentation (BraTS) Challenge

Contextualized Perturbation for Textual Adversarial Attack, NAACL 2021

An ever-growing playground of notebooks showcasing CLIP's impressive zero-shot capabilities.

QuanTaichi evaluation suite

CVPR 2021 Challenge on Super-Resolution Space

Azua - build AI algorithms to aid efficient decision-making with minimum data requirements.

Spatial-Temporal Transformer for Dynamic Scene Graph Generation, ICCV2021

All supplementary material used by me while TA-ing CS3244: Machine Learning

The implementation for paper Joint t-SNE for Comparable Projections of Multiple High-Dimensional Datasets.

Material for my PyConDE & PyData Berlin 2022 Talk "5 Steps to Speed Up Your Data-Analysis on a Single Core"

Deep motion generator collections

[ArXiv 2021] One-Shot Generative Domain Adaptation

Code for the Convolutional Vision Transformer (ConViT)