《Train in Germany, Test in The USA: Making 3D Object Detectors Generalize》(CVPR 2020)

Last update: Jan 02, 2023

Related tags

Overview

Train in Germany, Test in The USA: Making 3D Object Detectors Generalize

This paper has been accpeted by Conference on Computer Vision and Pattern Recognition (CVPR) 2020.

by Yan Wang*, Xiangyu Chen*, Yurong You, Li Erran, Bharath Hariharan, Mark Campbell, Kilian Q. Weinberger, Wei-Lun Chao*

Dependencies

Usage

Prepare Datasets (Jupyter notebook)

We develop our method on these datasets:

Configure dataset_path in config_path.py.

Raw datasets will be organized as the following structure:

 dataset_path/
     | kitti/               # KITTI object detection 3D dataset
         | training/
         | testing/
     | argo/                # Argoverse dataset v1.1
         | train1/
         | train2/
         | train3/
         | train4/
         | val/
         | test/
     | nusc/                # nuScenes dataset v1.0
         | maps/
         | samples/
         | sweeps/
         | v1.0-trainval/
     | lyft/                # Lyft Level 5 dataset v1.02
         | v1.02-train/
     | waymo/               # Waymo dataset v1.0
         | training/
         | validation/

Download all datasets.

For KITTI, Argoverse and Waymo, we provide scripts for automatic download.
```
cd scripts/
python download.py [--datasets kitti+argo+waymo]
```
nuScenes and Lyft need to downloaded manually.

Convert all datasets to KITTI format.

cd scripts/
python -m pip install -r convert_requirements.txt
python convert.py [--datasets argo+nusc+lyft+waymo]

Split validation set

We provide the train/val split used in our experiments under split folder.
```
cd split/
python replace_split.py
```
Generate car subset

We filter scenes and only keep those with cars.
```
cd scripts/
python gen_car_split.py
```

Statistical Normalization (Jupyter notebook)

Compute car size statistics of each dataset. The computed statistics are stored as label_stats_{train/val/test}.json under KITTI format dataset root.
```
cd stat_norm/
python stat.py
```
Generate rescaled datasets according to car size statistics. The rescaled datasets are stored under $dataset_path/rescaled_datasets by default.
```
cd stat_norm/
python norm.py [--path $PATH]
```

Training (To be updated)

We use PointRCNN to validate our method.

Setup PointRCNN
```
cd pointrcnn/
./build_and_install.sh
```

Build datasets in PointRCNN format.

cd pointrcnn/tools/
python generate_multi_data.py
python generate_gt_database.py --root ...

Download the models pretrained on source domains from google drive using gdrive.
```
cd pointrcnn/tools/
gdrive download -r 14MXjNImFoS2P7YprLNpSmFBsvxf5J2Kw
```

Adapt to a new domain by re-training with rescaled data.

cd pointrcnn/tools/

python train_rcnn.py --cfg_file ...

Inference

cd pointrcnn/tools/
python eval_rcnn.py --ckpt /path/to/checkpoint.pth --dataset $dataset --output_dir $output_dir

Evaluation

We provide evaluation code with

old (based on bbox height) and new (based on distance) difficulty metrics
output transformation functions to locate domain gap

python evaluate/
python evaluate.py --result_path $predictions --dataset_path $dataset_root --metric [old/new]

Citation

@inproceedings{wang2020train,
  title={Train in germany, test in the usa: Making 3d object detectors generalize},
  author={Yan Wang and Xiangyu Chen and Yurong You and Li Erran and Bharath Hariharan and Mark Campbell and Kilian Q. Weinberger and Wei-Lun Chao},
  booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  pages={11713-11723},
  year={2020}
}

《Train in Germany, Test in The USA: Making 3D Object Detectors Generalize》(CVPR 2020)

Related tags

Overview

Train in Germany, Test in The USA: Making 3D Object Detectors Generalize

Dependencies

Usage

Prepare Datasets (Jupyter notebook)

Statistical Normalization (Jupyter notebook)

Training (To be updated)

Inference

Evaluation

Citation

Owner

Xiangyu Chen

Implicit MLE: Backpropagating Through Discrete Exponential Family Distributions

A graph-to-sequence model for one-step retrosynthesis and reaction outcome prediction.

Improving Compound Activity Classification via Deep Transfer and Representation Learning

Deep High-Resolution Representation Learning for Human Pose Estimation

TLDR; Train custom adaptive filter optimizers without hand tuning or extra labels.

This is the official implementation of TrivialAugment and a mini-library for the application of multiple image augmentation strategies including RandAugment and TrivialAugment.

A whale detector design for the Kaggle whale-detector challenge!

TACTO: A Fast, Flexible and Open-source Simulator for High-Resolution Vision-based Tactile Sensors

PyTorch Code for "Generalization in Dexterous Manipulation via Geometry-Aware Multi-Task Learning"

Gesture Volume Control Using OpenCV and MediaPipe

Official Implementation of PCT

Modeling Temporal Concept Receptive Field Dynamically for Untrimmed Video Analysis

Elastic weight consolidation technique for incremental learning.

A code repository associated with the paper A Benchmark for Rough Sketch Cleanup by Chuan Yan, David Vanderhaeghe, and Yotam Gingold from SIGGRAPH Asia 2020.

Code of TIP2021 Paper《SFace: Sigmoid-Constrained Hypersphere Loss for Robust Face Recognition》. We provide both MxNet and Pytorch versions.

Implementation of CoCa, Contrastive Captioners are Image-Text Foundation Models, in Pytorch

Convert onnx models to pytorch.

TensorFlow tutorials and best practices.

The implementation of FOLD-R++ algorithm

Simple and Effective Few-Shot Named Entity Recognition with Structured Nearest Neighbor Learning