This repository contains the implementation of the paper Contrastive Instance Association for 4D Panoptic Segmentation using Sequences of 3D LiDAR Scans

Last update: Dec 01, 2022

Related tags

Deep Learning contrastive_association

Overview

Contrastive Instance Association for 4D Panoptic Segmentation using Sequences of 3D LiDAR Scans

This repository contains the implementation of the paper Contrastive Instance Association for 4D Panoptic Segmentation using Sequences of 3D LiDAR Scans.

The approach builds on top of an arbitrary single-scan Panoptic Segmentation network and extends it to the temporal domain by associating instances across time using our Contrastive Aggregation network that leverages the point-wise features from the panoptic network.

Requirements

Install this package: go to the root directory of this repo and run:

pip3 install -U -e .

Install packages in requirements.txt.
Install MinkowskiEngine.
Install spconv version 1.2.1.

Data preparation

Download the SemanticKITTI dataset inside the directory data/kitti/. The directory structure should look like this:

./
└── data/
    └── kitti
        └── sequences
            ├── 00/           
            │   ├── velodyne/	
            |   |	├── 000000.bin
            |   |	├── 000001.bin
            |   |	└── ...
            │   └── labels/ 
            |       ├── 000000.label
            |       ├── 000001.label
            |       └── ...
            ├── 08/ # for validation
            ├── 11/ # 11-21 for testing
            └── 21/
                └── ...

Pretrained models

Pretrained Panoptic Segmentation model.
Pretrained Contrastive Aggregation model.

Reproducing the results

Run the evaluation script, which will compute the metrics for the validation set:

python evaluate_4dpanoptic.py --ckpt_ps path/to/panoptic_weights --ckpt_ag path/to/aggregation_weights

Training

Create instances dataset

Since we use a frozen Panoptic Segmentation Network, to avoid running the forward pass during training, we save the instance predictions and the point features in advance running:

python save_panoptic_features.py --ckpt path/to/panoptic_weights

This will create a directory in cont_assoc/data/instance_features with the same structure as Kitti but containing, for each sequence of the train set, npy files containing the instance points, labels and features for each scan.

Save validation predictions

To get the 4D Panoptic Segmentation performance for the validation step during training, we save the full predictions for the validation set (sequence 08) running:

python save_panoptic_features.py --ckpt path/to/panoptic_weights --save_val_pred

This will create a directory in cont_assoc/data/validation_predictions with npy files for each scan of the validation sequence containing the semantic and instance predictions for each point.

Train Contrastive Aggregation Network

Once the instance dataset and the validation predictions are generated, we're ready to train the Contrastive Aggregation Network running:

python train_aggregation.py

All the configurations are in the config/contrastive_instances.yaml file.

Citation

If you use this repo, please cite as :

@article{marcuzzi2022ral,
  author = {Rodrigo Marcuzzi and Lucas Nunes and Louis Wiesmann and Ignacio Vizzo and Jens Behley and Cyrill Stachniss},
  title = {{Contrastive Instance Association for 4D Panoptic Segmentation \\ using Sequences of 3D LiDAR Scans}},
  journal = {IEEE Robotics and Automation Letters (RA-L)},
  year = 2022,
  volume={7},
  number={2},
  pages={1550-1557},
}

Acknowledgments

The Panoptic Segmentation Network used in this repo is DS-Net.

The loss function it's a modified version of SupContrast.

License

This project is free software made available under the MIT License. For details see the LICENSE file.

This repository contains the implementation of the paper Contrastive Instance Association for 4D Panoptic Segmentation using Sequences of 3D LiDAR Scans

Related tags

Overview

Contrastive Instance Association for 4D Panoptic Segmentation using Sequences of 3D LiDAR Scans

Requirements

Data preparation

Pretrained models

Reproducing the results

Training

Create instances dataset

Save validation predictions

Train Contrastive Aggregation Network

Citation

Acknowledgments

License

Owner

Photogrammetry & Robotics Bonn

Per-Pixel Classification is Not All You Need for Semantic Segmentation

Bidimensional Leaderboards: Generate and Evaluate Language Hand in Hand

This repository will be a summary and outlook on all our open, medical, AI advancements.

[CVPR 2021] Released code for Counterfactual Zero-Shot and Open-Set Visual Recognition

MixRNet(Using mixup as regularization and tuning hyper-parameters for ResNets)

🧮 Matrix Factorization for Collaborative Filtering is just Solving an Adjoint Latent Dirichlet Allocation Model after All

Code for the paper "Ordered Neurons: Integrating Tree Structures into Recurrent Neural Networks"

Experiments for Operating Systems Lab (ETCS-352)

Official Python implementation of the FuzionCoin protocol

A tensorflow implementation of GCN-LPA

You Only Hypothesize Once: Point Cloud Registration with Rotation-equivariant Descriptors

The source codes for TME-BNA: Temporal Motif-Preserving Network Embedding with Bicomponent Neighbor Aggregation.

A simple and lightweight genetic algorithm for optimization of any machine learning model

Analysis code and Latex source of the manuscript describing the conditional permutation test of confounding bias in predictive modelling.

[NeurIPS 2020] Semi-Supervision (Unlabeled Data) & Self-Supervision Improve Class-Imbalanced / Long-Tailed Learning

YOLOX is a high-performance anchor-free YOLO, exceeding yolov3~v5 with ONNX, TensorRT, ncnn, and OpenVINO supported.

Code for "Unsupervised Source Separation via Bayesian inference in the latent domain"

Segmentation models with pretrained backbones. Keras and TensorFlow Keras.

RepMLP: Re-parameterizing Convolutions into Fully-connected Layers for Image Recognition

A Tensorflow implementation of the Text Conditioned Auxiliary Classifier Generative Adversarial Network for Generating Images from text descriptions