MonoRCNN is a monocular 3D object detection method for automonous driving

Last update: Dec 27, 2022

Related tags

Overview

MonoRCNN

MonoRCNN is a monocular 3D object detection method for automonous driving, published at ICCV 2021. This project is an implementation of MonoRCNN.

Visualization

Methodology

Installation

Python 3.6
PyTorch 1.5.0
Detectron2 0.1.3

Please use the Detectron2 included in this project. To ignore fully occluded objects during training, build.py, rpn.py, and roi_heads.py have been modified.

Dataset Preparation

KITTI

Model & Log

KITTI val1 split

Organize the downloaded files as follows:

├── projects
│   ├── MonoRCNN
│   │   ├── output
│   │   │   ├── model
│   │   │   ├── log.txt
│   │   │   ├── ...

Test

cd projects/MonoRCNN
./main.py --config-file config/MonoRCNN_KITTI.yaml --num-gpus 1 --resume --eval-only

Set VISUALIZE as True to visualize 3D object detection results (saved in output/evaluation/test/visualization).

Training

cd projects/MonoRCNN
./main.py --config-file config/MonoRCNN_KITTI.yaml --num-gpus 1

Citation

If you find this project useful in your research, please cite:

@inproceedings{MonoRCNN_ICCV21,
    title = {Geometry-based Distance Decomposition for Monocular 3D Object Detection},
    author = {Xuepeng Shi and Qi Ye and 
              Xiaozhi Chen and Chuangrong Chen and 
              Zhixiang Chen and Tae-Kyun Kim},
    booktitle = {ICCV},
    year = {2021},
}

Contact

[email protected]

MonoRCNN is a monocular 3D object detection method for automonous driving

Related tags

Overview

MonoRCNN

Visualization

Methodology

Related Link

Installation

Dataset Preparation

Model & Log

Test

Training

Citation

Contact

Acknowledgement

Owner

This is a repo of basic Machine Learning!

patchmatch和patchmatchstereo算法的python实现

Implementation of Rotary Embeddings, from the Roformer paper, in Pytorch

[Link]mareteutral - pars tradg wth M []

Tianshou - An elegant PyTorch deep reinforcement learning library.

Automatic Idiomatic Expression Detection

Code for Talking Face Generation by Adversarially Disentangled Audio-Visual Representation (AAAI 2019)

Official implementation of EdiTTS: Score-based Editing for Controllable Text-to-Speech

This repository collects project-relevant Isabelle/HOL formalizations.

In the case of your data having only 1 channel while want to use timm models

Unofficial Tensorflow-Keras implementation of Fastformer based on paper [Fastformer: Additive Attention Can Be All You Need](https://arxiv.org/abs/2108.09084).

MetaDrive: Composing Diverse Scenarios for Generalizable Reinforcement Learning

Predicting Semantic Map Representations from Images with Pyramid Occupancy Networks

A PyTorch Implementation of Single Shot MultiBox Detector

This codebase proposes modular light python and pytorch implementations of several LiDAR Odometry methods

OBBDetection: an oriented object detection toolbox modified from MMdetection

Official implementation of Neural Bellman-Ford Networks (NeurIPS 2021)

This is the repository for the paper "Have I done enough planning or should I plan more?"

The official implementation of NeurIPS 2021 paper: Finding Optimal Tangent Points for Reducing Distortions of Hard-label Attacks

FaRL for Facial Representation Learning