A Planar RGB-D SLAM which utilizes Manhattan World structure to provide optimal camera pose trajectory while also providing a sparse reconstruction containing points, lines and planes, and a dense surfel-based reconstruction.

Last update: Dec 28, 2022

Related tags

Deep Learning ManhattanSLAM

Overview

ManhattanSLAM

Authors: Raza Yunus, Yanyan Li and Federico Tombari

ManhattanSLAM is a real-time SLAM library for RGB-D cameras that computes the camera pose trajectory, a sparse 3D reconstruction (containing point, line and plane features) and a dense surfel-based 3D reconstruction. Further details can be found in the related publication. The code is based on ORB-SLAM2.

Related Publication:

Raza Yunus, Yanyan Li and Federico Tombari, ManhattanSLAM: Robust Planar Tracking and Mapping Leveraging Mixture of Manhattan Frames, in 2021 IEEE International Conference on Robotics and Automation (ICRA) . PDF.

1. License

ManhattanSLAM is released under a GPLv3 license. For a list of all code/library dependencies (and associated licenses), please see Dependencies.md.

If you use ManhattanSLAM in an academic work, please cite:

@inproceedings{yunus2021manhattanslam,
    author = {R. Yunus, Y. Li and F. Tombari},
    title = {ManhattanSLAM: Robust Planar Tracking and Mapping Leveraging Mixture of Manhattan Frames},
    year = {2021},
    booktitle = {2021 IEEE international conference on Robotics and automation (ICRA)},
}

2. Prerequisites

We have tested the library in Ubuntu 16.04, but it should be easy to compile on other platforms. A powerful computer (e.g. i7) will ensure real-time performance and provide more stable and accurate results. Following is the list of dependecies for ManhattanSLAM and their versions tested by us:

OpenCV: 3.3.0
PCL: 1.7.2
Eigen3: 3.3
DBoW2: Included in Thirdparty folder
g2o: Included in Thirdparty folder
Pangolin
tinyply

3. Building and testing

Clone the repository:

git clone https://github.com/razayunus/ManhattanSLAM

There is a script build.sh to build the Thirdparty libraries and ManhattanSLAM. Please make sure you have installed all required dependencies (see section 2). Execute:

cd ManhattanSLAM
chmod +x build.sh
./build.sh

This will create libManhattanSLAM.so in lib folder and the executable manhattan_slam in Example folder.

To test the system:

Download a sequence for one of the following datasets and uncompress it:
- TUM RGB-D: https://vision.in.tum.de/data/datasets/rgbd-dataset
- ICL-NUIM: https://www.doc.ic.ac.uk/~ahanda/VaFRIC/iclnuim.html
- TAMU RGB-D: http://telerobot.cs.tamu.edu/MFG/rgbd/livo/data.html
Associate RGB images and depth images using the python script associate.py. You can generate an associations file by executing:

python associate.py PATH_TO_SEQUENCE/rgb.txt PATH_TO_SEQUENCE/depth.txt > associations.txt

Execute the following command. Change Config.yaml to ICL.yaml for ICL-NUIM sequences, TAMU.yaml for TAMU RGB-D sequences or TUM1.yaml, TUM2.yaml or TUM3.yaml for freiburg1, freiburg2 and freiburg3 sequences of TUM RGB-D respectively. Change PATH_TO_SEQUENCE_FOLDERto the uncompressed sequence folder. Change ASSOCIATIONS_FILE to the path to the corresponding associations file.

./Example/manhattan_slam Vocabulary/ORBvoc.txt Example/Config.yaml PATH_TO_SEQUENCE_FOLDER ASSOCIATIONS_FILE

A Planar RGB-D SLAM which utilizes Manhattan World structure to provide optimal camera pose trajectory while also providing a sparse reconstruction containing points, lines and planes, and a dense surfel-based reconstruction.

Related tags

Overview

ManhattanSLAM

Related Publication:

1. License

2. Prerequisites

3. Building and testing

Owner

MetaBalance: Improving Multi-Task Recommendations via Adapting Gradient Magnitudes of Auxiliary Tasks

Point Cloud Registration using Representative Overlapping Points.

Human Pose Detection on EdgeTPU

Learning Logic Rules for Document-Level Relation Extraction

An implementation for the ICCV 2021 paper Deep Permutation Equivariant Structure from Motion.

TransMIL: Transformer based Correlated Multiple Instance Learning for Whole Slide Image Classification

ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators

Official Pytorch Implementation of Relational Self-Attention: What's Missing in Attention for Video Understanding

Codes and models of NeurIPS2021 paper - DominoSearch: Find layer-wise fine-grained N:M sparse schemes from dense neural networks

Py-FEAT: Python Facial Expression Analysis Toolbox

Project dự đoán giá cổ phiếu bằng thuật toán LSTM gồm: code train và code demo

RSC-Net: 3D Human Pose, Shape and Texture from Low-Resolution Images and Videos

Danfeng Hong, Lianru Gao, Jing Yao, Bing Zhang, Antonio Plaza, Jocelyn Chanussot. Graph Convolutional Networks for Hyperspectral Image Classification, IEEE TGRS, 2021.

Ludwig is a toolbox that allows to train and evaluate deep learning models without the need to write code.

2021 National Underwater Robotics Vision Optics

This project is the official implementation of our accepted ICLR 2021 paper BiPointNet: Binary Neural Network for Point Clouds.

Lucid library adapted for PyTorch

Diffgram - Supervised Learning Data Platform

A high-level Python library for Quantum Natural Language Processing

This is official implementaion of paper "Token Shift Transformer for Video Classification".