[CoRL 21'] TANDEM: Tracking and Dense Mapping in Real-time using Deep Multi-view Stereo

Last update: Jan 04, 2023

Overview

TANDEM: Tracking and Dense Mapping
in Real-time using Deep Multi-view Stereo

Lukas Koestler^1* Nan Yang^1,2*,† Niclas Zeller^2,3 Daniel Cremers^1,2

^*equal contribution ^†corresponding author

¹Technical University of Munich ²Artisense
³Karlsruhe University of Applied Sciences

Conference on Robot Learning (CoRL) 2021, London, UK

3DV 2021 Best Demo Award

arXiv | Video | OpenReview | Project Page

Code and Data

📣 CVA-MVSNet released! Please check cva_mvsnet/.
📣 Replica training data released! Please check replica/.
C++ code realse before Christmas. Thank you for your patience!

Abstract

In this paper, we present TANDEM a real-time monocular tracking and dense mapping framework. For pose estimation, TANDEM performs photometric bundle adjustment based on a sliding window of keyframes. To increase the robustness, we propose a novel tracking front-end that performs dense direct image alignment using depth maps rendered from a global model that is built incrementally from dense depth predictions. To predict the dense depth maps, we propose Cascade View-Aggregation MVSNet (CVA-MVSNet) that utilizes the entire active keyframe window by hierarchically constructing 3D cost volumes with adaptive view aggregation to balance the different stereo baselines between the keyframes. Finally, the predicted depth maps are fused into a consistent global map represented as a truncated signed distance function (TSDF) voxel grid. Our experimental results show that TANDEM outperforms other state-of-the-art traditional and learning-based monocular visual odometry (VO) methods in terms of camera tracking. Moreover, TANDEM shows state-of-the-art real-time 3D reconstruction performance.

[CoRL 21'] TANDEM: Tracking and Dense Mapping in Real-time using Deep Multi-view Stereo

Related tags

Overview

TANDEM: Tracking and Dense Mapping
in Real-time using Deep Multi-view Stereo

Code and Data

Abstract

Poster

Owner

TUM Computer Vision Group

Spatial-Location-Constraint-Prototype-Loss-for-Open-Set-Recognition

CVPR2021 Content-Aware GAN Compression

Reinfore learning tool box, contains trpo, a3c algorithm for continous action space

An implementation of paper `Real-time Convolutional Neural Networks for Emotion and Gender Classification` with PaddlePaddle.

Multi-Task Deep Neural Networks for Natural Language Understanding

hySLAM is a hybrid SLAM/SfM system designed for mapping

Code for the paper "Training GANs with Stronger Augmentations via Contrastive Discriminator" (ICLR 2021)

For auto aligning, cropping, and scaling HR and LR images for training image based neural networks

Interpolation-based reduced-order models

Multi-Person Extreme Motion Prediction

This repository provides a PyTorch implementation and model weights for HCSC (Hierarchical Contrastive Selective Coding)

An efficient and effective learning to rank algorithm by mining information across ranking candidates. This repository contains the tensorflow implementation of SERank model. The code is developed based on TF-Ranking.

EgGateWayGetShell py脚本

Autoencoders pretraining using clustering

DTCN SMP Challenge - Sequential prediction learning framework and algorithm

[EMNLP 2020] Keep CALM and Explore: Language Models for Action Generation in Text-based Games

Voila - Voilà turns Jupyter notebooks into standalone web applications

[ICML 2021] “ Self-Damaging Contrastive Learning”, Ziyu Jiang, Tianlong Chen, Bobak Mortazavi, Zhangyang Wang

End-to-End Dense Video Captioning with Parallel Decoding (ICCV 2021)

Multi-Stage Episodic Control for Strategic Exploration in Text Games

[CoRL 21'] TANDEM: Tracking and Dense Mapping in Real-time using Deep Multi-view Stereo

Related tags

Overview

TANDEM: Tracking and Dense Mappingin Real-time using Deep Multi-view Stereo

Code and Data

Abstract

Poster

Owner

TUM Computer Vision Group

Spatial-Location-Constraint-Prototype-Loss-for-Open-Set-Recognition

CVPR2021 Content-Aware GAN Compression

Reinfore learning tool box, contains trpo, a3c algorithm for continous action space

An implementation of paper `Real-time Convolutional Neural Networks for Emotion and Gender Classification` with PaddlePaddle.

Multi-Task Deep Neural Networks for Natural Language Understanding

hySLAM is a hybrid SLAM/SfM system designed for mapping

Code for the paper "Training GANs with Stronger Augmentations via Contrastive Discriminator" (ICLR 2021)

For auto aligning, cropping, and scaling HR and LR images for training image based neural networks

Interpolation-based reduced-order models

Multi-Person Extreme Motion Prediction

This repository provides a PyTorch implementation and model weights for HCSC (Hierarchical Contrastive Selective Coding)

An efficient and effective learning to rank algorithm by mining information across ranking candidates. This repository contains the tensorflow implementation of SERank model. The code is developed based on TF-Ranking.

EgGateWayGetShell py脚本

Autoencoders pretraining using clustering

DTCN SMP Challenge - Sequential prediction learning framework and algorithm

[EMNLP 2020] Keep CALM and Explore: Language Models for Action Generation in Text-based Games

Voila - Voilà turns Jupyter notebooks into standalone web applications

[ICML 2021] “ Self-Damaging Contrastive Learning”, Ziyu Jiang, Tianlong Chen, Bobak Mortazavi, Zhangyang Wang

End-to-End Dense Video Captioning with Parallel Decoding (ICCV 2021)

Multi-Stage Episodic Control for Strategic Exploration in Text Games

TANDEM: Tracking and Dense Mapping
in Real-time using Deep Multi-view Stereo