Python code to fuse multiple RGB-D images into a TSDF voxel volume.

Last update: Jan 03, 2023

Overview

Volumetric TSDF Fusion of RGB-D Images in Python

This is a lightweight python script that fuses multiple registered color and depth images into a projective truncated signed distance function (TSDF) volume, which can then be used to create high quality 3D surface meshes and point clouds. Tested on Ubuntu 16.04.

An older CUDA/C++ version can be found here.

Requirements

Python 2.7+ with NumPy, PyCUDA, OpenCV, Scikit-image and Numba. These can be quickly installed/updated by running the following:
```
pip install --user numpy opencv-python scikit-image numba
```
[Optional] GPU acceleration requires an NVIDA GPU with CUDA and PyCUDA:
```
pip install --user pycuda
```

Demo

This demo fuses 1000 RGB-D images from the 7-scenes dataset into a 405 x 264 x 289 projective TSDF voxel volume with 2cm resolution at about 30 FPS in GPU mode (0.4 FPS in CPU mode), and outputs a 3D mesh mesh.ply which can be visualized with a 3D viewer like Meshlab.

Note: color images are saved as 24-bit PNG RGB, depth images are saved as 16-bit PNG in millimeters.

python demo.py

Seen In

References

Citing

This repository is a part of 3DMatch Toolbox. If you find this code useful in your work, please consider citing:

@inproceedings{zeng20163dmatch,
    title={3DMatch: Learning Local Geometric Descriptors from RGB-D Reconstructions},
    author={Zeng, Andy and Song, Shuran and Nie{\ss}ner, Matthias and Fisher, Matthew and Xiao, Jianxiong and Funkhouser, Thomas},
    booktitle={CVPR},
    year={2017}
}

Python code to fuse multiple RGB-D images into a TSDF voxel volume.

Related tags

Overview

Volumetric TSDF Fusion of RGB-D Images in Python

Requirements

Demo

Seen In

References

Citing

Owner

Andy Zeng

Human Pose Detection on EdgeTPU

Official repository for "Action-Based Conversations Dataset: A Corpus for Building More In-Depth Task-Oriented Dialogue Systems"

[NeurIPS 2021] COCO-LM: Correcting and Contrasting Text Sequences for Language Model Pretraining

Explore the Expression: Facial Expression Generation using Auxiliary Classifier Generative Adversarial Network

YOLOv3 in PyTorch > ONNX > CoreML > TFLite

So-ViT: Mind Visual Tokens for Vision Transformer

GCNet: Non-local Networks Meet Squeeze-Excitation Networks and Beyond

Implements pytorch code for the Accelerated SGD algorithm.

Data manipulation and transformation for audio signal processing, powered by PyTorch

This is the official implementation for the paper "Heterogeneous Multi-player Multi-armed Bandits: Closing the Gap and Generalization" in NeurIPS 2021.

The codes of paper 'Active-LATHE: An Active Learning Algorithm for Boosting the Error exponent for Learning Homogeneous Ising Trees'

This project aims at building a real-time wide band channel sounder using USRPs

Official implementation of UTNet: A Hybrid Transformer Architecture for Medical Image Segmentation

Create time-series datacubes for supervised machine learning with ICEYE SAR images.

Create and implement a deep learning library from scratch.

Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.

A spatial genome aligner for analyzing multiplexed DNA-FISH imaging data.

BoxInst: High-Performance Instance Segmentation with Box Annotations

[NeurIPS 2020] Code for the paper "Balanced Meta-Softmax for Long-Tailed Visual Recognition"