KinectFusion implemented in Python with PyTorch

Last update: Jan 03, 2023

Overview

KinectFusion implemented in Python with PyTorch

This is a lightweight Python implementation of KinectFusion. All the core functions (TSDF volume, frame-to-model tracking, point-to-plane ICP, raycasting, TSDF fusion, etc.) are implemented using pure PyTorch, i.e. no custom CUDA kernels.

Although without any custom CUDA functions, the system could still run at a fairly fast speed: The demo reconstructs the TUM fr1_desk sequence into a 225 x 171 x 111 TSDF volume with 2cm resolution at round 17 FPS with a single RTX-2080 GPU (~1.5 FPS in CPU mode)

Note that this project is mainly for study purpose, and is not fully optimized for accurate camera tracking.

Requirements

The core functionalities were implemented in PyTorch (1.10). Open3D (0.14.0) is used for visualisation. Other important dependancies include:

numpy==1.21.2
opencv-python==4.5.5
imageio==2.14.1
scikit-image==0.19.1
trimesh==3.9.43

You can create an anaconda environment called kinfu with the required dependencies by running:

conda env create -f environment.yml
conda activate kinfu

Data Preparation

The code was tested on TUM dataset. After downloading the raw sequences, you will need to run the pre-processing script under dataset/. For example:

python dataset/preprocess.py --config configs/fr1_desk.yaml

There are some example config files under configs/ which correspond to different sequences. You need to replace data_root to your own sequence directory before running the script. After running the script a new directory processed/ will appear under your sequence directory.

Run

After obtaining the processed sequence, you can simply run kinfu.py. For example:

python kinfu.py --config configs/fr1_desk.yaml --save_dir reconstruct/fr1_desk

which will perform the tracking and mapping headlessly and save the results. Or you could run:

python kinfu_gui.py --config configs/fr1_desk.yaml

If you want to visualize the tracking and reconstruction process on-the-fly.

Acknowledgement

Part of the tracking code was borrowed and modified from DeepIC. Also thank Binbin Xu for implementing part of the TSDF volume code which is inspired by Andy Zeng's tsdf-fusion-python.

KinectFusion implemented in Python with PyTorch

Related tags

Overview

KinectFusion implemented in Python with PyTorch

Requirements

Data Preparation

Run

Acknowledgement

Owner

Jingwen Wang

PyTorch implemention of ICCV'21 paper SGPA: Structure-Guided Prior Adaptation for Category-Level 6D Object Pose Estimation

Python implementation of "Single Image Haze Removal Using Dark Channel Prior"

Multi-Objective Reinforced Active Learning

Dense Passage Retriever - is a set of tools and models for open domain Q&A task.

particle tracking model, works with the ROMS output file(qck.nc, his.nc)

Best Practices on Recommendation Systems

This repo includes the CUB-GHA (Gaze-based Human Attention) dataset and code of the paper "Human Attention in Fine-grained Classification".

This program automatically runs Python code copied in clipboard

Neural Oblivious Decision Ensembles

Prompt-BERT: Prompt makes BERT Better at Sentence Embeddings

CPU inference engine that delivers unprecedented performance for sparse models

Deep learning with TensorFlow and earth observation data.

A Weakly Supervised Amodal Segmenter with Boundary Uncertainty Estimation

A PyTorch re-implementation of the paper 'Exploring Simple Siamese Representation Learning'. Reproduced the 67.8% Top1 Acc on ImageNet.

Code for Blind Image Decomposition (BID) and Blind Image Decomposition network (BIDeN).

Code for "The Intrinsic Dimension of Images and Its Impact on Learning" - ICLR 2021 Spotlight

[ICCV 2021] Deep Hough Voting for Robust Global Registration

Codes of paper "Unseen Object Amodal Instance Segmentation via Hierarchical Occlusion Modeling"

Fake videos detection by tracing the source using video hashing retrieval.

Provide baselines and evaluation metrics of the task: traffic flow prediction