Python scripts form performing stereo depth estimation using the HITNET model in Tensorflow Lite.

Last update: Oct 20, 2022

Overview

TFLite-HITNET-Stereo-depth-estimation

Python scripts form performing stereo depth estimation using the HITNET model in Tensorflow Lite.

Stereo depth estimation on the cones images from the Middlebury dataset (https://vision.middlebury.edu/stereo/data/scenes2003/)

Requirements

OpenCV, imread-from-url and tensorflow==2.6.0 or tflite_runtime. Also, pafy and youtube-dl are required for youtube video inference.

Installation

pip install -r requirements.txt
pip install pafy youtube-dl

For the tflite runtime, you can either use tensorflow(make sure it is version 2.6.0 or above) pip install tensorflow==2.6.0 or the TensorFlow Runtime binary

Known issues

In computers with a GPU, the program would silently creash without any error during the inference, os.environ["CUDA_VISIBLE_DEVICES"]="-1" is added at the beginning of the script to force the program to run on the CPU. You can comment this line for other types of devices.

tflite model

The original models were converted to different formats (including .tflite) by PINTO0309, download the models from his repository and save them into the models folder.

Original Tensorflow model

The Tensorflow pretrained model was taken from the original repository.

Examples

Image inference:

python imageDepthEstimation.py

Video inference:

python videoDepthEstimation.py

DrivingStereo dataset inference:

python drivingStereoTest.py

Pytorch inference

For performing the inference in Tensorflow, check my other repository HITNET Stereo Depth estimation.

ONNX inference

For performing the inference in ONNX, check my other repository ONNX HITNET Stereo Depth estimation.

Inference video Example Raspberry Pi 4

References:

Hitnet model: https://github.com/google-research/google-research/tree/master/hitnet
PINTO0309's model zoo: https://github.com/PINTO0309/PINTO_model_zoo
PINTO0309's model conversion tool: https://github.com/PINTO0309/openvino2tensorflow
DrivingStereo dataset: https://drivingstereo-dataset.github.io/
Original paper: https://arxiv.org/abs/2007.12140

Python scripts form performing stereo depth estimation using the HITNET model in Tensorflow Lite.

Related tags

Overview

TFLite-HITNET-Stereo-depth-estimation

Requirements

Installation

Known issues

tflite model

Original Tensorflow model

Examples

Pytorch inference

ONNX inference

Inference video Example Raspberry Pi 4

References:

Owner

Ibai Gorordo

LUKE -- Language Understanding with Knowledge-based Embeddings

EssentialMC2 Video Understanding

A Comparative Review of Recent Kinect-Based Action Recognition Algorithms (TIP2020, Matlab codes)

All public open-source implementations of convnets benchmarks

Open source implementation of "A Self-Supervised Descriptor for Image Copy Detection" (SSCD).

Implementation of Convolutional LSTM in PyTorch.

Official repo for BMVC2021 paper ASFormer: Transformer for Action Segmentation

Pytorch implementation of Compressive Transformers, from Deepmind

SimpleDepthEstimation - An unified codebase for NN-based monocular depth estimation methods

Discord Multi Tool that focuses on design and easy usage

Contains modeling practice materials and homework for the Computational Neuroscience course at Okinawa Institute of Science and Technology

PyTorch code for the ICCV'21 paper: "Always Be Dreaming: A New Approach for Class-Incremental Learning"

This repository contains pre-trained models and some evaluation code for our paper Towards Unsupervised Dense Information Retrieval with Contrastive Learning

Annotate with anyone, anywhere.

Global-Local Path Networks for Monocular Depth Estimation with Vertical CutDepth [Paper]

Implementation of H-Transformer-1D, Hierarchical Attention for Sequence Learning using 🤗 transformers

Official implementation of the method ContIG, for self-supervised learning from medical imaging with genomics

Style-based Point Generator with Adversarial Rendering for Point Cloud Completion (CVPR 2021)

Semantic similarity computation with different state-of-the-art metrics

A copy of Ares that costs 30 fucking dollars.