Lightweight stereo matching network based on MobileNetV1 and MobileNetV2

Last update: Nov 30, 2022

Related tags

Overview

MobileStereoNet: Towards Lightweight Deep Networks for Stereo Matching

This repository contains the code for

2D-MobileStereoNet prediction	Error map

3D-MobileStereoNet prediction	Error map

Installation

Requirements

The code is tested on:

Ubuntu 18.04
Python 3.6
PyTorch 1.4.0
Torchvision 0.5.0
CUDA 10.0

Setting up the environment

conda env create --file mobilestereonet.yml
conda activate mobilestereonet

Training

Set a variable (e.g. DATAPATH) for the dataset directory DATAPATH="/Datasets/SceneFlow/" or DATAPATH="/Datasets/KITTI2015/". Then, you can run the train.py file as below:

Pretraining on SceneFlow

python train.py --dataset sceneflow --datapath $DATAPATH --trainlist ./filenames/sceneflow_train.txt --testlist ./filenames/sceneflow_test.txt --epochs 20 --lrepochs "10,12,14,16:2" --batch_size 8 --test_batch_size 8 --model MSNet2D

Finetuning on KITTI

python train.py --dataset kitti --datapath $DATAPATH --trainlist ./filenames/kitti15_train.txt --testlist ./filenames/kitti15_val.txt --epochs 400 --lrepochs "200:10" --batch_size 8 --test_batch_size 8 --loadckpt ./checkpoints/pretrained.ckpt --model MSNet2D

The arguments in both cases can be set differently depending on the model and the system.

Prediction

The following script creates disparity maps for a specified model:

python prediction.py --datapath $DATAPATH --testlist ./filenames/kitti15_test.txt --loadckpt ./checkpoints/finetuned.ckpt --dataset kitti --colored True --model MSNet2D

Credits

The implementation of this code is based on PSMNet and GwcNet. Also, thanks to Matteo Poggi for the KITTI python utils.

Lightweight stereo matching network based on MobileNetV1 and MobileNetV2

Related tags

Overview

MobileStereoNet: Towards Lightweight Deep Networks for Stereo Matching

Installation

Requirements

Setting up the environment

Training

Pretraining on SceneFlow

Finetuning on KITTI

Prediction

Credits

License

Owner

Cognitive Systems Research Group

This repository contains several image-to-image translation models, whcih were tested for RGB to NIR image generation. The models are Pix2Pix, Pix2PixHD, CycleGAN and PointWise.

Code for Iso-Points: Optimizing Neural Implicit Surfaces with Hybrid Representations

Unified Pre-training for Self-Supervised Learning and Supervised Learning for ASR

PyTorch implementation of Neural View Synthesis and Matching for Semi-Supervised Few-Shot Learning of 3D Pose

WatermarkRemoval-WDNet-WACV2021

A fast implementation of bss_eval metrics for blind source separation

(NeurIPS 2021) Realistic Evaluation of Transductive Few-Shot Learning

Good Classification Measures and How to Find Them

Normal Learning in Videos with Attention Prototype Network

Vpw analyzer - A visual J1850 VPW analyzer written in Python

Photo2cartoon - 人像卡通化探索项目 (photo-to-cartoon translation project)

CUda Matrix Multiply library.

The official repo of the CVPR 2021 paper Group Collaborative Learning for Co-Salient Object Detection .

Quantile Regression DQN a Minimal Working Example, Distributional Reinforcement Learning with Quantile Regression

Implemenets the Contourlet-CNN as described in C-CNN: Contourlet Convolutional Neural Networks, using PyTorch

PyTorch Code for "Generalization in Dexterous Manipulation via Geometry-Aware Multi-Task Learning"

TorchCV: A PyTorch-Based Framework for Deep Learning in Computer Vision

Implementation of temporal pooling methods studied in [ICIP'20] A Comparative Evaluation Of Temporal Pooling Methods For Blind Video Quality Assessment

High accurate tool for automatic faces detection with landmarks

Magic tool for managing internet connection in local network by @zalexdev