Official PyTorch Implementation of Unsupervised Learning of Scene Flow Estimation Fusing with Local Rigidity

Last update: Nov 16, 2022

Related tags

Overview

UnRigidFlow

This is the official PyTorch implementation of UnRigidFlow (IJCAI2019).

Here are two sample results (~10MB gif for each) of our unsupervised models.

KITTI 15	Cityscapes

If you find this repo useful in your research, please consider citing:

@inproceedings{Liu:2019:unrigid, 
title = {Unsupervised Learning of Scene Flow Estimation Fusing with Local Rigidity}, 
author = {Liang Liu, Guangyao Zhai, Wenlong Ye, Yong Liu}, 
booktitle = {International Joint Conference on Artificial Intelligence, IJCAI}, 
year = {2019}
}

Requirements

This codebase was developed and tested with Python 3.5, Pytorch>=0.4.1, OpenCV 3.4, CUDA 9.0 and Ubuntu 16.04.

Most of the python packages can be installed by

pip3 install -r requirements.txt

In addition, Optimized correlation with CUDA kernel should be compiled manually with:

cd <correlation_package>
python3 setup.py install

and add <correlation_package> to $PYTHONPATH.

Note that if you are use PyTorch >= 1.0, you should make some changes, see NVIDIA/flownet2-pytorch#98.

Just replace #include <torch/torch.h> with #include <torch/extension.h> , adding #include <ATen/cuda/CUDAContext.h> and then replacing all at::globalContext().getCurrentCUDAStream() with at::cuda::getCurrentCUDAStream().

Training and Evaluation

We are mainly focused on KITTI benchmark. You will need to download all of the KITTI raw data and calibration files to train the model. You will also need the training files of KITTI 2012 and KITTI 2015 with calibration files [1], [2] for validating the models.

The complete training contains 3 steps:

Train the flow model separately:

python3 train.py -c configs/KITTI_flow.json

Train the depth model separately:

python3 train.py -c configs/KITTI_depth_stereo.json

Train the flow and depth models jointly:

python3 train.py -c configs/KITTI_rigid_flow_stereo.json

For evaluation, just adding --e options and modifying the corresponding model path for the above commands.

Pre-trained Models

You can download our pre-trained models, we provide the models as follow:

KITTI_flow: The separately trained optical flow network on KITTI raw data (from scratch)
KITTI_stereo_depth: The stereo depth network on KITTI raw data.
KITTI_flow_joint: The optical flow network jointly trained with stereo depth on KITTI raw data.

Acknowledgement

This repository refers some snippets from several great work, including PWC-Net, monodepth, UnFlow, UnDepthFlow, DF-Net. Although most of these are TensorFlow implementations, we are grateful for the sharing of these works, which save us a lot of time.

Official PyTorch Implementation of Unsupervised Learning of Scene Flow Estimation Fusing with Local Rigidity

Related tags

Overview

UnRigidFlow

Requirements

Training and Evaluation

Pre-trained Models

Acknowledgement

Owner

Liang Liu

LoveDA: A Remote Sensing Land-Cover Dataset for Domain Adaptive Semantic Segmentation (NeurIPS2021 Benchmark and Dataset Track)

A library for low-memory inferencing in PyTorch.

This repository contains the code for the paper ``Identifiable VAEs via Sparse Decoding''.

Histology images query (unsupervised)

Multi-Stage Episodic Control for Strategic Exploration in Text Games

Predicting 10 different clothing types using Xception pre-trained model.

D-NeRF: Neural Radiance Fields for Dynamic Scenes

Brain tumor detection using CNN (InceptionResNetV2 Model)

Code and data for ACL2021 paper Cross-Lingual Abstractive Summarization with Limited Parallel Resources.

Pytorch implementation AttnGAN: Fine-Grained Text to Image Generation with Attentional Generative Adversarial Networks

A graph adversarial learning toolbox based on PyTorch and DGL.

PyTorch Implementation of NCSOFT's FastPitchFormant: Source-filter based Decomposed Modeling for Speech Synthesis

A voice recognition assistant similar to amazon alexa, siri and google assistant.

Non-stationary GP package written from scratch in PyTorch

Useful materials and tutorials for 110-1 NTU DBME5028 (Application of Deep Learning in Medical Imaging)

An open source python library for automated feature engineering

Implementation of "DeepOrder: Deep Learning for Test Case Prioritization in Continuous Integration Testing".

Pretraining Representations For Data-Efficient Reinforcement Learning

CoINN: Correlated-informed neural networks: a new machine learning framework to predict pressure drop in micro-channels

magiCARP: Contrastive Authoring+Reviewing Pretraining