UPFlow: Upsampling Pyramid for Unsupervised Optical Flow Learning

By Kunming Luo, Chuan Wang, Shuaicheng Liu, Haoqiang Fan, Jue Wang, Jian Sun

Megvii Technology, University of Electronic Science and Technology of China

[Preprint]

@inproceedings{luo2021upflow,
  title={Upflow: Upsampling pyramid for unsupervised optical flow learning},
  author={Luo, Kunming and Wang, Chuan and Liu, Shuaicheng and Fan, Haoqiang and Wang, Jue and Sun, Jian},
  booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  pages={1045--1054},
  year={2021}
}

Introduction

We present an unsupervised learning approach for optical flow estimation by improving the upsampling and learning of pyramid network. We design a self-guided upsample module to tackle the interpolation blur problem caused by bilinear upsampling between pyramid levels. Moreover, we propose a pyramid distillation loss to add supervision for intermediate levels via distilling the finest flow as pseudo labels. By integrating these two components together, our method achieves the best performance for unsupervised optical flow learning on multiple leading benchmarks, including MPI-SIntel, KITTI 2012 and KITTI 2015. In particular, we achieve EPE=1.4 on KITTI 2012 and F1=9.38% on KITTI 2015, which outperform the previous state-of-the-art methods by 22.2% and 15.7%, respectively.

This repository includes:

inferring scripts; and
pretrain model; and
Training losses

Usage

Please first install the environments following how_to_install.md.

Run python3 test.py to test our trained model on KITTI 2015 dataset. Note that Cuda is needed.

Acknowledgement

Part of our codes are adapted from IRR-PWC, UnFlow ARFlow and UFlow, we thank the authors for their contributions.

PyTorch implementation of UPFlow (unsupervised optical flow learning)

Related tags

Overview

UPFlow: Upsampling Pyramid for Unsupervised Optical Flow Learning

Introduction

Usage

Acknowledgement

Owner

kunming luo

CAST: Character labeling in Animation using Self-supervision by Tracking

Exploring Visual Engagement Signals for Representation Learning

Its a Plant Leaf Disease Detection System based on Machine Learning.

RINDNet: Edge Detection for Discontinuity in Reflectance, Illumination, Normal and Depth, in ICCV 2021 (oral)

Video lie detector using xgboost - A video lie detector using OpenFace and xgboost

Implementation for paper LadderNet: Multi-path networks based on U-Net for medical image segmentation

Pytorch implementation of Nueral Style transfer

Sequence modeling benchmarks and temporal convolutional networks

The PyTorch implementation of paper REST: Debiased Social Recommendation via Reconstructing Exposure Strategies

Densely Connected Convolutional Networks, In CVPR 2017 (Best Paper Award).

An SMPC companion library for Syft

PyTorch Connectomics: segmentation toolbox for EM connectomics

This is a Tensorflow implementation of Learning to See in the Dark in CVPR 2018

Unsupervised Feature Ranking via Attribute Networks.

An all-in-one application to visualize multiple different local path planning algorithms

This tutorial repository is to introduce the functionality of KGTK to first-time users

nnFormer: Interleaved Transformer for Volumetric Segmentation

Comp445 project - Data Communications & Computer Networks

Code for the tech report Toward Training at ImageNet Scale with Differential Privacy

a grammar based feedback fuzzer