Learning Correspondence from the Cycle-consistency of Time (CVPR 2019)

Last update: Nov 29, 2022

Related tags

Overview

TimeCycle

Code for Learning Correspondence from the Cycle-consistency of Time (CVPR 2019, Oral). The code is developed based on the PyTorch framework, in version PyTorch 0.4 with Python 2. It also runs smoothly with PyTorch 1.0. This repo includes the training code for learning semi-dense correspondence from unlabeled videos, and testing code for applying this correspondence on segmentation mask tracking in videos.

Citation

If you use our code in your research or wish to refer to the baseline results, please use the following BibTeX entry.

@inproceedings{CVPR2019_CycleTime,
    Author = {Xiaolong Wang and Allan Jabri and Alexei A. Efros},
    Title = {Learning Correspondence from the Cycle-Consistency of Time},
    Booktitle = {CVPR},
    Year = {2019},
}

Model and Result

Our trained model can be downloaded from here. The tracking performance on DAVIS-2017 for this model (without training on DAVIS-2017) is:

cropSize	J_mean	J_recall	J_decay	F_mean	F_recall	F_decay
320 x 320	0.419	0.409	0.272	0.394	0.336	0.328
400 x 400	0.430	0.437	0.296	0.426	0.413	0.356
480 x 480	0.464	0.500	0.332	0.500	0.480	0.379

Note that one can easily improve the results in test time by increasing the input image size "cropSize" in the script. The training and testing procedures for this model are described as follows.

Converting Our Model to Standard Pytorch ResNet-50

Please see convert_model.ipynb for converting our model here to standard Pytorch ResNet-50 model format.

Dataset Preparation

Please read DATASET.md for downloading and preparing the VLOG dataset for training and DAVIS dataset for testing.

Training

Replace the input list in train_video_cycle_simple.py in the home folder as:

    params['filelist'] = 'YOUR_DATASET_FOLDER/vlog_frames_12fps.txt'

Then run the following code:

    python train_video_cycle_simple.py --checkpoint pytorch_checkpoints/release_model_simple

Testing

Replace the input list in test_davis.py in the home folder as:

    params['filelist'] = 'YOUR_DATASET_FOLDER/davis/DAVIS/vallist.txt'

Set up the dataset path YOUR_DATASET_FOLDER in run_test.sh . Then run the testing and evaluation code together:

    sh run_test.sh

Acknowledgements

weakalign by Ignacio Rocco, Relja Arandjelović and Josef Sivic.

inflated_convnets_pytorch by Yana Hasson.

pytorch-classification by Wei Yang.

Learning Correspondence from the Cycle-consistency of Time (CVPR 2019)

Related tags

Overview

TimeCycle

Citation

Model and Result

Converting Our Model to Standard Pytorch ResNet-50

Dataset Preparation

Training

Testing

Acknowledgements

Owner

Xiaolong Wang

Jiminy Cricket Environment (NeurIPS 2021)

Scalable machine learning based time series forecasting

Generate images from texts. In Russian

Official implementation of CVPR2020 paper "Deep Generative Model for Robust Imbalance Classification"

This repository contains FEDOT - an open-source framework for automated modeling and machine learning (AutoML)

Multi-modal Text Recognition Networks: Interactive Enhancements between Visual and Semantic Features

1st ranked 'driver careless behavior detection' for AI Online Competition 2021, hosted by MSIT Korea.

Face detection using deep learning.

Task Transformer Network for Joint MRI Reconstruction and Super-Resolution (MICCAI 2021)

Unsupervised Image-to-Image Translation

Generic Foreground Segmentation in Images

Real-Time and Accurate Full-Body Multi-Person Pose Estimation&Tracking System

A hifiasm fork for metagenome assembly using Hifi reads.

Convolutional Neural Network to detect deforestation in the Amazon Rainforest

[CVPR 2021] MetaSAug: Meta Semantic Augmentation for Long-Tailed Visual Recognition

Code To Tune or Not To Tune? Zero-shot Models for Legal Case Entailment.

Simple tutorials using Google's TensorFlow Framework

MemStream: Memory-Based Anomaly Detection in Multi-Aspect Streams with Concept Drift

Bayesian algorithm execution (BAX)

Final project code: Implementing MAE with downscaled encoders and datasets, for ESE546 FA21 at University of Pennsylvania