code for our ECCV-2020 paper: Self-supervised Video Representation Learning by Pace Prediction

Last update: Dec 14, 2022

Overview

Video_Pace

This repository contains the code for the following paper:

Jiangliu Wang, Jianbo Jiao and Yunhui Liu, "Self-Supervised Video Representation Learning by Pace Prediction", In: ECCV (2020).

Main idea:

Framework:

Requirements

pytroch >= 1.3.0
tensorboardX
cv2
scipy

Usage

Data preparation

UCF101 dataset

Download the original UCF101 dataset from the official website. And then extarct RGB images from videos.
Or direclty download the pre-processed RGB data of UCF101 here provided by feichtenhofer.

Pre-train

Train with pace prediction task on S3D-G, the default clip length is 64 and input video size is 224 x 224.

python train.py --rgb_prefix RGB_DIR --gpu 0,1,2,3 --bs 32 --lr 0.001 --height 256 --width 256 --crop_sz 224 --clip_len 64

Train with pace prediction task on c3d/r3d/r21d, the default clip length is 16 and input video size is 112 x 112.

python train.py --rgb_prefix RGB_DIR --gpu 0 --bs 30 --lr 0.001 --model c3d/r3d/r21d --height 128 --width 171 --crop_sz 112 --clip_len 16

Evaluation

To be updated...

Citation

If you find this work useful or use our code, please consider citing:

@InProceedings{Wang20,
  author       = "Jiangliu Wang and Jianbo Jiao and Yunhui Liu",
  title        = "Self-Supervised Video Representation Learning by Pace Prediction",
  booktitle    = "European Conference on Computer Vision",
  year         = "2020",
}

Acknowlegement

Part of our codes are adapted from S3D-G HowTO100M, we thank the authors for their contributions.

code for our ECCV-2020 paper: Self-supervised Video Representation Learning by Pace Prediction

Related tags

Overview

Video_Pace

Main idea:

Framework:

Requirements

Usage

Data preparation

Pre-train

Evaluation

Citation

Acknowlegement

Owner

Jiangliu Wang

Examples of how to create colorful, annotated equations in Latex using Tikz.

U^2-Net - Portrait matting This repository explores possibilities of using the original u^2-net model for portrait matting.

This repository contains the official MATLAB implementation of the TDA method for reverse image filtering

Learning to Reconstruct 3D Non-Cuboid Room Layout from a Single RGB Image

This is the official pytorch implementation of the BoxEL for the description logic EL++

Instance-wise Occlusion and Depth Orders in Natural Scenes (CVPR 2022)

TLDR: Twin Learning for Dimensionality Reduction

A scientific and useful toolbox, which contains practical and effective long-tail related tricks with extensive experimental results

A Tensorflow implementation of the Text Conditioned Auxiliary Classifier Generative Adversarial Network for Generating Images from text descriptions

JAX + dataclasses

A tiny, friendly, strong baseline code for Person-reID (based on pytorch).

PyTorch implementation for our NeurIPS 2021 Spotlight paper "Long Short-Term Transformer for Online Action Detection".

UV matrix decompostion using movielens dataset

Wordle-solver - Wordle answer generation program in python

GNEE - GAT Neural Event Embeddings

Unsupervised Representation Learning via Neural Activation Coding

AFLNet: A Greybox Fuzzer for Network Protocols

Efficient Online Bayesian Inference for Neural Bandits

A tutorial on training a DarkNet YOLOv4 model for the CrowdHuman dataset

PoolFormer: MetaFormer is Actually What You Need for Vision