This is the code for the paper "Motion-Focused Contrastive Learning of Video Representations" (ICCV'21).

Last update: Sep 23, 2022

Overview

Motion-Focused Contrastive Learning of Video Representations

Introduction

This is the code for the paper "Motion-Focused Contrastive Learning of Video Representations" (ICCV'21).

Requirements

torch == 1.5.1
torchvision == 0.6.1
liblinear
joblib

Data Preparation

You can refer to data_prepare

MCL Pretraining and Linear Evaluation

This implementation only supports multi-gpu, DistributedDataParallel training, which is faster and simpler; single-gpu or DataParallel training is not supported.

Following SeCo, try to download the weights MoCo v2 (200epochs) and put it into the pretrain folder, and run:

for UCF101 pretraining and linear evaluation
```
bash main_ucf101.sh
```
for Kinetics400 pretraining and linear evaluation
```
bash main_kinetics.sh
```

The checkpoint will be saved in the output/checkpoints entry defined in the configuration file. Besides, the linear evaluation result can be found in output/eval_output_linear.

Downstream task evaluation

finetune for UCF101

cd evaluate/downstream_finetune
bash run_ucf101.sh

finetune for HMDB51

cd evaluate/downstream_finetune
bash run_hmdb51.sh

The finetune result can be found in output/eval_output_finetune

This is the code for the paper "Motion-Focused Contrastive Learning of Video Representations" (ICCV'21).

Related tags

Overview

Motion-Focused Contrastive Learning of Video Representations

Introduction

Requirements

Data Preparation

MCL Pretraining and Linear Evaluation

Downstream task evaluation

Owner

A repository built on the Flow software package to explore cyber-security attacks on intelligent transportation systems.

[ICCV'21] NEAT: Neural Attention Fields for End-to-End Autonomous Driving

FewBit — a library for memory efficient training of large neural networks

This repository contains notebook implementations of the following Neural Process variants: Conditional Neural Processes (CNPs), Neural Processes (NPs), Attentive Neural Processes (ANPs).

WeakVRD-Captioning - Implementation of paper Improving Image Captioning with Better Use of Caption

Simple Python application to transform Serial data into OSC messages

Voice control for Garry's Mod

A python library to build Model Trees with Linear Models at the leaves.

This repository contains the database and code used in the paper Embedding Arithmetic for Text-driven Image Transformation

Exploit Camera Raw Data for Video Super-Resolution via Hidden Markov Model Inference

The FIRST GANs-based omics-to-omics translation framework

RetinaFace: Deep Face Detection Library in TensorFlow for Python

The full training script for Enformer (Tensorflow Sonnet) on TPU clusters

Differentiable simulation for system identification and visuomotor control

Hierarchical Few-Shot Generative Models

Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch

This is the source code for generating the ASL-Skeleton3D and ASL-Phono datasets. Check out the README.md for more details.

Exploring Machine Learning Models for detecting anomalous behavior in credit-card transactions. It's crucial that credit-card companies are able to recognize fraudulent activity so that customers are not charged for items they didn't purchase.

Camview - A CLI-tool used to stream CCTV online footage based on URL params

[NeurIPS'21] Shape As Points: A Differentiable Poisson Solver