Self-Regulated Learning for Egocentric Video Activity Anticipation

Last update: Sep 23, 2022

Related tags

Overview

Self-Regulated Learning for Egocentric Video Activity Anticipation

Introduction

This is a Pytorch implementation of the model described in our paper:

Z. Qi, S. Wang, C. Su, L. Su, Q. Huang, and Q. Tian. Self-Regulated Learning for Egocentric Video Activity Anticipation. TPAMI 2021.

Dependencies

Pytorch >= 1.0.1
Cuda 9.0.176
Cudnn 7.4.2
Python 3.6.8

Data

EPIC-Kitchens dataset

For the raw data of the EPIC-Kitchens dataset, please refer to https://github.com/epic-kitchens/download-scripts to download.

For the three modality features (rgb, flow, obj), please refer to https://github.com/fpv-iplab/rulstm to download. After downloading, put them in the folder './data'.

EGTEA Gaze+ dataset

For the raw data of the EGTEA Gaze+ dataset, please refer to http://cbs.ic.gatech.edu/fpv/ to download.

For the extracted features, please refer to https://github.com/fpv-iplab/rulstm to download. After downloading, put them in the folder './data'.

50 Salads dataset

For the raw data of the 50 Salads dataset, please refer to http://cvip.computing.dundee.ac.uk/datasets/foodpreparation/50salads/ to download.

For the extracted features, please refer to https://github.com/colincsl/TemporalConvolutionalNetworks to download. After downloading, put them in the folder './data'.

Breakfast dataset

For the raw data of the Breakfast dataset, please refer to https://serre-lab.clps.brown.edu/resource/breakfast-actions-dataset/ to download.

For the extraced I3D features, please download from Baidu passward: 'wub3' or Google Drive. After downloading, put them in the folder './data'.

Train for Epic-Kitchen dataset

For rgb feature, python main.py --gpu_ids 0 --batch_size 128 --wd 1e-5 --lr 0.1 --reinforce_verb_weight 0.01 --reinforce_noun_weight 0.01 --revision_weight 0.8 --mode train --modality rgb --hidden 1024 --feat_in 1024

Silimar commonds can be used for flow or obj features.

Validation for Epic-Kitchen dataset

Please download the pre-trained model weigths from Baidu passward: 'wub3' or Google Drive, and put them in the folder './results/EPIC/base_srl/pre_trained/'.

For rgb feature, python main.py --gpu_ids 0 --batch_size 128 --mode validate --modality rgb --hidden 1024 --feat_in 1024 --resume_timestamp pre_trained

For flow feature, python main.py --gpu_ids 0 --batch_size 128 --mode validate --modality flow --hidden 1024 --feat_in 1024 --resume_timestamp pre_trained

For obj feature, python main.py --gpu_ids 0 --batch_size 128 --mode validate --modality obj --hidden 352 --feat_in 352 --resume_timestamp pre_trained

For three modality features, python main.py --gpu_ids 0 --batch_size 128 --mode validate --modality fusion --resume_timestamp pre_trained

Citation

Please cite our paper if you use this code in your own work:

@article{qi2021self,
  title={Self-Regulated Learning for Egocentric Video Activity Anticipation},
  author={Qi, Zhaobo and Wang, Shuhui and Su, Chi and Su, Li and Huang, Qingming and Tian, Qi},
  journal={IEEE Transactions on Pattern Analysis \& Machine Intelligence},
  number={01},
  pages={1--1},
  year={2021},
  publisher={IEEE Computer Society}
}

Concat

If you have any problem about our code, feel free to contact

[email protected]

Self-Regulated Learning for Egocentric Video Activity Anticipation

Related tags

Overview

Self-Regulated Learning for Egocentric Video Activity Anticipation

Introduction

Dependencies

Data

EPIC-Kitchens dataset

EGTEA Gaze+ dataset

50 Salads dataset

Breakfast dataset

Train for Epic-Kitchen dataset

Validation for Epic-Kitchen dataset

Citation

Concat

Owner

qzhb

Code for paper 'Hand-Object Contact Consistency Reasoning for Human Grasps Generation' at ICCV 2021

unet for image segmentation

Imaging, analysis, and simulation software for radio interferometry

NExT-QA: Next Phase of Question-Answering to Explaining Temporal Actions (CVPR2021)

Deep Reinforcement Learning for Multiplayer Online Battle Arena

DenseCLIP: Language-Guided Dense Prediction with Context-Aware Prompting

基于AlphaPose的TensorRT加速

Contrastive Multi-View Representation Learning on Graphs

Code base for reproducing results of I.Schubert, D.Driess, O.Oguz, and M.Toussaint: Learning to Execute: Efficient Learning of Universal Plan-Conditioned Policies in Robotics. NeurIPS (2021)

Regularized Frank-Wolfe for Dense CRFs: Generalizing Mean Field and Beyond

Building blocks for uncertainty-aware cycle consistency presented at NeurIPS'21.

Semi-supervised Stance Detection of Tweets Via Distant Network Supervision

Collection of TensorFlow2 implementations of Generative Adversarial Network varieties presented in research papers.

A benchmark dataset for mesh multi-label-classification based on cube engravings introduced in MeshCNN

A PyTorch implementation of the Transformer model in "Attention is All You Need".

Benchmark for Answering Existential First Order Queries with Single Free Variable

Y. Zhang, Q. Yao, W. Dai, L. Chen. AutoSF: Searching Scoring Functions for Knowledge Graph Embedding. IEEE International Conference on Data Engineering (ICDE). 2020

PyTorch Implementation of [1611.06440] Pruning Convolutional Neural Networks for Resource Efficient Inference

FS2KToolbox FS2K Dataset Towards the translation between Face

Astrostatistics class for the MSc degree in Astrophysics at the University of Milan-Bicocca (Italy)