[ICCV 2021] Group-aware Contrastive Regression for Action Quality Assessment

Last update: Dec 24, 2022

Related tags

Overview

CoRe

Created by Xumin Yu*, Yongming Rao*, Wenliang Zhao, Jiwen Lu, Jie Zhou

This is the PyTorch implementation for ICCV paper Group-aware Contrastive Regression for Action Quality Assessment arXiv.

We present a new Contrastive Regression (CoRe) framework to learn the relative scores by pair-wise comparison, which highlights the differences between videos and guides the models to learn the key hints for action quality assessment.

Pretrained Model

Our pretrained CoRe model for MTL-AQA is available at [Tsinghua Cloud] [Google Drive]

Usage

Requirement

Python >= 3.6
Pytorch >= 1.4.0
torchvision >= 0.4.1
torch_videovision

pip install git+https://github.com/hassony2/torch_videovision

Download initial I3D

We use the Kinetics pretrained I3D model from the reposity kinetics_i3d_pytorch

Dataset Preparation

MTL-AQA

Please download the dataset from the repository MTL-AQA. The data structure should be:

$DATASET_ROOT
├── MTL-AQA/
    ├── new
        ├── new_total_frames_256s
            ├── 01
            ...
            └── 09
    ├── info
        ├── final_annotations_dict_with_dive_number
        ├── test_split_0.pkl
        └── train_split_0.pkl
    └── model_rgb.pth

The processed annotations are already provided in this repo. You can download the prepared dataset [BaiduYun](code:smff). Download and unzip the four zip files under MTL-AQA/, then follow the structure. If you want to prepare the data by yourself, please see MTL_helper for some helps. We provide codes for processing the data from an online video to the frames data.

AQA-7

Download AQA-7 Dataset:

mkdir AQA-Seven & cd AQA-Seven
wget http://rtis.oit.unlv.edu/datasets/AQA-7.zip
unzip AQA-7.zip

The data structure should be:

$DATASET_ROOT
├── Seven/
    ├── diving-out
        ├── 001
            ├── img_00001.jpg
            ...
        ...
        └── 370
    ├── gym_vault-out
        ├── 001
            ├── img_00001.jpg
            ...
    ...

    └── Split_4
        ├── split_4_test_list.mat
        └── split_4_train_list.mat

You can download he prepared dataset [BaiduYun](code:65rl). Unzip the file under Seven/

JIGSAWS

Please download the dataset from JIASAWS. You are required to complete a form before you use this dataset for academic research.

The training and test code for JIGSAWS is on the way.

Training and Evaluation

To train a CoRe model:

bash ./scripts/train.sh <GPUIDS>  <MTL/Seven> <exp_name>  [--resume]

For example,

# train a model on MTL
bash ./scripts/train.sh 0,1 MTL try 

# train a model on Seven
bash ./scripts/train.sh 0,1 Seven try --Seven_cls 1

To evaluate a pretrained model:

bash ./scripts/test.sh <GPUIDS>  <MTL/Seven> <exp_name>  --ckpts <path> [--Seven_cls <int>]

For example,

# test a model on MTL
bash ./scripts/test.sh 0 MTL try --ckpts ./MTL_CoRe.pth

# test a model on Seven
bash ./scripts/test.sh 0 Seven try --Seven_cls 1 --ckpts ./Seven_CoRe_1.pth

Visualizatin Results

Citation

If you find our work useful in your research, please consider citing:

@misc{yu2021groupaware,
      title={Group-aware Contrastive Regression for Action Quality Assessment}, 
      author={Xumin Yu and Yongming Rao and Wenliang Zhao and Jiwen Lu and Jie Zhou},
      year={2021},
      eprint={2108.07797},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

[ICCV 2021] Group-aware Contrastive Regression for Action Quality Assessment

Related tags

Overview

CoRe

Pretrained Model

Usage

Requirement

Download initial I3D

Dataset Preparation

MTL-AQA

AQA-7

JIGSAWS

Training and Evaluation

Visualizatin Results

Citation

Owner

Xumin Yu

Tf alloc - Simplication of GPU allocation for Tensorflow2

PyTorch common framework to accelerate network implementation, training and validation

AniGAN: Style-Guided Generative Adversarial Networks for Unsupervised Anime Face Generation

Implementation of Multistream Transformers in Pytorch

Roadmap to becoming a machine learning engineer in 2020

Breaking the Curse of Space Explosion: Towards Efficient NAS with Curriculum Search

YOLOv4 / Scaled-YOLOv4 / YOLO - Neural Networks for Object Detection (Windows and Linux version of Darknet )

Official PyTorch Implementation for InfoSwap: Information Bottleneck Disentanglement for Identity Swapping

Official respository for "Modeling Defocus-Disparity in Dual-Pixel Sensors", ICCP 2020

Implementation of "RaScaNet: Learning Tiny Models by Raster-Scanning Image" from CVPR 2021.

Pytorch code for "Text-Independent Speaker Verification Using 3D Convolutional Neural Networks".

PINN Burgers - 1D Burgers equation simulated by PINN

Pytorch implementation of PTNet for high-resolution and longitudinal infant MRI synthesis

Quickly comparing your image classification models with the state-of-the-art models (such as DenseNet, ResNet, ...)

OBG-FCN - implementation of 'Object Boundary Guided Semantic Segmentation'

Look Who’s Talking: Active Speaker Detection in the Wild

[ACM MM 2021] TSA-Net: Tube Self-Attention Network for Action Quality Assessment

Measuring Coding Challenge Competence With APPS

DeepHyper: Scalable Asynchronous Neural Architecture and Hyperparameter Search for Deep Neural Networks

a practicable framework used in Deep Learning. So far UDL only provide DCFNet implementation for the ICCV paper (Dynamic Cross Feature Fusion for Remote Sensing Pansharpening)