Efficient Two-Step Networks for Temporal Action Segmentation (Neurocomputing 2021)

Last update: Apr 16, 2022

Related tags

Overview

Efficient Two-Step Networks for Temporal Action Segmentation

This repository provides a PyTorch implementation of the paper Efficient Two-Step Networks for Temporal Action Segmentation.

Requirements

* Python 3.8.5
* pyTorch 1.8.1

You can download packages using requirements.txt.
pip install -r requirements.txt

Datasets

Download the data provided by MS-TCN, which contains the I3D features (w/o fine-tune) and the ground truth labels for 3 datasets. (~30GB)
Extract it so that you have the data folder in the same directory as train.py.

directory structure

├── config
│   ├── 50salads
│   ├── breakfast
│   └── gtea
├── csv
│   ├── 50salads
│   ├── breakfast
│   └── gtea
├─ dataset ─── 50salads/...
│           ├─ breakfast/...
│           └─ gtea ─── features/
│                    ├─ groundTruth/
│                    ├─ splits/
│                    └─ mapping.txt
├── libs
├── result
├── utils 
├── requirements.txt
├── train.py
├── eval.py
└── README.md

Training and Testing of ETSN

Setting

First, convert ground truth files into numpy array.

python utils/generate_gt_array.py ./dataset

Then, please run the below script to generate csv files for data laoder'.

python utils/builda_dataset.py ./dataset

Training

You can train a model by changing the settings of the configuration file.

python train.py ./config/xxx/xxx/config.yaml

Evaluation

You can evaluate the performance of result after running.

python eval.py ./result/xxx/xxx/config.yaml test

We also provide trained ETSN model in Google Drive. Extract it so that you have the result folder in the same directory as train.py.

average cross validation results

python utils/average_cv_results.py [result_dir]

Citation

If you find our code useful, please cite our paper.

@article{LI2021373,
author = {Yunheng Li and Zhuben Dong and Kaiyuan Liu and Lin Feng and Lianyu Hu and Jie Zhu and Li Xu and Yuhan wang and Shenglan Liu},
journal = {Neurocomputing},
title = {Efficient Two-Step Networks for Temporal Action Segmentation},
year = {2021},
volume = {454},
pages = {373-381},
issn = {0925-2312},
doi = {https://doi.org/10.1016/j.neucom.2021.04.121},
url = {https://www.sciencedirect.com/science/article/pii/S0925231221006998},

}

Contact

For any question, please raise an issue or contact.

Acknowledgement

We appreciate MS-TCN for extracted I3D feature, backbone network and evaluation code.

Appreciating Yuchi Ishikawa shares the re-implementation of MS-TCN with pytorch.

Efficient Two-Step Networks for Temporal Action Segmentation (Neurocomputing 2021)

Related tags

Overview

Efficient Two-Step Networks for Temporal Action Segmentation

Requirements

Datasets

directory structure

Training and Testing of ETSN

Setting

Training

Evaluation

average cross validation results

Citation

Contact

Acknowledgement

Owner

Implementation of Nyström Self-attention, from the paper Nyströmformer

Explore the Expression: Facial Expression Generation using Auxiliary Classifier Generative Adversarial Network

Python library containing BART query generation and BERT-based Siamese models for neural retrieval.

Contrastive Fact Verification

Numerical-computing-is-fun - Learning numerical computing with notebooks for all ages.

Repository containing detailed experiments related to the paper "Memotion Analysis through the Lens of Joint Embedding".

How to use TensorLayer

Zen-NAS: A Zero-Shot NAS for High-Performance Deep Image Recognition

The official implementation of Theme Transformer

MASS (Mueen's Algorithm for Similarity Search) - a python 2 and 3 compatible library used for searching time series sub-sequences under z-normalized Euclidean distance for similarity.

A face dataset generator with out-of-focus blur detection and dynamic interval adjustment.

Official repository for the paper "Instance-Conditioned GAN"

PyTorch Implementation of Exploring Explicit Domain Supervision for Latent Space Disentanglement in Unpaired Image-to-Image Translation.

A lightweight deep network for fast and accurate optical flow estimation.

Official Code for "Non-deep Networks"

Implementing yolov4 target detection and tracking based on nao robot

Inverse Rendering for Complex Indoor Scenes: Shape, Spatially-Varying Lighting and SVBRDF From a Single Image

An open source app to help calm you down when needed.

PyBrain - Another Python Machine Learning Library.

Generalized Data Weighting via Class-level Gradient Manipulation