PyTorch implementation for Stochastic Fine-grained Labeling of Multi-state Sign Glosses for Continuous Sign Language Recognition.

Last update: Dec 19, 2022

Related tags

Deep Learning stochastic-cslr

Overview

Stochastic CSLR

This is the PyTorch implementation for the ECCV 2020 paper: Stochastic Fine-grained Labeling of Multi-state Sign Glosses for Continuous Sign Language Recognition.

Quick Start

1. Installation

pip install git+https://github.com/zheniu/stochastic-cslr

Also, you need to install sclite for evaluation. Take a look at step 2 for instructions.

2. Prepare the dataset

Download the RWTH-PHOENIX-2014 dataset here.
Unzip it and obtain the path to phoenix-2014-multisigner/ folder for later use.
Install sclite for evaluation. Check phoenix-2014-multisigner/evaluation/NIST-sclite_sctk-2.4.0-20091110-0958.tar.bz2 for detail.
After installing sclite, put it in your PATH.

3. Run a quick test

You can use the script quick_test.py for a quick test.

python3 quick_test.py --data-root your_path_to/phoenix-2014-multisigner

By specifying the model type --model sfl/dfl, the data split --split dev/test, whether to use a language model--use-lm, you can get the following results:

Model	WER (dev)	sub/del/ins (dev)	WER (test)	sub/del/ins (test)
DFL	27.1	12.7/7.4/7.0	27.7	13.8/7.3/6.6
SFL	26.2	12.7/6.9/6.7	26.6	13.7/6.5/6.4
DFL + LM	25.6	11.5/9.2/4.9	26.4	12.4/9.3/4.7
SFL + LM	24.3	11.4/8.5/4.4	25.3	12.4/8.5/4.3

Note that these results are slightly different from the paper as a different random seed is used.

You may also take a look at quick_test.py as it shows how to use the pretrained models.

4. Train your own model

The configuration files for deterministic and stochastic fine-grained labeling are put under config/. The training script is based on a PyTorch experiment runner torchzq, which automatically reads the hyperparameters in the YAML file and passes them to stochastic_cslr/runner.py.

Before running, change the data_root in the YAML configurations to phoenix-2014-multisigner/ first.

Train (for instance, dfl):

tzq config/dfl-fp16.yml train

Test the trained model

tzq config/dfl-fp16.yml test

Citation

You may cite this work by:

@inproceedings{niu2020stochastic,
  title={Stochastic Fine-Grained Labeling of Multi-state Sign Glosses for Continuous Sign Language Recognition},
  author={Niu, Zhe and Mak, Brian},
  booktitle={European Conference on Computer Vision},
  pages={172--186},
  year={2020},
  organization={Springer}
}

PyTorch implementation for Stochastic Fine-grained Labeling of Multi-state Sign Glosses for Continuous Sign Language Recognition.

Related tags

Overview

Stochastic CSLR

Quick Start

1. Installation

2. Prepare the dataset

3. Run a quick test

4. Train your own model

Train (for instance, dfl):

Test the trained model

Citation

Owner

Zhe Niu

CMSC320 - Introduction to Data Science - Fall 2021

Supplementary code for SIGGRAPH 2021 paper: Discovering Diverse Athletic Jumping Strategies

Fast Style Transfer in TensorFlow

Implementation of Perceiver, General Perception with Iterative Attention in TensorFlow

Relative Positional Encoding for Transformers with Linear Complexity

Lighting the Darkness in the Deep Learning Era: A Survey, An Online Platform, A New Dataset

RaftMLP: How Much Can Be Done Without Attention and with Less Spatial Locality?

Unsupervised Feature Loss (UFLoss) for High Fidelity Deep learning (DL)-based reconstruction

A few stylization coreML models that I've trained with CreateML

Adversarial Adaptation with Distillation for BERT Unsupervised Domain Adaptation

Code, pre-trained models and saliency results for the paper "Boosting RGB-D Saliency Detection by Leveraging Unlabeled RGB Images".

Python Assignments for the Deep Learning lectures by Andrew NG on coursera with complete submission for grading capability.

Implementation of "GNNAutoScale: Scalable and Expressive Graph Neural Networks via Historical Embeddings" in PyTorch

Convolutional Neural Networks on Graphs with Fast Localized Spectral Filtering

Lingvo is a framework for building neural networks in Tensorflow, particularly sequence models.

Pytorch Implementation of Interaction Networks for Learning about Objects, Relations and Physics

Finding Biological Plausibility for Adversarially Robust Features via Metameric Tasks

ObjectDrawer-ToolBox: a graphical image annotation tool to generate ground plane masks for a 3D object reconstruction system

SAN for Product Attributes Prediction

EdiBERT is a generative model based on a bi-directional transformer, suited for image manipulation