Non-Autoregressive Predictive Coding

This repository contains the implementation of Non-Autoregressive Predictive Coding (NPC) as described in the preprint paper submitted to ICASSP 2021.

A quick example for training NPC

python main.py --config config/self_supervised/npc_example.yml \
               --task self-learning

For more complete examples including downstream tasks, please see the example script.
For preparing data, please visit preprocess.
For detailed hyperparameters setting and description, please checkout example config file of NPC.
For all run-time options, use -h flag.
Implementation of Autoregressive Predictive Coding (APC, 2019, Chung et al.) and Vector-Quantized APC (VQ-APC, 2020, Chung et al.) are also available using similar training/downstream execution with example config files here.

Some notes

We found the unmasked feature produced by the last ConvBlock layer a better representation. In the phone classification tasks, switching to the unmasked feature (PER 25.6%) provided a 1.6% improvement over the masked feature (PER 27.2%). Currently, this is not included in the preprint version and will be updated to the paper in the future. Please refer to downstream examples to activate this option.
APC/VQ-APC are implemented with the following modifications for improvement (for the unmodified version, please visit the official implementation of APC / VQAPC)
- Multi-group VQ available for VQ-APC, but with VQ on last layer only
- Using utterance-wised CMVN surface feature（just as NPC did)
- Using Gumbel Softmax from official API of pytorch
See package requirement for toolkits used, tensorboard can be used to access log files in --logdir.

Contact

Feel free to contact me for questions or feedbacks, my email can be found in the paper or my personal page.

Citation

If you find our work and/or this repository helpful, please do consider citing us

@article{liu2020nonautoregressive,
  title   = {Non-Autoregressive Predictive Coding for Learning Speech Representations from Local Dependencies},
  author  = {Liu, Alexander and Chung, Yu-An and Glass, James},
  journal = {arXiv preprint arXiv:2011.00406},
  year    = {2020}
}

Non-Autoregressive Predictive Coding

Related tags

Overview

Non-Autoregressive Predictive Coding

Some notes

Contact

Citation

Owner

Alexander H. Liu

PeCo: Perceptual Codebook for BERT Pre-training of Vision Transformers

Tool which allow you to detect and translate text.

This repository serves as a place to document a toy attempt on how to create a generative text model in Catalan, based on GPT-2

NLP topic mdel LDA - Gathered from New York Times website

ChainKnowledgeGraph, 产业链知识图谱包括A股上市公司、行业和产品共3类实体

🚀Clone a voice in 5 seconds to generate arbitrary speech in real-time

CCF BDCI 2020 房产行业聊天问答匹配赛道 A榜47/2985

Arabic-Phonetic-Output - You can input the phonetic version of any Arabic text here. This software will show you output in Arabic (with vowels)

RoNER is a Named Entity Recognition model based on a pre-trained BERT transformer model trained on RONECv2

Making text a first-class citizen in TensorFlow.

:house_with_garden: Fast & easy transfer learning for NLP. Harvesting language models for the industry. Focus on Question Answering.

CCF BDCI BERT系统调优赛题baseline（Pytorch版本）

Converts text into a PDF of handwritten notes

Snowball compiler and stemming algorithms

Transformers implementation for Fall 2021 Clinic

Easily train your own text-generating neural network of any size and complexity on any text dataset with a few lines of code.

A library for finding knowledge neurons in pretrained transformer models.

Code for evaluating Japanese pretrained models provided by NTT Ltd.

profile tools for pytorch nn models

Chinese Grammatical Error Diagnosis