ConferencingSpeech2022; Non-intrusive Objective Speech Quality Assessment (NISQA) Challenge

Last update: Dec 02, 2022

Related tags

Text Data & NLP ConferencingSpeech2022

Overview

ConferencingSpeech 2022 challenge

This repository contains the datasets list and scripts required for the ConferencingSpeech 2022 challenge. For more details about the challenge, please see our website.

Details

baseline, this folder contains baseline system include inference model exported by inference scripts;
eval, this folder contains evaluation scripts to calculate PLCC, RMSE and SRCC;
data-sets, this folder contains training and development test data-sets provied to the participant;
- Tencent Corpus, this dataset includes about 14,000 speech chinese speech clips with simulated (e.g. codecs, packet-loss, background noise) and live conditions.
- NISQA Corpus, the NISQA Corpus includes more than 14,000 speech samples with simulated (e.g. codecs, packet-loss, background noise) and live (e.g. mobile phone, Zoom, Skype, WhatsApp) conditions.
- IU Bloomington Corpus, there are 10,000 speech signals extracted from COSINE and VOiCESdatasets, each truncated between 3 to 6 seconds long.
- PSTN Corpus, there are about 80,000 speech clips through classic public switched telephone networks, each truncated 10 seconds long.

Requirements

To install requirements install Anaconda and then use:

conda env create -f envs.yml

This will create a new environment with the name "conferencingSpeech". Activate this environment to go on:

conda activate conferencingSpeech

Code license

Apache 2.0

ConferencingSpeech2022; Non-intrusive Objective Speech Quality Assessment (NISQA) Challenge

Related tags

Overview

ConferencingSpeech 2022 challenge

Details

Requirements

Code license

Owner

Code for hyperboloid embeddings for knowledge graph entities

New Modeling The Background CodeBase

A unified tokenization tool for Images, Chinese and English.

An attempt to map the areas with active conflict in Ukraine using open source twitter data.

Simple Python script to scrape youtube channles of "Parity Technologies and Web3 Foundation" and translate them to well-known braille language or any language

This program do translate english words to portuguese

Indobenchmark are collections of Natural Language Understanding (IndoNLU) and Natural Language Generation (IndoNLG)

💬 Open source machine learning framework to automate text- and voice-based conversations: NLU, dialogue management, connect to Slack, Facebook, and more - Create chatbots and voice assistants

Various Algorithms for Short Text Mining

Shared code for training sentence embeddings with Flax / JAX

基于GRU网络的句子判断程序/A program based on GRU network for judging sentences

Modular and extensible speech recognition library leveraging pytorch-lightning and hydra.

Outreachy TFX custom component project

HF's ML for Audio study group

VMD Audio/Text control with natural language

华为商城抢购手机的Python脚本 Python script of Huawei Store snapping up mobile phones

EasyTransfer is designed to make the development of transfer learning in NLP applications easier.

fastNLP: A Modularized and Extensible NLP Framework. Currently still in incubation.

Code for the paper "VisualBERT: A Simple and Performant Baseline for Vision and Language"

Twitter bot that uses NLP models to summarize news articles referenced in a user's twitter timeline