wenet-kws

Production First and Production Ready End-to-End Keyword Spotting Toolkit.

The goal of this toolkit it to...

Small footprint keyword spotting (KWS), or specifically wake-up word (WuW) detection is a typical and important module in internet of things (IoT) devices. It provides a way for users to control IoT devices with a hands-free experience. A WuW detection system usually runs locally and persistently on IoT devices, which requires low consumptional power, less model parameters, low computational comlexity and to detect predefined keyword in a streaming way, i.e., requires low latency.

Typical Scenario

We are going to support the following typical applications of wakeup word:

Single wake-up word
Multiple wake-up words
Customizable wake-up word
Personalized wake-up word, i.e. combination of wake-up word detection and voiceprint

Dataset

We plan to support a variaty of open source wake-up word datasets, include but not limited to:

All the well-trained models on these dataset will be made public avaliable.

Runtime

We plan to support a variaty of hardwares and platforms, including:

Web browser
x86
Android
Raspberry Pi

Production First and Production Ready End-to-End Keyword Spotting Toolkit

Related tags

Overview

wenet-kws

Typical Scenario

Dataset

Runtime

Owner

This codebase facilitates fast experimentation of differentially private training of Hugging Face transformers.

Correctly generate plurals, ordinals, indefinite articles; convert numbers to words

Crie tokens de autenticação íntegros e seguros com UToken.

UniSpeech - Large Scale Self-Supervised Learning for Speech

[WWW 2021 GLB] New Benchmarks for Learning on Non-Homophilous Graphs

Code for the paper "Flexible Generation of Natural Language Deductions"

FastFormers - highly efficient transformer models for NLU

Write Python in Urdu - اردو میں کوڈ لکھیں

code for modular summarization work published in ACL2021 by Krishna et al

Universal End2End Training Platform, including pre-training, classification tasks, machine translation, and etc.

Control the classic General Instrument SP0256-AL2 speech chip and AY-3-8910 sound generator with a Raspberry Pi and this Python library.

txtai: Build AI-powered semantic search applications in Go

숭실대학교 컴퓨터학부 전공종합설계프로젝트

Rhyme with AI

Code for the paper "A Simple but Tough-to-Beat Baseline for Sentence Embeddings".

I label phrases on a scale of five values: negative, somewhat negative, neutral, somewhat positive, positive

Source code of paper "BP-Transformer: Modelling Long-Range Context via Binary Partitioning"

Easy-to-use CPM for Chinese text generation

Chinese real time voice cloning (VC) and Chinese text to speech (TTS).