Recognition of 38 speech commands in russian. Based on Yandex Cup 2021 ML Challenge: ASR

Last update: May 05, 2022

Overview

Speech_38_ru_commands

Recognition of 38 speech commands in russian. Based on Yandex Cup 2021 ML Challenge: ASR

Программа умеет распознавать 38 ключевых слов на русском языке , произнесенных в микрофон из списка:

дальше, вперед, назад, вверх, вниз, выше, ниже, домой, громче, тише, лайк, дизлайк, следующий, предыдущий, сначала, перемотай, выключи, стоп, хватит, замолчи, заткнись, останови, пауза, включи, смотреть, продолжи, играй, запусти, ноль, один, два, три, четыре, пять, шесть, семь, восемь, девять.

Используемая модель была подготовлена для соревнования Yandex Cup 2021 ML Challenge: ASR. Получило 3 место из 54 участников. с показателем точности 92.01

Скачать модель по ссылке https://disk.yandex.ru/d/L053qF-0OPKlog

Пример запуска программы:

python speech_38_ru_commands.py --porog 1.2

где , число 1.2 - это порог уверенности в команде. Можно задавать в диапазоне 0.0 - 7.9999

Recognition of 38 speech commands in russian. Based on Yandex Cup 2021 ML Challenge: ASR

Related tags

Overview

Speech_38_ru_commands

Owner

Andrey

Neural network sequence labeling model

justCTF [*] 2020 challenges sources

Module for automatic summarization of text documents and HTML pages.

jel - Japanese Entity Linker - is Bi-encoder based entity linker for japanese.

NLTK Source

ALIbaba's Collection of Encoder-decoders from MinD (Machine IntelligeNce of Damo) Lab

Hierarchical unsupervised and semi-supervised topic models for sparse count data with CorEx

MASS: Masked Sequence to Sequence Pre-training for Language Generation

Wind Speed Prediction using LSTMs in PyTorch

Code for evaluating Japanese pretrained models provided by NTT Ltd.

Utility for Google Text-To-Speech batch audio files generator. Ideal for prompt files creation with Google voices for application in offline IVRs

Host your own GPT-3 Discord bot

Full Spectrum Bioinformatics - a free online text designed to introduce key topics in Bioinformatics using the Python

Chinese version of GPT2 training code, using BERT tokenizer.

Persian Bert For Long-Range Sequences

💬 Open source machine learning framework to automate text- and voice-based conversations: NLU, dialogue management, connect to Slack, Facebook, and more - Create chatbots and voice assistants

BERT, LDA, and TFIDF based keyword extraction in Python

Machine learning classifiers to predict American Sign Language .

PIZZA - a task-oriented semantic parsing dataset

Pytorch implementation of winner from VQA Chllange Workshop in CVPR'17