STT for TorchScript is a port of Coqui STT based on DeepSpeech to PyTorch.

Last update: Oct 18, 2021

Related tags

Text Data & NLP st3

Overview

st3

STT for TorchScript is a port of Coqui STT based on DeepSpeech to PyTorch.

Currently it supports converting pbmm models to pt scripts with integrated beam search.

Check out the first pre-release: https://github.com/proger/st3/releases

PyTorch impelementations of BERT-based Spelling Error Correction Models

59 Jun 29, 2021

PyTorch Implementation of VAENAR-TTS: Variational Auto-Encoder based Non-AutoRegressive Text-to-Speech Synthesis.

VAENAR-TTS - PyTorch Implementation PyTorch Implementation of VAENAR-TTS: Variational Auto-Encoder based Non-AutoRegressive Text-to-Speech Synthesis.

67 Nov 14, 2022

A PyTorch implementation of the WaveGlow: A Flow-based Generative Network for Speech Synthesis

WaveGlow A PyTorch implementation of the WaveGlow: A Flow-based Generative Network for Speech Synthesis Quick Start: Install requirements: pip install

204 Jul 14, 2022

PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models

Deepvoice3_pytorch PyTorch implementation of convolutional networks-based text-to-speech synthesis models: arXiv:1710.07654: Deep Voice 3: Scaling Tex

1.8k Dec 30, 2022

Transformers4Rec is a flexible and efficient library for sequential and session-based recommendation, available for both PyTorch and Tensorflow.

730 Jan 9, 2023

A Word Level Transformer layer based on PyTorch and 🤗 Transformers.

Transformer Embedder A Word Level Transformer layer based on PyTorch and 🤗 Transformers. How to use Install the library from PyPI: pip install transf

27 Nov 20, 2022

The PyTorch based implementation of continuous integrate-and-fire (CIF) module.

CIF-PyTorch This is a PyTorch based implementation of continuous integrate-and-fire (CIF) module for end-to-end (E2E) automatic speech recognition (AS

24 Dec 29, 2022

An example project using OpenPrompt under pytorch-lightning for prompt-based SST2 sentiment analysis model

pl_prompt_sst An example project using OpenPrompt under the framework of pytorch-lightning for a training prompt-based text classification model on SS

5 Oct 21, 2022

PyTorch implementation of the paper: Text is no more Enough! A Benchmark for Profile-based Spoken Language Understanding

Text is no more Enough! A Benchmark for Profile-based Spoken Language Understanding This repository contains the official PyTorch implementation of th

26 Dec 14, 2022

Releases(english1)

english1(Sep 13, 2021)
This is a conversion of Coqui English STT v0.9.3 model to TorchScript, allowing to deploy a speech recognizer as a single file. The TorchScript bundle is self-contained and runs DeepSpeech frontend and beam search returning 10 best results. LM Scorer is not supported at the moment.

To run, download the pt file and save the following code to recognize.py and make sure you have torchaudio installed using pip3 install torchaudio:

import torch, torchaudio, sys waveform, sr = torchaudio.load(sys.argv[1], normalize=True) assert sr == 16000 model = torch.jit.load('coqui-stt-0.9.3-models.pt') for transcript, scores in model(waveform.squeeze()): print(transcript, scores)

Now you can run the model on English recordings like below. Any format supported by TorchAudio backend should work.

python3 recognize.py sample.wav
Source code(tar.gz)
Source code(zip)
coqui-stt-0.9.3-models.pt(180.26 MB)

Owner

Vlad Ki

GitHub Repository

Installation, test and evaluation of Scribosermo speech-to-text engine

Scribosermo STT Setup Scribosermo is a LGPL licensed, open-source speech recognition engine to "Train fast Speech-to-Text networks in different langua

3 Jun 20, 2022

Leon is an open-source personal assistant who can live on your server.

Leon Your open-source personal assistant. Website :: Documentation :: Roadmap :: Contributing :: Story 👋 Introduction Leon is an open-source personal

11.7k Dec 30, 2022

ChatterBot is a machine learning, conversational dialog engine for creating chat bots

ChatterBot ChatterBot is a machine-learning based conversational dialog engine build in Python which makes it possible to generate responses based on

12.8k Jan 03, 2023

A CRM department in a local bank works on classify their lost customers with their past datas. So they want predict with these method that average loss balance and passive duration for future.

Rule-Based-Classification-in-a-Banking-Case. A CRM department in a local bank works on classify their lost customers with their past datas. So they wa

4 Mar 20, 2022

neural network based speaker embedder

Content What is deepaudio-speaker? Installation Get Started Model Architecture How to contribute to deepaudio-speaker? Acknowledge What is deepaudio-s

20 Dec 29, 2022

A list of NLP(Natural Language Processing) tutorials built on Tensorflow 2.0.

335 Jan 04, 2023

Japanese Long-Unit-Word Tokenizer with RemBertTokenizerFast of Transformers

Japanese-LUW-Tokenizer Japanese Long-Unit-Word (国語研長単位) Tokenizer for Transformers based on 青空文庫 Basic Usage from transformers import RemBertToken

3 Dec 22, 2021

This is a MD5 password/passphrase brute force tool

CROWES-PASS-CRACK-TOOl This is a MD5 password/passphrase brute force tool How to install: Do 'git clone https://github.com/CROW31/CROWES-PASS-CRACK-TO

9 Mar 02, 2022

Experiments in converting wikidata to ftm

FollowTheMoney / Wikidata mappings This repo will contain tools for converting Wikidata entities into FtM schema. Prefixes: https://www.mediawiki.org/

2 Nov 12, 2021

Bot to connect a real Telegram user, simulating responses with OpenAI's davinci GPT-3 model.

AI-BOT Bot to connect a real Telegram user, simulating responses with OpenAI's davinci GPT-3 model.

2 Dec 21, 2022

texlive expressions for documents

tex2nix Generate Texlive environment containing all dependencies for your document rather than downloading gigabytes of texlive packages. Installation

70 Dec 26, 2022

Large-scale pretraining for dialogue

A State-of-the-Art Large-scale Pretrained Response Generation Model (DialoGPT) This repository contains the source code and trained model for a large-

1.8k Jan 07, 2023

This repository contains the codes for LipGAN. LipGAN was published as a part of the paper titled "Towards Automatic Face-to-Face Translation".

LipGAN Generate realistic talking faces for any human speech and face identity. [Paper] | [Project Page] | [Demonstration Video] Important Update: A n

438 Dec 31, 2022

STT for TorchScript is a port of Coqui STT based on DeepSpeech to PyTorch.

Related tags

Overview

st3

You might also like...

PyTorch impelementations of BERT-based Spelling Error Correction Models

PyTorch Implementation of VAENAR-TTS: Variational Auto-Encoder based Non-AutoRegressive Text-to-Speech Synthesis.

A PyTorch implementation of the WaveGlow: A Flow-based Generative Network for Speech Synthesis

PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models

Transformers4Rec is a flexible and efficient library for sequential and session-based recommendation, available for both PyTorch and Tensorflow.

A Word Level Transformer layer based on PyTorch and 🤗 Transformers.

The PyTorch based implementation of continuous integrate-and-fire (CIF) module.

An example project using OpenPrompt under pytorch-lightning for prompt-based SST2 sentiment analysis model

PyTorch implementation of the paper: Text is no more Enough! A Benchmark for Profile-based Spoken Language Understanding

Releases(english1)

english1(Sep 13, 2021)

Owner

Vlad Ki

Installation, test and evaluation of Scribosermo speech-to-text engine

Leon is an open-source personal assistant who can live on your server.

ChatterBot is a machine learning, conversational dialog engine for creating chat bots

A CRM department in a local bank works on classify their lost customers with their past datas. So they want predict with these method that average loss balance and passive duration for future.

neural network based speaker embedder

A list of NLP(Natural Language Processing) tutorials built on Tensorflow 2.0.

Japanese Long-Unit-Word Tokenizer with RemBertTokenizerFast of Transformers

This is a MD5 password/passphrase brute force tool

Experiments in converting wikidata to ftm

Bot to connect a real Telegram user, simulating responses with OpenAI's davinci GPT-3 model.

texlive expressions for documents

Large-scale pretraining for dialogue

This repository contains the codes for LipGAN. LipGAN was published as a part of the paper titled "Towards Automatic Face-to-Face Translation".

TextFlint is a multilingual robustness evaluation platform for natural language processing tasks,

基于pytorch_rnn的古诗词生成

A repo for open resources & information for people to succeed in PhD in CS & career in AI / NLP

Russian words synonyms and antonyms

Sequence modeling benchmarks and temporal convolutional networks

🌸 fastText + Bloom embeddings for compact, full-coverage vectors with spaCy

Train 🤗-transformers model with Poutyne.