BART aids transcribe tasks by taking a source audio file and creating automatic repeated loops, allowing transcribers to listen to fragments multiple times

Last update: Feb 04, 2022

Related tags

Audio bart

Overview

 ____   ____ ____  ______
|    \ /    |    \|      |
|  o  )  o  |  D  )      |
|     |     |    /|_|  |_|
|  O  |  _  |    \  |  |
|     |  |  |  .  \ |  |
|_____|__|__|__|\_| |__|

"Are you transcribing a conversation? BART can help!"

BART (Beyond Audio Replay Technology) aids transcribe tasks by taking a source audio file and creating automatic repeated loops, allowing transcribers to listen to fragments multiple times (with possible overlap between segments).

Installation

Make sure to clone this repository and install Python dependencies using:

pip install -r requirements.txt

BART relies on Pydub for processing audio. Pydub needs ffmepg or libav to be available on your system. Read this carefully.

Installing ffmpeg on MacOS

On my macOS system, I did the following.

Download ffmpeg and ffprobe binaries from: https://evermeet.cx/ffmpeg/
Unarchive downloaded files and move them to your $PATH, e.g.:

sudo cp ffmpeg /usr/local/bin
sudo cp ffprobe /usr/local/bin

Double check they are in your $PATH:

which ffmpeg || echo "ffmpeg not found"
which ffprobe || echo "ffprobe not found"

Usage

BART is very simple and expects all input files in the input/ directory.

Overmore, BART is very naive. It always writes output files to the output/ directory with exactly the same name as the input file. If the file exists, it will overwrite without warning.

BART is very chatty and will tell you when he adds a segment and when he adds a pause.

Example

python main.py --input test.mp3 --segment-length 10 --pause-length 1.0 --segment-step 5 --segment-repeat 2

Usage:

python main.py -h

BART aids transcribe tasks by taking a source audio file and creating automatic repeated loops, allowing transcribers to listen to fragments multiple times

Related tags

Overview

Installation

Installing ffmpeg on MacOS

Usage

Example

Owner

Oliva music bot help to play vc music

A Python library and tools AUCTUS A6 based radios.

A python script that can play .mp3 URLs upon the ringing or motion detection of a Ring doorbell. The sound plays through Sonos speakers.

Use android as mic/speaker for ubuntu

Gradient - A Python program designed to create a reactive and ambient music listening experience

Linear Prediction Coefficients estimation from mel-spectrogram implemented in Python based on Levinson-Durbin algorithm.

A Python 3 script for capturing and recording a SDR stream to a WAV file (or serving it to a HTTP audio stream).

Free and Open Source Channel/Group Voice chat music player for telegram with button support saavn playback support.

A Python wrapper for the high-quality vocoder "World"

MIDI-DDSP: Detailed Control of Musical Performance via Hierarchical Modeling

Use python MIDI to write some simple music

Pyrogram bot to automate streaming music in voice chats

Play any song directly into your group voice chat.

ᴀ ʙᴏᴛ ᴛʜᴀᴛ ᴄᴀɴ ᴘʟᴀʏ ᴍᴜꜱɪᴄ ɪɴ ᴛᴇʟᴇɢʀᴀᴍ ɢʀᴏᴜᴘ ᴏɴ ᴠᴏɪᴄᴇ ᴄᴀʟʟ

Audio features extraction

Implementation of "Slow-Fast Auditory Streams for Audio Recognition, ICASSP, 2021" in PyTorch

Audio Retrieval with Natural Language Queries: A Benchmark Study

Conferencing Speech Challenge

Mopidy is an extensible music server written in Python

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.