Official implementation of the RAVE model: a Realtime Audio Variational autoEncoder

Last update: Jan 02, 2023

Overview

RAVE: Realtime Audio Variational autoEncoder

Official implementation of RAVE: A variational autoencoder for fast and high-quality neural audio synthesis (article link)

Installation

RAVE needs python 3.9. Install the dependencies using

pip install -r requirements.txt

Training

Both RAVE and the prior model are available in this repo. For most users we recommand to use the cli_helper.py script, since it will generate a set of instructions allowing the training and export of both RAVE and the prior model on a specific dataset.

python cli_helper.py

However, if you want to customize even more your training, you can use the provided train_{rave, prior}.py and export_{rave, prior}.py scripts manually.

Realtime usage

[NOT AVAILABLE YET]

RAVE and the prior model can be used in realtime inside max/msp, allowing creative interactions with both models. Code and details about this part of the project are not available yet, we are currently working on the corresponding article !

An audio example of the prior sampling patch is available in the docs/ folder.

Official implementation of the RAVE model: a Realtime Audio Variational autoEncoder

Related tags

Overview

RAVE: Realtime Audio Variational autoEncoder

Installation

Training

Realtime usage

Owner

Antoine Caillon

Evidential Softmax for Sparse Multimodal Distributions in Deep Generative Models

This is an official implementation of the paper "Distance-aware Quantization", accepted to ICCV2021.

CIFAR-10 Photo Classification

A curated list of awesome papers for Semantic Retrieval (TOIS Accepted: Semantic Models for the First-stage Retrieval: A Comprehensive Review).

Hyper-parameter optimization for sklearn

Google Recaptcha solver.

A PyTorch library and evaluation platform for end-to-end compression research

ARKitScenes - A Diverse Real-World Dataset for 3D Indoor Scene Understanding Using Mobile RGB-D Data

Data visualization app for H&M competition in kaggle

Code in PyTorch for the convex combination linear IAF and the Householder Flow, J.M. Tomczak & M. Welling

This repository gives an example on how to preprocess the data of the HECKTOR challenge

SciKit-Learn Laboratory (SKLL) makes it easy to run machine learning experiments.

ROSITA: Enhancing Vision-and-Language Semantic Alignments via Cross- and Intra-modal Knowledge Integration

A dataset for online Arabic calligraphy

Caffe: a fast open framework for deep learning.

Discovering and Achieving Goals via World Models

Fast Learning of MNL Model From General Partial Rankings with Application to Network Formation Modeling

Half Instance Normalization Network for Image Restoration

Graph-total-spanning-trees - A Python script to get total number of Spanning Trees in a Graph

Forecasting Nonverbal Social Signals during Dyadic Interactions with Generative Adversarial Neural Networks