TONet: Tone-Octave Network for Singing Melody Extraction from Polyphonic Music

Last update: Dec 01, 2022

Overview

TONet

Introduction

The official implementation of "TONet: Tone-Octave Network for Singing Melody Extraction from Polyphonic Music", in ICASSP 2022

We propose TONet, a plug-and-play model that improves both tone and octave perceptions by leveraging a novel input representation and a novel network architecture. Any CFP-input-based Model can be settled in TONet and lead to possible better performance.

Main Results on Extraction Performance

Experiments are done to verify the capability of TONet with various baseline backbone models. Our results show that tone-octave fusion with Tone-CFP can significantly improve the singing voice extraction performance across various datasets -- with substantial gains in octave and tone accuracy.

Getting Started

Download Datasets

After downloading the data, use the txt files in the data folder, and process the CFP feature by feature_extraction.py.

Overwrite the Configuration

The config.py contains all configurations you need to change and set.

Train and Evaluation

python main.py train

python main.py test

Produce the Estimation Digram

Uncomment the write prediction in tonet.py

Model Checkpoints

We provide the best TO-FTANet checkpoints in this link. More checkpoints will be uploaded.

Citing

@inproceedings{tonet-ke2022,
  author = {Ke Chen, Shuai Yu, Cheng-i Wang, Wei Li, Taylor Berg-Kirkpatrick, Shlomo Dubnov},
  title = {TONet: Tone-Octave Network for Singing Melody Extraction  from Polyphonic Music},
  booktitle = {{ICASSP} 2022}
}

TONet: Tone-Octave Network for Singing Melody Extraction from Polyphonic Music

Related tags

Overview

TONet

Introduction

Main Results on Extraction Performance

Getting Started

Download Datasets

Overwrite the Configuration

Train and Evaluation

Produce the Estimation Digram

Model Checkpoints

Citing

Owner

Knut(Ke) Chen

Audio features extraction

Just-Music - Spotify API Driven Music Web app, that allows to listen and control and share songs

A collection of python scripts for extracting and analyzing acoustics from audio files.

Python implementation of the Short Term Objective Intelligibility measure

Library for Python 3 to communicate with the Google Chromecast.

A simple music player, powered by Python, utilising various libraries such as Tkinter and Pygame

Audio2midi - Automatic Audio-to-symbolic Arrangement

[Singing Log] Let your program learn to sing!

Tune in is a Collaborative Music Playing Systems where multiple guests can join a room and enjoy the song being played

live coding in python + supercollider

A Python 3 script for capturing and recording a SDR stream to a WAV file (or serving it to a HTTP audio stream).

Expressive Digital Signal Processing (DSP) package for Python

Official implementation of A cappella: Audio-visual Singing VoiceSeparation, from BMVC21

A Python port and library-fication of the midicsv tool by John Walker.

Music player - endlessly plays your music

This is my voice assistant Patric!

gentle forced aligner

Speech Algorithms Collections

A Simple Script that will help you to Play / Change Songs with just your Voice

This bot can stream audio or video files and urls in telegram voice chats