music-ai

Deep learning transformer model that generates unique music sequences.

Abstract

In 2017, a new state-of-the-art was published for natural language processing: the Transformer. Relying solely on attention mechanisms, the Transformer outperformed existing solutions based on recurrent and convolutional neural networks¹. However, recurrent neural networks, long short-term memory, and gated recurrent neural networks remain dominant in the field of generative music. I aim to introduce the Transformer into the field of music, with the goal of teaching the deep learning model to predict the second half of a composition given the first half. A Transformer equipped with 32 attention heads and sinusoidal positional encoding was trained on the Nottingham MIDI dataset for 5000 epochs over a period of 48 hours, optimized by stochastic gradient descent and measured with cross entropy loss, and regulated by an exponential learning rate decrease schedule. For the first thousand epochs, the model had noticeable improvement but lacked arrangement to the generated sequences. By five thousand epochs, the model clearly demonstrated the knowledge of general music trends used to better predict how classical composers write their pieces, and most tracks were melodic to the human ear. Future applications of this technique include generating tracks for various instruments, rating the quality of existing music tracks, and complete originality if combined with a generative network mapping melodies to latent space.

¹ Attention Is All You Need

Video

Hardware

Ubuntu

32 GB RAM
Intel Core i3-4170 CPU @3.70 GHz x4 (4 GB RAM)
NVIDIA GeForce GTX 1050 Ti

Deep learning transformer model that generates unique music sequences.

Related tags

Overview

music-ai

Abstract

Video

Hardware

Owner

xacer

A voice assistant which can handle your everyday task and allows you to book items from your favourite store!

Terminal-based audio-to-text converter

Spotipy - Player de música simples em Python

IDing the songs played on the do you radio show

Voicefixer aims at the restoration of human speech regardless how serious its degraded.

Python wrapper around sox.

XA Music Player - Telegram Music Bot

GNU Radio – the Free and Open Software Radio Ecosystem

The project aims to develop a personal-assistant for Windows & Linux-based systems

TwitterMusicBot - A Twitter bot with Spotify integration.

A python program to cut longer MP3 files (i.e. recordings of several songs) into the individual tracks.

Mopidy is an extensible music server written in Python

Okaeri-Music is a telegram music bot project, allow you to play music on voice chat group telegram.

Analysis of voices based on the Mel-frequency band

Python interface to the WebRTC Voice Activity Detector

Codes for "Efficient Long-Range Attention Network for Image Super-resolution"

This Is Telegram Music UserBot To Play Music Without Being Admin

PatrikZero's CS:GO Hearing protection

Mentos Music Bot With Python

?️ Open Source Audio Matching and Mastering