In this project we can see how we can generate automatic music using character RNN.

Overview

Automatic Music Genaration

Table of Contents

  1. Project Description
  2. Approach towards the problem
  3. Limitations
  4. Libraries Used
  5. Summary
  6. Applications
  7. References


1. Project Description

  • Natural Language Processing in Artificial Intelligence is the application of computational techniques to the analysis and synthesis of natural language and speech.

  • For a hypothetical scenario it was assumed that ABC music prod. pvt.ltd is a reknowned audio-video production house based out of Mumbai, India

  • As COVID-19 cases are increasing day by day it is almost impossible for the musicians to coop up with real time studio work.

  • Hence, the company wants you to make an AI based music genaration system.

  • The goal of this project is to make an AI based music genaration system.

  • The key contraint to the problem is accruracy.

  • You have been hired as a freelance data scientist for ABC music prod. pvt.ltd

  • The model should read a text file in abc format.

  • The model should genarate the corresponding music framed out of that note sequence.


2. Approach towards the problem

  • The approach to this project was to make a chat bot from scratch.

  • Initially all the necessary libraries were imported and installed.

  • Then, the data corpus was formed.

  • A data corpus is a collection of linguistic data.

  • Then while performing preprocessing we used lemitization technique in order to process the data into desirable format.

  • Then after preprocessing, we genarated various batches to fit into the model.

  • After that we build a charecter RNN model and trained the model for 100 epochs with a batch size of 16 and a sequence length of 64

  • For every 10 epochs the model would save the updated weights in the main directory

  • We recieved a validation accuracy of 92%

  • Now we genarated the music code which is nothing but the predicted vocabulary

  • After that we went here and we were successfully able to genarate music.

3. Limitation(s)

  • The only limitation of this model is that it is being trained with very less data.

  • However, while getting trained on various intrument data this model can be further enhanced for different instruments as well.

  • We have trained this model for only 100 epochs. As the number of epochs increase it is expected that the accuracy of the model will increase.

4. Libraries Used

Following are the list of libraries that were used for making this project.

  • Python was used as the general purposae programming language

  • Keras was used to perform all Deep Learning operation(s) such as model bulding, compliation and training.

  • numpy was used in order to calcucate numercal operations.

5. Summary

  • In this project an Automatic music genaration system was made from scratch.

  • Here, we recieved a validation accuracy of 92%

  • This project can be widely used for music production systems

  • The only limitation of this model is that it is being trained with very less data.

  • However, while getting trained on various intrument data this model can be further enhanced for different instruments as well.

  • This model can be used for in house music production systems.

  • This can be widely used to automate manual instruments.

  • This can be also used to make automatic VST(virtual studio toolkit) plugins

6. Applications

  • This model can be used for in house music production systems.

  • This can be widely used to automate manual instruments.

  • This can be also used to make automatic VST(virtual studio toolkit) plugins

7. References

Owner
Pronay Ghosh
A data-driven professional who grows day by day. Currently working as a Data Science Researcher at The International School of AI and Data Science.
Pronay Ghosh
Simple, hackable offline speech to text - using the VOSK-API.

Nerd Dictation Offline Speech to Text for Desktop Linux. This is a utility that provides simple access speech to text for using in Linux without being

Campbell Barton 844 Jan 07, 2023
Spotify Song Recommendation Program

Spotify-Song-Recommendation-Program Made by Esra Nur Özüm Written in Python The aim of this project was to build a recommendation system that recommen

esra nur özüm 1 Jun 30, 2022
Suyash More 111 Jan 07, 2023
:speech_balloon: SpeechPy - A Library for Speech Processing and Recognition: http://speechpy.readthedocs.io/en/latest/

SpeechPy Official Project Documentation Table of Contents Documentation Which Python versions are supported Citation How to Install? Local Installatio

Amirsina Torfi 870 Dec 27, 2022
DCL - An easy to use diacritic library used for diacritic and accent manipulation.

Diacritics Library This library is used for adding, and removing diacritics from strings. Getting started Start by importing the module: import dcl DC

Kreus Amredes 6 Jun 03, 2022
A Python 3 script for capturing and recording a SDR stream to a WAV file (or serving it to a HTTP audio stream).

rfsoapyfile A Python 3 script for capturing and recording a SDR stream to a WAV file (or serving it to a HTTP audio stream). The script is threaded fo

4 Dec 19, 2022
Guide & Examples to create deeplearning gstreamer plugins and use them in your pipeline

upai-gst-dl-plugins Guide & Examples to create deeplearning gstreamer plugins and use them in your pipeline Introduction Thanks to the work done by @j

UPAI.IO 11 Dec 11, 2022
Speech recognition module for Python, supporting several engines and APIs, online and offline.

SpeechRecognition Library for performing speech recognition, with support for several engines and APIs, online and offline. Speech recognition engine/

Anthony Zhang 6.7k Jan 08, 2023
MelGAN test on audio decoding

Official repository for the paper MelGAN: Generative Adversarial Networks for Conditional Waveform Synthesis The original work URL: https://github.com

Jurio 1 Apr 29, 2022
SolidMusic rewrite version, need help

Telegram Streamer Bot This is rewrite version of solidmusic, but it can't be deployed now, help me to make this bot running fast and good. If anyone w

Shohih Abdul 63 Jan 06, 2022
Analysis of voices based on the Mel-frequency band

Speaker_partition_module Analysis of voices based on the Mel-frequency band. Goal: Identification of voices speaking (diarization) and calculation of

1 Feb 06, 2022
A python library for working with praat, textgrids, time aligned audio transcripts, and audio files.

praatIO Questions? Comments? Feedback? A library for working with praat, time aligned audio transcripts, and audio files that comes with batteries inc

Tim 224 Dec 19, 2022
Telegram Bot to play music in VoiceChat with Channel Support and autostarts Radio.

VCPlayerBot Telegram bot to stream videos in telegram voicechat for both groups and channels. Supports live streams, YouTube videos and telegram media

Abdisamad Omar Mohamed 1 Oct 15, 2021
Anaphones are like anagrams, but for sounds.

Anaphones Anaphones are like anagrams but for sounds (phonemes). Examples include: salami-awesomely, atari-tiara, and beefy-phoebe. Anaphones can be a

James Murphy 18 Nov 02, 2022
無料で使える中品質なテキスト読み上げソフトウェア、VOICEVOXのコア

無料で使える中品質なテキスト読み上げソフトウェア、VOICEVOXのコア

Hiroshiba 0 Aug 29, 2022
Sequencer: Deep LSTM for Image Classification

Sequencer: Deep LSTM for Image Classification Created by Yuki Tatsunami Masato Taki This repository contains implementation for Sequencer. Abstract In

Yuki Tatsunami 111 Dec 16, 2022
Bot duniya Music Player

Bot duniya Music Player Requirements 📝 FFmpeg (Latest) NodeJS nodesource.com (NodeJS 17+) Python (3.10+) PyTgCalls (Lastest) 2nd Telegram Account (ne

Aman Vishwakarma 16 Oct 21, 2022
BART aids transcribe tasks by taking a source audio file and creating automatic repeated loops, allowing transcribers to listen to fragments multiple times

BART (Beyond Audio Replay Technology) aids transcribe tasks by taking a source audio file and creating automatic repeated loops, allowing transcribers to listen to fragments multiple times (with poss

2 Feb 04, 2022
A rofi-blocks script that searches youtube and plays the selected audio on mpv.

rofi-ytm A rofi-blocks script that searches youtube and plays the selected audio on mpv. To use the script, run the following command rofi -modi block

Cliford 26 Dec 21, 2022
Convert complex chord names to midi notes

ezchord Simple python script that can convert complex chord names to midi notes Prerequisites pip install midiutil Usage ./ezchord.py Dmin7 G7 C timi

Alex Zhang 2 Dec 20, 2022