A design of MIDI language for music generation task, specifically for Natural Language Processing (NLP) models.

Last update: May 25, 2022

Related tags

Text Data & NLP midi_language

Overview

MIDI Language

Introduction

Reference

Paper: Pop Music Transformer: Beat-based Modeling and Generation of Expressive Pop Piano Compositions: code

This is a modified version with an extension of multi-instrumental support.

Function

Convert Midi into event sequence, and represented by mapped integer array.

This could send to NLP models for AI auto music composition.

Due to this project considers more about music structures as well as its chord and melody on higher level, including note, drum, tempo, musical instrument (program in midi) and its expressions (tempo and velocity), rather than digging into too much details like sound source & direction, instrumental performing techniques (such as, bend sound, piano sustain pedal, violin overtones), the language of MIDI is design this way (see chapter Details below).

Usage

See language.py, it contains procedures:

load w2i (word to integer) and i2w (integer to word), for not calculating it every time;
encode midi to iteger array, each object handle one mid file;
decode integer array to midi, each object handle many results and export to mid files;

The code language.py has arguments:

input: input file of audio file to encode/decode;
output: output file of audio file to encode;
train: if have, it will switch to training mode with variations (data augmentation);

MidiEncoder data augmentation:

pitch_variation_range: a random pitch shift within a range for whole midi;
velocity_scale_variation_range: a random note/drum velocity scale for whole midi;
velocity_noise_scale_variation_range: a random note/drum velocity scale for each element within midi;
tempo_scale_variation_range: a random tempo change for whole midi;

MidiDecoder needs numerator and denominator time signatures for reconstructing midi files.

Details

Event Structure

Required:

Bar
Position (0~split-1)

Optional:

note:
- Note
- Program (0~127)
- Pitch (0~127)
- Velocity (0~127)
- Duration (0~split*bar_scale-1)
drum:
- Drum
- Program (0~127)
- Pitch (0~127)
- Velocity (0~127)
- Duration (0~split*bar_scale-1)
chord:
- Chord (chroma_name:chord_name)
tempo:
- Tempo_Class (T0~Ti)
- Tempo_Value (0~59)

A design of MIDI language for music generation task, specifically for Natural Language Processing (NLP) models.

Related tags

Overview

MIDI Language

Introduction

Reference

Function

Usage

Details

Event Structure

Required:

Optional:

Owner

Robert Bogan Kang

BERT-based Financial Question Answering System

Spert NLP Relation Extraction API deployed with torchserve for inference

Text-Summarization-using-NLP - Text Summarization using NLP to fetch BBC News Article and summarize its text and also it includes custom article Summarization

Official source for spanish Language Models and resources made @ BSC-TEMU within the "Plan de las Tecnologías del Lenguaje" (Plan-TL).

Fidibo.com comments Sentiment Analyser

My Implementation for the paper EDA: Easy Data Augmentation Techniques for Boosting Performance on Text Classification Tasks using Tensorflow

A design of MIDI language for music generation task, specifically for Natural Language Processing (NLP) models.

Dé op-de-vlucht Pieton vertaler. Wereldwijd gebruikt door meer dan 1.000+ succesvolle bedrijven!

Repository for the paper "Optimal Subarchitecture Extraction for BERT"

PyTorch Implementation of "Bridging Pre-trained Language Models and Hand-crafted Features for Unsupervised POS Tagging" (Findings of ACL 2022)

Clone a voice in 5 seconds to generate arbitrary speech in real-time

Repository of the Code to Chatbots, developed in Python

Task-based datasets, preprocessing, and evaluation for sequence models.

MMDA - multimodal document analysis

A unified tokenization tool for Images, Chinese and English.

Quantifiers and Negations in RE Documents

jiant is an NLP toolkit

Torchrecipes provides a set of reproduci-able, re-usable, ready-to-run RECIPES for training different types of models, across multiple domains, on PyTorch Lightning.

Two-stage text summarization with BERT and BART

LSTC: Boosting Atomic Action Detection with Long-Short-Term Context