Implementing a simplified copy of Shazam application from scratch using MinHashing and LSH.

Last update: Nov 17, 2022

Overview

Building Shazam from scratch

In this repository we tried to implement a simplified copy of the Shazam application able to tell you the name of a song listening to a short sample.

Overview

Converting the songs from mp3 to wav with Librosa and extraction of the peaks
MinHashing with permutations on the shingles matrix
Locality sensitive hashing to divide the songs in buckets
Shazam!

pickle is a folder that contains the songs peaks, the shingles array and the shingle matrix in pickle format.
ShazamLSH.ipynb is the main notebook that only contains the explanation of the steps and some comments
function.py contains all the implemented function needed to execute the notebook

Resources

This is the dataset we used and processed:

https://www.kaggle.com/dhrumil140396/mp3s32k

We also share some useful links can help to understand what is the process behind Min Hashing and LSH in order to recognise song:

Implementing a simplified copy of Shazam application from scratch using MinHashing and LSH.

Related tags

Overview

Building Shazam from scratch

Overview

Contents

Resources

Owner

Arturo Ghinassi

Open-source Monocular Python HawkEye for Tennis

The Empirical Investigation of Representation Learning for Imitation (EIRLI)

Easy to use Audio Tagging in PyTorch

Bayesian optimization in PyTorch

Bling's Object detection tool

Bio-Computing Platform Featuring Large-Scale Representation Learning and Multi-Task Deep Learning “螺旋桨”生物计算工具集

Semi-Supervised Signed Clustering Graph Neural Network (and Implementation of Some Spectral Methods)

Gradient Inversion with Generative Image Prior

Online-compatible Unsupervised Non-resonant Anomaly Detection Repository

Doosan robotic arm, simulation, control, visualization in Gazebo and ROS2 for Reinforcement Learning.

Generalized hybrid model for mode-locked laser diodes with an extended passive cavity

Pytorch reimplement of the paper "A Novel Cascade Binary Tagging Framework for Relational Triple Extraction" ACL2020. The original code is written in keras.

learned_optimization: Training and evaluating learned optimizers in JAX

Code of Classification Saliency-Based Rule for Visible and Infrared Image Fusion

Label-Free Model Evaluation with Semi-Structured Dataset Representations

Convert Table data to approximate values with GUI

Anomaly Localization in Model Gradients Under Backdoor Attacks Against Federated Learning

A machine learning package for streaming data in Python. The other ancestor of River.

VD-BERT: A Unified Vision and Dialog Transformer with BERT

MRQy is a quality assurance and checking tool for quantitative assessment of magnetic resonance imaging (MRI) data.