[제 13회 투빅스 컨퍼런스] OK Mugle! - 장르부터 멜로디까지, Content-based Music Recommendation

Last update: Oct 09, 2022

Related tags

Deep Learning OkMugle

Overview

Ok Mugle! 🎵

장르부터 멜로디까지, Content-based Music Recommendation

'Ok Mugle!'은 제13회 투빅스 컨퍼런스(2022.01.15)에서 진행한 음악 추천 프로젝트입니다.

Description 📖

본 프로젝트에서는 Kakao Arena에서 제공하는 Melon Playlist Continuation 데이터를 활용하여, 사용자가 검색한 노래와 유사한 노래 추천을 구현하였습니다.

[Model] '유사성'의 기준을 멜로디, 분위기, 상황, 장르 등으로 정의
- 해당 요소 반영하여 Music2Vec, Time Convolutional AutoEncoder, ConsineEmbeddingLoss Multimodal 등의 모델 Building
[Retrieval] Embedding의 Cosine Similarity를 구하여 Retrieval 구성
[Ranking] 다양한 Ranking Method 사용 → 추천 결과 Ensemble
[Serving] 최종적으로 Score Total Top 10 Ranking Method의 추천 결과 활용하여 Web 구현 & 모델 Serving

Usage ✔️

Windows Shell에 아래 명령을 입력하여 실행합니다.

set FLASK_APP=server
set FLASK_ENV=development
flask run

Result (Web) 💻

웹(ToBigs 13th Conference Music Recommendation) 바로가기
웹 메인화면

검색창에 '비투비 - 비밀 (Insane) (Acoustic Ver.)'를 검색한 결과 화면

Presentation 🙋

컨퍼런스 발표영상과 보고서입니다. 자세한 분석 내용은 아래 링크를 통해 확인해주세요!

Contributor 🧑‍🤝‍🧑

본 프로젝트에는 빅데이터 분석 및 인공지능 대표 연합동아리 ToBig's 멤버들이 참여하였습니다.

기수	이름
15기	이성범
16기	김권호
16기	박한나
16기	이승주
16기	이예림
16기	주지훈
7기	이광록(멘토)

File Directory 📂

Ok Mugle!
├── 1. preprocessig
│   ├── make_song_meta_and_playlist.ipynb       # 노래, 플레이리스트 데이터 전처리
│   ├── make_mel_data.ipynb                     # 멜 데이터 전처리
│   └── make_mel_batch_data.ipynb               # 멜 데이터 배치 단위로 전처리
│
├── 2. model
│   ├── genre_embedding_model.ipynb             # Music2Vec
│   ├── mel_embedding_model.ipynb               # Time Convolutional Autoencoder
│   └── genre_and_mel_embedding_model.ipynb     # CosineEmbeddingLoss Multimodal
│
├── 3. embedding-visualization
│   └── embedding_visualization_tsne.ipynb      # t-SNE를 활용한 각 임베딩별 시각화
│
├── 4. ranking
│   ├── make_ranking_data_preprocessig.ipynb    # 각 임베딩별 코사인 유사도 Top50 데이터 셋 제작 
│   ├── make_ranking_data_multiprocessig.py     # make_ranking_data_preprocessig의 multiprocessig을 위한 함수
│   ├── make_ranking_data.ipynb                 # 순위별 가중치 ranking, 각 임베딩 별 상위 Top3 ranking
│   └── cos_sim_music_serving.ipynb             # 각 임베딩, ranking 별 결과
│
└── 5. web
    ├── crawling                                # 결과창 구현을 위한 데이터 수집
    │   └── melon_crawling.py 
    │ 
    ├── data                                    # 웹 제작에 활용된 데이터
    │    ├── ranking_song_id2playlist.json
    │    ├── song_id2artist_name_basket.json
    │    ├── song_id2song_name.json
    │    └── song_name_artist_name2song_id.json
    │ 
    ├── static                                  # 웹 제작에 활용된 css, font, image, js
    │    ├── css
    │    ├── fonts
    │    ├── images
    │    └── js
    │ 
    ├── templates                               # 프론트 구현
    │    ├── about.html
    │    ├── index.html
    │    ├── people.html
    │    └── result.html
    │ 
    └── server.py                               # 백엔드 구현
    │
    └── requirements.txt                        # 필요 패키지 목록

[제 13회 투빅스 컨퍼런스] OK Mugle! - 장르부터 멜로디까지, Content-based Music Recommendation

Related tags

Overview

Ok Mugle! 🎵

장르부터 멜로디까지, Content-based Music Recommendation

Description 📖

Usage ✔️

Result (Web) 💻

Presentation 🙋

Contributor 🧑‍🤝‍🧑

File Directory 📂

Owner

SeongBeomLEE

Official implementation of NeurIPS'2021 paper TransformerFusion

[LREC] MMChat: Multi-Modal Chat Dataset on Social Media

This repository includes the official project for the paper: TransMix: Attend to Mix for Vision Transformers.

Official code for the paper "Self-Supervised Prototypical Transfer Learning for Few-Shot Classification"

A simple implementation of Kalman filter in single object tracking

SlideGraph+: Whole Slide Image Level Graphs to Predict HER2 Status in Breast Cancer

Spatial Intention Maps for Multi-Agent Mobile Manipulation (ICRA 2021)

Official Code for ICML 2021 paper "Revisiting Point Cloud Shape Classification with a Simple and Effective Baseline"

Image Captioning using CNN and Transformers

Official implementation of SynthTIGER (Synthetic Text Image GEneratoR) ICDAR 2021

Python wrapper class for OpenVINO Model Server. User can submit inference request to OVMS with just a few lines of code

Swapping face using Face Mesh with TensorFlow Lite

Pre-trained NFNets with 99% of the accuracy of the official paper

LTR_CrossEncoder: Legal Text Retrieval Zalo AI Challenge 2021

This repository implements variational graph auto encoder by Thomas Kipf.

A toolkit for developing and comparing reinforcement learning algorithms.

Open AI's Python library

Ludwig is a toolbox that allows to train and evaluate deep learning models without the need to write code.

Face recognize system

Codes to calculate solar-sensor zenith and azimuth angles directly from hyperspectral images collected by UAV. Works only for UAVs that have high resolution GNSS/IMU unit.