Rhyme with AI

Last update: Nov 21, 2022

Overview

Local development

Create a conda virtual environment and activate it:

conda env create --file environment.yml
conda activate rhyme-with-ai

Install the rhyme_with_ai package and all its dependencies:

pip install --editable .

Download the weights of the models (if you get any errors, make sure these align with those specified in app/app.py)):

make download-data

Run the app:

make streamlit

Test the Docker containers by running:

make docker-build
make docker-serve

This project uses black for code formatting. To incorporate it in your version control follow the instructions below (copied from black's own readme):

Use pre-commit. Once you have it installed, add this to the .pre-commit-config.yaml in your repository:

repos:
-   repo: https://github.com/ambv/black
    rev: stable
    hooks:
    - id: black
      language_version: python3.7

Then run pre-commit install and you're ready to go.

Deploy to App Engine

Follow Google's documentation to set up Custom Runtimes in the App Engine Flexible Environment. Deploy the app:

gcloud app deploy

And you're done!

Todo

Integrate TokenWeighter in the RhymeGenerator.
Don't block on model loading or rhyme mutations (use API?).

Rhyme with AI

Related tags

Overview

Local development

Deploy to App Engine

Todo

Owner

GoDataDriven

Toy example of an applied ML pipeline for me to experiment with MLOps tools.

KoBERTopic은 BERTopic을 한국어 데이터에 적용할 수 있도록 토크나이저와 BERT를 수정한 코드입니다.

BookNLP, a natural language processing pipeline for books

A demo for end-to-end English and Chinese text spotting using ABCNet.

TFPNER: Exploration on the Named Entity Recognition of Token Fused with Part-of-Speech

Beyond the Imitation Game collaborative benchmark for enormous language models

KR-FinBert And KR-FinBert-SC

VADER Sentiment Analysis. VADER (Valence Aware Dictionary and sEntiment Reasoner) is a lexicon and rule-based sentiment analysis tool that is specifically attuned to sentiments expressed in social media, and works well on texts from other domains.

iBOT: Image BERT Pre-Training with Online Tokenizer

Mednlp - Medical natural language parsing and utility library

This repository contains the codes for LipGAN. LipGAN was published as a part of the paper titled "Towards Automatic Face-to-Face Translation".

Indobenchmark are collections of Natural Language Understanding (IndoNLU) and Natural Language Generation (IndoNLG)

Tevatron is a simple and efficient toolkit for training and running dense retrievers with deep language models.

LUKE -- Language Understanding with Knowledge-based Embeddings

Python library for processing Chinese text

Programme de chiffrement et de déchiffrement inverse d'un message en python3.

End-to-end text to speech system using gruut and onnx. There are 40 voices available across 8 languages.

pyupbit 라이브러리를 활용하여 upbit에서 비트코인을 자동매매하는 코드입니다. 조코딩 유튜브 채널에서 자세한 강의 영상을 보실 수 있습니다.

Count the frequency of letters or words in a text file and show a graph.

This project uses word frequency and Term Frequency-Inverse Document Frequency to summarize a text.