squote

A semantic search engine that takes some input text and returns some (questionably) relevant (questionably) famous quotes.

Built with:

Quotes from https://thewebminer.com/.

setup

First, install the necessary dependencies into a python 3 environment of your choice. For instance, to install the deps into a venv, run

python3 -m venv venv
source venv/bin/activate
pip install -r requirements.txt

There are additional native dependencies for FAISS: libomp and libopenblas must be available (see the FAISS repo for install instructions). All other commands should be run from within the virtual environment.

A Makefile is provided to make things nice and easy.

make dirs
make data   # downloads the raw quote data
make model  # downloads ~350MB of BERT weights

running

Before we can run the app, we need embeddings of the quotes. To generate the embeddings and save them in a pickled pandas DataFrame, run the commands below. This will take some time (couple of hours) on CPU.

make serve  # this runs bert-as-a-service
make embed  # this computes the embeddings

Once the embeddings exist, we can run the streamlit app with:

make serve  # not needed if still running from above
make app

Have fun!

Semantic search for quotes.

Related tags

Overview

squote

setup

running

Owner

cjwallace

FewCLUE: 为中文NLP定制的小样本学习测评基准

Searching keywords in PDF file folders

The official repository of the ISBI 2022 KNIGHT Challenge

Simple Text-To-Speech Bot For Discord

This simple Python program calculates a love score based on your and your crush's full names in English

:house_with_garden: Fast & easy transfer learning for NLP. Harvesting language models for the industry. Focus on Question Answering.

Multi-Scale Temporal Frequency Convolutional Network With Axial Attention for Speech Enhancement

Conversational-AI-ChatBot - Intelligent ChatBot built with Microsoft's DialoGPT transformer to make conversations with human users!

基于Transformer的单模型、多尺度的VAE模型

Python implementation of TextRank for phrase extraction and summarization of text documents

Word2Wave: a framework for generating short audio samples from a text prompt using WaveGAN and COALA.

A design of MIDI language for music generation task, specifically for Natural Language Processing (NLP) models.

A paper list of pre-trained language models (PLMs).

Text to speech for Vietnamese, ez to use, ez to update

An easier way to build neural search on the cloud

A deep learning-based translation library built on Huggingface transformers

Tool to add main subject to items on Wikidata using a WMFs CirrusSearch for named entity recognition or a manually supplied list of QIDs

An automated program that helps customers of Pizza Palour place their pizza orders

Implementation of paper Does syntax matter? A strong baseline for Aspect-based Sentiment Analysis with RoBERTa.

CJK computer science terms comparison / 中日韓電腦科學術語對照 / 日中韓のコンピュータ科学の用語対照 / 한·중·일 전산학 용어 대조