A Structured Self-attentive Sentence Embedding

Last update: Nov 28, 2022

Overview

Structured Self-attentive sentence embeddings

Implementation for the paper A Structured Self-Attentive Sentence Embedding, which was published in ICLR 2017: https://arxiv.org/abs/1703.03130 .

USAGE:

For binary sentiment classification on imdb dataset run : python classification.py "binary"

For multiclass classification on reuters dataset run : python classification.py "multiclass"

You can change the model parameters in the model_params.json file Other tranining parameters like number of attention hops etc can be configured in the config.json file.

If you want to use pretrained glove embeddings , set the use_embeddings parameter to "True" ,default is set to False. Do not forget to download the glove.6B.50d.txt and place it in the glove folder.

Implemented:

Classification using self attention
Regularization using Frobenius norm
Gradient clipping
Visualizing the attention weights

Instead of pruning ,used averaging over the sentence embeddings.

Visualization:

After training, the model is tested on 100 test points. Attention weights for the 100 test data are retrieved and used to visualize over the text using heatmaps. A file visualization.html gets saved in the visualization/ folder after successful training. The visualization code was provided by Zhouhan Lin (@hantek). Many thanks.

Below is a shot of the visualization on few datapoints.

Training accuracy 93.4% Tested on 1000 points with 90.2% accuracy

A Structured Self-attentive Sentence Embedding

Related tags

Overview

Structured Self-attentive sentence embeddings

USAGE:

Implemented:

Visualization:

Owner

Kaushal Shetty

MEDIALpy: MEDIcal Abbreviations Lookup in Python

Official source for spanish Language Models and resources made @ BSC-TEMU within the "Plan de las Tecnologías del Lenguaje" (Plan-TL).

PyTorch code for EMNLP 2019 paper "LXMERT: Learning Cross-Modality Encoder Representations from Transformers".

UA-GEC: Grammatical Error Correction and Fluency Corpus for the Ukrainian Language

Uses Google's gTTS module to easily create robo text readin' on command.

Easy, fast, effective, and automatic g-code compression!

Applied Natural Language Processing in the Enterprise - An O'Reilly Media Publication

A python project made to generate code using either OpenAI's codex or GPT-J (Although not as good as codex)

A desktop GUI providing an audio interface for GPT3.

A linter to manage all your python exceptions and try/except blocks (limited only for those who like dinosaurs).

Malware-Related Sentence Classification

Amazon Multilingual Counterfactual Dataset (AMCD)

Indobenchmark are collections of Natural Language Understanding (IndoNLU) and Natural Language Generation (IndoNLG)

Shirt Bot is a discord bot which uses GPT-3 to generate text

Search for documents in a domain through Google. The objective is to extract metadata

Code repository of the paper Neural circuit policies enabling auditable autonomy published in Nature Machine Intelligence

TTS is a library for advanced Text-to-Speech generation.

CrossNER: Evaluating Cross-Domain Named Entity Recognition (AAAI-2021)

WikiPron - a command-line tool and Python API for mining multilingual pronunciation data from Wiktionary