NLP tool to extract emotional phrase from tweets 🤩

Last update: Oct 17, 2022

Overview

Emotional phrase extractor

Extract phrase in the given text that is used to express the sentiment. Capturing sentiment in language is important in these times where decisions and reactions are created and updated in seconds. But, which words actually lead to the sentiment description? This project aims to solve this problem.

Powered using Pytorch + hugggingface 🤗

Try it out.

git clone https://github.com/shahules786/twitter-emotions.git

cd twitter-emotions

sudo docker build --tag twitter-emotions:api .

sudo docker run -p 9999:9999  -it twitter-emotions:api python twitteremotions/app.py

Server will start running on port 9999 of localhost

Example

Installation for development

git clone https://github.com/shahules786/twitter-emotions.git

cd twitter-emotions

pip install -r requirements.txt

Train Model on your data

from twitteremotions.emotions import TwitterEmotions
emotions = TwitterEmotions()
emotions.train(train_path="data/train.csv", epochs=10, batch_size=32, max_len=168, test_size=0.25)

Contributing

All contrbutions are welcome 👋

You might also like...

HuggingTweets - Train a model to generate tweets

HuggingTweets - Train a model to generate tweets Create in 5 minutes a tweet generator based on your favorite Tweeter Make my own model with the demo

318 Jan 4, 2023

Colibri core is an NLP tool as well as a C++ and Python library for working with basic linguistic constructions such as n-grams and skipgrams (i.e patterns with one or more gaps, either of fixed or dynamic size) in a quick and memory-efficient way. At the core is the tool ``colibri-patternmodeller`` whi ch allows you to build, view, manipulate and query pattern models.

Colibri Core by Maarten van Gompel, [email protected], Radboud University Nijmegen Licensed under GPLv3 (See http://www.gnu.org/licenses/gpl-3.0.html

122 Nov 17, 2022

Python bindings to the dutch NLP tool Frog (pos tagger, lemmatiser, NER tagger, morphological analysis, shallow parser, dependency parser)

Frog for Python This is a Python binding to the Natural Language Processing suite Frog. Frog is intended for Dutch and performs part-of-speech tagging

46 Dec 14, 2022

The tool to make NLP datasets ready to use

chazutsu photo from Kaikado, traditional Japanese chazutsu maker chazutsu is the dataset downloader for NLP. import chazutsu r = chazutsu.data

243 Dec 29, 2022

Snips Python library to extract meaning from text

Snips NLU Snips NLU (Natural Language Understanding) is a Python library that allows to extract structured information from sentences written in natur

3.7k Dec 30, 2022

Search for documents in a domain through Google. The objective is to extract metadata

MetaFinder - Metadata search through Google _____ __ ___________ .__ .___ / \

85 Dec 16, 2022

Extract Keywords from sentence or Replace keywords in sentences.

FlashText This module can be used to replace keywords in sentences or extract keywords from sentences. It is based on the FlashText algorithm. Install

5.3k Jan 1, 2023

Snips Python library to extract meaning from text

Snips NLU Snips NLU (Natural Language Understanding) is a Python library that allows to extract structured information from sentences written in natur

3.5k Feb 12, 2021

Textpipe: clean and extract metadata from text

textpipe: clean and extract metadata from text textpipe is a Python package for converting raw text in to clean, readable text and extracting metadata

298 Nov 21, 2022

Comments

avoid confusion : end_tokens instead of start_tokens
Avoid Confusion

Replace start_tokens with end_tokens for the fourth argument to calculate the loss function to avoid confusion :)

While reviewing your amazing project, I noticed that the EmotionData class of the dataloader.py file is returning:

{ ... # start_tokens "start_tokens": torch.tensor(start_tokens, dtype=torch.long), # end_tokens "end_tokens": torch.tensor(end_tokens, dtype=torch.long), }

But in the engine.py file you are passing start_tokens for both the third and fourth arguments of the loss_fn():

loss = loss_fn( start, end, torch.argmax(data["start_tokens"], axis=1), torch.argmax(data["start_tokens"], axis=1) )

But the fourth has to be end_tokens. This minor change will not affect the loss_fn() output function since they are equal in all cases [=1].But, to respect conventions and avoid confusion, it would be better if it looks like the one shown below on the right:
opened by zekaouinoureddine 0

Releases(v1.0.0)

v1.0.0(May 17, 2021)

Trained Roberta base weights for twitter-emotions.
Source code(tar.gz)
Source code(zip)
emotion_torch.pth(475.54 MB)
pytorch_model.bin(477.98 MB)

Owner

Shahul ES

Data Scientist | Kaggle GrandMaster ( Rank 20) | Opensource @mljar

GitHub Repository

Biterm Topic Model (BTM): modeling topics in short texts

Biterm Topic Model Bitermplus implements Biterm topic model for short texts introduced by Xiaohui Yan, Jiafeng Guo, Yanyan Lan, and Xueqi Cheng. Actua

49 Dec 30, 2022

Smart discord chatbot integrated with Dialogflow to manage different classrooms and assist in teaching!

smart-school-chatbot Smart discord chatbot integrated with Dialogflow to interact with students naturally and manage different classes in a school. De

5 Oct 24, 2022

Transformer related optimization, including BERT, GPT

This repository provides a script and recipe to run the highly optimized transformer-based encoder and decoder component, and it is tested and maintained by NVIDIA.

1.7k Jan 04, 2023

🎐 a python library for doing approximate and phonetic matching of strings.

jellyfish Jellyfish is a python library for doing approximate and phonetic matching of strings. Written by James Turk 1.8k Dec 21, 2022

Bnagla hand written document digiiztion

Bnagla hand written document digiiztion This repo addresses the problem of digiizing hand written documents in Bangla. Documents have definite fields

1 Dec 10, 2021

Code for EMNLP'21 paper "Types of Out-of-Distribution Texts and How to Detect Them"

19 Oct 28, 2022

📔️ Generate a text-based journal from a template file.

JGen 📔️ Generate a text-based journal from a template file. Contents Getting Started Example Overview Usage Details Reserved Keywords Gotchas Getting

21 Sep 25, 2022

Learning General Purpose Distributed Sentence Representations via Large Scale Multi-task Learning

GenSen Learning General Purpose Distributed Sentence Representations via Large Scale Multi-task Learning Sandeep Subramanian, Adam Trischler, Yoshua B

309 Oct 19, 2022

This repository collects together basic linguistic processing data for using dataset dumps from the Common Voice project

Common Voice Utils This repository collects together basic linguistic processing data for using dataset dumps from the Common Voice project. It aims t

40 Dec 20, 2022

Unsupervised Language Model Pre-training for French

FlauBERT and FLUE FlauBERT is a French BERT trained on a very large and heterogeneous French corpus. Models of different sizes are trained using the n

212 Dec 10, 2022

ttslearn: Library for Pythonで学ぶ音声合成 (Text-to-speech with Python)

ttslearn: Library for Pythonで学ぶ音声合成 (Text-to-speech with Python) 日本語は以下に続きます (Japanese follows) English: This book is written in Japanese and primaril

189 Dec 29, 2022

Simple Python library, distributed via binary wheels with few direct dependencies, for easily using wav2vec 2.0 models for speech recognition

Wav2Vec2 STT Python Beta Software Simple Python library, distributed via binary wheels with few direct dependencies, for easily using wav2vec 2.0 mode

22 Dec 29, 2022

NLP tool to extract emotional phrase from tweets 🤩

Related tags

Overview

Emotional phrase extractor

Try it out.

Example

Installation for development

Contributing

You might also like...

HuggingTweets - Train a model to generate tweets

Python bindings to the dutch NLP tool Frog (pos tagger, lemmatiser, NER tagger, morphological analysis, shallow parser, dependency parser)

The tool to make NLP datasets ready to use

Snips Python library to extract meaning from text

Search for documents in a domain through Google. The objective is to extract metadata

Extract Keywords from sentence or Replace keywords in sentences.

Snips Python library to extract meaning from text

Textpipe: clean and extract metadata from text

Comments

avoid confusion : end_tokens instead of start_tokens

Avoid Confusion

Releases(v1.0.0)

v1.0.0(May 17, 2021)

Owner

Shahul ES

Biterm Topic Model (BTM): modeling topics in short texts

Smart discord chatbot integrated with Dialogflow to manage different classrooms and assist in teaching!

Transformer related optimization, including BERT, GPT

🎐 a python library for doing approximate and phonetic matching of strings.

Bnagla hand written document digiiztion

Code for EMNLP'21 paper "Types of Out-of-Distribution Texts and How to Detect Them"

📔️ Generate a text-based journal from a template file.

Learning General Purpose Distributed Sentence Representations via Large Scale Multi-task Learning

This repository collects together basic linguistic processing data for using dataset dumps from the Common Voice project

Unsupervised Language Model Pre-training for French

ttslearn: Library for Pythonで学ぶ音声合成 (Text-to-speech with Python)

Simple Python library, distributed via binary wheels with few direct dependencies, for easily using wav2vec 2.0 models for speech recognition

A collection of Korean Text Datasets ready to use using Tensorflow-Datasets.

Multi-Scale Temporal Frequency Convolutional Network With Axial Attention for Speech Enhancement

Simple Text-To-Speech Bot For Discord

Universal End2End Training Platform, including pre-training, classification tasks, machine translation, and etc.

A python framework to transform natural language questions to queries in a database query language.

Graphical user interface for Argos Translate

A BERT-based reverse-dictionary of Korean proverbs

Translate - a PyTorch Language Library