A number of methods in order to perform Natural Language Processing on live data derived from Twitter

Last update: Nov 24, 2021

Related tags

Overview

Twitter_NLP

Link to Project: https://twitoff-amadou.herokuapp.com/

==Description==

This project integrates a number of methods in order to perform Natural Language Processing (NLP) on live data derived from Twitter. The goal of this project is to demonstrate how NLP can be used at a basic level to classify hypertext by which Twitter user is most likely to 'tweet' (or post) it. For this project, Twitter API access had been granted, and implemented with the Tweepy wrapper for python.

To start, the web app it built using the Flask platform and is deployed on Heroku. For the functionality of the project, data is extracted from Twitter using its API and the Tweepy library and is fed into SQLAlchemy tables. These tables which hold a variety of information we're concerned with, such as the usernames and past tweeting data, are integrated with our PostgreSQL database. The Spacy library is then responsible for vectorizing our tweets into components our models can operate on. Finally, a random forest classifier is tasked with receiving and training on these vectors.

The interface of the app is quite intuitive. There are two text boxes, one labeled "User to add" and the other, "Tweet text to predict". The user is expected to type a name into the 'add' box, such that Tweepy can add the respective twitter user(s) and their tweeting data to our PostgreSQL database. Our random forest will then train live on the inputted values. Once this has been accomplished with at least two Twitter users in the database, one can add text into the 'predict' box, select the two users they wish to compare and let our model produce a result.

A number of methods in order to perform Natural Language Processing on live data derived from Twitter

Related tags

Overview

Twitter_NLP

==Description==

Owner

A simple Streamlit App to classify swahili news into different categories.

Statistics and Mathematics for Machine Learning, Deep Learning , Deep NLP

What are the best Systems? New Perspectives on NLP Benchmarking

Sequence model architectures from scratch in PyTorch

source code for paper: WhiteningBERT: An Easy Unsupervised Sentence Embedding Approach.

TruthfulQA: Measuring How Models Imitate Human Falsehoods

Source code for CsiNet and CRNet using Fully Connected Layer-Shared feedback architecture.

AI Assistant for Building Reliable, High-performing and Fair Multilingual NLP Systems

Watson Natural Language Understanding and Knowledge Studio

Python powered crossword generator with database with 20k+ polish words

Honor's thesis project analyzing whether the GPT-2 model can more effectively generate free-verse or structured poetry.

Wake: Context-Sensitive Automatic Keyword Extraction Using Word2vec

This is the Alpha of Nutte language, she is not complete yet / Essa é a Alpha da Nutte language, não está completa ainda

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

UniSpeech - Large Scale Self-Supervised Learning for Speech

Source code of the "Graph-Bert: Only Attention is Needed for Learning Graph Representations" paper

NLP and Text Generation Experiments in TensorFlow 2.x / 1.x

Host your own GPT-3 Discord bot

PeCo: Perceptual Codebook for BERT Pre-training of Vision Transformers

This is a project of data parallel that running on NLP tasks.