Text Classification in Turkish Texts with Bert

Last update: Dec 31, 2022

Overview

You can watch the details of the project on my youtube channel

Project Interface

Project Second Interface

Goal= Correctly guessing the classification of texts and audios

BERT_Text_Classification

It is a text classification task implementation transformers (by HuggingFace) with BERT. It contains several parts:

--Data pre-processing

--BERT tokenization and input formating

--Train with BERT

--Evaluation

--Save and load saved model

Text-classification-transformers

Text classification tasks are most easily encountered in the area of natural language processing and can be used in various ways.

However, the given data needs to be preprocessed and the model's data pipeline must be created according to the preprocessing.

The purpose of this Repository is to allow text classification to be easily performed with Transformers (BERT)-like models if text classification data has been preprocessed into a specific structure.

Implemented based on Huggingfcae transformers for quick and convenient implementation.

Text Classification in Turkish Texts with Bert

Related tags

Overview

You can watch the details of the project on my youtube channel

Project Interface

Project Second Interface

BERT_Text_Classification

Text-classification-transformers

📝 read_dataset

Unique Categories

☄️ Available models

🏴‍☠️ Model Performance

Predictions Vs Actuals

🃏 predictor

97.22 📈

Owner

ALBERT: A Lite BERT for Self-supervised Learning of Language Representations

PyTorch Implementation of VAENAR-TTS: Variational Auto-Encoder based Non-AutoRegressive Text-to-Speech Synthesis.

ACL22 paper: Imputing Out-of-Vocabulary Embeddings with LOVE Makes Language Models Robust with Little Cost

HAIS_2GNN: 3D Visual Grounding with Graph and Attention

Gpt2-WebAPI - The objective of this API is to provide the 3 best possible responses to sentences that the user would input via http GET request as a parameter

Collection of scripts to pinpoint obfuscated code

Wikipedia-Utils: Preprocessing Wikipedia Texts for NLP

An open source framework for seq2seq models in PyTorch.

BMInf (Big Model Inference) is a low-resource inference package for large-scale pretrained language models (PLMs).

Easily train your own text-generating neural network of any size and complexity on any text dataset with a few lines of code.

Text Analysis & Topic Extraction on Android App user reviews

Spokestack is a library that allows a user to easily incorporate a voice interface into any Python application with a focus on embedded systems.

Sentello is python script that simulates the anti-evasion and anti-analysis techniques used by malware.

NVDA, the free and open source Screen Reader for Microsoft Windows

DVC-NLP-Simple-usecase

pkuseg多领域中文分词工具; The pkuseg toolkit for multi-domain Chinese word segmentation

Pytorch code for ICRA'21 paper: "Hierarchical Cross-Modal Agent for Robotics Vision-and-Language Navigation"

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Code and checkpoints for training the transformer-based Table QA models introduced in the paper TAPAS: Weakly Supervised Table Parsing via Pre-training.

Implementation for paper BLEU: a Method for Automatic Evaluation of Machine Translation