NLP算法

说明

此算法仓库包括文本分类、序列标注、关系抽取、文本匹配、文本相似度匹配这五个主流NLP任务，涉及到22个相关的模型算法。

框架结构

文件结构

all_models
├── Base_line
│   ├── __init__.py
│   ├── base_data_process.py
│   ├── base_evaluation.py
│   └── single_tokenizer.py
│
├── Texts_Classification
│   ├── 机器学习_文本分类
│   ├── fasttext_文本分类
│   ├── textcnn_文本分类
│   ├── lstm_文本分类
│   ├── han_文本分类
│   ├── bert_文本分类
│   └── 数据准备
│
├── Sequence_Labeling
│   ├── crf_suite
│   ├── lstm_crf
│   ├── bert_lstm_crf
│   ├── bert_mrc
│   └── 数据准备
│
├── Relation_Extraction
│   ├── CasRel
│   ├── multihead_joint_extraction
│   ├── R-bert_relation_recognition
│   ├── attention_lstm_relation_recognition
│   ├── attention_lstm_relation_recognition_for_single_sentence
│   ├── tagging_scheme_joint_extraction
│   ├── entity_extraction_bert_lstm_crf
│   └── 数据准备
│
├── Text_Matching
│   ├── DSSM
│   ├── ARC-II
│   ├── ESIM
│   ├── bert
│   └── 数据准备
│
├── Text_Similarity_Matching
│   ├── tfidf
│   ├── BM25
│   ├── pysparnn
│   └── commodity_title.txt
│
├── 记录
├── .gitignore
└── README.md

nlp基础任务

Related tags

Overview

NLP算法

说明

框架结构

文件结构

Owner

zuxinqi

NeMo: a toolkit for conversational AI

NLP library designed for reproducible experimentation management

IndoBERTweet is the first large-scale pretrained model for Indonesian Twitter. Published at EMNLP 2021 (main conference)

Snips Python library to extract meaning from text

AIDynamicTextReader - A simple dynamic text reader based on Artificial intelligence

ACL'2021: Learning Dense Representations of Phrases at Scale

Share constant definitions between programming languages and make your constants constant again

A Python package implementing a new model for text classification with visualization tools for Explainable AI :octocat:

Deep Learning Topics with Computer Vision & NLP

Python library for parsing resumes using natural language processing and machine learning

Korea Spell Checker

Data loaders and abstractions for text and NLP

Official code of our work, Unified Pre-training for Program Understanding and Generation [NAACL 2021].

The official implementation of VAENAR-TTS, a VAE based non-autoregressive TTS model.

Ceaser-Cipher - The Caesar Cipher technique is one of the earliest and simplest method of encryption technique

Implementation of the Hybrid Perception Block and Dual-Pruned Self-Attention block from the ITTR paper for Image to Image Translation using Transformers

Converts python code into c++ by using OpenAI CODEX.

PyTorch Implementation of "Non-Autoregressive Neural Machine Translation"

A repo for materials relating to the tutorial of CS-332 NLP

Python functions for summarizing and improving voice dictation input.