【关于 NLP】那些你不知道的事

作者：杨夕、芙蕖、李玲、陈海顺、twilight、LeoLRH、JimmyDU、艾春辉、张永泰、金金金

介绍

本项目是作者们根据个人面试和经验总结出的自然语言处理(NLP)面试准备的学习笔记与资料，该资料目前包含自然语言处理各领域的面试题积累。

目录架构

一、【关于基础算法篇】那些你不知道的事

二、【关于机器学习算法篇】那些你不知道的事

三、【关于深度学习算法篇】那些你不知道的事

四、【关于 NLP 学习算法】那些你不知道的事

4.1 【关于信息抽取】那些你不知道的事

4.1.1 【关于命名实体识别】那些你不知道的事

4.1.2 【关于关系抽取】那些你不知道的事

【关于关系抽取】那些你不知道的事

4.1.3 【关于事件抽取】那些你不知道的事

【关于事件抽取】那些你不知道的事

4.2 【关于 NLP 预训练算法】那些你不知道的事

4.3 【关于文本分类】那些你不知道的事

4.4 【关于文本匹配】那些你不知道的事

4.5 【关于问答系统】那些你不知道的事

4.5.1 【关于 FAQ 检索式问答系统】那些你不知道的事

【关于 FAQ 检索式问答系统】那些你不知道的事

4.5.2 【关于问答系统工具篇】那些你不知道的事

【关于 Faiss 】那些你不知道的事

4.6 【关于对话系统】那些你不知道的事

4.7 【关于知识图谱】那些你不知道的事

五、【关于 NLP 技巧】那些你不知道的事

5.1 【关于少样本问题】那些你不知道的事

5.2 【关于脏数据】那些你不知道的事

【关于 “脏数据”处理】那些你不知道的事
- 一、动机
  - 1.1 何为“脏数据”？
  - 1.2 “脏数据” 会带来什么后果？
- 二、“脏数据” 处理篇
  - 2.1 “脏数据” 怎么处理呢？
  - 2.2 置信学习方法篇

5.3 【关于炼丹炉】那些你不知道的事

【关于 batch_size设置】那些你不知道的事
- 一、训练模型时，batch_size的设置，学习率的设置?

六、【关于 Python 】那些你不知道的事

【关于 Python 】那些你不知道的事

七、【关于 Tensorflow 】那些你不知道的事

【关于 Tensorflow 损失函数】那些你不知道的事

本项目是作者们根据个人面试和经验总结出的自然语言处理(NLP)面试准备的学习笔记与资料，该资料目前包含 自然语言处理各领域的 面试题积累。

Related tags

Overview

【关于 NLP】那些你不知道的事

介绍

目录架构

一、【关于 基础算法篇】那些你不知道的事

二、【关于 机器学习算法篇】那些你不知道的事

三、【关于 深度学习算法篇】那些你不知道的事

四、【关于 NLP 学习算法】那些你不知道的事

4.1 【关于 信息抽取】那些你不知道的事

4.1.1 【关于 命名实体识别】那些你不知道的事

4.1.2 【关于 关系抽取】那些你不知道的事

4.1.3 【关于 事件抽取】那些你不知道的事

4.2 【关于 NLP 预训练算法】那些你不知道的事

4.3 【关于 文本分类】那些你不知道的事

4.4 【关于 文本匹配】那些你不知道的事

4.5 【关于 问答系统】那些你不知道的事

4.5.1 【关于 FAQ 检索式问答系统】 那些你不知道的事

4.5.2 【关于 问答系统工具篇】 那些你不知道的事

4.6 【关于 对话系统】那些你不知道的事

4.7 【关于 知识图谱】那些你不知道的事

4.7.1 【关于 知识图谱】 那些你不知道的事

4.7.2 【关于 KBQA】那些你不知道的事

4.7.3 【关于 Neo4j】那些你不知道的事

4.8 【关于 文本摘要】 那些你不知道的事

4.9 【关于 知识表示学习】那些你不知道的事

五、【关于 NLP 技巧】那些你不知道的事

5.1 【关于 少样本问题】那些你不知道的事

5.2 【关于 脏数据】那些你不知道的事

5.3 【关于 炼丹炉】那些你不知道的事

六、【关于 Python 】那些你不知道的事

七、【关于 Tensorflow 】那些你不知道的事

Owner

translate using your voice

Collection of scripts to pinpoint obfuscated code

A paper list for aspect based sentiment analysis.

This python module is an easy-to-use port of the text normalization used in the paper "Not low-resource anymore: Aligner ensembling, batch filtering, and new datasets for Bengali-English machine translation". It is intended to be used for normalizing / cleaning Bengali and English text.

A repo for open resources & information for people to succeed in PhD in CS & career in AI / NLP

Proquabet - Convert your prose into proquints and then you essentially have Vogon poetry

Vad-sli-asr - A Python scripts for a speech processing pipeline with Voice Activity Detection (VAD)

Chinese NewsTitle Generation Project by GPT2.带有超级详细注释的中文GPT2新闻标题生成项目。

Entity Disambiguation as text extraction (ACL 2022)

Maha is a text processing library specially developed to deal with Arabic text.

ttslearn: Library for Pythonで学ぶ音声合成 (Text-to-speech with Python)

LewusBot - Twitch ChatBot built in python with twitchio library

Code for paper "Role-oriented Network Embedding Based on Adversarial Learning between Higher-order and Local Features"

Simple virtual assistant using pyttsx3 and speech recognition optionally with pywhatkit and pther libraries.

Translation for Trilium Notes. Trilium Notes 中文版.

Spokestack is a library that allows a user to easily incorporate a voice interface into any Python application with a focus on embedded systems.

Official code for "Parser-Free Virtual Try-on via Distilling Appearance Flows", CVPR 2021

CrossNER: Evaluating Cross-Domain Named Entity Recognition (AAAI-2021)

The tool to make NLP datasets ready to use

DAGAN - Dual Attention GANs for Semantic Image Synthesis

本项目是作者们根据个人面试和经验总结出的自然语言处理(NLP)面试准备的学习笔记与资料，该资料目前包含自然语言处理各领域的面试题积累。

一、【关于基础算法篇】那些你不知道的事

二、【关于机器学习算法篇】那些你不知道的事

三、【关于深度学习算法篇】那些你不知道的事

4.1 【关于信息抽取】那些你不知道的事

4.1.1 【关于命名实体识别】那些你不知道的事

4.1.2 【关于关系抽取】那些你不知道的事

4.1.3 【关于事件抽取】那些你不知道的事

4.3 【关于文本分类】那些你不知道的事

4.4 【关于文本匹配】那些你不知道的事

4.5 【关于问答系统】那些你不知道的事

4.5.1 【关于 FAQ 检索式问答系统】那些你不知道的事

4.5.2 【关于问答系统工具篇】那些你不知道的事

4.6 【关于对话系统】那些你不知道的事

4.7 【关于知识图谱】那些你不知道的事

4.7.1 【关于知识图谱】那些你不知道的事

4.8 【关于文本摘要】那些你不知道的事

4.9 【关于知识表示学习】那些你不知道的事

5.1 【关于少样本问题】那些你不知道的事

5.2 【关于脏数据】那些你不知道的事

5.3 【关于炼丹炉】那些你不知道的事