中文問句產生器；使用台達電閱讀理解資料集(DRCD)

Last update: Oct 22, 2021

Overview

Transformer QG on DRCD

The inputs of the model refers to

we integrate C and A into a new C' in the following form.
C' = [c1, c2, ..., [HL], a1, ..., a|A|, [HL], ..., c|C|]

Proposed by Ying-Hong Chan & Yao-Chung Fan. (2019). A Re-current BERT-based Model for Question Generation.

我們還有另外一個英文QG: Transformer-QG-on-SQuAD

Features

完整的流程；從微調到模型評分
支援許多先進的語言模型
內建Flask，可快速作為API server

DRCD dataset

台達閱讀理解資料集 Delta Reading Comprehension Dataset (DRCD) 屬於通用領域繁體中文機器閱讀理解資料集。 DRCD資料集從2,108篇維基條目中整理出10,014篇段落，並從段落中標註出30,000多個問題。

Available models

BART (base on uer/bart-base-chinese-cluecorpussmall)

Use in Transformers

from transformers import AutoTokenizer, AutoModelForSeq2SeqLM
  
tokenizer = AutoTokenizer.from_pretrained("p208p2002/bart-drcd-qg-hl")

model = AutoModelForSeq2SeqLM.from_pretrained("p208p2002/bart-drcd-qg-hl")

Expriments

Model	Bleu 1	Bleu 2	Bleu 3	Bleu 4	METEOR	ROUGE-L
BART-HLSQG	34.25	27.70	22.43	18.13	23.58	36.88

Environment requirements

The hole development is based on Ubuntu system

If you don't have pytorch 1.6+ please install or update first

https://pytorch.org/get-started/locally/

Install packages pip install -r requirements.txt
Setup scorer python setup_scorer.py
Download dataset python init_dataset.py

Training

Seq2Seq LM

usage: train_seq2seq_lm.py [-h]
                           [--base_model {bert-base-chinese,uer/bart-base-chinese-cluecorpussmall,p208p2002/bart-drcd-qg-hl}]
                           [-d {drcd}] [--batch_size BATCH_SIZE]
                           [--epoch EPOCH] [--lr LR] [--dev DEV] [--server]
                           [--run_test] [-fc FROM_CHECKPOINT]

optional arguments:
  -h, --help            show this help message and exit
  --base_model {bert-base-chinese,uer/bart-base-chinese-cluecorpussmall,p208p2002/bart-drcd-qg-hl}
  -d {drcd}, --dataset {drcd}
  --batch_size BATCH_SIZE
  --epoch EPOCH
  --lr LR
  --dev DEV
  --server
  --run_test
  -fc FROM_CHECKPOINT, --from_checkpoint FROM_CHECKPOINT

Run as API server

From pre-trained (recommend)

python train_seq2seq_lm.py --server --base_model p208p2002/bart-drcd-qg-hl

From your own checkpoint

python train_xxx_lm.py --server --base_model YOUR_BASE_MODEL --from_checkpoint FROM_CHECKPOINT

Request example

curl --location --request POST 'http://127.0.0.1:5000/' \
--header 'Content-Type: application/x-www-form-urlencoded' \
--data-urlencode 'context=[HL]伊隆·里夫·馬斯克[HL]是一名企業家和商業大亨'

{"predict": "哪一個人是一名企業家和商業大亨?"}

中文問句產生器；使用台達電閱讀理解資料集(DRCD)

Related tags

Overview

Transformer QG on DRCD

Features

DRCD dataset

Available models

Use in Transformers

Expriments

Environment requirements

Training

Seq2Seq LM

Run as API server

From pre-trained (recommend)

From your own checkpoint

Request example

Owner

Philip

Big Bird: Transformers for Longer Sequences

TFPNER: Exploration on the Named Entity Recognition of Token Fused with Part-of-Speech

CATs: Semantic Correspondence with Transformers

An implementation of model parallel GPT-2 and GPT-3-style models using the mesh-tensorflow library.

Model for recasing and repunctuating ASR transcripts

A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)

Global Rhythm Style Transfer Without Text Transcriptions

Guide to using pre-trained large language models of source code

DVC-NLP-Simple-usecase

PyTorch Implementation of Meta-StyleSpeech : Multi-Speaker Adaptive Text-to-Speech Generation

Official implementations for various pre-training models of ERNIE-family, covering topics of Language Understanding & Generation, Multimodal Understanding & Generation, and beyond.

This is the writeup of all the challenges from Advent-of-cyber-2019 of TryHackMe

Contract Understanding Atticus Dataset

A PyTorch implementation of the Transformer model in "Attention is All You Need".

Edge-Augmented Graph Transformer

自然言語で書かれた時間情報表現を抽出/規格化するルールベースの解析器

A notebook that shows how to import the IITB English-Hindi Parallel Corpus from the HuggingFace datasets repository

[EMNLP 2021] Mirror-BERT: Converting Pretrained Language Models to universal text encoders without labels.

BeautyNet is an AI powered model which can tell you whether you're beautiful or not.

Implementing SimCSE(paper, official repository) using TensorFlow 2 and KR-BERT.