Pytorch implementation of "Get To The Point: Summarization with Pointer-Generator Networks"

Last update: Oct 14, 2022

Overview

About this repository

This repo contains an Pytorch implementation for the ACL 2017 paper Get To The Point: Summarization with Pointer-Generator Networks. The code framework is based on TextBox.

Environment

python >= 3.8.11
torch >= 1.6.0

Run install.sh to install other requirements.

Dataset

The processed dataset can be downloaded from Google Drive. Once finished, unzip the datafiles (train.src, train.tgt, ...) to ./data.

An overview of dataset: train: 287113 cases, dev: 13368 cases, test: 11490 cases

Paramters

# overall settings
data_path: 'data/'
checkpoint_dir: 'saved/'
generated_text_dir: 'generated/'
# dataset settings
max_vocab_size: 50000
src_len: 400
tgt_len: 100

# model settngs
decoding_strategy: 'beam_search'
beam_size: 4
is_attention: True
is_pgen: True
is_coverage: True
cov_loss_lambda: 1.0

Log file is located in ./log, more details can be found in yamls.

Note: Distributed Data Parallel (DDP) is not supported yet.

Train & Evaluation

From scratch run `fire.py`.

if __name__ == '__main__':
    config = Config(config_dict={'test_only': False,
                                 'load_experiment': None})
    train(config)

If you want to resume from a checkpoint, just set the 'load_experiment': './saved/$model_name$.pth'. Similarly, when 'test_only' is set to True, 'load_experiment' is required.

Results

The best model is trained on a TITAN Xp GPU (8GB usage).

Training loss

Ablation study

Model	Rouge-1	Rouge-2	Rouge-L
Seq2Seq	22.17	7.20	20.97
Seq2Seq+attn	29.35	12.58	27.38
Seq2Seq+attn+pgen	36.04	15.87	32.92
Seq2Seq+attn+pgen+coverage	39.52	17.85	36.40

Note: The architecture of the Seq2Seq model is based on lstm, I hope I can replace it with transformer in the future.

Pytorch implementation of "Get To The Point: Summarization with Pointer-Generator Networks"

Related tags

Overview

About this repository

Environment

Dataset

Paramters

Train & Evaluation

From scratch run `fire.py`.

Results

Training loss

Ablation study

Owner

wxDai

This repository contains tutorials for the py4DSTEM Python package

Training a Resilient Q-Network against Observational Interference, Causal Inference Q-Networks

GPOEO is a micro-intrusive GPU online energy optimization framework for iterative applications

The PyTorch implementation of DiscoBox: Weakly Supervised Instance Segmentation and Semantic Correspondence from Box Supervision.

Painting app using Python machine learning and vision technology.

code for TCL: Vision-Language Pre-Training with Triple Contrastive Learning, CVPR 2022

Python code to generate art with Generative Adversarial Network

Deep Reinforcement Learning for mobile robot navigation in ROS Gazebo simulator

A benchmark dataset for emulating atmospheric radiative transfer in weather and climate models with machine learning (NeurIPS 2021 Datasets and Benchmarks Track)

Implementation of paper: "Image Super-Resolution Using Dense Skip Connections" in PyTorch

Source code for paper "Deep Superpixel-based Network for Blind Image Quality Assessment"

deep learning model that learns to code with drawing in the Processing language

TaCL: Improving BERT Pre-training with Token-aware Contrastive Learning

Zalo AI challenge 2021 task hum to song

Secure Distributed Training at Scale

Finding Donors for CharityML

Class-Attentive Diffusion Network for Semi-Supervised Classification [AAAI'21] (official implementation)

PyTorch implementations of the paper: "Learning Independent Instance Maps for Crowd Localization"

Jingju baseline - A baseline model of our project of Beijing opera script generation

✅ How Robust are Fact Checking Systems on Colloquial Claims?. In NAACL-HLT, 2021.

Pytorch implementation of "Get To The Point: Summarization with Pointer-Generator Networks"

Related tags

Overview

About this repository

Environment

Dataset

Paramters

Train & Evaluation

From scratch run fire.py.

Results

Training loss

Ablation study

Owner

wxDai

This repository contains tutorials for the py4DSTEM Python package

Training a Resilient Q-Network against Observational Interference, Causal Inference Q-Networks

GPOEO is a micro-intrusive GPU online energy optimization framework for iterative applications

The PyTorch implementation of DiscoBox: Weakly Supervised Instance Segmentation and Semantic Correspondence from Box Supervision.

Painting app using Python machine learning and vision technology.

code for TCL: Vision-Language Pre-Training with Triple Contrastive Learning, CVPR 2022

Python code to generate art with Generative Adversarial Network

Deep Reinforcement Learning for mobile robot navigation in ROS Gazebo simulator

A benchmark dataset for emulating atmospheric radiative transfer in weather and climate models with machine learning (NeurIPS 2021 Datasets and Benchmarks Track)

Implementation of paper: "Image Super-Resolution Using Dense Skip Connections" in PyTorch

Source code for paper "Deep Superpixel-based Network for Blind Image Quality Assessment"

deep learning model that learns to code with drawing in the Processing language

TaCL: Improving BERT Pre-training with Token-aware Contrastive Learning

Zalo AI challenge 2021 task hum to song

Secure Distributed Training at Scale

Finding Donors for CharityML

Class-Attentive Diffusion Network for Semi-Supervised Classification [AAAI'21] (official implementation)

PyTorch implementations of the paper: "Learning Independent Instance Maps for Crowd Localization"

Jingju baseline - A baseline model of our project of Beijing opera script generation

✅ How Robust are Fact Checking Systems on Colloquial Claims?. In NAACL-HLT, 2021.

From scratch run `fire.py`.