code for modular summarization work published in ACL2021 by Krishna et al

Last update: Nov 24, 2022

Related tags

Overview

This repository contains the code for running modular summarization pipelines as described in the publication
Krishna K, Khosla K, Bigham J, Lipton ZC. Generating SOAP Notes from Doctor-Patient Conversations." ACL 2021.

Instructions

Although we can not release models trained on the confidential medical data, we have released models trained on the publicly available AMI dataset.
To reproduce the results on the AMI dataset, you need to follow the steps listed below. For convenience, we have also created a Google Colab notebook here that runs these steps on Google's servers (free-of-cost as of June 2021) and produces the summaries and their rouge scores.

Step1: Set up the environment by installing the required packages mentioned in requirements.txt using pip.

Step2: Download the ami_models folder from this link and put it at the root of the repository:

Step3: Run the following 3 commands to prepare data, run summary generation pipelines, and show the achieved rouge scores.

# command1: downloads and preprocesses AMI dataset  
./prepare_data.sh  
  
 # command2: runs the summarization pipelines on the data and computes rouge scores  
 # (before running this command, you need to download the models as shown above)  
./predict_ami.sh  
  
# command3: print the results  
python show_results.py

code for modular summarization work published in ACL2021 by Krishna et al

Related tags

Overview

Instructions

Owner

Approximately Correct Machine Intelligence (ACMI) Lab

Use the power of GPT3 to execute any function inside your programs just by giving some doctests

用Resnet101+GPT搭建一个玩王者荣耀的AI

An implementation of WaveNet with fast generation

Correctly generate plurals, ordinals, indefinite articles; convert numbers to words

This project uses unsupervised machine learning to identify correlations between daily inoculation rates in the USA and twitter sentiment in regards to COVID-19.

The aim of this task is to predict someone's English proficiency based on a text input.

vits chinese, tts chinese, tts mandarin

BERN2: an advanced neural biomedical namedentity recognition and normalization tool

CoNLL-English NER Task (NER in English)

DomainWordsDict, Chinese words dict that contains more than 68 domains, which can be used as text classification、knowledge enhance task

"Investigating the Limitations of Transformers with Simple Arithmetic Tasks", 2021

💬 Open source machine learning framework to automate text- and voice-based conversations: NLU, dialogue management, connect to Slack, Facebook, and more - Create chatbots and voice assistants

Translate - a PyTorch Language Library

Neural text generators like the GPT models promise a general-purpose means of manipulating texts.

Translators - is a library which aims to bring free, multiple, enjoyable translation to individuals and students in Python

Optimal Transport Tools (OTT), A toolbox for all things Wasserstein.

This is the Alpha of Nutte language, she is not complete yet / Essa é a Alpha da Nutte language, não está completa ainda

Implementation of "Adversarial purification with Score-based generative models", ICML 2021

BPEmb is a collection of pre-trained subword embeddings in 275 languages, based on Byte-Pair Encoding (BPE) and trained on Wikipedia.