Code for the paper "BERT Loses Patience: Fast and Robust Inference with Early Exit".

Last update: Jan 04, 2023

Related tags

Text Data & NLP PABEE

Overview

Patience-based Early Exit

Code for the paper "BERT Loses Patience: Fast and Robust Inference with Early Exit".

NEWS: We now have a better and tidier implementation integrated into Hugging Face transformers!

Citation

If you use this code in your research, please cite our paper:

@inproceedings{zhou2020bert,
 author = {Zhou, Wangchunshu and Xu, Canwen and Ge, Tao and McAuley, Julian and Xu, Ke and Wei, Furu},
 booktitle = {Advances in Neural Information Processing Systems},
 pages = {18330--18341},
 publisher = {Curran Associates, Inc.},
 title = {BERT Loses Patience: Fast and Robust Inference with Early Exit},
 url = {https://proceedings.neurips.cc/paper/2020/file/d4dd111a4fd973394238aca5c05bebe3-Paper.pdf},
 volume = {33},
 year = {2020}
}

Requirement

Our code is built on huggingface/transformers. To use our code, you must clone and install huggingface/transformers.

Training

You can fine-tune a pretrained language model and train the internal classifiers by configuring and running finetune_bert.sh and finetune_albert.sh .

Inference

You can inference with different patience settings by configuring and running patience_infer_albert.sh and patience_infer_bert.sh.

Bug Report and Contribution

If you'd like to contribute and add more tasks (only GLUE is available at this moment), please submit a pull request and contact me. Also, if you find any problem or bug, please report with an issue. Thanks!

Code for the paper "BERT Loses Patience: Fast and Robust Inference with Early Exit".

Related tags

Overview

Patience-based Early Exit

Citation

Requirement

Training

Inference

Bug Report and Contribution

Owner

Kevin Canwen Xu

Creating a chess engine using GPT-3

FactSumm: Factual Consistency Scorer for Abstractive Summarization

Associated Repository for "Translation between Molecules and Natural Language"

Perform sentiment analysis on textual data that people generally post on websites like social networks and movie review sites.

Under the hood working of transformers, fine-tuning GPT-3 models, DeBERTa, vision models, and the start of Metaverse, using a variety of NLP platforms: Hugging Face, OpenAI API, Trax, and AllenNLP

Samantha, A covid-19 information bot which will provide basic information about this pandemic in form of conversation.

☀️ Measuring the accuracy of BBC weather forecasts in Honolulu, USA

XLNet: Generalized Autoregressive Pretraining for Language Understanding

本项目是作者们根据个人面试和经验总结出的自然语言处理(NLP)面试准备的学习笔记与资料，该资料目前包含自然语言处理各领域的面试题积累。

HiFi DeepVariant + WhatsHap workflowHiFi DeepVariant + WhatsHap workflow

Deep Learning for Natural Language Processing - Lectures 2021

ChainKnowledgeGraph, 产业链知识图谱包括A股上市公司、行业和产品共3类实体

📜 GPT-2 Rhyming Limerick and Haiku models using data augmentation

Persian Bert For Long-Range Sequences

EMNLP'2021: Can Language Models be Biomedical Knowledge Bases?

Just a Basic like Language for Zeno INC

code for modular summarization work published in ACL2021 by Krishna et al

Suite of 500 procedurally-generated NLP tasks to study language model adaptability

Just a basic Telegram AI chat bot written in Python using Pyrogram.

تولید اسم های رندوم فینگیلیش

Code for the paper "BERT Loses Patience: Fast and Robust Inference with Early Exit".

Related tags

Overview

Patience-based Early Exit

Citation

Requirement

Training

Inference

Bug Report and Contribution

Owner

Kevin Canwen Xu

Creating a chess engine using GPT-3

FactSumm: Factual Consistency Scorer for Abstractive Summarization

Associated Repository for "Translation between Molecules and Natural Language"

Perform sentiment analysis on textual data that people generally post on websites like social networks and movie review sites.

Under the hood working of transformers, fine-tuning GPT-3 models, DeBERTa, vision models, and the start of Metaverse, using a variety of NLP platforms: Hugging Face, OpenAI API, Trax, and AllenNLP

Samantha, A covid-19 information bot which will provide basic information about this pandemic in form of conversation.

☀️ Measuring the accuracy of BBC weather forecasts in Honolulu, USA

XLNet: Generalized Autoregressive Pretraining for Language Understanding

本项目是作者们根据个人面试和经验总结出的自然语言处理(NLP)面试准备的学习笔记与资料，该资料目前包含 自然语言处理各领域的 面试题积累。

HiFi DeepVariant + WhatsHap workflowHiFi DeepVariant + WhatsHap workflow

Deep Learning for Natural Language Processing - Lectures 2021

ChainKnowledgeGraph, 产业链知识图谱包括A股上市公司、行业和产品共3类实体

📜 GPT-2 Rhyming Limerick and Haiku models using data augmentation

Persian Bert For Long-Range Sequences

EMNLP'2021: Can Language Models be Biomedical Knowledge Bases?

Just a Basic like Language for Zeno INC

code for modular summarization work published in ACL2021 by Krishna et al

Suite of 500 procedurally-generated NLP tasks to study language model adaptability

Just a basic Telegram AI chat bot written in Python using Pyrogram.

تولید اسم های رندوم فینگیلیش

本项目是作者们根据个人面试和经验总结出的自然语言处理(NLP)面试准备的学习笔记与资料，该资料目前包含自然语言处理各领域的面试题积累。