Code for the Findings of NAACL 2022(Long Paper): AdapterBias: Parameter-efficient Token-dependent Representation Shift for Adapters in NLP Tasks

Last update: Nov 12, 2022

Overview

AdapterBias: Parameter-efficient Token-dependent Representation Shift for Adapters in NLP Tasks

arXiv link: upcoming

To be published in Findings of NAACL 2022

Authors: Chin-Lun Fu*, Zih-Ching Chen*, Yun-Ru Lee, Hung-yi Lee

Overview

In this study, AdapterBias, a surprisingly simple yet effective adapter architecture, is proposed. AdapterBias adds a token-dependent shift to the hidden output of transformer layers to adapt to downstream tasks with only a vector and a linear layer.

Dataset

We use GLUE Benchmark as our dataset. You can download all datasets from the website.

Training

cd src
python exp.py \
    --adapter True \
    --GLUE_path <ur_GLUE_path> \
    --output_path <output_path> \
    --model <model name> \
    --task <the task u want to run> \
    --epoch 100 \
    --lr 0.0001 \
    --max_len 512 \
    --batch_size 32 \

-s or --seed specifies the random seed
-g or --GLUE_path specifies the path of your GLUE dataset.
-o or --output_path specifies the path of saved model and saved predicted file.
-m or --model specifies the pre-trained language model (PLM) you used in training.
- Some examples: bert-base, bert-large, roberta-base, roberta-large
-t or --task specifies the downstream task.
- Some examples: cola, mnli, qnli, qqp, mrpc, rte, sst, sts
-a or --adapter specifies whether you adding our AdapterBias in PLM
--share_alpha specifies whether you share the same alpha in AdapterBias in all transformer layers

Inference

After you run the training, you can automatically get the prediction file in <output_path>/result/. Also, the saved model is in <output_path>/model/.

Running all nine tasks of GLUE benchmark, you can sumbit the prediction files to the website.

Code for the Findings of NAACL 2022(Long Paper): AdapterBias: Parameter-efficient Token-dependent Representation Shift for Adapters in NLP Tasks

Related tags

Overview

AdapterBias: Parameter-efficient Token-dependent Representation Shift for Adapters in NLP Tasks

Overview

Dataset

Training

Inference

Owner

Allen

Production First and Production Ready End-to-End Keyword Spotting Toolkit

apple's universal binaries BUT MUCH WORSE (PRACTICAL SHITPOST) (NOT PRODUCTION READY)

Code for the paper "Flexible Generation of Natural Language Deductions"

BiNE: Bipartite Network Embedding

A complete NLP guideline for enthusiasts

A single model that parses Universal Dependencies across 75 languages.

NAACL 2022: MCSE: Multimodal Contrastive Learning of Sentence Embeddings

TFIDF-based QA system for AIO2 competition

PG-19 Language Modelling Benchmark

Wind Speed Prediction using LSTMs in PyTorch

Opal-lang - A WIP programming language based on Python

This repository contains (not all) code from my project on Named Entity Recognition in philosophical text

The aim of this task is to predict someone's English proficiency based on a text input.

String Gen + Word Checker

A model library for exploring state-of-the-art deep learning topologies and techniques for optimizing Natural Language Processing neural networks

👄 The most accurate natural language detection library for Python, suitable for long and short text alike

TweebankNLP - Pre-trained Tweet NLP Pipeline (NER, tokenization, lemmatization, POS tagging, dependency parsing) + Models + Tweebank-NER

Repo for Enhanced Seq2Seq Autoencoder via Contrastive Learning for Abstractive Text Summarization

This repository contains helper functions which can help you generate additional data points depending on your NLP task.

结巴中文分词