This is a library for training and applying sparse fine-tunings with torch and transformers.

Last update: Dec 30, 2022

Related tags

Overview

This is a library for training and applying sparse fine-tunings with torch and transformers. Please refer to our paper Composable Sparse Fine-Tuning for Cross Lingual Transfer for background.

Installation

First, install Python 3.9 and PyTorch >= 1.9 (earlier versions may work but haven't been tested), e.g. using conda:

conda create -n sft python=3.9
conda activate sft
conda install pytorch cudatoolkit=11.1 -c pytorch -c conda-forge

Then download and install composable-sft:

git clone https://github.com/cambridgeltl/composable-sft.git
cd composable-sft
pip install -e .

Using pre-trained SFTs

Pre-trained SFTs can be downloaded directly and applied to models as follows:

from transformers import AutoConfig, AutoModelForTokenClassification
from sft import SFT

config = AutoConfig.from_pretrained(
    'bert-base-multilingual-cased',
    num_labels=17,
)

model = AutoModelForTokenClassification.from_pretrained(
    'bert-base-multilingual-cased',
    config=config,
)

language_sft = SFT('cambridgeltl/mbert-lang-sft-bxr-small') # SFT for Buryat
task_sft = SFT('cambridgeltl/mbert-task-sft-pos') # SFT for POS tagging

# Apply SFTs to pre-trained mBERT TokenClassification model
language_sft.apply(model)
task_sft.apply(model)

For a full list of pre-trained SFTs available, see MODELS

Example Scripts

Example scripts are provided in examples/ to show how to train SFTs using LT-SFT and evaluate them.

Citation

If you use this software, please cite the following paper:

@misc{ansell2021composable,
      title={Composable Sparse Fine-Tuning for Cross-Lingual Transfer},
      author={Alan Ansell and Edoardo Maria Ponti and Anna Korhonen and Ivan Vuli\'{c}},
      year={2021},
      eprint={2110.07560},
      archivePrefix={arXiv},
      primaryClass={cs.CL}
}

This is a library for training and applying sparse fine-tunings with torch and transformers.

Related tags

Overview

Installation

Using pre-trained SFTs

Example Scripts

Citation

Owner

Cambridge Language Technology Lab

minimizer-space de Bruijn graphs (mdBG) for whole genome assembly

Franka Emika Panda manipulator kinematics&dynamics simulation

Human Pose Detection on EdgeTPU

Source code of the paper Meta-learning with an Adaptive Task Scheduler.

HybVIO visual-inertial odometry and SLAM system

2D&3D human pose estimation

Static-test - A playground to play with ideas related to testing the comparability of the code

Changing the Mind of Transformers for Topically-Controllable Language Generation

Official source code to CVPR'20 paper, "When2com: Multi-Agent Perception via Communication Graph Grouping"

Monitor your ML jobs on mobile devices📱, especially for Google Colab / Kaggle

A novel Engagement Detection with Multi-Task Training (ED-MTT) system

Expand human face editing via Global Direction of StyleCLIP, especially to maintain similarity during editing.

Heterogeneous Deep Graph Infomax

⚓ Eurybia monitor model drift over time and securize model deployment with data validation

Vignette is a face tracking software for characters using osu!framework.

TensorFlow implementation of ENet

Faster Convex Lipschitz Regression

Data manipulation and transformation for audio signal processing, powered by PyTorch

🔅 Shapash makes Machine Learning models transparent and understandable by everyone

MAVE: : A Product Dataset for Multi-source Attribute Value Extraction