Code for our ACL 2021 paper - ConSERT: A Contrastive Framework for Self-Supervised Sentence Representation Transfer

Last update: Dec 25, 2022

Related tags

Overview

ConSERT

Code for our ACL 2021 paper - ConSERT: A Contrastive Framework for Self-Supervised Sentence Representation Transfer

Requirements

torch==1.6.0
cudatoolkit==10.0.103
cudnn==7.6.5
sentence-transformers==0.3.9
transformers==3.4.0
tensorboardX==2.1
pandas==1.1.5
sentencepiece==0.1.85
matplotlib==3.4.1
apex==0.1.0

Get Started

Download pre-trained language model (e.g. bert-base-uncased) from HuggingFace's Library
Download STS datasets to ./data folder using SentEval toolkit

Run the following script to run the unsupervised experiment:

python3 main.py --no_pair --seed 1 --use_apex_amp --apex_amp_opt_level O1 --batch_size 96 --max_seq_length 64 --evaluation_steps 200 --add_cl --cl_loss_only --cl_rate 0.15 --temperature 0.1 --learning_rate 0.0000005 --train_data stssick --num_epochs 10 --da_final_1 feature_cutoff --da_final_2 shuffle --cutoff_rate_final_1 0.2 --model_name_or_path [PRETRAINED_BERT_FOLDER] --model_save_path ./output/unsup-base-feature_cutoff-shuffle --force_del --no_dropout --patience 10

where [PRETRAINED_BERT_FOLDER] should be replaced to the folder that contains downloaded pre-trained language model

Citation

@article{yan2021consert,
  title={ConSERT: A Contrastive Framework for Self-Supervised Sentence Representation Transfer},
  author={Yan, Yuanmeng and Li, Rumei and Wang, Sirui and Zhang, Fuzheng and Wu, Wei and Xu, Weiran},
  journal={arXiv preprint arXiv:2105.11741},
  year={2021}
}

Code for our ACL 2021 paper - ConSERT: A Contrastive Framework for Self-Supervised Sentence Representation Transfer

Related tags

Overview

ConSERT

Requirements

Get Started

Citation

Owner

Yan Yuanmeng

A state-of-the-art semi-supervised method for image recognition

Official PyTorch repo for JoJoGAN: One Shot Face Stylization

Cryptocurrency Prediction with Artificial Intelligence (Deep Learning via LSTM Neural Networks)

fastgradio is a python library to quickly build and share gradio interfaces of your trained fastai models.

Hypersearch weight debugging and losses tutorial

Explaining Deep Neural Networks - A comparison of different CAM methods based on an insect data set

Curriculum Domain Adaptation for Semantic Segmentation of Urban Scenes, ICCV 2017

DTCN SMP Challenge - Sequential prediction learning framework and algorithm

A Kernel fuzzer focusing on race bugs

Event-forecasting - Event Forecasting Algorithms With Python

NeurIPS 2021 paper 'Representation Learning on Spatial Networks' code

We simulate traveling back in time with a modern camera to rephotograph famous historical subjects.

Few-shot Relation Extraction via Bayesian Meta-learning on Relation Graphs

Official repository of the paper Learning to Regress 3D Face Shape and Expression from an Image without 3D Supervision

Extension to fastai for volumetric medical data

HuSpaCy: industrial-strength Hungarian natural language processing

UT-Sarulab MOS prediction system using SSL models

Deep learning with TensorFlow and earth observation data.

"Graph Neural Controlled Differential Equations for Traffic Forecasting", AAAI 2022

ISNAS-DIP: Image Specific Neural Architecture Search for Deep Image Prior [CVPR 2022]