Which Apple Keeps Which Doctor Away? Colorful Word Representations with Visual Oracles

Last update: Apr 14, 2022

Related tags

Text Data & NLP AppleLM

Overview

AppleLM

Which Apple Keeps Which Doctor Away? Colorful Word Representations with Visual Oracles (TASLP 2022)

Setup

This implementation is based on Transformers.

Preparation

Download GLUE datasets

The datasets can be downloaded automatically. Please refer to https://github.com/nyu-mll/GLUE-baselines

git clone https://github.com/nyu-mll/GLUE-baselines.git
python download_glue_data.py --data_dir glue_data --tasks all

It is recommended to put the folder glue_data to data/. The architecture looks like:

AppleLM
└───data
│   └───glue_data
│       │   CoLA/
│       │   MRPC/
│       │   ...

Visual Features

Pre-extracted visual features can be downloaded from Google Drive borrowed from the repo Multi30K.

The features are used in image embedding layer for indexing. Extract train-resnet50-avgpool.npy and put it in the data/ folder.

Training & Evaluate

export GLUE_DIR=data/glue_data/
export CUDA_VISIBLE_DEVICES="0"
export TASK_NAME=CoLA
python ./examples/run_glue_visual-tfidf_att.py \
    --model_type bert \
    --model_name_or_path bert-large-uncased-whole-word-masking \
    --task_name $TASK_NAME \
    --do_eval \
    --do_lower_case \
    --data_dir $GLUE_DIR/$TASK_NAME \
    --max_seq_length 128 \
    --per_gpu_eval_batch_size=32   \
    --per_gpu_train_batch_size=16   \
    --learning_rate 1e-5 \
    --eval_all_checkpoints \
    --save_steps 500 \
    --max_steps 5336 \
    --warmup_steps 320 \
    --image_dir data/train.lc.norm.tok.en \
    --image_embedding_file data/train-resnet50-avgpool.npy \
    --num_img 3 \
    --tfidf 5 \
    --image_merge att-gate \
    --stopwords_dir data/stopwords-en.txt \
    --output_dir experiments/CoLA_bert_wwm

Reference

Please kindly cite this paper in your publications if it helps your research:

@ARTICLE{zhang2022which,
  author={Zhang, Zhuosheng and Yu, Haojie and Zhao, Hai and Utiyama, Masao},
  journal={IEEE/ACM Transactions on Audio, Speech, and Language Processing}, 
  title={Which Apple Keeps Which Doctor Away? Colorful Word Representations With Visual Oracles}, 
  year={2022},
  volume={30},
  number={},
  pages={49-59},
  doi={10.1109/TASLP.2021.3130972}
}

Which Apple Keeps Which Doctor Away? Colorful Word Representations with Visual Oracles

Related tags

Overview

AppleLM

Setup

Preparation

Training & Evaluate

Reference

Owner

Zhuosheng Zhang

Pytorch-version BERT-flow: One can apply BERT-flow to any PLM within Pytorch framework.

Interactive Jupyter Notebook Environment for using the GPT-3 Instruct API

ANTLR (ANother Tool for Language Recognition) is a powerful parser generator for reading, processing, executing, or translating structured text or binary files.

This python module is an easy-to-use port of the text normalization used in the paper "Not low-resource anymore: Aligner ensembling, batch filtering, and new datasets for Bengali-English machine translation". It is intended to be used for normalizing / cleaning Bengali and English text.

Transcribing audio files using Hugging Face's implementation of Wav2Vec2 + "chain-linking" NLP tasks to combine speech-to-text with downstream tasks like translation and summarisation.

Applying "Load What You Need: Smaller Versions of Multilingual BERT" to LaBSE

A Multi-modal Model Chinese Spell Checker Released on ACL2021.

Under the hood working of transformers, fine-tuning GPT-3 models, DeBERTa, vision models, and the start of Metaverse, using a variety of NLP platforms: Hugging Face, OpenAI API, Trax, and AllenNLP

🤗 Transformers: State-of-the-art Natural Language Processing for Pytorch, TensorFlow, and JAX.

CrossNER: Evaluating Cross-Domain Named Entity Recognition (AAAI-2021)

Speech Recognition Database Management with python

Natural language computational chemistry command line interface.

2021海华AI挑战赛·中文阅读理解·技术组·第三名

Winner system (DAMO-NLP) of SemEval 2022 MultiCoNER shared task over 10 out of 13 tracks.

ALBERT: A Lite BERT for Self-supervised Learning of Language Representations

TPlinker for NER 中文/英文命名实体识别

KakaoBrain KoGPT (Korean Generative Pre-trained Transformer)

sangha, pronounced "suhng-guh", is a social networking, booking platform where students and teachers can share their practice.

"Investigating the Limitations of Transformers with Simple Arithmetic Tasks", 2021

A program that uses real statistics to choose the best times to bet on BloxFlip's crash gamemode