The implementation of Parameter Differentiation based Multilingual Neural Machine Translation

Last update: Dec 17, 2022

Overview

The implementation of Parameter Differentiation based Multilingual Neural Machine Translation .

Requirement:

apex
fairseq
scikit-learn
pytorch

Process data following https://github.com/pytorch/fairseq/tree/main/examples/translation#multilingual-translation.
Training:

data_bin=    # data path 
lang_pairs=  # comma separated language pairs

fairseq-train $data_path \
    --task parameter_differentiation_task --lang-pairs $lang_pairs --encoder-langtok tgt \
    --criterion label_smoothed_cross_entropy --label-smoothing 0.1 \
    --optimizer adam --lr 0.0015 --adam-betas '(0.9,0.98)' \
    --lr-scheduler inverse_sqrt --warmup-updates 4000 --warmup-init-lr 1e-07 \
    --arch parameter_differentiation_base_model \
    --max-tokens 8192 \
    --user-dir $PWD

Decoding

source_lang=
target_lang=
model_path=
fairseq-generate $data_path --path $model_path \
    --task parameter_differentiation_task --lang-pairs $lang_pairs --encoder-langtok tgt \
    --beam 4 --lenpen 0.6 --remove-bpe sentencepiece \
    --source-lang $source_lang --target-lang $target_lang > result.$source_lang-$target_lang.txt

The implementation of Parameter Differentiation based Multilingual Neural Machine Translation

Related tags

Overview

Owner

Qian Wang

FastFormers - highly efficient transformer models for NLU

Modular and extensible speech recognition library leveraging pytorch-lightning and hydra.

InferSent sentence embeddings

Train BPE with fastBPE, and load to Huggingface Tokenizer.

Chinese NER with albert/electra or other bert descendable model (keras)

Study German declensions (dER nettE Mann, ein nettER Mann, mit dEM nettEN Mann, ohne dEN nettEN Mann ...) Generate as many exercises as you want using the incredible power of SPACY!

Rhythm-Finder is a unsupervised ML driven python powered web-application that can find the songs that suits you.

A linter to manage all your python exceptions and try/except blocks (limited only for those who like dinosaurs).

Tool to check whether a GCP bucket is public or not.

chaii - hindi & tamil question answering

Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Simple virtual assistant using pyttsx3 and speech recognition optionally with pywhatkit and pther libraries.

Implementation of "Adversarial purification with Score-based generative models", ICML 2021

A Survey of Natural Language Generation in Task-Oriented Dialogue System (TOD): Recent Advances and New Frontiers

TweebankNLP - Pre-trained Tweet NLP Pipeline (NER, tokenization, lemmatization, POS tagging, dependency parsing) + Models + Tweebank-NER

LewusBot - Twitch ChatBot built in python with twitchio library

Code to reproduce the results of the paper 'Towards Realistic Few-Shot Relation Extraction' (EMNLP 2021)

This library is testing the ethics of language models by using natural adversarial texts.

a chinese segment base on crf