Code for the paper "Balancing Training for Multilingual Neural Machine Translation, ACL 2020"

Last update: May 18, 2022

Related tags

Deep Learning multiDDS

Overview

Balancing Training for Multilingual Neural Machine Translation

Implementation of the paper

Balancing Training for Multilingual Neural Machine Translation

Xinyi Wang, Yulia Tsvetkov, Graham Neubig

Data:

The preprocessed and binarized data for fairseq can be downloaded here

To process data from scrach, see the script

util_scripts/prepare_multilingual_data.sh

Training Scripts:

The training scripts for many-to-one translation of the related language group (Related M2O) is under the directory job_scripts/related_ted8_m2o/.

Our methods:

MultiDDS-S:

job_scripts/related_ted8_m2o/multidds_s.sh

MultiDDS:

job_scripts/related_ted8_m2o/multidds.sh

Baselines:

Proportional:

job_scripts/related_ted8_m2o/proportional.sh

Temperature:

job_scripts/related_ted8_m2o/temperature.sh

The scripts for Related O2M is under the directory job_scripts/related_ted8_o2m/

The scripts for Diverse M2O is under the directory job_scripts/diverse_ted8_m2o/

The scripts for Diverse O2M is under the directory job_scripts/diverse_ted8_o2m/

Inference Scripts:

Each of the experiment script directory contains a trans.sh file to translate the test set. To translate the test set for the Related M2O MultiDDS-S

job_scripts/related_ted8_m2o/trans.sh checkpoints/related_ted8_m2o/multidds_s/

To translate other experiment, simply replace the argument with the experiment checkpoint directory.

Citation

Please cite as:

@inproceedings{wang2020multiDDS,
  title = {Balancing Training for Multilingual Neural Machine Translation},
  author = {Xinyi Wang, Yulia Tsvetkov, Graham Neubig},
  booktitle = {ACL},
  year = {2020},
}

Code for the paper "Balancing Training for Multilingual Neural Machine Translation, ACL 2020"

Related tags

Overview

Balancing Training for Multilingual Neural Machine Translation

Data:

Training Scripts:

Inference Scripts:

Citation

Owner

Xinyi Wang

A Deep Learning based project for creating line art portraits.

Contrastively Disentangled Sequential Variational Audoencoder

PyTorch implementation of "Dataset Knowledge Transfer for Class-Incremental Learning Without Memory" (WACV2022)

Learning to Adapt Structured Output Space for Semantic Segmentation, CVPR 2018 (spotlight)

Improving Factual Completeness and Consistency of Image-to-text Radiology Report Generation

Deep Probabilistic Programming Course @ DIKU

Conversational text Analysis using various NLP techniques

This git repo contains the implementation of my ML project on Heart Disease Prediction

Official implementation of the Neurips 2021 paper Searching Parameterized AP Loss for Object Detection.

Code related to the manuscript "Averting A Crisis In Simulation-Based Inference"

An experimentation and research platform to investigate the interaction of automated agents in an abstract simulated network environments.

Deep learning image registration library for PyTorch

Learn about Spice.ai with in-depth samples

Using fully convolutional networks for semantic segmentation with caffe for the cityscapes dataset

An index of algorithms for learning causality with data

A PyTorch Implementation of ViT (Vision Transformer)

Open source person re-identification library in python

Code and description for my BSc Project, September 2021

NFNets and Adaptive Gradient Clipping for SGD implemented in PyTorch

OverFeat is a Convolutional Network-based image classifier and feature extractor.