JASS: Japanese-specific Sequence to Sequence Pre-training for Neural Machine Translation

Last update: Oct 26, 2022

Related tags

Deep Learning JASS

Overview

JASS: Japanese-specific Sequence to Sequence Pre-training for Neural Machine Translation

This the repository for this paper.

Find extensions of this work and new pre-trained models here: code, paper

Requirements

Install OpenNMT-py (1.0) and subword-nmt.

pip install OpenNMT-py
pip install subword-nmt

Pre-trained JASS models

We release JASS models on 2 language pairs: ja+en, ja+ru. For Japanese seq2seq pretraining, we use our proposed JASS methods while MASS is utilized for English and Russian.

Model	Vocabulary	BPE codes
JASS-jaen	ja-en	ja-en.bpe.codes
JASS-jaru	ja-ru	ja-ru.bpe.codes

Usage

Run the bpe precrocessing for the dataset to be finetuned. After setting up the downloaded vocabulary for src and tgt sentences during the preprocessing phase by preprocess.py of OpenNMT, use train_from argument of train.py in OpenNMT to implement the finetuning for the pretrained model.

Others

We will update the current Japanese--English pre-trained model and release pretrained models on Japanese--Chinese and Japanese--Korean. We released new models here: code

Reference

[1] Zhuoyuan Mao, Fabien Cromieres, Raj Dabre, Haiyue Song, Sadao Kurohashi, JASS: Japanese-specific Sequence to Sequence Pre-training for Neural Machine Translation

@inproceedings{mao-etal-2020-jass,
    title = "{JASS}: {J}apanese-specific Sequence to Sequence Pre-training for Neural Machine Translation",
    author = "Mao, Zhuoyuan  and
      Cromieres, Fabien  and
      Dabre, Raj  and
      Song, Haiyue  and
      Kurohashi, Sadao",
    booktitle = "Proceedings of The 12th Language Resources and Evaluation Conference",
    month = may,
    year = "2020",
    address = "Marseille, France",
    publisher = "European Language Resources Association",
    url = "https://www.aclweb.org/anthology/2020.lrec-1.454",
    pages = "3683--3691",
    language = "English",
    ISBN = "979-10-95546-34-4",
}

JASS: Japanese-specific Sequence to Sequence Pre-training for Neural Machine Translation

Related tags

Overview

JASS: Japanese-specific Sequence to Sequence Pre-training for Neural Machine Translation

Requirements

Pre-trained JASS models

Usage

Others

Reference

Owner

Zhuoyuan Mao

Code for the paper "Benchmarking and Analyzing Point Cloud Classification under Corruptions"

BuildingNet: Learning to Label 3D Buildings

This repository contains project created during the Data Challenge module at London School of Hygiene & Tropical Medicine

Easy-to-use,Modular and Extendible package of deep-learning based CTR models .

It is the assignment for COMP 576 in Rice University

Exploring the Dual-task Correlation for Pose Guided Person Image Generation

LF-YOLO (Lighter and Faster YOLO) is used to detect defect of X-ray weld image.

Codebase for Amodal Segmentation through Out-of-Task andOut-of-Distribution Generalization with a Bayesian Model

Fit Fast, Explain Fast

Fast, flexible and fun neural networks.

Pretty Tensor - Fluent Neural Networks in TensorFlow

Official PyTorch Implementation of Convolutional Hough Matching Networks, CVPR 2021 (oral)

YOLTv4 builds upon YOLT and SIMRDWN, and updates these frameworks to use the most performant version of YOLO, YOLOv4

Buffon’s needle: one of the oldest problems in geometric probability

Coarse implement of the paper "A Simultaneous Denoising and Dereverberation Framework with Target Decoupling", On DNS-2020 dataset, the DNSMOS of first stage is 3.42 and second stage is 3.47.

Using pretrained language models for biomedical knowledge graph completion.

ManiSkill-Learn is a framework for training agents on SAPIEN Open-Source Manipulation Skill Challenge (ManiSkill Challenge), a large-scale learning-from-demonstrations benchmark for object manipulation.

Official Repository for Machine Learning class - Physics Without Frontiers 2021

Training Certifiably Robust Neural Networks with Efficient Local Lipschitz Bounds (Local-Lip)

《Truly shift-invariant convolutional neural networks》(2021)