Machine translation models released by the Gourmet project

Last update: Dec 08, 2021

Related tags

Overview

Gourmet Models

Overview

The Gourmet project has released several machine translation models to translate low-resource languages. This repository contains information about the models, as well as sample code showing how the models can be used. The models themselves are available as dockers, and can be downloaded from a separate site (see below).

Some of the models are described in our public deliverables. See the integration report for details of model training, and the evaluation report for details of evaluation.

Using the Models

The model download links are here. Once downloaded, the model can be deployed using:

docker load <

to launch the translation server, use:

docker run -p 4000:4000 -i --rm

This exposes the model on port 4000. Change the second number above if you want to use a different port.

To test the model, use the sample client like this:

$ ./gourmet-client.py 
2021-11-15 14:54:10 DEBUG: __main__:  Connecting to translation server at localhost4000
2021-11-15 14:54:10 DEBUG: urllib3.connectionpool:  Starting new HTTP connection (1): localhost:4000
2021-11-15 14:54:14 DEBUG: urllib3.connectionpool:  http://localhost:4000 "POST /translation HTTP/1.1" 200 88
Translation: An gudanar da taron sauyin yanayi a Glasgow
Time: 2242
Error: None

This is using the English to Hausa model. You can use the -n and -p arguments to change the host and port. The source text is hard-coded but easily changed.

Some of the models support additional arguments which can be passed to the docker at launch time, using the -e argument to docker (for environemnent) variables. See the models page for more details.

Computational Requirements

All models are configured to use the CPU for translation. A GPU is not required, and the models will not use it.

Licence

CC-BY

Acknowledgements

This project has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 825299.

Machine translation models released by the Gourmet project

Related tags

Overview

Gourmet Models

Overview

Using the Models

Computational Requirements

Licence

Acknowledgements

Owner

Edinburgh NLP

This is Assignment1 code for the Web Data Processing System.

Data and evaluation code for the paper WikiNEuRal: Combined Neural and Knowledge-based Silver Data Creation for Multilingual NER (EMNLP 2021).

Original implementation of the pooling method introduced in "Speaker embeddings by modeling channel-wise correlations"

Python library for processing Chinese text

Pretty-doc - Composable text objects with python

Official PyTorch code for ClipBERT, an efficient framework for end-to-end learning on image-text and video-text tasks

Random Directed Acyclic Graph Generator

Awesome Treasure of Transformers Models Collection

Search msDS-AllowedToActOnBehalfOfOtherIdentity

LOT: A Benchmark for Evaluating Chinese Long Text Understanding and Generation

Sample data associated with the Aurora-BP study

Web Scraping, Document Deduplication & GPT-2 Fine-tuning with a newly created scam dataset.

ChainKnowledgeGraph, 产业链知识图谱包括A股上市公司、行业和产品共3类实体

A programming language with logic of Python, and syntax of all languages.

Predict the spans of toxic posts that were responsible for the toxic label of the posts

Sentiment Classification using WSD, Maximum Entropy & Naive Bayes Classifiers

A python framework to transform natural language questions to queries in a database query language.

Fake news detector filters - Smart filter project allow to classify the quality of information and web pages

Implementation of the Hybrid Perception Block and Dual-Pruned Self-Attention block from the ITTR paper for Image to Image Translation using Transformers

The Classical Language Toolkit