A high-level yet extensible library for fast language model tuning via automatic prompt search

Last update: Dec 07, 2022

Related tags

Overview

ruPrompts

ruPrompts is a high-level yet extensible library for fast language model tuning via automatic prompt search, featuring integration with HuggingFace Hub, configuration system powered by Hydra, and command line interface.

Prompt is a text instruction for language model, like

Translate English to French:
cat =>

For some tasks the prompt is obvious, but for some it isn't. With ruPrompts you can define only the prompt format, like {text}, and train it automatically for any task, if you have a training dataset.

You can currently use ruPrompts for text-to-text tasks, such as summarization, detoxification, style transfer, etc., and for styled text generation, as a special case of text-to-text.

Features

Modular structure for convenient extensibility
Integration with HF Transformers, support for all models with LM head
Integration with HF Hub for sharing and loading pretrained prompts
CLI and configuration system powered by Hydra
Pretrained prompts for ruGPT-3

Installation

ruPrompts can be installed with pip:

pip install ruprompts[hydra]

See Installation for other installation options.

Usage

Loading a pretrained prompt for styled text generation:

>> ppln_joke("Говорит кружка ложке") [{"generated_text": 'Говорит кружка ложке: "Не бойся, не утонешь!".'}]">

>>> import ruprompts
>>> from transformers import pipeline

>>> ppln_joke = pipeline("text-generation-with-prompt", prompt="konodyuk/prompt_rugpt3large_joke")
>>> ppln_joke("Говорит кружка ложке")
[{"generated_text": 'Говорит кружка ложке: "Не бойся, не утонешь!".'}]

For text2text tasks:

>> ppln_detox("Опять эти тупые дятлы все испортили, чтоб их черти взяли") [{"generated_text": 'Опять эти люди все испортили'}]">

>>> ppln_detox = pipeline("text2text-generation-with-prompt", prompt="konodyuk/prompt_rugpt3large_detox_russe")
>>> ppln_detox("Опять эти тупые дятлы все испортили, чтоб их черти взяли")
[{"generated_text": 'Опять эти люди все испортили'}]

Proceed to Quick Start for a more detailed introduction or start using ruPrompts right now with our Colab Tutorials.

License

ruPrompts is Apache 2.0 licensed. See the LICENSE file for details.

A high-level yet extensible library for fast language model tuning via automatic prompt search

Related tags

Overview

ruPrompts

Features

Installation

Usage

License

Owner

Sber AI

ElasticBERT: A pre-trained model with multi-exit transformer architecture.

Code repository for "It's About Time: Analog clock Reading in the Wild"

iBOT: Image BERT Pre-Training with Online Tokenizer

Baseline code for Korean open domain question answering(ODQA)

Rethinking the Truly Unsupervised Image-to-Image Translation - Official PyTorch Implementation (ICCV 2021)

Pytorch-Named-Entity-Recognition-with-BERT

A Semi-Intelligent ChatBot filled with statistical and economical data for the Premier League.

Just a Basic like Language for Zeno INC

Header-only C++ HNSW implementation with python bindings

Chinese real time voice cloning (VC) and Chinese text to speech (TTS).

Reformer, the efficient Transformer, in Pytorch

Unofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration

Levenshtein and Hamming distance computation

An assignment on creating a minimalist neural network toolkit for CS11-747

RuCLIP-SB (Russian Contrastive Language–Image Pretraining SWIN-BERT) is a multimodal model for obtaining images and text similarities and rearranging captions and pictures. Unlike other versions of the model we use BERT for text encoder and SWIN transformer for image encoder.

Paddlespeech Streaming ASR GUI

Transformer-based Text Auto-encoder (T-TA) using TensorFlow 2.

Scikit-learn style model finetuning for NLP

OceanScript is an Esoteric language used to encode and decode text into a formulation of characters

A model library for exploring state-of-the-art deep learning topologies and techniques for optimizing Natural Language Processing neural networks