RuCLIP tiny (Russian Contrastive Language–Image Pretraining) is a neural network trained to work with different pairs (images, texts).

Last update: Sep 20, 2022

Related tags

Text Data & NLP ru-clip-tiny

Overview

RuCLIPtiny

Zero-shot image classification model for Russian language

RuCLIP tiny (Russian Contrastive Language–Image Pretraining) is a neural network trained to work with different pairs (images, texts). Our model is based on ConvNeXt-tiny and DistilRuBert-tiny, and is supported by extensive research zero-shot transfer, computer vision, natural language processing, and multimodal learning.

Result evaluation

Our model achieved 46.62% top1 and 73.18% top5 zero-shot accuracy on CIFAR100

Examples

Evaluate & Simple usage

Finetuning

ONNX conversion and speed testing

Model weights

Usage

Install rucliptiny module and requirements first. Use this trick

!gdown -O ru-clip-tiny.pkl https://drive.google.com/uc?id=1-3g3J90pZmHo9jbBzsEmr7ei5zm3VXOL
!pip install git+https://github.com/cene555/ru-clip-tiny.git

Example in 3 steps

Download CLIP image from repo

!wget -c -O CLIP.png https://github.com/openai/CLIP/blob/main/CLIP.png?raw=true

Import libraries

from rucliptiny.predictor import Predictor
from rucliptiny import RuCLIPtiny
import torch

torch.manual_seed(1)
device = torch.device('cuda' if torch.cuda.is_available() else 'cpu')

Load model

model = RuCLIPtiny()
model.load_state_dict(torch.load('ru-clip-tiny.pkl'))
model = model.to(device).eval()

Use predictor to get probabilities

predictor = Predictor()

classes = ['диаграмма', 'собака', 'кошка']
text_probs = predictor(model=model, images_path=["CLIP.png"],
                       classes=classes, get_probs=True,
                       max_len=77, device=device)

Cosine similarity Visualization Example

Speed Testing

NVIDIA Tesla K80 (Google Colab session)

TORCH	batch	encode_image	encode_text	total
RuCLIPtiny	2	0.011	0.004	0.015
RuCLIPtiny	8	0.011	0.004	0.015
RuCLIPtiny	16	0.012	0.005	0.017
RuCLIPtiny	32	0.014	0.005	0.019
RuCLIPtiny	64	0.013	0.006	0.019

We would like to express my gratitude to Sber AI for the grants provided, for which research was carried out, as part of the Artificial Intelligence International Junior Contest (AIIJC)

RuCLIP tiny (Russian Contrastive Language–Image Pretraining) is a neural network trained to work with different pairs (images, texts).

Related tags

Overview

RuCLIPtiny

Result evaluation

Examples

Model weights

Usage

Example in 3 steps

Cosine similarity Visualization Example

Speed Testing

Owner

Shahmatov Arseniy

Phrase-BERT: Improved Phrase Embeddings from BERT with an Application to Corpus Exploration

Code for EMNLP'21 paper "Types of Out-of-Distribution Texts and How to Detect Them"

A retro text-to-speech bot for Discord

[ICCV 2021] Instance-level Image Retrieval using Reranking Transformers

PatrickStar enables Larger, Faster, Greener Pretrained Models for NLP. Democratize AI for everyone.

Community and sentiment analysis based on tweets

Signature remover is a NLP based solution which removes email signatures from the rest of the text.

Text Analysis & Topic Extraction on Android App user reviews

🦅 Pretrained BigBird Model for Korean (up to 4096 tokens)

A CRM department in a local bank works on classify their lost customers with their past datas. So they want predict with these method that average loss balance and passive duration for future.

Material for GW4SHM workshop, 16/03/2022.

Official PyTorch implementation of "Dual Path Learning for Domain Adaptation of Semantic Segmentation".

Open-Source Toolkit for End-to-End Speech Recognition leveraging PyTorch-Lightning and Hydra.

A Streamlit web app that generates Rick and Morty stories using GPT2.

NLPIR tutorial: pretrain for IR. pre-train on raw textual corpus, fine-tune on MS MARCO Document Ranking

We have built a Voice based Personal Assistant for people to access files hands free in their device using natural language processing.

Using BERT-based models for toxic span detection

📜 GPT-2 Rhyming Limerick and Haiku models using data augmentation

Document processing using transformers

PyTorch source code of NAACL 2019 paper "An Embarrassingly Simple Approach for Transfer Learning from Pretrained Language Models"