A repository to run gpt-j-6b on low vram machines (4.2 gb minimum vram for 2000 token context, 3.5 gb for 1000 token context). Model loading takes 12gb free ram.

Last update: Dec 25, 2022

Overview

Basic-UI-for-GPT-J-6B-with-low-vram

A repository to run GPT-J-6B on low vram systems by using both ram, vram and pinned memory.

There seem to be some issues with the weights in the drive link. There seems to be some performance loss, most likely because of poor 16 bit conversion.

How to run :

Use - pip install git+https://github.com/finetuneanon/[email protected]
Use the link - https://drive.google.com/file/d/1tboTvohQifN6f1JiSV8hnciyNKvj9pvm/view?usp=sharing to dowload the model that has been saved as described here - https://github.com/arrmansa/saving-and-loading-large-models-pytorch

Timing (2000 token context)

1

system -

16 gb ddr4 ram . 1070 8gb gpu.
23 blocks on ram (ram_blocks = 23) out of which 18 are on shared/pinned memory (max_shared_ram_blocks = 18).

timing -

single run of the model(inputs) takes 6.5 seconds.
35 seconds to generate 25 tokens at 2000 context. (1.4 seconds/token)

2

system -

16 gb ddr4 ram . 1060 6gb gpu.
26 blocks on ram (ram_blocks = 26) out of which 18 are on shared/pinned memory (max_shared_ram_blocks = 18).

timing -

40 seconds to generate 25 tokens at 2000 context. (1.6 seconds/token)

A repository to run gpt-j-6b on low vram machines (4.2 gb minimum vram for 2000 token context, 3.5 gb for 1000 token context). Model loading takes 12gb free ram.

Related tags

Overview

Basic-UI-for-GPT-J-6B-with-low-vram

There seem to be some issues with the weights in the drive link. There seems to be some performance loss, most likely because of poor 16 bit conversion.

How to run :

Timing (2000 token context)

1

system -

timing -

2

system -

timing -

Owner

scikit-learn wrappers for Python fastText.

Compute distance between sequences. 30+ algorithms, pure python implementation, common interface, optional external libs usage.

EasyTransfer is designed to make the development of transfer learning in NLP applications easier.

BiQE: Code and dataset for the BiQE paper

GPT-Code-Clippy (GPT-CC) is an open source version of GitHub Copilot, a language model

The (extremely) naive sentiment classification function based on NBSVM trained on wisesight_sentiment

LegalNLP - Natural Language Processing Methods for the Brazilian Legal Language

The official code for “DocTr: Document Image Transformer for Geometric Unwarping and Illumination Correction”, ACM MM, Oral Paper, 2021.

Python package to easily retrain OpenAI's GPT-2 text-generating model on new texts

A Flask Sentiment Analysis API, with visual implementation

CATs: Semantic Correspondence with Transformers

Generate custom detailed survey paper with topic clustered sections and proper citations, from just a single query in just under 30 mins !!

Repository for Project Insight: NLP as a Service

NewsMTSC: (Multi-)Target-dependent Sentiment Classification in News Articles

spaCy plugin for Transformers , Udify, ELmo, etc.

This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.

Smart discord chatbot integrated with Dialogflow to manage different classrooms and assist in teaching!

An Analysis Toolkit for Natural Language Generation (Translation, Captioning, Summarization, etc.)

Official PyTorch implementation of SegFormer

Easy, fast, effective, and automatic g-code compression!