Code Generation using a large neural network called GPT-J

Last update: Dec 31, 2022

Overview

CodeGenX

CodeGenX is a Code Generation system powered by Artificial Intelligence! It is delivered to you in the form of a Visual Studio Code Extension and is Free and Open-source!

Installation

You can find installation instructions and additional information about CodeGenX in the documentation here.

About CodeGenX

1. Languages Supported

CodeGenX currently only supports Python. We are planning to add additional languages in future releases.

2. Modules Trained On

CodeGenX was trained on Python code which covers many of its common uses. Some libraries which CodeGenX is specifically trained on are:

Tensorflow
Pytorch
Scikit-Learn
Pandas
NumPy
OpenCV
Django
Flask
PyGame

3. How CodeGenX Works

At the core of CodeGenX lies a large neural network called GPT-J. GPT-J is a 6 billion parameter transformer model which was trained on hundreds of gigabytes of text from the internet. We fine-tuned this model on a dataset of open-source python code. This fine-tuned model can now be used to generate code when given an input with the right instructions.

Contributors ✨

This project would not have been possible without the help of these wonderful people:

_{Arya Manjaramkar}	_{Matthias Wijnsma}	_{Thomas Houtrique}	_{Dominic Rampas}	_{Bilel Medimegh}	_{Josh Hills}	_Alex
_Tiimo

Acknowledgements

Many thanks to the support of the Google TPU Research Cloud for providing the precious compute needed for this project.

Code Generation using a large neural network called GPT-J

Related tags

Overview

CodeGenX

Installation

About CodeGenX

1. Languages Supported

2. Modules Trained On

3. How CodeGenX Works

Contributors ✨

Acknowledgements

Owner

DeepGenX

Text classification on IMDB dataset using Keras and Bi-LSTM network

Multi-Scale Temporal Frequency Convolutional Network With Axial Attention for Speech Enhancement

pytorch implementation of Attention is all you need

Continuously update some NLP practice based on different tasks.

Machine learning models from Singapore's NLP research community

Ecommerce product title recognition package

Unsupervised Language Model Pre-training for French

Edge-Augmented Graph Transformer

Gpt2-WebAPI - The objective of this API is to provide the 3 best possible responses to sentences that the user would input via http GET request as a parameter

Summarization, translation, sentiment-analysis, text-generation and more at blazing speed using a T5 version implemented in ONNX.

neural network based speaker embedder

jel - Japanese Entity Linker - is Bi-encoder based entity linker for japanese.

COVID-19 Related NLP Papers

使用Mask LM预训练任务来预训练Bert模型。训练垂直领域语料的模型表征，提升下游任务的表现。

Full Spectrum Bioinformatics - a free online text designed to introduce key topics in Bioinformatics using the Python

Text Classification Using LSTM

Partially offline multi-language translator built upon Huggingface transformers.

Blender addon - Scrub timeline from viewport with a shortcut

Coreference resolution for English, French, German and Polish, optimised for limited training data and easily extensible for further languages

Model for recasing and repunctuating ASR transcripts