Official repository for the paper "GN-Transformer: Fusing AST and Source Code information in Graph Networks".

Last update: Nov 26, 2022

Related tags

Overview

GN-Transformer AST

This is the official repository for the paper "GN-Transformer: Fusing AST and Source Code information in Graph Networks".

Data Preparing

Preprocess the dataset by yourself

The code we used to preprocess the Java and Python datasets are under in ./preprocess, please read README.md in /Java and /Python respectively to see how to preprocess the corpus.

The original corpus we used are from here:

Java corpus: https://github.com/xing-hu/TL-CodeSum

Python corpus: https://github.com/EdinburghNLP/code-docstring-corpus

Directly use our preprocessed dataset

You can directly download our preprocessed dataset:

Java: https://drive.google.com/file/d/1hVJaA2JA377Iz3bstHLIGaffUh_ogVnG/view?usp=sharing

Python: https://drive.google.com/file/d/1lQhczrERskISdBcWeS6VWLwCMpBAh-YF/view?usp=sharing

Or you can run the data_prepare.sh in ./data to prepare the dataset.

Training

Enter the script folders and run the gntransformer.sh, the training and testing will start.

#GPU: gpu device ids

#NAME: name of the model

Java:

cd ./scripts/java

bash gntransformer.sh #GPU #NAME

Python:

cd ./scripts/python

bash gntransformer.sh #GPU #NAME

Examples:

bash gntransformer.sh 0 some_name # one gpu

bash gntransformer.sh 0,1 some_name # two gpus

...

Trained models

You can download our trained models here:

Java: https://drive.google.com/file/d/1vnIuGLBNGU_AHDwL7yZIkoaByWiLKYxb/view?usp=sharing

Python: https://drive.google.com/file/d/1tk3Wc4YpSo_oLKCi6h3Kitvsux3vWFUO/view?usp=sharing

Or directly run download_models.sh in ./models to download the trained models.

Official repository for the paper "GN-Transformer: Fusing AST and Source Code information in Graph Networks".

Related tags

Overview

GN-Transformer AST

Data Preparing

Preprocess the dataset by yourself

Directly use our preprocessed dataset

Training

Java:

Python:

Examples:

Trained models

Owner

Cheng Jun-Yan

A clean and robust Pytorch implementation of PPO on continuous action space.

10x faster matrix and vector operations

Implementation of Rotary Embeddings, from the Roformer paper, in Pytorch

Recreate CenternetV2 based on MMDET.

用强化学习DQN算法，训练AI模型来玩合成大西瓜游戏，提供Keras版本和PARL（paddle）版本

GeoMol: Torsional Geometric Generation of Molecular 3D Conformer Ensembles

An Official Repo of CVPR '20 "MSeg: A Composite Dataset for Multi-Domain Segmentation"

Home for cuQuantum Python & NVIDIA cuQuantum SDK C++ samples

Real-time analysis of intracranial neurophysiology recordings.

GT China coal model

When Does Pretraining Help? Assessing Self-Supervised Learning for Law and the CaseHOLD Dataset of 53,000+ Legal Holdings

Self-Supervised Learning for Domain Adaptation on Point-Clouds

Dataset para entrenamiento de yoloV3 para 4 clases

MATLAB codes of the book "Digital Image Processing Fourth Edition" converted to Python

Machine learning for NeuroImaging in Python

[NeurIPS2021] Exploring Architectural Ingredients of Adversarially Robust Deep Neural Networks

Identify the emotion of multiple speakers in an Audio Segment

Python-based Informatics Kit for Analysing Chemical Units

SkipGNN: Predicting Molecular Interactions with Skip-Graph Networks (Scientific Reports)

Monocular 3D Object Detection: An Extrinsic Parameter Free Approach (CVPR2021)