Source code for "Understanding Knowledge Integration in Language Models with Graph Convolutions"

Last update: Oct 18, 2022

Related tags

Deep Learning GCS_KI

Overview

Graph Convolution Simulator (GCS)

Source code for "Understanding Knowledge Integration in Language Models with Graph Convolutions"

Requirements:

PyTorch and DGL should be installed based on your system. For other libraries, you can install them using the following command:

$ pip install -r requirements.txt

Run Knowledge Integration Interpretation (KI) by GCS on example data:

$ bash run_example.sh

Interpretation results are saved in ./example/example_data/gcs.edgelist.

If the knowledge graph is small, users can visualize it by ./example/example_data/results.pdf. Here is the results for the example data:

Run Knowledge Intergration Interpretation by GCS for your own model

Step 1: Prepare the entity embedding of vanilla LM and knowledge-enhanced LM:

Store them as PyTorch tensor (.pt) format. Make sure they have the same number of rows, and the indexes of entities are the same. The default files are emb_roberta.pt and emb_kadapter.pt.

Step 2: Prepare the knowledge graph:

Three files are needed to load the knowledge graph:

a) qid2idx.json: The index dictionary. The key is entity Q-label, and value is the index of entity in entity embedding
b) qid2label.json : The label dictionary. The key is entity Q-label, and the value is the entity label text. Note that this dictionary is only for visualization, you can set it as {Q-label: Q-label} if you don't have the text.
c) kg.edgelist: The knowledge triple to construct knowledge graph. Each row is for one triple as: entity1_idx \t entity2_idx \t {}.

Step 3: Run GCS for KI interpretation:

After two preparation steps, you can run GCS by:

$ python src/example.py  --emb_vlm emb_roberta.pt  -emb_klm emb_kadapter.pt  --data_dir ./example_data  --lr 1e-3  --loss mi_loss

As for the hyperparameters, users may check them in ./example/src/example.py. Note that for large knowledge graphs, we recommend to use mutual information loss (mi_loss), and please do not visualize the results for large knowledge graphs.

Step 4: Analyze GCS interpretation results:

The interpretation results are saved in ./example/example_data/gcs.edgelist. Each row is for one triple as: entity1_idx \t entity2_idx \t {'a': xxxx}. Here, the value of 'a' is the attention coefficient value on the triple/entity (entity1, r, entity2). Users may use them to analyze the factual knowledge learned during knowledge integration.

Reproduce the results in the paper

Please enter ./all_exp folder for more details

Cite

If you use the code, please cite the paper:

@article{hou2022understanding,
  title={Understanding Knowledge Integration in Language Models with Graph Convolutions},
  author={Hou, Yifan and Fu, Guoji and Sachan, Mrinmaya},
  journal={arXiv preprint arXiv:2202.00964},
  year={2022}
}

Contact

Feel free to open an issue or send me ([email protected]) an email if you have any questions!

Source code for "Understanding Knowledge Integration in Language Models with Graph Convolutions"

Related tags

Overview

Graph Convolution Simulator (GCS)

Requirements:

Run Knowledge Integration Interpretation (KI) by GCS on example data:

Run Knowledge Intergration Interpretation by GCS for your own model

Step 1: Prepare the entity embedding of vanilla LM and knowledge-enhanced LM:

Step 2: Prepare the knowledge graph:

Step 3: Run GCS for KI interpretation:

Step 4: Analyze GCS interpretation results:

Reproduce the results in the paper

Cite

Contact

Owner

yifan

A repository with exploration into using transformers to predict DNA ↔ transcription factor binding

StarGAN v2-Tensorflow - Simple Tensorflow implementation of StarGAN v2

Learning and Building Convolutional Neural Networks using PyTorch

This repository provides a basic implementation of our GCPR 2021 paper "Learning Conditional Invariance through Cycle Consistency"

This project uses Template Matching technique for object detecting by detection of template image over base image.

Source code for Acorn, the precision farming rover by Twisted Fields

Code for paper: "Spinning Language Models for Propaganda-As-A-Service"

Pipeline for employing a Lightweight deep learning models for LOW-power systems

[NeurIPS 2021] Large Scale Learning on Non-Homophilous Graphs: New Benchmarks and Strong Simple Methods

Code for the paper "MASTER: Multi-Aspect Non-local Network for Scene Text Recognition" (Pattern Recognition 2021)

Face Recognition plus identification simply and fast | Python

Using Python to Play Cyberpunk 2077

AI drive app that can help user become beautiful.

Project page for the paper Semi-Supervised Raw-to-Raw Mapping 2021.

cl;asification problem using classification models in supervised learning

A simple implementation of Kalman filter in single object tracking

Heterogeneous Temporal Graph Neural Network

Model Zoo for MindSpore

Source code for "UniRE: A Unified Label Space for Entity Relation Extraction.", ACL2021.

A Differentiable Recipe for Learning Visual Non-Prehensile Planar Manipulation