[ACL 20] Probing Linguistic Features of Sentence-level Representations in Neural Relation Extraction

Last update: Jan 06, 2023

Overview

REval

Introduction
Overview
Requirements
Installation
Probing
Usage
Citation
License

🎓 Introduction

REval is a simple framework for probing sentence-level representations of Relation Extraction models.

✅ Requirements

REval is tested with:

Python 3.7

🚀 Installation

With pip

<TBD>

From source

git clone https://github.com/DFKI-NLP/REval
cd REval
pip install -r requirements.txt

🔬 Probing

Supported Datasets

SemEval 2010 Task 8 (CoreNLP annotated version) [LINK]
TACRED (obtained via LDC) [LINK]

Probing Tasks

Task	SemEval 2010	TACRED
ArgTypeHead	✔️	✔️
ArgTypeTail	✔️	✔️
Length	✔️	✔️
EntityDistance	✔️	✔️
ArgumentOrder		✔️
EntityExistsBetweenHeadTail	✔️	✔️
PosTagHeadLeft	✔️	✔️
PosTagHeadRight	✔️	✔️
PosTagTailLeft	✔️	✔️
PosTagTailRight	✔️	✔️
TreeDepth	✔️	✔️
SDPTreeDepth	✔️	✔️
ArgumentHeadGrammaticalRole	✔️	✔️
ArgumentTailGrammaticalRole	✔️	✔️

🔧 Usage

Step 1: create the probing task datasets from the original datasets.

SemEval 2010 Task 8

python reval.py generate-all-from-semeval \
    --train-file <SEMEVAL DIR>/train.json \
    --validation-file <SEMEVAL DIR>/dev.json \
    --test-file <SEMEVAL DIR>/test.json \
    --output-dir ./data/semeval/

TACRED

python reval.py generate-all-from-tacred \
    --train-file <TACRED DIR>/train.json \
    --validation-file <TACRED DIR>/dev.json \
    --test-file <TACRED DIR>/test.json \
    --output-dir ./data/tacred/

Step 2: Run the probing tasks on a model.

For example, download a Relation Extraction model trained with RelEx, e.g., the CNN trained on SemEval.

mkdir -p models/cnn_semeval
wget --content-disposition https://cloud.dfki.de/owncloud/index.php/s/F3gf9xkeb2foTFe/download -P models/cnn_semeval

python probing_task_evaluation.py \
    --model-dir ./models/cnn_semeval/ \
    --data-dir ./data/semeval/ \
    --dataset semeval2010 \
    --cuda-device 0 \
    --batch-size 64 \
    --cache-representations

After the run is completed, the results are stored to probing_task_results.json in the model-dir.

{
    "ArgTypeHead": {
        "acc": 75.82,
        "devacc": 78.96,
        "ndev": 670,
        "ntest": 2283
    },
    "ArgTypeTail": {
        "acc": 75.4,
        "devacc": 78.79,
        "ndev": 627,
        "ntest": 2130
    },
    [...]
}

📚 Citation

If you use REval, please consider citing the following paper:

@inproceedings{alt-etal-2020-probing,
    title={Probing Linguistic Features of Sentence-level Representations in Neural Relation Extraction},
    author={Christoph Alt and Aleksandra Gabryszak and Leonhard Hennig},
    year={2020},
    booktitle={Proceedings of ACL},
    url={https://arxiv.org/abs/2004.08134}
}

📘 License

REval is released under the terms of the MIT License.

[ACL 20] Probing Linguistic Features of Sentence-level Representations in Neural Relation Extraction

Related tags

Overview

REval

Table of Contents

🎓 Introduction

✅ Requirements

🚀 Installation

With pip

From source

🔬 Probing

Supported Datasets

Probing Tasks

🔧 Usage

Step 1: create the probing task datasets from the original datasets.

SemEval 2010 Task 8

TACRED

Step 2: Run the probing tasks on a model.

📚 Citation

📘 License

Owner

Code and data for "Broaden the Vision: Geo-Diverse Visual Commonsense Reasoning" (EMNLP 2021).

The code of Zero-shot learning for low-light image enhancement based on dual iteration

A program to recognize fruits on pictures or videos using yolov5

High accurate tool for automatic faces detection with landmarks

Skyformer: Remodel Self-Attention with Gaussian Kernel and Nystr\"om Method (NeurIPS 2021)

Pmapper is a super-resolution and deconvolution toolkit for python 3.6+

Voila - Voilà turns Jupyter notebooks into standalone web applications

ADGAN - The Implementation of paper Controllable Person Image Synthesis with Attribute-Decomposed GAN

Pervasive Attention: 2D Convolutional Networks for Sequence-to-Sequence Prediction

Implementation of QuickDraw - an online game developed by Google, combined with AirGesture - a simple gesture recognition application

Pytorch implementation of various High Dynamic Range (HDR) Imaging algorithms

[ACMMM 2021 Oral] Enhanced Invertible Encoding for Learned Image Compression

Implementation of PersonaGPT Dialog Model

A torch.Tensor-like DataFrame library supporting multiple execution runtimes and Arrow as a common memory format

Demo code for ICCV 2021 paper "Sensor-Guided Optical Flow"

Fast and robust certifiable relative pose estimation

Adds timm pretrained backbone to pytorch's FasterRcnn model

NAS-Bench-x11 and the Power of Learning Curves

这是一个yolox-pytorch的源码，可以用于训练自己的模型。

[CVPRW 21] "BNN - BN = ? Training Binary Neural Networks without Batch Normalization", Tianlong Chen, Zhenyu Zhang, Xu Ouyang, Zechun Liu, Zhiqiang Shen, Zhangyang Wang