Discretized Integrated Gradients for Explaining Language Models (EMNLP 2021)

Last update: Oct 27, 2022

Related tags

Overview

Discretized Integrated Gradients for Explaining Language Models (EMNLP 2021)

Overview of paths used in DIG and IG. w is the word being attributed. The gray region is the neighborhood of w. Green line depicts the straight-line path from w to w' used by IG and the green squares are the corresponding interpolation points. Left: In DIG-Greedy, we first monotonize each word in the neighborhood (red arrow). Then the word closest to its corresponding monotonic point is selected as the anchor (blue line to w_5 since the red arrow of w_5 has the shortest magnitude). Right: In DIG-MaxCount we first count the number of monotonic dimensions for each word in the neighborhood (shown in [.] above). Then, the word with the highest number of monotonic dimensions is selected as the anchor word (blue line to w_4), followed by changing the non-monotonic dimensions of w_4 (red line to c). Repeating this step gives the zigzag blue path. Finally, the red stars are the interpolated points used by our method. Please refer to the paper for more details.

Dependencies

Dependencies can be installed using requirements.txt.

Evaluating DIG:

Install all the requirements from requirements.txt.
Execute ./setup.sh for setting up the folder hierarchy for experiments.

Commands for reproducing the reported results on DistilBERT fine-tuned on SST2:

# Generate the KNN graph
python knn.py -dataset sst2 -nn distilbert

# DIG (strategy: Greedy)
python main.py -dataset sst2 -nn distilbert -strategy greedy

# DIG (strategy: MaxCount)
python main.py -dataset sst2 -nn distilbert -strategy maxcount

Similarly, commands can be changed for other settings.

Please contact Soumya for any clarifications or suggestions.

Discretized Integrated Gradients for Explaining Language Models (EMNLP 2021)

Related tags

Overview

Discretized Integrated Gradients for Explaining Language Models (EMNLP 2021)

Dependencies

Evaluating DIG:

Owner

INK Lab @ USC

Benchmarks for Model-Based Optimization

Action Segmentation Evaluation

Official PyTorch Implementation of GAN-Supervised Dense Visual Alignment

Code for our paper Aspect Sentiment Quad Prediction as Paraphrase Generation in EMNLP 2021.

Sequential Model-based Algorithm Configuration

Code for HodgeNet: Learning Spectral Geometry on Triangle Meshes, in SIGGRAPH 2021.

NovelD: A Simple yet Effective Exploration Criterion

Use graph-based analysis to re-classify stocks and to improve Markowitz portfolio optimization

Walk with fastai

A PyTorch implementation of "SelfGNN: Self-supervised Graph Neural Networks without explicit negative sampling"

Memory Efficient Attention (O(sqrt(n)) for Jax and PyTorch

Practical and Real-world applications of ML based on the homework of Hung-yi Lee Machine Learning Course 2021

This repository includes the code of the sequence-to-sequence model for discontinuous constituent parsing described in paper Discontinuous Grammar as a Foreign Language.

The code for our NeurIPS 2021 paper "Kernelized Heterogeneous Risk Minimization".

This is the code of "Multi-view Contrastive Graph Clustering" in NeurlPS 2021.

Code for the paper "Combining Textual Features for the Detection of Hateful and Offensive Language"

A 3D Dense mapping backend library of SLAM based on taichi-Lang designed for the aerial swarm.

An introduction to satellite image analysis using Python + OpenCV and JavaScript + Google Earth Engine

Pytorch Implementation for (STANet+ and STANet)

Learning cell communication from spatial graphs of cells