Implementation of E(n)-Transformer, which extends the ideas of Welling's E(n)-Equivariant Graph Neural Network to attention

Last update: Jan 02, 2023

Overview

E(n)-Equivariant Transformer (wip)

Implementation of E(n)-Equivariant Transformer, which extends the ideas from Welling's E(n)-Equivariant Graph Neural Network with attention.

Install

$ pip install En-transformer

Usage

import torch
from en_transformer import EnTransformer

model = EnTransformer(
    dim = 512,
    depth = 4,
    dim_head = 64,
    heads = 8,
    edge_dim = 4,
    fourier_features = 2
)

feats = torch.randn(1, 16, 512)
coors = torch.randn(1, 16, 3)
edges = torch.randn(1, 16, 16, 4)

feats, coors = model(feats, coors, edges)  # (1, 16, 512), (1, 16, 3)

Todo

masking
neighborhoods by radius

Citations

@misc{satorras2021en,
    title 	= {E(n) Equivariant Graph Neural Networks}, 
    author 	= {Victor Garcia Satorras and Emiel Hoogeboom and Max Welling},
    year 	= {2021},
    eprint 	= {2102.09844},
    archivePrefix = {arXiv},
    primaryClass = {cs.LG}
}

Comments

Checkpoint sequential segments should equal number of layers instead of 1?

https://github.com/lucidrains/En-transformer/blob/a37e635d93a322cafdaaf829397c601350b23e5b/en_transformer/en_transformer.py#L527

Looking at the source code here: https://pytorch.org/docs/stable/_modules/torch/utils/checkpoint.html#checkpoint_sequential

opened by aced125 2
On rotary embeddings

Hi @lucidrains, thank you for your amazing work; big fan! I had a quick question on the usage of this repository.

Based on my understanding, rotary embeddings are a drop-in replacement for the original sinusoidal or learnt PEs in Transformers for sequential data, as in NLP or other temporal applications. If my application is not on sequential data, is there a reason why I should still use rotary embeddings?

E.g. for molecular datasets such as QM9 (from the En-GNNs paper), would it make sense to have rotary embeddings?

opened by chaitjo 1
Is this line required?

https://github.com/lucidrains/En-transformer/blob/7247e258fab953b2a8b5a73b8dfdfb72910711f8/en_transformer/en_transformer.py#L159

Is this line required? Does line 157, two lines above, make this line redundant?

opened by aced125 1
Performance drop with checkpointing update

I see a drop in performance (higher loss) when I update checkpointing from checkpoint_sequential(self.layers, 1, inp) to checkpoint_sequential(self.layers, len(self.layers), inp). Is this expected?

opened by heiidii 0
varying number of nodes

@lucidrains Thank you for your efficient implementation. I was wondering how to use this implementation for the dataset when the number of nodes in each graph is not the same? For example, the datasets of small molecules.

opened by mohaiminul2810 1
Edge model/rep

Hi,

Thank you for providing this version of the EnGNN model. This is not really an issue just a query. The original model as implemented here (https://github.com/vgsatorras/egnn) has 3 main steps per layer: edge_feat = self.edge_model(h[row], h[col], radial, edge_attr) coord = self.coord_model(coord, edge_index, coord_diff, edge_feat) h, agg = self.node_model(h, edge_index, edge_feat, node_attr) I am interested in the edge_feat and was wondering what would be an equivalent edge representation in your implementation. Line 335 in EnTransformer.py: qk = self.edge_mlp(qk) seems like the best candidate. Thanks, Pooja

opened by heiidii 1
efficient implementation

Hi, I wonder if relative distances and coordinates can be handled more efficiently using memory efficient attention as in " Self-attention Does Not Need O(n^2) Memory". It is straightforward for the scalar part.

opened by amrhamedp 2

Releases(1.0.2)

1.0.2(Jan 4, 2023)

null
Source code(tar.gz)
Source code(zip)
1.0.1(Dec 30, 2022)

null
Source code(tar.gz)
Source code(zip)
1.0.0(Dec 30, 2022)

null
Source code(tar.gz)
Source code(zip)
0.6.0(Nov 24, 2022)

null
Source code(tar.gz)
Source code(zip)
0.5.4(Mar 4, 2022)

Source code(tar.gz)
Source code(zip)
0.5.3(Mar 4, 2022)

Source code(tar.gz)
Source code(zip)
0.5.2(Mar 4, 2022)

Source code(tar.gz)
Source code(zip)
0.5.1(Nov 27, 2021)

Source code(tar.gz)
Source code(zip)
0.5.0(Aug 27, 2021)

Source code(tar.gz)
Source code(zip)
0.4.0(Aug 25, 2021)

Source code(tar.gz)
Source code(zip)
0.3.9(Aug 25, 2021)

Source code(tar.gz)
Source code(zip)
0.3.8(Jun 10, 2021)

Source code(tar.gz)
Source code(zip)
0.3.7(Jun 10, 2021)

Source code(tar.gz)
Source code(zip)
0.3.6(Jun 8, 2021)

Source code(tar.gz)
Source code(zip)
0.3.5(Jun 6, 2021)

Source code(tar.gz)
Source code(zip)
0.3.4(Jun 5, 2021)

Source code(tar.gz)
Source code(zip)
0.3.3(Jun 5, 2021)

Source code(tar.gz)
Source code(zip)
0.3.2(Jun 5, 2021)

Source code(tar.gz)
Source code(zip)
0.3.1(Jun 4, 2021)

Source code(tar.gz)
Source code(zip)
0.3.0(Jun 4, 2021)

Source code(tar.gz)
Source code(zip)
0.2.12(May 27, 2021)

Source code(tar.gz)
Source code(zip)
0.2.11(May 27, 2021)

Source code(tar.gz)
Source code(zip)
0.2.10(May 27, 2021)

Source code(tar.gz)
Source code(zip)
0.2.8(May 17, 2021)

Source code(tar.gz)
Source code(zip)
0.2.7(May 17, 2021)

Source code(tar.gz)
Source code(zip)
0.2.6(May 16, 2021)

Source code(tar.gz)
Source code(zip)
0.2.5(May 16, 2021)

Source code(tar.gz)
Source code(zip)
0.2.4(May 16, 2021)

Source code(tar.gz)
Source code(zip)
0.2.3(May 16, 2021)

Source code(tar.gz)
Source code(zip)
0.2.2(May 15, 2021)

Source code(tar.gz)
Source code(zip)

Owner

Phil Wang

Working with Attention. It's all we need.

GitHub Repository

Deep Implicit Moving Least-Squares Functions for 3D Reconstruction

DeepMLS: Deep Implicit Moving Least-Squares Functions for 3D Reconstruction This repository contains the implementation of the paper: Deep Implicit Mo

103 Dec 22, 2022

Dist2Dec: A Simplicial Neural Network for Homology Localization

6 Jun 12, 2022

Video Frame Interpolation without Temporal Priors (a general method for blurry video interpolation)

Video Frame Interpolation without Temporal Priors (NeurIPS2020) [Paper] [video] How to run Prerequisites NVIDIA GPU + CUDA 9.0 + CuDNN 7.6.5 Pytorch 1

31 Sep 04, 2022

This repository for project that can Automate Number Plate Recognition (ANPR) in Morocco Licensed Vehicles. 💻 + 🚙 + 🇲🇦 = 🤖 🕵🏻‍♂️

MoroccoAI Data Challenge (Edition #001) This Reposotory is result of our work in the comepetiton organized by MoroccoAI in the context of the first Mo

14 Oct 31, 2022

Code for the upcoming CVPR 2021 paper

The Temporal Opportunist: Self-Supervised Multi-Frame Monocular Depth Jamie Watson, Oisin Mac Aodha, Victor Prisacariu, Gabriel J. Brostow and Michael

496 Dec 30, 2022

Minimalistic PyTorch training loop

Backbone for PyTorch training loop Will try to keep it minimalistic. pip install back from back import Bone Features Progress bar Checkpoints saving/l

4 Jan 16, 2020

Heart Arrhythmia Classification

This program takes and input of an ECG in European Data Format (EDF) and outputs the classification for heartbeats into normal vs different types of arrhythmia . It uses a deep learning model for cla

4 Nov 02, 2022

A library for preparing, training, and evaluating scalable deep learning hybrid recommender systems using PyTorch.

collie Collie is a library for preparing, training, and evaluating implicit deep learning hybrid recommender systems, named after the Border Collie do

96 Dec 29, 2022

This is the official repository for our paper: ''Pruning Self-attentions into Convolutional Layers in Single Path''.

Pruning Self-attentions into Convolutional Layers in Single Path This is the official repository for our paper: Pruning Self-attentions into Convoluti

77 Dec 26, 2022

A colab notebook for training Stylegan2-ada on colab, transfer learning onto your own dataset.

Stylegan2-Ada-Google-Colab-Starter-Notebook A no thrills colab notebook for training Stylegan2-ada on colab. transfer learning onto your own dataset h

66 Dec 16, 2022

SANet: A Slice-Aware Network for Pulmonary Nodule Detection

SANet: A Slice-Aware Network for Pulmonary Nodule Detection This paper (SANet) has been accepted and early accessed in IEEE TPAMI 2021. This code and

39 Dec 17, 2022

ComPhy: Compositional Physical Reasoning ofObjects and Events from Videos

ComPhy This repository holds the code for the paper. ComPhy: Compositional Physical Reasoning ofObjects and Events from Videos, (Under review) PDF Pro

29 Dec 29, 2022

Bayesian Image Reconstruction using Deep Generative Models

Bayesian Image Reconstruction using Deep Generative Models R. Marinescu, D. Moyer, P. Golland For technical inquiries, please create a Github issue. F

51 Nov 23, 2022

Model Agnostic Interpretability for Multiple Instance Learning

MIL Model Agnostic Interpretability This repo contains the code for "Model Agnostic Interpretability for Multiple Instance Learning". Overview Executa

10 Dec 17, 2022

WPPNets: Unsupervised CNN Training with Wasserstein Patch Priors for Image Superresolution

WPPNets: Unsupervised CNN Training with Wasserstein Patch Priors for Image Superresolution This code belongs to the paper [1] available at https://arx

5 Jun 02, 2022

This is the official implement of paper "ActionCLIP: A New Paradigm for Action Recognition"

This is an official pytorch implementation of ActionCLIP: A New Paradigm for Video Action Recognition [arXiv] Overview Content Prerequisites Data Prep

268 Jan 09, 2023

AITom is an open-source platform for AI driven cellular electron cryo-tomography analysis.

AITom Introduction AITom is an open-source platform for AI driven cellular electron cryo-tomography analysis. AITom is originated from the tomominer l

93 Jan 02, 2023

Deep Probabilistic Programming Course @ DIKU

52 May 14, 2022

Stroke-predictions-ml-model - Machine learning model to predict individuals chances of having a stroke

stroke-predictions-ml-model machine learning model to predict individuals chance

1 Jan 03, 2022

This is a vision-based 3d model manipulation and control UI

Manipulation of 3D Models Using Hand Gesture This program allows user to manipulation 3D models (.obj format) with their hands. The project support bo

43 Oct 23, 2022

Implementation of E(n)-Transformer, which extends the ideas of Welling's E(n)-Equivariant Graph Neural Network to attention

Related tags

Overview

E(n)-Equivariant Transformer (wip)

Install

Usage

Todo

Citations

Comments

Checkpoint sequential segments should equal number of layers instead of 1?

On rotary embeddings

Is this line required?

Performance drop with checkpointing update

varying number of nodes

Edge model/rep

efficient implementation

Releases(1.0.2)

1.0.2(Jan 4, 2023)

1.0.1(Dec 30, 2022)

1.0.0(Dec 30, 2022)

0.6.0(Nov 24, 2022)

0.5.4(Mar 4, 2022)

0.5.3(Mar 4, 2022)

0.5.2(Mar 4, 2022)

0.5.1(Nov 27, 2021)

0.5.0(Aug 27, 2021)

0.4.0(Aug 25, 2021)

0.3.9(Aug 25, 2021)

0.3.8(Jun 10, 2021)

0.3.7(Jun 10, 2021)

0.3.6(Jun 8, 2021)

0.3.5(Jun 6, 2021)

0.3.4(Jun 5, 2021)

0.3.3(Jun 5, 2021)

0.3.2(Jun 5, 2021)

0.3.1(Jun 4, 2021)

0.3.0(Jun 4, 2021)

0.2.12(May 27, 2021)

0.2.11(May 27, 2021)

0.2.10(May 27, 2021)

0.2.8(May 17, 2021)

0.2.7(May 17, 2021)

0.2.6(May 16, 2021)

0.2.5(May 16, 2021)

0.2.4(May 16, 2021)

0.2.3(May 16, 2021)

0.2.2(May 15, 2021)