Implementation of Neural Distance Embeddings for Biological Sequences (NeuroSEED) in PyTorch

Last update: Dec 23, 2022

Overview

Neural Distance Embeddings for Biological Sequences

Official implementation of Neural Distance Embeddings for Biological Sequences (NeuroSEED) in PyTorch. NeuroSEED is a novel framework to embed biological sequences in geometric vector spaces. Preprint will we published soon.

Overview

The repository is organised in four main folders one for each of the tasks analysed. Each of these contain scripts and models used for the task as well as instructions on how to run them and the tuned hyperparameters found.

edit_distance for the edit distance approximation task
closest_string for the closest string retrieval task
hierarchical_clustering for the hierarchical clustering task, further divided in relaxed and unsupervised for the two approaches explored
multiple_alignment for the multiple sequence alignment task, further divided in guide_tree and steiner_string
util contains a series of utility routines shared between all the tasks
tests contains a wide range of tests for the various components of the repository

Installation

Create a virtual (or conda) environment and install the dependencies:

python3 -m venv neuroseed
source neuroseed/bin/activate
pip install -r requirements.txt

Then install the mst and unionfind packages used for the hierarchical clustering:

cd hierarchical_clustering/relaxed/mst; python setup.py build_ext --inplace; cd ../../..
cd hierarchical_clustering/relaxed/unionfind; python setup.py build_ext --inplace; cd ../../..

License

MIT

Implementation of Neural Distance Embeddings for Biological Sequences (NeuroSEED) in PyTorch

Related tags

Overview

Neural Distance Embeddings for Biological Sequences

Overview

Installation

License

Owner

Gabriele Corso

Source code for our paper "Improving Empathetic Response Generation by Recognizing Emotion Cause in Conversations"

A library for researching neural networks compression and acceleration methods.

Global Filter Networks for Image Classification

Chainer implementation of recent GAN variants

The Multi-Mission Maximum Likelihood framework (3ML)

Neural models of common sense. 🤖

Old Photo Restoration (Official PyTorch Implementation)

PyTorch code for our ECCV 2018 paper "Image Super-Resolution Using Very Deep Residual Channel Attention Networks"

VolumeGAN - 3D-aware Image Synthesis via Learning Structural and Textural Representations

SLAMP: Stochastic Latent Appearance and Motion Prediction

MAME is a multi-purpose emulation framework.

Uses Open AI Gym environment to create autonomous cryptocurrency bot to trade cryptocurrencies.

CVPR 2020 oral paper: Overcoming Classifier Imbalance for Long-tail Object Detection with Balanced Group Softmax.

Face Depixelizer based on "PULSE: Self-Supervised Photo Upsampling via Latent Space Exploration of Generative Models" repository.

An implementation of shampoo

Unsupervised Learning of Probably Symmetric Deformable 3D Objects from Images in the Wild

Implementation of the ivis algorithm as described in the paper Structure-preserving visualisation of high dimensional single-cell datasets.

BisQue is a web-based platform designed to provide researchers with organizational and quantitative analysis tools for 5D image data. Users can extend BisQue by implementing containerized ML workflows.

Complementary Patch for Weakly Supervised Semantic Segmentation, ICCV21 (poster)

A toolkit for controlling Euro Truck Simulator 2 with python to develop self-driving algorithms.