A fast hierarchical dimensionality reduction algorithm.

Last update: Dec 12, 2022

Related tags

Overview

h-NNE: Hierarchical Nearest Neighbor Embedding

A fast hierarchical dimensionality reduction algorithm.

h-NNE is a general purpose dimensionality reduction algorithm such as t-SNE and UMAP. It stands out for its speed, simplicity and the fact that it provides a hierarchy of clusterings as part of its projection process. The algorithm is inspired by the FINCH clustering algorithm. For more information on the structure of the algorithm, please look at our corresponding paper in ArXiv:

M. Saquib Sarfraz*, Marios Koulakis*, Constantin Seibold, Rainer Stiefelhagen. Hierarchical Nearest Neighbor Graph Embedding for Efficient Dimensionality Reduction. CVPR 2022.

More details are available in the project documentation.

Installation

The project is available in PyPI. To install run:

pip install hnne

How to use h-NNE

The HNNE class implements the common methods of the sklearn interface.

Simple projection example

import numpy as np
from hnne import HNNE

data = np.random.random(size=(1000, 256))

hnne = HNNE(dim=2)
projection = hnne.fit_transform(data)

Projecting on new points

hnne = HNNE()
projection = hnne.fit_transform(data)

new_data_projection = hnne.transform(new_data)

Demos

The following demo notebooks are available:

Citation

If you make use of this project in your work, it would be appreciated if you cite the hnne paper:

@article{hnne,
  title={Hierarchical Nearest Neighbor Graph Embedding for Efficient Dimensionality Reduction},
  author={M. Saquib Sarfraz, Marios Koulakis, Constantin Seibold, Rainer Stiefelhagen},
  booktitle = {Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
  year = {2022}
}

If you make use of the clustering properties of the algorithm please also cite:

 @inproceedings{finch,
   author    = {M. Saquib Sarfraz and Vivek Sharma and Rainer Stiefelhagen},
   title     = {Efficient Parameter-free Clustering Using First Neighbor Relations},
   booktitle = {Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
   pages = {8934--8943},
   year  = {2019}
}

A fast hierarchical dimensionality reduction algorithm.

Related tags

Overview

h-NNE: Hierarchical Nearest Neighbor Embedding

Installation

How to use h-NNE

Simple projection example

Projecting on new points

Demos

Citation

Owner

Marios Koulakis

The guide to tackle with the Text Summarization

Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code

Korean extractive summarization. 2021 AI 텍스트 요약 온라인 해커톤 화성갈끄니까팀 코드

DensePhrases provides answers to your natural language questions from the entire Wikipedia in real-time

An open source library for deep learning end-to-end dialog systems and chatbots.

An open source framework for seq2seq models in PyTorch.

A very simple framework for state-of-the-art Natural Language Processing (NLP)

Code and checkpoints for training the transformer-based Table QA models introduced in the paper TAPAS: Weakly Supervised Table Parsing via Pre-training.

Open-World Entity Segmentation

Nystromformer: A Nystrom-based Algorithm for Approximating Self-Attention

Levenshtein and Hamming distance computation

Auto_code_complete is a auto word-completetion program which allows you to customize it on your needs

CCF BDCI 2020 房产行业聊天问答匹配赛道 A榜47/2985

Example code for "Real-World Natural Language Processing"

Wind Speed Prediction using LSTMs in PyTorch

Translate U is capable of translating the text present in an image from one language to the other.

The following links explain a bit the idea of semantic search and how search mechanisms work by doing retrieve and rerank

A PyTorch Implementation of End-to-End Models for Speech-to-Text

Natural language computational chemistry command line interface.

SpeechBrain is an open-source and all-in-one speech toolkit based on PyTorch.