MASS (Mueen's Algorithm for Similarity Search) - a python 2 and 3 compatible library used for searching time series sub-sequences under z-normalized Euclidean distance for similarity.

Last update: Dec 31, 2022

Overview

Introduction

MASS allows you to search a time series for a subquery resulting in an array of distances. These array of distances enable you to identify similar or dissimilar subsequences compared to your query. At its core, MASS computes Euclidean distances under z-normalization in an efficient manner and is domain agnostic in nature. It is the fundamental algorithm that the matrix profile algorithm is built on top of.

mass-ts is a python 2 and 3 compatible library.

Free software: Apache Software License 2.0

Features

Original Author's Algorithms

MASS - the first implementation of MASS
MASS2 - the second implementation of MASS that is significantly faster. Typically this is the one you will use.
MASS3 - a piecewise version of MASS2 that can be tuned to your hardware. Generally this is used to search very large time series.
MASS_weighted - TODO

Library Specific Algorithms

MASS2_batch - a batch version of MASS2 that reduces overall memory usage, provides parallelization and enables you to find top K number of matches within the time series. The goal of using this implementation is for very large time series similarity search.
top_k_motifs - find the top K number of similar subsequences to your given query. It returns the starting index of the subsequence.
top_k_discords - find the top K number of dissimilar subsequences to your given query. It returns the starting index of the subsequence.
MASS2_gpu - a GPU implementation of MASS2 leveraging the Python library CuPy.

Installation

pip install mass-ts

GPU Support

Please follow the installation guide for CuPy. It covers what drivers and environmental dependencies are required. Once you are finished there, you can install GPU support for the algorithms.

pip install mass-ts[gpu]

Example Usage

A dedicated repository for practical examples can be found at the mass-ts-examples repository.

import numpy as np
import mass_ts as mts

ts = np.loadtxt('ts.txt')
query = np.loadtxt('query.txt')

# mass
distances = mts.mass(ts, query)

# mass2
distances = mts.mass2(ts, query)

# mass3
distances = mts.mass3(ts, query, 256)

# mass2_gpu
distances = mts.mass2_gpu(ts, query)

# mass2_batch
# start a multi-threaded batch job with all cpu cores and give me the top 5 matches.
# note that batch_size partitions your time series into a subsequence similarity search.
# even for large time series in single threaded mode, this is much more memory efficient than
# MASS2 on its own.
batch_size = 10000
top_matches = 5
n_jobs = -1
indices, distances = mts.mass2_batch(ts, query, batch_size, 
    top_matches=top_matches, n_jobs=n_jobs)

# find minimum distance
min_idx = np.argmin(distances)

# find top 4 motif starting indices
k = 4
exclusion_zone = 25
top_motifs = mts.top_k_motifs(distances, k, exclusion_zone)

# find top 4 discord starting indices
k = 4
exclusion_zone = 25
top_discords = mts.top_k_discords(distances, k, exclusion_zone)

Citations

Abdullah Mueen, Yan Zhu, Michael Yeh, Kaveh Kamgar, Krishnamurthy Viswanathan, Chetan Kumar Gupta and Eamonn Keogh (2015), The Fastest Similarity Search Algorithm for Time Series Subsequences under Euclidean Distance, URL: http://www.cs.unm.edu/~mueen/FastestSimilaritySearch.html

MASS (Mueen's Algorithm for Similarity Search) - a python 2 and 3 compatible library used for searching time series sub-sequences under z-normalized Euclidean distance for similarity.

Related tags

Overview

Introduction

Features

Installation

GPU Support

Example Usage

Citations

Owner

Matrix Profile Foundation

Codes of the paper Deformable Butterfly: A Highly Structured and Sparse Linear Transform.

Bayes-Newton—A Gaussian process library in JAX, with a unifying view of approximate Bayesian inference as variants of Newton's algorithm.

TensorFlow 101: Introduction to Deep Learning for Python Within TensorFlow

CausalNLP is a practical toolkit for causal inference with text as treatment, outcome, or "controlled-for" variable.

Modelisation on galaxy evolution using PEGASE-HR

A generator of point clouds dataset for PyPipes.

TensorFlow (Python API) implementation of Neural Style

implement of SwiftNet:Real-time Video Object Segmentation

On the Complementarity between Pre-Training and Back-Translation for Neural Machine Translation (Findings of EMNLP 2021))

AAAI 2022: Stationary diffusion state neural estimation

Mengzi Pretrained Models

PyTorch implementation of DeepDream algorithm

The official code repo of "HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound Classification and Detection"

My Body is a Cage: the Role of Morphology in Graph-Based Incompatible Control

Code for Recurrent Mask Refinement for Few-Shot Medical Image Segmentation (ICCV 2021).

This is the code for HOI Transformer

TinyML Cookbook, published by Packt

Author Disambiguation using Knowledge Graph Embeddings with Literals

AI创造营：Metaverse启动机之重构现世，结合PaddlePaddle 和 Wechaty 创造自己的聊天机器人

Official Pytorch implementation of "CLIPstyler:Image Style Transfer with a Single Text Condition"

MASS (Mueen's Algorithm for Similarity Search) - a python 2 and 3 compatible library used for searching time series sub-sequences under z-normalized Euclidean distance for similarity.

Related tags

Overview

Introduction

Features

Installation

GPU Support

Example Usage

Citations

Owner

Matrix Profile Foundation

Codes of the paper Deformable Butterfly: A Highly Structured and Sparse Linear Transform.

Bayes-Newton—A Gaussian process library in JAX, with a unifying view of approximate Bayesian inference as variants of Newton's algorithm.

TensorFlow 101: Introduction to Deep Learning for Python Within TensorFlow

CausalNLP is a practical toolkit for causal inference with text as treatment, outcome, or "controlled-for" variable.

Modelisation on galaxy evolution using PEGASE-HR

A generator of point clouds dataset for PyPipes.

TensorFlow (Python API) implementation of Neural Style

implement of SwiftNet:Real-time Video Object Segmentation

On the Complementarity between Pre-Training and Back-Translation for Neural Machine Translation (Findings of EMNLP 2021))

AAAI 2022: Stationary diffusion state neural estimation

Mengzi Pretrained Models

PyTorch implementation of DeepDream algorithm

The official code repo of "HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound Classification and Detection"

My Body is a Cage: the Role of Morphology in Graph-Based Incompatible Control

Code for Recurrent Mask Refinement for Few-Shot Medical Image Segmentation (ICCV 2021).

This is the code for HOI Transformer

TinyML Cookbook, published by Packt

Author Disambiguation using Knowledge Graph Embeddings with Literals

AI创造营 ：Metaverse启动机之重构现世，结合PaddlePaddle 和 Wechaty 创造自己的聊天机器人

Official Pytorch implementation of "CLIPstyler:Image Style Transfer with a Single Text Condition"

AI创造营：Metaverse启动机之重构现世，结合PaddlePaddle 和 Wechaty 创造自己的聊天机器人