Python module for machine learning time series:

Last update: Dec 29, 2022

Overview

seglearn

Seglearn is a python package for machine learning time series or sequences. It provides an integrated pipeline for segmentation, feature extraction, feature processing, and final estimator. Seglearn provides a flexible approach to multivariate time series and related contextual (meta) data for classification, regression, and forecasting problems. Support and examples are provided for learning time series with classical machine learning and deep learning models. It is compatible with scikit-learn.

Documentation

Installation documentation, API documentation, and examples can be found on the documentation.

Dependencies

seglearn is tested to work under Python 3.5. The dependency requirements are based on the last scikit-learn release:

scipy(>=0.17.0)
numpy(>=1.11.0)
scikit-learn(>=0.21.3)

Additionally, to run the examples, you need:

matplotlib(>=2.0.0)
keras (>=2.1.4) for the neural network examples
pandas

In order to run the test cases, you need:

pytest

The neural network examples were tested on keras using the tensorflow-gpu backend, which is recommended.

Installation

seglearn-learn is currently available on the PyPi's repository and you can install it via pip:

pip install -U seglearn

or if you use python3:

pip3 install -U seglearn

If you prefer, you can clone it and run the setup.py file. Use the following commands to get a copy from GitHub and install all dependencies:

git clone https://github.com/dmbee/seglearn.git
cd seglearn
pip install .

Or install using pip and GitHub:

pip install -U git+https://github.com/dmbee/seglearn.git

Testing

After installation, you can use pytest to run the test suite from seglearn's root directory:

pytest

Change Log

Version history can be viewed in the Change Log.

Development

The development of this scikit-learn-contrib is in line with the one of the scikit-learn community. Therefore, you can refer to their Development Guide.

Please submit new pull requests on the dev branch with unit tests and an example to demonstrate any new functionality / api changes.

Citing seglearn

If you use seglearn in a scientific publication, we would appreciate citations to the following paper:

@article{arXiv:1803.08118,
author  = {David Burns, Cari Whyne},
title   = {Seglearn: A Python Package for Learning Sequences and Time Series},
journal = {arXiv},
year    = {2018},
url     = {https://arxiv.org/abs/1803.08118}
}

If you use the seglearn test data in a scientific publication, we would appreciate citations to the following paper:

@article{arXiv:1802.01489,
author  = {David Burns, Nathan Leung, Michael Hardisty, Cari Whyne, Patrick Henry, Stewart McLachlin},
title   = {Shoulder Physiotherapy Exercise Recognition: Machine Learning the Inertial Signals from a Smartwatch},
journal = {arXiv},
year    = {2018},
url     = {https://arxiv.org/abs/1802.01489}
}

Python module for machine learning time series:

Related tags

Overview

seglearn

Documentation

Dependencies

Installation

Testing

Change Log

Development

Citing seglearn

Owner

David Burns

A Python package for time series classification

A statistical library designed to fill the void in Python's time series analysis capabilities, including the equivalent of R's auto.arima function.

A toolkit for geo ML data processing and model evaluation (fork of solaris)

MCML is a toolkit for semi-supervised dimensionality reduction and quantitative analysis of Multi-Class, Multi-Label data

OptaPy is an AI constraint solver for Python to optimize planning and scheduling problems.

A visual dataflow programming language for sklearn

Bottleneck a collection of fast, NaN-aware NumPy array functions written in C.

PyCaret is an open-source, low-code machine learning library in Python that automates machine learning workflows.

BentoML is a flexible, high-performance framework for serving, managing, and deploying machine learning models.

Model Agnostic Confidence Estimator (MACEST) - A Python library for calibrating Machine Learning models' confidence scores

This is the code repository for Interpretable Machine Learning with Python, published by Packt.

slim-python is a package to learn customized scoring systems for decision-making problems.

Simulation of early COVID-19 using SIR model and variants (SEIR ...).

Timeseries analysis for neuroscience data

PyHarmonize: Adding harmony lines to recorded melodies in Python

A Lucid Framework for Transparent and Interpretable Machine Learning Models.

A simple python program that draws a tree for incrementing values using the Collatz Conjecture.

Tools for mathematical optimization region

A Python toolkit for rule-based/unsupervised anomaly detection in time series

Can a machine learning project be implemented to estimate the salaries of baseball players whose salary information and career statistics for 1986 are shared?