Sitemap
This is not a Monad tutorial

Writings, reviews and interviews about programming languages, operating systems, network protocols, artificial intelligence and machine learning

Follow publication

Stumpy: unleashing the power of the matrix profile for time series analysis

14 min readNov 2, 2020

--

Source: Stumpy documentation

What is STUMPY? What are the goals of the project?

What kind of time series analysis can be done with Stumpy? In what fields do you think it will help the most?

What are the benefits of computing the matrix profile in the context of analyzing a time series? What are the advantages over other methods?

What is the general criteria when choosing a window size? Is there some indicator to look up when analysing a time series?

What is semantic segmentation in the context of time series? What were the problems in the past with this method and how do you solve them?

How does the sampling rate affect the analysis of a time series? How often are important patterns hidden because of a bad sampling method?

Source: Stumpy documentation

STUMPED is the distributed version of STUMP and it is implemented using Dask. Why have you chosen Dask over other solutions to implement STUMPED?

Was GPU compatibility challenging to integrate in the project?

Considering you have chosen Numba for optimizing and parallelizing computation, have you thought about using Julia in the future, which has built-in features for this tasks?

How do you justify the comparison between the benchmark using 256-CPUs (STUMPED.256) against the one using 16 GPUs (GPU-STUMP.DGX2), especially economically speaking?

In the paper presenting the STOMP algorithms, an implementation in a seismologic dataset is shown, working with a really huge amount of data and analysing it within days. How near are we from real-time anomaly detection systems that analyse datasets as large at that scale?

How do you think STUMPY will evolve? Do you have in mind new features to implement in the near future?

Are there any books you recommend reading on the topic?

Where can readers find you and where can they learn more about STUMPY?

--

--

This is not a Monad tutorial
This is not a Monad tutorial

Published in This is not a Monad tutorial

Writings, reviews and interviews about programming languages, operating systems, network protocols, artificial intelligence and machine learning

Federico Carrone
Federico Carrone

Written by Federico Carrone

A happy member of The Erlang, Rust/ML and Lisp Evangelism Strikeforce. Network Protocol’s RFC fanatic. Big Data and Machine Learning

No responses yet