Replies: 8 comments
-
This is the algorithm I refer to: |
Beta Was this translation helpful? Give feedback.
-
Yes,
No, we do not have an mSTAMP implementation at this time. Do you have a particular use case in mind? Would you be interested in contributing a PR? |
Beta Was this translation helpful? Give feedback.
-
I will take a look into the code and see if I can manage that
I have a CSV file with CAN data (information and sensors from a car) measured for a 1.5 hour drive. The data is sampled with 200Hz and contains 10 time-series. For each time-series, there are about one million data points. |
Beta Was this translation helpful? Give feedback.
-
Perhaps you are fully aware of this already so feel free to ignore this comment but I want to reiterate (in case others are reading this and if the paper wasn't clear on the following point) that the multi-dimensional matrix profile is not the same as computing individual matrix profiles for each dimension of your time series and stacking them together.
Thank you for the context! For something of this size (10 time series x 1 million data points), it should take about a day (or less) to compute the multi-dimensional matrix profile on a 2-core machine. So, depending on the hardware that you have available to you, in the time between our comment exchanges, you could've computed the matrix profile. This is not meant to come across as a snarky comment but I often find myself overthinking things before trying things out and so it is a reminder that it may be beneficial to be pragmatic. Of course, I'm guessing that you will have more data to analyze after this? I'm curious as to why you necessarily need/want the anytime algorithm? This sort of implies that you don't want to necessarily compute the full multi-dimensional matrix profile and you only need an approximate matrix profile? If you do want the full matrix profile file, then the current STUMPY |
Beta Was this translation helpful? Give feedback.
-
I want to use the anytime algorithm because I expect that it will be much faster than the ordered algorithm. Why is the STUMPY mstump (ordered alogorithm) faster than an anytime algorithm ? |
Beta Was this translation helpful? Give feedback.
-
Good question! To make sure that we are on the same page, let's set aside STUMPY When computing the full multi-dimensional matrix profile, Conversely, the Now, the main benefit with an anytime In our development of STUMPY, we had to decide whether it was more important for users to compute the full multi-dimensional matrix profile (in which case, we should provide the fastest algorithm that is Have you already tried Let me know if that helps or if you have any further questions. |
Beta Was this translation helpful? Give feedback.
-
Many thanks for a large amount of new and helpful information. For my task of grouping the motifs hierarchically into classes, I think read the MPdist paper and will follow your ideas from your comment #269 (comment). |
Beta Was this translation helpful? Give feedback.
-
@sim-san I'm closing this for now but feel free to re-open (or start a new issue) if you have more questions |
Beta Was this translation helpful? Give feedback.
-
If I understand this correctly, the mstump function uses the algorithm mSTOMP from [1].
Is it planned to implement the algorithm mSTAMP with the anytime property as well ?
[1] https://www.cs.ucr.edu/~eamonn/Motif_Discovery_ICDM.pdf
Beta Was this translation helpful? Give feedback.
All reactions