-
Hi, I was looking through the tutorial for mstump and either my understanding is incorrect or I believe there may be an error. Specifically, under Multidimensional Matrix Profiles, it is mentioned that,
Why are we choosing the two lowest values here? Is this just arbitrarily selecting the two most probable motifs? From the code below this, it seems like we are selecting along index 1 (row 2), meaning the second dimension right? And also, why must these two lowest values be a tie? An additional question: When choosing k for when we're interested in discords instead, is it as simple as setting |
Beta Was this translation helpful? Give feedback.
Replies: 2 comments 7 replies
-
@lukail Thank you for your question and welcome to the STUMPY community!
So, this goes back to the concept of a "global motif" (or global minima) in that the global minima must be a pair. In the case of a 1D time series, the two subsequences that are identified at the global minima MUST be the one-nearest neighbor of each other (i.e., for a global minima, if subsequence A is the nearest neighbor to subsequence B then subsequence B must also be the nearest neighbor to subsequence A). This is why the two lowest values must be a tie. For a global minima, no other subsequences should be closer in distance (otherwise it wouldn't be called a global motif). Thus, by choosing the two lowest values, we are essentially choosing the index location of the two subsequences represented by the global minima. Hopefully that makes sense but please feel free to ask for any clarification. You now have me thinking whether this "tying pair" also holds for more than one dimension?! Hmmmm. I will have to think about that and I may need to edit that part out.
Are you referring to this code The
And then
And then the additional indexing after sorting
So, in the plot proceeding this code, we plot the profiles for Let me know if that helps.
Well, in addition to setting
This section of the tutorial provides some guidance but let me know if that is helpful! |
Beta Was this translation helpful? Give feedback.
-
Beta Was this translation helpful? Give feedback.
@lukail Thank you for your question and welcome to the STUMPY community!
So, this goes back to the concept of a "global motif" (or global minima) in that the global minima must be a pair. In the case of a 1D time series, the two subsequences that are identified at the global minima MUST be the one-nearest neighbor of each other (i.e., for a global minima, if subsequence A is the nearest neighbor to subsequence B then subsequence B must also be the nearest neighbor to subsequence A). This is why the two lowest values must be a tie. For a global minima, no other subsequences should be clos…