Abstract
The discovery of repeated structures in time series, known as motifs, is an important data mining task. Various techniques exist to mine motifs in either a database of time series or within one or two individual time series, either for a user-defined motif length or a range of lengths. However, mining frequent motifs of variable length in large time series databases remains an unsolved task that is computationally expensive. We propose FRM-Miner, an efficient algorithm for discovering more informative patterns in time series data, i.e., motifs of different
length that are non-overlapping, occur frequently and where the euclidean distance between the motif and its various occurrences is minimal. Unlike current state-of-the-art approaches, FRM-Miner can efficiently find variable motif lengths in large time series databases. Through extensive experimentation, we show desirable properties of FRM-Miner, including robustness to noise, which facilitates the discovery of motifs that remain undetected using state-of-the-art methods. Additionally, our method is highly scalable, taking only 2.95 hours to discover informative sets of motifs on all 128 time series data sets of the UCR Time Series Archive, where related state-of-the-art algorithms such as Ostinato require several days.
length that are non-overlapping, occur frequently and where the euclidean distance between the motif and its various occurrences is minimal. Unlike current state-of-the-art approaches, FRM-Miner can efficiently find variable motif lengths in large time series databases. Through extensive experimentation, we show desirable properties of FRM-Miner, including robustness to noise, which facilitates the discovery of motifs that remain undetected using state-of-the-art methods. Additionally, our method is highly scalable, taking only 2.95 hours to discover informative sets of motifs on all 128 time series data sets of the UCR Time Series Archive, where related state-of-the-art algorithms such as Ostinato require several days.
Original language | English |
---|---|
Title of host publication | Efficiently Mining Frequent Representative Motifs in Large Collections of Time Series |
Pages | 66-75 |
Number of pages | 10 |
DOIs | |
Publication status | Published - 2023 |
Event | 2023 IEEE International Conference on Big Data (BigData) - Sorrento, Italy Duration: 15 Dec 2023 → 18 Dec 2023 |
Conference
Conference | 2023 IEEE International Conference on Big Data (BigData) |
---|---|
Country/Territory | Italy |
City | Sorrento |
Period | 15/12/23 → 18/12/23 |