Efficiently Mining Frequent Representative Motifs in Large Collections of Time Series

Stijn Rotman, Boris Čule, Len Feremans

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

164 Downloads (Pure)

Abstract

The discovery of repeated structures in time series, known as motifs, is an important data mining task. Various techniques exist to mine motifs in either a database of time series or within one or two individual time series, either for a user-defined motif length or a range of lengths. However, mining frequent motifs of variable length in large time series databases remains an unsolved task that is computationally expensive. We propose FRM-Miner, an efficient algorithm for discovering more informative patterns in time series data, i.e., motifs of different
length that are non-overlapping, occur frequently and where the euclidean distance between the motif and its various occurrences is minimal. Unlike current state-of-the-art approaches, FRM-Miner can efficiently find variable motif lengths in large time series databases. Through extensive experimentation, we show desirable properties of FRM-Miner, including robustness to noise, which facilitates the discovery of motifs that remain undetected using state-of-the-art methods. Additionally, our method is highly scalable, taking only 2.95 hours to discover informative sets of motifs on all 128 time series data sets of the UCR Time Series Archive, where related state-of-the-art algorithms such as Ostinato require several days.
Original languageEnglish
Title of host publicationEfficiently Mining Frequent Representative Motifs in Large Collections of Time Series
Pages66-75
Number of pages10
DOIs
Publication statusPublished - 2023
Event2023 IEEE International Conference on Big Data (BigData) - Sorrento, Italy
Duration: 15 Dec 202318 Dec 2023

Conference

Conference2023 IEEE International Conference on Big Data (BigData)
Country/TerritoryItaly
CitySorrento
Period15/12/2318/12/23

Fingerprint

Dive into the research topics of 'Efficiently Mining Frequent Representative Motifs in Large Collections of Time Series'. Together they form a unique fingerprint.

Cite this