OPP-Miner: Order-preserving sequential pattern mining
- URL: http://arxiv.org/abs/2202.03140v2
- Date: Wed, 9 Feb 2022 04:01:05 GMT
- Title: OPP-Miner: Order-preserving sequential pattern mining
- Authors: Youxi Wu, Qian Hu, Yan Li, Lei Guo, Xingquan Zhu, Xindong Wu
- Abstract summary: This paper proposes an Order-Preserving sequential Pattern (OPP) mining method, which represents patterns based on the order relationships of the time series data.
Experiments validate that OPP-Miner is not only efficient and scalable but can also discover similar sub-sequences in time series.
- Score: 26.997138010841347
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: A time series is a collection of measurements in chronological order.
Discovering patterns from time series is useful in many domains, such as stock
analysis, disease detection, and weather forecast. To discover patterns,
existing methods often convert time series data into another form, such as
nominal/symbolic format, to reduce dimensionality, which inevitably deviates
the data values. Moreover, existing methods mainly neglect the order
relationships between time series values. To tackle these issues, inspired by
order-preserving matching, this paper proposes an Order-Preserving sequential
Pattern (OPP) mining method, which represents patterns based on the order
relationships of the time series data. An inherent advantage of such
representation is that the trend of a time series can be represented by the
relative order of the values underneath the time series data. To obtain
frequent trends in time series, we propose the OPP-Miner algorithm to mine
patterns with the same trend (sub-sequences with the same relative order).
OPP-Miner employs the filtration and verification strategies to calculate the
support and uses pattern fusion strategy to generate candidate patterns. To
compress the result set, we also study finding the maximal OPPs. Experiments
validate that OPP-Miner is not only efficient and scalable but can also
discover similar sub-sequences in time series. In addition, case studies show
that our algorithms have high utility in analyzing the COVID-19 epidemic by
identifying critical trends and improve the clustering performance.
Related papers
- TimeSiam: A Pre-Training Framework for Siamese Time-Series Modeling [67.02157180089573]
Time series pre-training has recently garnered wide attention for its potential to reduce labeling expenses and benefit various downstream tasks.
This paper proposes TimeSiam as a simple but effective self-supervised pre-training framework for Time series based on Siamese networks.
arXiv Detail & Related papers (2024-02-04T13:10:51Z) - Graph Spatiotemporal Process for Multivariate Time Series Anomaly
Detection with Missing Values [67.76168547245237]
We introduce a novel framework called GST-Pro, which utilizes a graphtemporal process and anomaly scorer to detect anomalies.
Our experimental results show that the GST-Pro method can effectively detect anomalies in time series data and outperforms state-of-the-art methods.
arXiv Detail & Related papers (2024-01-11T10:10:16Z) - Time-to-Pattern: Information-Theoretic Unsupervised Learning for
Scalable Time Series Summarization [7.294418916091012]
We introduce an approach to time series summarization called Time-to-Pattern (T2P)
T2P aims to find a set of diverse patterns that together encode the most salient information, following the notion of minimum description length.
Our synthetic and real-world experiments reveal that T2P discovers informative patterns, even in noisy and complex settings.
arXiv Detail & Related papers (2023-08-26T01:15:32Z) - A Pattern Discovery Approach to Multivariate Time Series Forecasting [27.130141538089152]
State-of-the-art deep learning methods fail to construct models for full time series because model complexity grows exponentially with time series length.
We propose a novel pattern discovery method that can automatically capture diverse and complex time series patterns.
We also propose a learnable correlation matrix, that enables the model to capture distinct correlations among multiple time series.
arXiv Detail & Related papers (2022-12-20T14:54:04Z) - TaSPM: Targeted Sequential Pattern Mining [53.234101208024335]
We propose a generic framework namely TaSPM, based on the fast CM-SPAM algorithm.
We also propose several pruning strategies to reduce meaningless operations in mining processes.
Experiments show that the novel targeted mining algorithm TaSPM can achieve faster running time and less memory consumption.
arXiv Detail & Related papers (2022-02-26T17:49:47Z) - Optimal Latent Space Forecasting for Large Collections of Short Time
Series Using Temporal Matrix Factorization [0.0]
It is a common practice to evaluate multiple methods and choose one of these methods or an ensemble for producing the best forecasts.
We propose a framework for forecasting short high-dimensional time series data by combining low-rank temporal matrix factorization and optimal model selection on latent time series.
arXiv Detail & Related papers (2021-12-15T11:39:21Z) - Cluster-and-Conquer: A Framework For Time-Series Forecasting [94.63501563413725]
We propose a three-stage framework for forecasting high-dimensional time-series data.
Our framework is highly general, allowing for any time-series forecasting and clustering method to be used in each step.
When instantiated with simple linear autoregressive models, we are able to achieve state-of-the-art results on several benchmark datasets.
arXiv Detail & Related papers (2021-10-26T20:41:19Z) - Novel Features for Time Series Analysis: A Complex Networks Approach [62.997667081978825]
Time series data are ubiquitous in several domains as climate, economics and health care.
Recent conceptual approach relies on time series mapping to complex networks.
Network analysis can be used to characterize different types of time series.
arXiv Detail & Related papers (2021-10-11T13:46:28Z) - Extracting Seasonal Gradual Patterns from Temporal Sequence Data Using
Periodic Patterns Mining [0.0]
Seasonal gradual patterns capture co-variation of complex attributes in the form of " when X increases/decreases, Y increases/decreases"
No method has been proposed to extract gradual patterns that regularly appear at identical time intervals in many sequences of temporal data.
We propose an approach for their extraction based on mining periodic frequent patterns common to multiple sequences.
arXiv Detail & Related papers (2020-10-20T14:03:37Z) - Predicting Temporal Sets with Deep Neural Networks [50.53727580527024]
We propose an integrated solution based on the deep neural networks for temporal sets prediction.
A unique perspective is to learn element relationship by constructing set-level co-occurrence graph.
We design an attention-based module to adaptively learn the temporal dependency of elements and sets.
arXiv Detail & Related papers (2020-06-20T03:29:02Z) - Data Curves Clustering Using Common Patterns Detection [0.0]
Analyzing and clustering time series, or in general any kind of curves, could be critical for several human activities.
New Curves Clustering Using Common Patterns (3CP) methodology is introduced.
arXiv Detail & Related papers (2020-01-05T18:36:38Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.