Incremental Mining of Frequent Serial Episodes Considering Multiple
Occurrence
- URL: http://arxiv.org/abs/2201.11650v1
- Date: Thu, 27 Jan 2022 17:10:16 GMT
- Title: Incremental Mining of Frequent Serial Episodes Considering Multiple
Occurrence
- Authors: Thomas Guyet, Wenbin Zhang and Albert Bifet
- Abstract summary: One of the fundamental research directions is to mine sequential patterns over data streams.
The pattern over a window of itemsets stream and their multiple occurrences provides additional capability to recognize the essential characteristics of the patterns.
We propose a corresponding efficient sequential miner with novel strategies to prune search space efficiently.
- Score: 11.387440344044315
- License: http://creativecommons.org/publicdomain/zero/1.0/
- Abstract: The need to analyze information from streams arises in a variety of
applications. One of the fundamental research directions is to mine sequential
patterns over data streams. Current studies mine series of items based on the
existence of the pattern in transactions but pay no attention to the series of
itemsets and their multiple occurrences. The pattern over a window of itemsets
stream and their multiple occurrences, however, provides additional capability
to recognize the essential characteristics of the patterns and the
inter-relationships among them that are unidentifiable by the existing items
and existence based studies. In this paper, we study such a new sequential
pattern mining problem and propose a corresponding efficient sequential miner
with novel strategies to prune search space efficiently. Experiments on both
real and synthetic data show the utility of our approach.
Related papers
- RPS: A Generic Reservoir Patterns Sampler [1.09784964592609]
We introduce an approach that harnesses a weighted reservoir to facilitate direct pattern sampling from streaming batch data.
We present a generic algorithm capable of addressing temporal biases and handling various pattern types, including sequential, weighted, and unweighted itemsets.
arXiv Detail & Related papers (2024-10-31T16:25:21Z) - SEMINAR: Search Enhanced Multi-modal Interest Network and Approximate Retrieval for Lifelong Sequential Recommendation [16.370075234443245]
We propose a unified lifelong multi-modal sequence model called SEMINAR-Search Enhanced Multi-Modal Interest Network and Approximate Retrieval.
Specifically, a network called Pretraining Search Unit learns the lifelong sequences of multi-modal query-item pairs in a pretraining-finetuning manner.
To accelerate the online retrieval speed of multi-modal embedding, we propose a multi-modal codebook-based product quantization strategy.
arXiv Detail & Related papers (2024-07-15T13:33:30Z) - State-Space Modeling in Long Sequence Processing: A Survey on Recurrence in the Transformer Era [59.279784235147254]
This survey provides an in-depth summary of the latest approaches that are based on recurrent models for sequential data processing.
The emerging picture suggests that there is room for thinking of novel routes, constituted by learning algorithms which depart from the standard Backpropagation Through Time.
arXiv Detail & Related papers (2024-06-13T12:51:22Z) - Causal Feature Selection via Transfer Entropy [59.999594949050596]
Causal discovery aims to identify causal relationships between features with observational data.
We introduce a new causal feature selection approach that relies on the forward and backward feature selection procedures.
We provide theoretical guarantees on the regression and classification errors for both the exact and the finite-sample cases.
arXiv Detail & Related papers (2023-10-17T08:04:45Z) - Mutual Exclusivity Training and Primitive Augmentation to Induce
Compositionality [84.94877848357896]
Recent datasets expose the lack of the systematic generalization ability in standard sequence-to-sequence models.
We analyze this behavior of seq2seq models and identify two contributing factors: a lack of mutual exclusivity bias and the tendency to memorize whole examples.
We show substantial empirical improvements using standard sequence-to-sequence models on two widely-used compositionality datasets.
arXiv Detail & Related papers (2022-11-28T17:36:41Z) - Diversifying Design of Nucleic Acid Aptamers Using Unsupervised Machine
Learning [54.247560894146105]
Inverse design of short single-stranded RNA and DNA sequences (aptamers) is the task of finding sequences that satisfy a set of desired criteria.
We propose to use an unsupervised machine learning model known as the Potts model to discover new, useful sequences with controllable sequence diversity.
arXiv Detail & Related papers (2022-08-10T13:30:58Z) - TaSPM: Targeted Sequential Pattern Mining [53.234101208024335]
We propose a generic framework namely TaSPM, based on the fast CM-SPAM algorithm.
We also propose several pruning strategies to reduce meaningless operations in mining processes.
Experiments show that the novel targeted mining algorithm TaSPM can achieve faster running time and less memory consumption.
arXiv Detail & Related papers (2022-02-26T17:49:47Z) - OPP-Miner: Order-preserving sequential pattern mining [26.997138010841347]
This paper proposes an Order-Preserving sequential Pattern (OPP) mining method, which represents patterns based on the order relationships of the time series data.
Experiments validate that OPP-Miner is not only efficient and scalable but can also discover similar sub-sequences in time series.
arXiv Detail & Related papers (2022-01-09T11:06:26Z) - Novel Features for Time Series Analysis: A Complex Networks Approach [62.997667081978825]
Time series data are ubiquitous in several domains as climate, economics and health care.
Recent conceptual approach relies on time series mapping to complex networks.
Network analysis can be used to characterize different types of time series.
arXiv Detail & Related papers (2021-10-11T13:46:28Z) - Extracting Seasonal Gradual Patterns from Temporal Sequence Data Using
Periodic Patterns Mining [0.0]
Seasonal gradual patterns capture co-variation of complex attributes in the form of " when X increases/decreases, Y increases/decreases"
No method has been proposed to extract gradual patterns that regularly appear at identical time intervals in many sequences of temporal data.
We propose an approach for their extraction based on mining periodic frequent patterns common to multiple sequences.
arXiv Detail & Related papers (2020-10-20T14:03:37Z) - Multi-source Data Mining for e-Learning [3.8673630752805432]
Pattern mining involves extracting interesting frequent patterns from data.
With the increase in the amount of data, multi-source and heterogeneous data has become an emerging challenge in this domain.
This challenge is the main focus of our work where we propose to mine multi-source data in order to extract interesting frequent patterns.
arXiv Detail & Related papers (2020-09-17T15:39:45Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.