ASTRIDE: Adaptive Symbolization for Time Series Databases
- URL: http://arxiv.org/abs/2302.04097v1
- Date: Wed, 8 Feb 2023 14:46:24 GMT
- Title: ASTRIDE: Adaptive Symbolization for Time Series Databases
- Authors: Sylvain W. Combettes, Charles Truong, and Laurent Oudre
- Abstract summary: We introduce ASTRIDE, a novel symbolic representation of time series, along with its accelerated variant FASTRIDE (Fast ASTRIDE)
Unlike most symbolization procedures, ASTRIDE is adaptive during both the segmentation step by performing change-point detection and the quantization step by using quantiles.
We demonstrate the performance of the ASTRIDE and FASTRIDE representations compared to SAX, 1d-SAX, SFA, and ABBA on reconstruction and, when applicable, on classification tasks.
- Score: 6.8820425565516095
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: We introduce ASTRIDE (Adaptive Symbolization for Time seRIes DatabasEs), a
novel symbolic representation of time series, along with its accelerated
variant FASTRIDE (Fast ASTRIDE). Unlike most symbolization procedures, ASTRIDE
is adaptive during both the segmentation step by performing change-point
detection and the quantization step by using quantiles. Instead of proceeding
signal by signal, ASTRIDE builds a dictionary of symbols that is common to all
signals in a data set. We also introduce D-GED (Dynamic General Edit Distance),
a novel similarity measure on symbolic representations based on the general
edit distance. We demonstrate the performance of the ASTRIDE and FASTRIDE
representations compared to SAX (Symbolic Aggregate approXimation), 1d-SAX, SFA
(Symbolic Fourier Approximation), and ABBA (Adaptive Brownian Bridge-based
Aggregation) on reconstruction and, when applicable, on classification tasks.
These algorithms are evaluated on 86 univariate equal-size data sets from the
UCR Time Series Classification Archive. An open source GitHub repository called
astride is made available to reproduce all the experiments in Python.
Related papers
- Leveraging 2D Information for Long-term Time Series Forecasting with Vanilla Transformers [55.475142494272724]
Time series prediction is crucial for understanding and forecasting complex dynamics in various domains.
We introduce GridTST, a model that combines the benefits of two approaches using innovative multi-directional attentions.
The model consistently delivers state-of-the-art performance across various real-world datasets.
arXiv Detail & Related papers (2024-05-22T16:41:21Z) - Quick Adaptive Ternary Segmentation: An Efficient Decoding Procedure For
Hidden Markov Models [70.26374282390401]
Decoding the original signal (i.e., hidden chain) from the noisy observations is one of the main goals in nearly all HMM based data analyses.
We present Quick Adaptive Ternary (QATS), a divide-and-conquer procedure which decodes the hidden sequence in polylogarithmic computational complexity.
arXiv Detail & Related papers (2023-05-29T19:37:48Z) - HyperTime: Implicit Neural Representation for Time Series [131.57172578210256]
Implicit neural representations (INRs) have recently emerged as a powerful tool that provides an accurate and resolution-independent encoding of data.
In this paper, we analyze the representation of time series using INRs, comparing different activation functions in terms of reconstruction accuracy and training convergence speed.
We propose a hypernetwork architecture that leverages INRs to learn a compressed latent representation of an entire time series dataset.
arXiv Detail & Related papers (2022-08-11T14:05:51Z) - Towards Symbolic Time Series Representation Improved by Kernel Density
Estimators [0.0]
This paper builds up on the popular mapping technique Symbolic Aggregate approXimation algorithm (SAX)
It is extensively utilized in sequence classification, pattern mining, anomaly detection, time series indexing and other data mining tasks.
We describe here our approach, including evaluation on commonly employed tasks such as time series reconstruction error and Euclidean distance lower bounding with promising improvements over SAX.
arXiv Detail & Related papers (2022-05-25T13:16:18Z) - An efficient aggregation method for the symbolic representation of
temporal data [0.0]
We present a new variant of the adaptive Brownian bridge-based aggregation (ABBA) method, called fABBA.
This variant utilizes a new aggregation approach tailored to the piecewise representation of time series.
In contrast to the original method, the new approach does not require the number of time series symbols to be specified in advance.
arXiv Detail & Related papers (2022-01-14T22:51:24Z) - Imputing Missing Observations with Time Sliced Synthetic Minority
Oversampling Technique [0.3973560285628012]
We present a simple yet novel time series imputation technique with the goal of constructing an irregular time series that is uniform across every sample in a data set.
We fix a grid defined by the midpoints of non-overlapping bins (dubbed "slices") of observation times and ensure that each sample has values for all of the features at that given time.
This allows one to both impute fully missing observations to allow uniform time series classification across the entire data and, in special cases, to impute individually missing features.
arXiv Detail & Related papers (2022-01-14T19:23:24Z) - MrSQM: Fast Time Series Classification with Symbolic Representations [11.853438514668207]
MrSQM uses multiple symbolic representations and efficient sequence mining to extract important time series features.
We study four feature selection approaches on symbolic sequences, ranging from fully supervised, to unsupervised and hybrids.
Our experiments on 112 datasets of the UEA/UCR benchmark demonstrate that MrSQM can quickly extract useful features.
arXiv Detail & Related papers (2021-09-02T15:54:46Z) - Mitigating Generation Shifts for Generalized Zero-Shot Learning [52.98182124310114]
Generalized Zero-Shot Learning (GZSL) is the task of leveraging semantic information (e.g., attributes) to recognize the seen and unseen samples, where unseen classes are not observable during training.
We propose a novel Generation Shifts Mitigating Flow framework for learning unseen data synthesis efficiently and effectively.
Experimental results demonstrate that GSMFlow achieves state-of-the-art recognition performance in both conventional and generalized zero-shot settings.
arXiv Detail & Related papers (2021-07-07T11:43:59Z) - High-performance symbolic-numerics via multiple dispatch [52.77024349608834]
Symbolics.jl is an extendable symbolic system which uses dynamic multiple dispatch to change behavior depending on the domain needs.
We show that by formalizing a generic API on actions independent of implementation, we can retroactively add optimized data structures to our system.
We demonstrate the ability to swap between classical term-rewriting simplifiers and e-graph-based term-rewriting simplifiers.
arXiv Detail & Related papers (2021-05-09T14:22:43Z) - Spatial-spectral Hyperspectral Image Classification via Multiple Random
Anchor Graphs Ensemble Learning [88.60285937702304]
This paper proposes a novel spatial-spectral HSI classification method via multiple random anchor graphs ensemble learning (RAGE)
Firstly, the local binary pattern is adopted to extract the more descriptive features on each selected band, which preserves local structures and subtle changes of a region.
Secondly, the adaptive neighbors assignment is introduced in the construction of anchor graph, to reduce the computational complexity.
arXiv Detail & Related papers (2021-03-25T09:31:41Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.