Multiscale Dubuc: A New Similarity Measure for Time Series
- URL: http://arxiv.org/abs/2411.10418v1
- Date: Fri, 15 Nov 2024 18:38:18 GMT
- Title: Multiscale Dubuc: A New Similarity Measure for Time Series
- Authors: Mahsa Khazaei, Azim Ahmadzadeh, Krishna Rukmini Puthucode,
- Abstract summary: We introduce the Multiscale Dubuc Distance measure and prove that it is a metric.
We use 95 datasets from the UCR Time Series Classification Archive to compare MDD's performance with EuD, LCSS, and DTW.
Our experiments show that MDD's overall success, without any case-specific customization, is comparable to DTW with optimized window sizes per dataset.
- Score: 1.024113475677323
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Quantifying similarities between time series in a meaningful way remains a challenge in time series analysis, despite many advances in the field. Most real-world solutions still rely on a few popular measures, such as Euclidean Distance (EuD), Longest Common Subsequence (LCSS), and Dynamic Time Warping (DTW). The strengths and weaknesses of these measures have been studied extensively, and incremental improvements have been proposed. In this study, however, we present a different similarity measure that fuses the notion of Dubuc's variation from fractal analysis with the Intersection-over-Union (IoU) measure which is widely used in object recognition (also known as the Jaccard Index). In this proof-of-concept paper, we introduce the Multiscale Dubuc Distance (MDD) measure and prove that it is a metric, possessing desirable properties such as the triangle inequality. We use 95 datasets from the UCR Time Series Classification Archive to compare MDD's performance with EuD, LCSS, and DTW. Our experiments show that MDD's overall success, without any case-specific customization, is comparable to DTW with optimized window sizes per dataset. We also highlight several datasets where MDD's performance improves significantly when its single parameter is customized. This customization serves as a powerful tool for gauging MDD's sensitivity to noise. Lastly, we show that MDD's running time is linear in the length of the time series, which is crucial for real-world applications involving very large datasets.
Related papers
- DTW+S: Shape-based Comparison of Time-series with Ordered Local Trend [4.6380010540165655]
We develop a measure that looks for similar trends occurring around similar times and is easily interpretable.
We propose a novel measure, DTW+S, which creates an interpretable "closeness-preserving" matrix representation of the time-series.
We show that DTW+S is the only measure able to produce good clustering compared to the baselines.
arXiv Detail & Related papers (2023-09-07T09:18:12Z) - Generative Time Series Forecasting with Diffusion, Denoise, and
Disentanglement [51.55157852647306]
Time series forecasting has been a widely explored task of great importance in many applications.
It is common that real-world time series data are recorded in a short time period, which results in a big gap between the deep model and the limited and noisy time series.
We propose to address the time series forecasting problem with generative modeling and propose a bidirectional variational auto-encoder equipped with diffusion, denoise, and disentanglement.
arXiv Detail & Related papers (2023-01-08T12:20:46Z) - Matrix Profile XXVII: A Novel Distance Measure for Comparing Long Time
Series [18.205595410817327]
We introduce PRCIS, which stands for Pattern Representation Comparison in Series.
PRCIS is a distance measure for long time series, which exploits recent progress in our ability to summarize time series with dictionaries.
arXiv Detail & Related papers (2022-12-09T23:02:23Z) - TimesNet: Temporal 2D-Variation Modeling for General Time Series
Analysis [80.56913334060404]
Time series analysis is of immense importance in applications, such as weather forecasting, anomaly detection, and action recognition.
Previous methods attempt to accomplish this directly from the 1D time series.
We ravel out the complex temporal variations into the multiple intraperiod- and interperiod-variations.
arXiv Detail & Related papers (2022-10-05T12:19:51Z) - Triformer: Triangular, Variable-Specific Attentions for Long Sequence
Multivariate Time Series Forecasting--Full Version [50.43914511877446]
We propose a triangular, variable-specific attention to ensure high efficiency and accuracy.
We show that Triformer outperforms state-of-the-art methods w.r.t. both accuracy and efficiency.
arXiv Detail & Related papers (2022-04-28T20:41:49Z) - DTWSSE: Data Augmentation with a Siamese Encoder for Time Series [8.019203034348083]
We propose a DTW-based synthetic minority oversampling technique using siamese encoder for named DTWSSE.
In order to reasonably measure the distance of the time series, DTW, which has been verified to be an effective method, is employed as the distance metric.
The encoder is a Neural Network for mapping the time series data from the DTW hidden space to the Euclidean deep feature space, and the decoder is used to map the deep feature space back to the DTW hidden space.
arXiv Detail & Related papers (2021-08-23T01:46:24Z) - Elastic Similarity Measures for Multivariate Time Series Classification [4.5669999076671655]
Elastic similarity measures are a class of similarity measures specifically designed to work with time series data.
Elastic similarity measures are widely used in machine learning tasks such as classification, clustering and outlier detection.
arXiv Detail & Related papers (2021-02-20T02:24:33Z) - Exploring Data Augmentation for Multi-Modality 3D Object Detection [82.9988604088494]
It is counter-intuitive that multi-modality methods based on point cloud and images perform only marginally better or sometimes worse than approaches that solely use point cloud.
We propose a pipeline, named transformation flow, to bridge the gap between single and multi-modality data augmentation with transformation reversing and replaying.
Our method also wins the best PKL award in the 3rd nuScenes detection challenge.
arXiv Detail & Related papers (2020-12-23T15:23:16Z) - A Case-Study on the Impact of Dynamic Time Warping in Time Series
Regression [2.639737913330821]
We show that Dynamic Time Warping (DTW) is effective in improving accuracy on a regression task when only a single wavelength is considered.
When combined with k-Nearest Neighbour, DTW has the added advantage that it can reveal similarities and differences between samples at the level of the time-series.
However, in the problem, we consider here data is available across a spectrum of wavelengths.
arXiv Detail & Related papers (2020-10-11T15:21:21Z) - Benchmarking Multivariate Time Series Classification Algorithms [69.12151492736524]
Time Series Classification (TSC) involved building predictive models for a discrete target variable from ordered, real valued, attributes.
Over recent years, a new set of TSC algorithms have been developed which have made significant improvement over the previous state of the art.
We review recently proposed bespoke MTSC algorithms based on deep learning, shapelets and bag of words approaches.
arXiv Detail & Related papers (2020-07-26T15:56:40Z) - Aligning Time Series on Incomparable Spaces [83.8261699057419]
We propose Gromov dynamic time warping (GDTW), a distance between time series on potentially incomparable spaces.
We demonstrate its effectiveness at aligning, combining and comparing time series living on incomparable spaces.
arXiv Detail & Related papers (2020-06-22T22:19:28Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.