Quantile-based fuzzy clustering of multivariate time series in the
frequency domain
- URL: http://arxiv.org/abs/2109.03728v1
- Date: Wed, 8 Sep 2021 15:38:33 GMT
- Title: Quantile-based fuzzy clustering of multivariate time series in the
frequency domain
- Authors: \'Angel L\'opez-Oriona, Jos\'e A. Vilar, Pierpaolo-D'Urso
- Abstract summary: fuzzy C-means and fuzzy C-medoids algorithms are proposed.
The performance of the proposed approach is evaluated in a broad simulation study.
Two specific applications involving air quality and financial databases illustrate the usefulness of our approach.
- Score: 2.610470075814367
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: A novel procedure to perform fuzzy clustering of multivariate time series
generated from different dependence models is proposed. Different amounts of
dissimilarity between the generating models or changes on the dynamic
behaviours over time are some arguments justifying a fuzzy approach, where each
series is associated to all the clusters with specific membership levels. Our
procedure considers quantile-based cross-spectral features and consists of
three stages: (i) each element is characterized by a vector of proper estimates
of the quantile cross-spectral densities, (ii) principal component analysis is
carried out to capture the main differences reducing the effects of the noise,
and (iii) the squared Euclidean distance between the first retained principal
components is used to perform clustering through the standard fuzzy C-means and
fuzzy C-medoids algorithms. The performance of the proposed approach is
evaluated in a broad simulation study where several types of generating
processes are considered, including linear, nonlinear and dynamic conditional
correlation models. Assessment is done in two different ways: by directly
measuring the quality of the resulting fuzzy partition and by taking into
account the ability of the technique to determine the overlapping nature of
series located equidistant from well-defined clusters. The procedure is
compared with the few alternatives suggested in the literature, substantially
outperforming all of them whatever the underlying process and the evaluation
scheme. Two specific applications involving air quality and financial databases
illustrate the usefulness of our approach.
Related papers
- Interpetable Target-Feature Aggregation for Multi-Task Learning based on Bias-Variance Analysis [53.38518232934096]
Multi-task learning (MTL) is a powerful machine learning paradigm designed to leverage shared knowledge across tasks to improve generalization and performance.
We propose an MTL approach at the intersection between task clustering and feature transformation based on a two-phase iterative aggregation of targets and features.
In both phases, a key aspect is to preserve the interpretability of the reduced targets and features through the aggregation with the mean, which is motivated by applications to Earth science.
arXiv Detail & Related papers (2024-06-12T08:30:16Z) - Robust Detection of Lead-Lag Relationships in Lagged Multi-Factor Models [61.10851158749843]
Key insights can be obtained by discovering lead-lag relationships inherent in the data.
We develop a clustering-driven methodology for robust detection of lead-lag relationships in lagged multi-factor models.
arXiv Detail & Related papers (2023-05-11T10:30:35Z) - Fuzzy clustering of ordinal time series based on two novel distances
with economic applications [0.12891210250935145]
Two novel distances between ordinal time series are introduced and used to construct fuzzy clustering procedures.
The resulting clustering algorithms are computationally efficient and able to group series generated from similar processes.
Two specific applications involving economic time series illustrate the usefulness of the proposed approaches.
arXiv Detail & Related papers (2023-04-24T16:39:22Z) - Multi-View Clustering via Semi-non-negative Tensor Factorization [120.87318230985653]
We develop a novel multi-view clustering based on semi-non-negative tensor factorization (Semi-NTF)
Our model directly considers the between-view relationship and exploits the between-view complementary information.
In addition, we provide an optimization algorithm for the proposed method and prove mathematically that the algorithm always converges to the stationary KKT point.
arXiv Detail & Related papers (2023-03-29T14:54:19Z) - High-dimensional variable clustering based on maxima of a weakly dependent random process [1.1999555634662633]
We propose a new class of models for variable clustering called Asymptotic Independent block (AI-block) models.
This class of models is identifiable, meaning that there exists a maximal element with a partial order between partitions, allowing for statistical inference.
We also present an algorithm depending on a tuning parameter that recovers the clusters of variables without specifying the number of clusters empha priori.
arXiv Detail & Related papers (2023-02-02T08:24:26Z) - Unified Multi-View Orthonormal Non-Negative Graph Based Clustering
Framework [74.25493157757943]
We formulate a novel clustering model, which exploits the non-negative feature property and incorporates the multi-view information into a unified joint learning framework.
We also explore, for the first time, the multi-model non-negative graph-based approach to clustering data based on deep features.
arXiv Detail & Related papers (2022-11-03T08:18:27Z) - Consistency and Diversity induced Human Motion Segmentation [231.36289425663702]
We propose a novel Consistency and Diversity induced human Motion (CDMS) algorithm.
Our model factorizes the source and target data into distinct multi-layer feature spaces.
A multi-mutual learning strategy is carried out to reduce the domain gap between the source and target data.
arXiv Detail & Related papers (2022-02-10T06:23:56Z) - Cluster-and-Conquer: A Framework For Time-Series Forecasting [94.63501563413725]
We propose a three-stage framework for forecasting high-dimensional time-series data.
Our framework is highly general, allowing for any time-series forecasting and clustering method to be used in each step.
When instantiated with simple linear autoregressive models, we are able to achieve state-of-the-art results on several benchmark datasets.
arXiv Detail & Related papers (2021-10-26T20:41:19Z) - Quantile-based fuzzy C-means clustering of multivariate time series:
Robust techniques [2.3226893628361682]
Robustness to the presence of outliers is achieved by using the so-called metric, noise and trimmed approaches.
Results from a broad simulation study indicate that the algorithms are substantially effective in coping with the presence of outlying series.
arXiv Detail & Related papers (2021-09-22T20:26:12Z) - Conjugate Mixture Models for Clustering Multimodal Data [24.640116037967985]
The problem of multimodal clustering arises whenever the data are gathered with several physically different sensors.
We show that multimodal clustering can be addressed within a novel framework, namely conjugate mixture models.
arXiv Detail & Related papers (2020-12-09T10:13:22Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.