Quantile-based fuzzy C-means clustering of multivariate time series:
Robust techniques
- URL: http://arxiv.org/abs/2109.11027v1
- Date: Wed, 22 Sep 2021 20:26:12 GMT
- Title: Quantile-based fuzzy C-means clustering of multivariate time series:
Robust techniques
- Authors: \'Angel L\'opez-Oriona, Pierpaolo D'Urso, Jos\'e Antonio Vilar and
Borja Lafuente-Rego
- Abstract summary: Robustness to the presence of outliers is achieved by using the so-called metric, noise and trimmed approaches.
Results from a broad simulation study indicate that the algorithms are substantially effective in coping with the presence of outlying series.
- Score: 2.3226893628361682
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Three robust methods for clustering multivariate time series from the point
of view of generating processes are proposed. The procedures are robust
versions of a fuzzy C-means model based on: (i) estimates of the quantile
cross-spectral density and (ii) the classical principal component analysis.
Robustness to the presence of outliers is achieved by using the so-called
metric, noise and trimmed approaches. The metric approach incorporates in the
objective function a distance measure aimed at neutralizing the effect of the
outliers, the noise approach builds an artificial cluster expected to contain
the outlying series and the trimmed approach eliminates the most atypical
series in the dataset. All the proposed techniques inherit the nice properties
of the quantile cross-spectral density, as being able to uncover general types
of dependence. Results from a broad simulation study including multivariate
linear, nonlinear and GARCH processes indicate that the algorithms are
substantially effective in coping with the presence of outlying series (i.e.,
series exhibiting a dependence structure different from that of the majority),
clearly poutperforming alternative procedures. The usefulness of the suggested
methods is highlighted by means of two specific applications regarding
financial and environmental series.
Related papers
- Fuzzy clustering of circular time series based on a new dependence
measure with applications to wind data [2.845817138242963]
Time series clustering is an essential machine learning task with applications in many disciplines.
A distance between circular series is introduced and used to construct a clustering procedure.
A fuzzy approach is adopted, which enables the procedure to locate each series into several clusters with different membership degrees.
arXiv Detail & Related papers (2024-01-26T12:21:57Z) - Detection of Anomalies in Multivariate Time Series Using Ensemble
Techniques [3.2422067155309806]
We propose an ensemble technique that combines multiple base models toward the final decision.
A semi-supervised approach using a Logistic Regressor to combine the base models' outputs is also proposed.
The performance improvement in terms of anomaly detection accuracy reaches 2% for the unsupervised and at least 10% for the semi-supervised models.
arXiv Detail & Related papers (2023-08-06T17:51:22Z) - Rethinking k-means from manifold learning perspective [122.38667613245151]
We present a new clustering algorithm which directly detects clusters of data without mean estimation.
Specifically, we construct distance matrix between data points by Butterworth filter.
To well exploit the complementary information embedded in different views, we leverage the tensor Schatten p-norm regularization.
arXiv Detail & Related papers (2023-05-12T03:01:41Z) - Robust Detection of Lead-Lag Relationships in Lagged Multi-Factor Models [61.10851158749843]
Key insights can be obtained by discovering lead-lag relationships inherent in the data.
We develop a clustering-driven methodology for robust detection of lead-lag relationships in lagged multi-factor models.
arXiv Detail & Related papers (2023-05-11T10:30:35Z) - Fuzzy clustering of ordinal time series based on two novel distances
with economic applications [0.12891210250935145]
Two novel distances between ordinal time series are introduced and used to construct fuzzy clustering procedures.
The resulting clustering algorithms are computationally efficient and able to group series generated from similar processes.
Two specific applications involving economic time series illustrate the usefulness of the proposed approaches.
arXiv Detail & Related papers (2023-04-24T16:39:22Z) - Unified Multi-View Orthonormal Non-Negative Graph Based Clustering
Framework [74.25493157757943]
We formulate a novel clustering model, which exploits the non-negative feature property and incorporates the multi-view information into a unified joint learning framework.
We also explore, for the first time, the multi-model non-negative graph-based approach to clustering data based on deep features.
arXiv Detail & Related papers (2022-11-03T08:18:27Z) - A One-shot Framework for Distributed Clustered Learning in Heterogeneous
Environments [54.172993875654015]
The paper proposes a family of communication efficient methods for distributed learning in heterogeneous environments.
One-shot approach, based on local computations at the users and a clustering based aggregation step at the server is shown to provide strong learning guarantees.
For strongly convex problems it is shown that, as long as the number of data points per user is above a threshold, the proposed approach achieves order-optimal mean-squared error rates in terms of the sample size.
arXiv Detail & Related papers (2022-09-22T09:04:10Z) - Gradient Based Clustering [72.15857783681658]
We propose a general approach for distance based clustering, using the gradient of the cost function that measures clustering quality.
The approach is an iterative two step procedure (alternating between cluster assignment and cluster center updates) and is applicable to a wide range of functions.
arXiv Detail & Related papers (2022-02-01T19:31:15Z) - Quantile-based fuzzy clustering of multivariate time series in the
frequency domain [2.610470075814367]
fuzzy C-means and fuzzy C-medoids algorithms are proposed.
The performance of the proposed approach is evaluated in a broad simulation study.
Two specific applications involving air quality and financial databases illustrate the usefulness of our approach.
arXiv Detail & Related papers (2021-09-08T15:38:33Z) - Stable and consistent density-based clustering via multiparameter
persistence [77.34726150561087]
We consider the degree-Rips construction from topological data analysis.
We analyze its stability to perturbations of the input data using the correspondence-interleaving distance.
We integrate these methods into a pipeline for density-based clustering, which we call Persistable.
arXiv Detail & Related papers (2020-05-18T19:45:04Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.