Granger Causality Based Hierarchical Time Series Clustering for State
Estimation
- URL: http://arxiv.org/abs/2104.04206v1
- Date: Fri, 9 Apr 2021 06:14:54 GMT
- Title: Granger Causality Based Hierarchical Time Series Clustering for State
Estimation
- Authors: Sin Yong Tan, Homagni Saha, Margarite Jacoby, Gregor P. Henze, Soumik
Sarkar
- Abstract summary: Clustering is useful when working with a large volume of unlabeled data.
We propose a hierarchical time series clustering technique based on symbolic dynamic filtering and Granger causality.
A new distance metric based on Granger causality is proposed and used for the time series clustering, as well as validated on empirical data sets.
- Score: 8.384689499720515
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: Clustering is an unsupervised learning technique that is useful when working
with a large volume of unlabeled data. Complex dynamical systems in real life
often entail data streaming from a large number of sources. Although it is
desirable to use all source variables to form accurate state estimates, it is
often impractical due to large computational power requirements, and
sufficiently robust algorithms to handle these cases are not common. We propose
a hierarchical time series clustering technique based on symbolic dynamic
filtering and Granger causality, which serves as a dimensionality reduction and
noise-rejection tool. Our process forms a hierarchy of variables in the
multivariate time series with clustering of relevant variables at each level,
thus separating out noise and less relevant variables. A new distance metric
based on Granger causality is proposed and used for the time series clustering,
as well as validated on empirical data sets. Experimental results from
occupancy detection and building temperature estimation tasks show fidelity to
the empirical data sets while maintaining state-prediction accuracy with
substantially reduced data dimensionality.
Related papers
- PeFAD: A Parameter-Efficient Federated Framework for Time Series Anomaly Detection [51.20479454379662]
We propose a.
Federated Anomaly Detection framework named PeFAD with the increasing privacy concerns.
We conduct extensive evaluations on four real datasets, where PeFAD outperforms existing state-of-the-art baselines by up to 28.74%.
arXiv Detail & Related papers (2024-06-04T13:51:08Z) - Data Aggregation for Hierarchical Clustering [0.3626013617212666]
BETULA is a numerically stable version of the well known BIRCH data aggregation algorithm.
It can be used to make HAC viable on systems with constrained resources with only small losses on clustering quality.
arXiv Detail & Related papers (2023-09-05T19:39:43Z) - Large-scale Fully-Unsupervised Re-Identification [78.47108158030213]
We propose two strategies to learn from large-scale unlabeled data.
The first strategy performs a local neighborhood sampling to reduce the dataset size in each without violating neighborhood relationships.
A second strategy leverages a novel Re-Ranking technique, which has a lower time upper bound complexity and reduces the memory complexity from O(n2) to O(kn) with k n.
arXiv Detail & Related papers (2023-07-26T16:19:19Z) - Robust Detection of Lead-Lag Relationships in Lagged Multi-Factor Models [61.10851158749843]
Key insights can be obtained by discovering lead-lag relationships inherent in the data.
We develop a clustering-driven methodology for robust detection of lead-lag relationships in lagged multi-factor models.
arXiv Detail & Related papers (2023-05-11T10:30:35Z) - Doubly Inhomogeneous Reinforcement Learning [4.334006170547247]
We propose an original algorithm to determine the best data chunks" that display similar dynamics over time and across individuals for policy learning.
Our method is general, and works with a wide range of clustering and change point detection algorithms.
arXiv Detail & Related papers (2022-11-08T03:41:14Z) - Anomaly Clustering: Grouping Images into Coherent Clusters of Anomaly
Types [60.45942774425782]
We introduce anomaly clustering, whose goal is to group data into coherent clusters of anomaly types.
This is different from anomaly detection, whose goal is to divide anomalies from normal data.
We present a simple yet effective clustering framework using a patch-based pretrained deep embeddings and off-the-shelf clustering methods.
arXiv Detail & Related papers (2021-12-21T23:11:33Z) - Cluster-and-Conquer: A Framework For Time-Series Forecasting [94.63501563413725]
We propose a three-stage framework for forecasting high-dimensional time-series data.
Our framework is highly general, allowing for any time-series forecasting and clustering method to be used in each step.
When instantiated with simple linear autoregressive models, we are able to achieve state-of-the-art results on several benchmark datasets.
arXiv Detail & Related papers (2021-10-26T20:41:19Z) - TELESTO: A Graph Neural Network Model for Anomaly Classification in
Cloud Services [77.454688257702]
Machine learning (ML) and artificial intelligence (AI) are applied on IT system operation and maintenance.
One direction aims at the recognition of re-occurring anomaly types to enable remediation automation.
We propose a method that is invariant to dimensionality changes of given data.
arXiv Detail & Related papers (2021-02-25T14:24:49Z) - Minority Oversampling for Imbalanced Time Series Classification [7.695093197007146]
This paper proposes a structure preserving Oversampling method to combat the High-dimensional Imbalanced Time-series classification.
Experimental results on several publicly available time-series datasets demonstrate the superiority of OHIT against the state-of-the-art oversampling algorithms.
arXiv Detail & Related papers (2020-04-14T09:20:12Z) - Autoencoder-based time series clustering with energy applications [0.0]
Time series clustering is a challenging task due to the specific nature of the data.
In this paper we investigate the combination of a convolutional autoencoder and a k-medoids algorithm to perfom time series clustering.
arXiv Detail & Related papers (2020-02-10T10:04:29Z) - Data Curves Clustering Using Common Patterns Detection [0.0]
Analyzing and clustering time series, or in general any kind of curves, could be critical for several human activities.
New Curves Clustering Using Common Patterns (3CP) methodology is introduced.
arXiv Detail & Related papers (2020-01-05T18:36:38Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.