Model-based Clustering of Individuals' Ecological Momentary Assessment
Time-series Data for Improving Forecasting Performance
- URL: http://arxiv.org/abs/2310.07491v1
- Date: Wed, 11 Oct 2023 13:39:04 GMT
- Title: Model-based Clustering of Individuals' Ecological Momentary Assessment
Time-series Data for Improving Forecasting Performance
- Authors: Mandani Ntekouli, Gerasimos Spanakis, Lourens Waldorp, Anne Roefs
- Abstract summary: It is believed that additional information of similar individuals is likely to enhance these models leading to better individuals' description.
Two model-based clustering approaches are examined, where the first is using model-extracted parameters of personalized models.
The superiority of clustering-based methods is confirmed, indicating that the utilization of group-based information could be effectively enhance the overall performance of all individuals' data.
- Score: 5.312303275762104
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Through Ecological Momentary Assessment (EMA) studies, a number of
time-series data is collected across multiple individuals, continuously
monitoring various items of emotional behavior. Such complex data is commonly
analyzed in an individual level, using personalized models. However, it is
believed that additional information of similar individuals is likely to
enhance these models leading to better individuals' description. Thus,
clustering is investigated with an aim to group together the most similar
individuals, and subsequently use this information in group-based models in
order to improve individuals' predictive performance. More specifically, two
model-based clustering approaches are examined, where the first is using
model-extracted parameters of personalized models, whereas the second is
optimized on the model-based forecasting performance. Both methods are then
analyzed using intrinsic clustering evaluation measures (e.g. Silhouette
coefficients) as well as the performance of a downstream forecasting scheme,
where each forecasting group-model is devoted to describe all individuals
belonging to one cluster. Among these, clustering based on performance shows
the best results, in terms of all examined evaluation measures. As another
level of evaluation, those group-models' performance is compared to three
baseline scenarios, the personalized, the all-in-one group and the random
group-based concept. According to this comparison, the superiority of
clustering-based methods is again confirmed, indicating that the utilization of
group-based information could be effectively enhance the overall performance of
all individuals' data.
Related papers
- A structured regression approach for evaluating model performance across intersectional subgroups [53.91682617836498]
Disaggregated evaluation is a central task in AI fairness assessment, where the goal is to measure an AI system's performance across different subgroups.
We introduce a structured regression approach to disaggregated evaluation that we demonstrate can yield reliable system performance estimates even for very small subgroups.
arXiv Detail & Related papers (2024-01-26T14:21:45Z) - Time series clustering based on prediction accuracy of global
forecasting models [0.0]
A novel method to perform model-based clustering of time series is proposed in this paper.
Unlike most techniques proposed in the literature, the method considers the predictive accuracy as the main element for constructing the clustering partition.
An extensive simulation study shows that our method outperforms several alternative techniques concerning both clustering effectiveness and predictive accuracy.
arXiv Detail & Related papers (2023-04-30T13:12:19Z) - Unified Multi-View Orthonormal Non-Negative Graph Based Clustering
Framework [74.25493157757943]
We formulate a novel clustering model, which exploits the non-negative feature property and incorporates the multi-view information into a unified joint learning framework.
We also explore, for the first time, the multi-model non-negative graph-based approach to clustering data based on deep features.
arXiv Detail & Related papers (2022-11-03T08:18:27Z) - Using Explainable Boosting Machine to Compare Idiographic and Nomothetic
Approaches for Ecological Momentary Assessment Data [2.0824228840987447]
This paper explores the use of non-linear interpretable machine learning (ML) models in classification problems.
Various ensembles of trees are compared to linear models using imbalanced synthetic and real-world datasets.
In one of the two real-world datasets, knowledge distillation method achieves improved AUC scores.
arXiv Detail & Related papers (2022-04-04T17:56:37Z) - Multi-objective Semi-supervised Clustering for Finding Predictive
Clusters [0.5371337604556311]
This study focuses on clustering problems and aims to find compact clusters that are informative regarding the outcome variable.
The main goal is partitioning data points so that observations in each cluster are similar and the outcome variable can be predicated using these clusters simultaneously.
arXiv Detail & Related papers (2022-01-26T06:24:38Z) - Spectral Clustering with Variance Information for Group Structure
Estimation in Panel Data [7.712669451925186]
We first conduct a local analysis which reveals that the variances of the individual coefficient estimates contain useful information for the estimation of group structure.
We then propose a method to estimate unobserved groupings for general panel data models that explicitly account for the variance information.
arXiv Detail & Related papers (2022-01-05T19:16:16Z) - Cluster-and-Conquer: A Framework For Time-Series Forecasting [94.63501563413725]
We propose a three-stage framework for forecasting high-dimensional time-series data.
Our framework is highly general, allowing for any time-series forecasting and clustering method to be used in each step.
When instantiated with simple linear autoregressive models, we are able to achieve state-of-the-art results on several benchmark datasets.
arXiv Detail & Related papers (2021-10-26T20:41:19Z) - Test-time Collective Prediction [73.74982509510961]
Multiple parties in machine learning want to jointly make predictions on future test points.
Agents wish to benefit from the collective expertise of the full set of agents, but may not be willing to release their data or model parameters.
We explore a decentralized mechanism to make collective predictions at test time, leveraging each agent's pre-trained model.
arXiv Detail & Related papers (2021-06-22T18:29:58Z) - Characterizing Fairness Over the Set of Good Models Under Selective
Labels [69.64662540443162]
We develop a framework for characterizing predictive fairness properties over the set of models that deliver similar overall performance.
We provide tractable algorithms to compute the range of attainable group-level predictive disparities.
We extend our framework to address the empirically relevant challenge of selectively labelled data.
arXiv Detail & Related papers (2021-01-02T02:11:37Z) - Temporal Phenotyping using Deep Predictive Clustering of Disease
Progression [97.88605060346455]
We develop a deep learning approach for clustering time-series data, where each cluster comprises patients who share similar future outcomes of interest.
Experiments on two real-world datasets show that our model achieves superior clustering performance over state-of-the-art benchmarks.
arXiv Detail & Related papers (2020-06-15T20:48:43Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.