Spectral Clustering with Variance Information for Group Structure
Estimation in Panel Data
- URL: http://arxiv.org/abs/2201.01793v2
- Date: Thu, 8 Feb 2024 16:02:52 GMT
- Title: Spectral Clustering with Variance Information for Group Structure
Estimation in Panel Data
- Authors: Lu Yu, Jiaying Gu, Stanislav Volgushev
- Abstract summary: We first conduct a local analysis which reveals that the variances of the individual coefficient estimates contain useful information for the estimation of group structure.
We then propose a method to estimate unobserved groupings for general panel data models that explicitly account for the variance information.
- Score: 7.712669451925186
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Consider a panel data setting where repeated observations on individuals are
available. Often it is reasonable to assume that there exist groups of
individuals that share similar effects of observed characteristics, but the
grouping is typically unknown in advance. We first conduct a local analysis
which reveals that the variances of the individual coefficient estimates
contain useful information for the estimation of group structure. We then
propose a method to estimate unobserved groupings for general panel data models
that explicitly account for the variance information. Our proposed method
remains computationally feasible with a large number of individuals and/or
repeated measurements on each individual. The developed ideas can also be
applied even when individual-level data are not available and only parameter
estimates together with some quantification of estimation uncertainty are given
to the researcher. A thorough simulation study demonstrates superior
performance of our method than existing methods and we apply the method to two
empirical applications.
Related papers
- A structured regression approach for evaluating model performance across intersectional subgroups [53.91682617836498]
Disaggregated evaluation is a central task in AI fairness assessment, where the goal is to measure an AI system's performance across different subgroups.
We introduce a structured regression approach to disaggregated evaluation that we demonstrate can yield reliable system performance estimates even for very small subgroups.
arXiv Detail & Related papers (2024-01-26T14:21:45Z) - Model-based Clustering of Individuals' Ecological Momentary Assessment
Time-series Data for Improving Forecasting Performance [5.312303275762104]
It is believed that additional information of similar individuals is likely to enhance these models leading to better individuals' description.
Two model-based clustering approaches are examined, where the first is using model-extracted parameters of personalized models.
The superiority of clustering-based methods is confirmed, indicating that the utilization of group-based information could be effectively enhance the overall performance of all individuals' data.
arXiv Detail & Related papers (2023-10-11T13:39:04Z) - Beyond Normal: On the Evaluation of Mutual Information Estimators [52.85079110699378]
We show how to construct a diverse family of distributions with known ground-truth mutual information.
We provide guidelines for practitioners on how to select appropriate estimator adapted to the difficulty of problem considered.
arXiv Detail & Related papers (2023-06-19T17:26:34Z) - Estimating Structural Disparities for Face Models [54.062512989859265]
In machine learning, disparity metrics are often defined by measuring the difference in the performance or outcome of a model, across different sub-populations.
We explore performing such analysis on computer vision models trained on human faces, and on tasks such as face attribute prediction and affect estimation.
arXiv Detail & Related papers (2022-04-13T05:30:53Z) - Combining Observational and Randomized Data for Estimating Heterogeneous
Treatment Effects [82.20189909620899]
Estimating heterogeneous treatment effects is an important problem across many domains.
Currently, most existing works rely exclusively on observational data.
We propose to estimate heterogeneous treatment effects by combining large amounts of observational data and small amounts of randomized data.
arXiv Detail & Related papers (2022-02-25T18:59:54Z) - Recommendations for Bayesian hierarchical model specifications for
case-control studies in mental health [0.0]
Researchers must choose whether to assume all subjects are drawn from a common population, or to model them as deriving from separate populations.
We ran systematic simulations on synthetic multi-group behavioural data from a commonly used bandit task.
We found that fitting groups separately provided the most accurate and robust inference across all conditions.
arXiv Detail & Related papers (2020-11-03T14:19:59Z) - Robust Recursive Partitioning for Heterogeneous Treatment Effects with
Uncertainty Quantification [84.53697297858146]
Subgroup analysis of treatment effects plays an important role in applications from medicine to public policy to recommender systems.
Most of the current methods of subgroup analysis begin with a particular algorithm for estimating individualized treatment effects (ITE)
This paper develops a new method for subgroup analysis, R2P, that addresses all these weaknesses.
arXiv Detail & Related papers (2020-06-14T14:50:02Z) - Performance metrics for intervention-triggering prediction models do not
reflect an expected reduction in outcomes from using the model [71.9860741092209]
Clinical researchers often select among and evaluate risk prediction models.
Standard metrics calculated from retrospective data are only related to model utility under certain assumptions.
When predictions are delivered repeatedly throughout time, the relationship between standard metrics and utility is further complicated.
arXiv Detail & Related papers (2020-06-02T16:26:49Z) - Adaptive Discrete Smoothing for High-Dimensional and Nonlinear Panel
Data [4.550919471480445]
We develop a data-driven smoothing technique for high-dimensional and non-linear panel data models.
The weights are determined by a data-driven way and depend on the similarity between the corresponding functions.
We conduct a simulation study which shows that the prediction can be greatly improved by using our estimator.
arXiv Detail & Related papers (2019-12-30T09:50:58Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.