Developing Federated Time-to-Event Scores Using Heterogeneous Real-World
Survival Data
- URL: http://arxiv.org/abs/2403.05229v1
- Date: Fri, 8 Mar 2024 11:32:00 GMT
- Title: Developing Federated Time-to-Event Scores Using Heterogeneous Real-World
Survival Data
- Authors: Siqi Li, Yuqing Shang, Ziwen Wang, Qiming Wu, Chuan Hong, Yilin Ning,
Di Miao, Marcus Eng Hock Ong, Bibhas Chakraborty, Nan Liu
- Abstract summary: Existing methods for constructing survival scores presume that data originates from a single source.
We propose a novel framework for building federated scoring systems for multi-site survival outcomes.
In testing datasets from each participant site, our proposed federated scoring system consistently outperformed all local models.
- Score: 8.3734832069509
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Survival analysis serves as a fundamental component in numerous healthcare
applications, where the determination of the time to specific events (such as
the onset of a certain disease or death) for patients is crucial for clinical
decision-making. Scoring systems are widely used for swift and efficient risk
prediction. However, existing methods for constructing survival scores presume
that data originates from a single source, posing privacy challenges in
collaborations with multiple data owners. We propose a novel framework for
building federated scoring systems for multi-site survival outcomes, ensuring
both privacy and communication efficiency. We applied our approach to sites
with heterogeneous survival data originating from emergency departments in
Singapore and the United States. Additionally, we independently developed local
scores at each site. In testing datasets from each participant site, our
proposed federated scoring system consistently outperformed all local models,
evidenced by higher integrated area under the receiver operating characteristic
curve (iAUC) values, with a maximum improvement of 11.6%. Additionally, the
federated score's time-dependent AUC(t) values showed advantages over local
scores, exhibiting narrower confidence intervals (CIs) across most time points.
The model developed through our proposed method exhibits effective performance
on each local site, signifying noteworthy implications for healthcare research.
Sites participating in our proposed federated scoring model training gained
benefits by acquiring survival models with enhanced prediction accuracy and
efficiency. This study demonstrates the effectiveness of our privacy-preserving
federated survival score generation framework and its applicability to
real-world heterogeneous survival data.
Related papers
- Differentially Private Multi-Site Treatment Effect Estimation [28.13660104055298]
Most patient data remains in silo in separate hospitals, preventing the design of data-driven healthcare AI systems.
We look at estimating the average treatment effect (ATE), an important task in causal inference for healthcare applications.
We address this through a class of per-site estimation algorithms that reports the ATE estimate and its variance as a quality measure.
arXiv Detail & Related papers (2023-10-10T01:21:01Z) - Source-Free Collaborative Domain Adaptation via Multi-Perspective
Feature Enrichment for Functional MRI Analysis [55.03872260158717]
Resting-state MRI functional (rs-fMRI) is increasingly employed in multi-site research to aid neurological disorder analysis.
Many methods have been proposed to reduce fMRI heterogeneity between source and target domains.
But acquiring source data is challenging due to concerns and/or data storage burdens in multi-site studies.
We design a source-free collaborative domain adaptation framework for fMRI analysis, where only a pretrained source model and unlabeled target data are accessible.
arXiv Detail & Related papers (2023-08-24T01:30:18Z) - Scaling Survival Analysis in Healthcare with Federated Survival Forests:
A Comparative Study on Heart Failure and Breast Cancer Genomics [7.967995669387532]
In real-world applications, survival data are often incomplete, censored, distributed, and confidential.
We propose an extension of the Federated Survival Forest algorithm, called FedSurF++.
arXiv Detail & Related papers (2023-08-04T15:25:56Z) - GREAT Score: Global Robustness Evaluation of Adversarial Perturbation using Generative Models [60.48306899271866]
We present a new framework, called GREAT Score, for global robustness evaluation of adversarial perturbation using generative models.
We show high correlation and significantly reduced cost of GREAT Score when compared to the attack-based model ranking on RobustBench.
GREAT Score can be used for remote auditing of privacy-sensitive black-box models.
arXiv Detail & Related papers (2023-04-19T14:58:27Z) - Federated Survival Forests [9.413131350284083]
We present a novel algorithm for survival analysis based on one of the most successful survival models, the random survival forest.
With a single communication round, FedSurF obtains a discriminative power comparable to deep-learning-based federated models trained over hundreds of iterations.
arXiv Detail & Related papers (2023-02-06T14:31:51Z) - Decentralized Distributed Learning with Privacy-Preserving Data
Synthesis [9.276097219140073]
In the medical field, multi-center collaborations are often sought to yield more generalizable findings by leveraging the heterogeneity of patient and clinical data.
Recent privacy regulations hinder the possibility to share data, and consequently, to come up with machine learning-based solutions that support diagnosis and prognosis.
We present a decentralized distributed method that integrates features from local nodes, providing models able to generalize across multiple datasets while maintaining privacy.
arXiv Detail & Related papers (2022-06-20T23:49:38Z) - Practical Challenges in Differentially-Private Federated Survival
Analysis of Medical Data [57.19441629270029]
In this paper, we take advantage of the inherent properties of neural networks to federate the process of training of survival analysis models.
In the realistic setting of small medical datasets and only a few data centers, this noise makes it harder for the models to converge.
We propose DPFed-post which adds a post-processing stage to the private federated learning scheme.
arXiv Detail & Related papers (2022-02-08T10:03:24Z) - Bootstrapping Your Own Positive Sample: Contrastive Learning With
Electronic Health Record Data [62.29031007761901]
This paper proposes a novel contrastive regularized clinical classification model.
We introduce two unique positive sampling strategies specifically tailored for EHR data.
Our framework yields highly competitive experimental results in predicting the mortality risk on real-world COVID-19 EHR data.
arXiv Detail & Related papers (2021-04-07T06:02:04Z) - A Tree-based Federated Learning Approach for Personalized Treatment
Effect Estimation from Heterogeneous Data Sources [5.049057348282933]
Federated learning is an appealing framework for analyzing sensitive data from distributed health data networks.
We develop an efficient and interpretable tree-based ensemble of personalized treatment effect estimators to join results across hospital sites.
arXiv Detail & Related papers (2021-03-10T18:51:30Z) - UNITE: Uncertainty-based Health Risk Prediction Leveraging Multi-sourced
Data [81.00385374948125]
We present UNcertaInTy-based hEalth risk prediction (UNITE) model.
UNITE provides accurate disease risk prediction and uncertainty estimation leveraging multi-sourced health data.
We evaluate UNITE on real-world disease risk prediction tasks: nonalcoholic fatty liver disease (NASH) and Alzheimer's disease (AD)
UNITE achieves up to 0.841 in F1 score for AD detection, up to 0.609 in PR-AUC for NASH detection, and outperforms various state-of-the-art baselines by up to $19%$ over the best baseline.
arXiv Detail & Related papers (2020-10-22T02:28:11Z) - Survival Cluster Analysis [93.50540270973927]
There is an unmet need in survival analysis for identifying subpopulations with distinct risk profiles.
An approach that addresses this need is likely to improve characterization of individual outcomes.
arXiv Detail & Related papers (2020-02-29T22:41:21Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.