Scaling Survival Analysis in Healthcare with Federated Survival Forests:
A Comparative Study on Heart Failure and Breast Cancer Genomics
- URL: http://arxiv.org/abs/2308.02382v1
- Date: Fri, 4 Aug 2023 15:25:56 GMT
- Title: Scaling Survival Analysis in Healthcare with Federated Survival Forests:
A Comparative Study on Heart Failure and Breast Cancer Genomics
- Authors: Alberto Archetti, Francesca Ieva, Matteo Matteucci
- Abstract summary: In real-world applications, survival data are often incomplete, censored, distributed, and confidential.
We propose an extension of the Federated Survival Forest algorithm, called FedSurF++.
- Score: 7.967995669387532
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Survival analysis is a fundamental tool in medicine, modeling the time until
an event of interest occurs in a population. However, in real-world
applications, survival data are often incomplete, censored, distributed, and
confidential, especially in healthcare settings where privacy is critical. The
scarcity of data can severely limit the scalability of survival models to
distributed applications that rely on large data pools. Federated learning is a
promising technique that enables machine learning models to be trained on
multiple datasets without compromising user privacy, making it particularly
well-suited for addressing the challenges of survival data and large-scale
survival applications. Despite significant developments in federated learning
for classification and regression, many directions remain unexplored in the
context of survival analysis. In this work, we propose an extension of the
Federated Survival Forest algorithm, called FedSurF++. This federated ensemble
method constructs random survival forests in heterogeneous federations.
Specifically, we investigate several new tree sampling methods from client
forests and compare the results with state-of-the-art survival models based on
neural networks. The key advantage of FedSurF++ is its ability to achieve
comparable performance to existing methods while requiring only a single
communication round to complete. The extensive empirical investigation results
in a significant improvement from the algorithmic and privacy preservation
perspectives, making the original FedSurF algorithm more efficient, robust, and
private. We also present results on two real-world datasets demonstrating the
success of FedSurF++ in real-world healthcare studies. Our results underscore
the potential of FedSurF++ to improve the scalability and effectiveness of
survival analysis in distributed settings while preserving user privacy.
Related papers
- HACSurv: A Hierarchical Copula-based Approach for Survival Analysis with Dependent Competing Risks [51.95824566163554]
HACSurv is a survival analysis method that learns structures and cause-specific survival functions from data with competing risks.
By capturing the dependencies between risks and censoring, HACSurv achieves better survival predictions.
arXiv Detail & Related papers (2024-10-19T18:52:18Z) - Deep End-to-End Survival Analysis with Temporal Consistency [49.77103348208835]
We present a novel Survival Analysis algorithm designed to efficiently handle large-scale longitudinal data.
A central idea in our method is temporal consistency, a hypothesis that past and future outcomes in the data evolve smoothly over time.
Our framework uniquely incorporates temporal consistency into large datasets by providing a stable training signal.
arXiv Detail & Related papers (2024-10-09T11:37:09Z) - Developing Federated Time-to-Event Scores Using Heterogeneous Real-World
Survival Data [8.3734832069509]
Existing methods for constructing survival scores presume that data originates from a single source.
We propose a novel framework for building federated scoring systems for multi-site survival outcomes.
In testing datasets from each participant site, our proposed federated scoring system consistently outperformed all local models.
arXiv Detail & Related papers (2024-03-08T11:32:00Z) - Source-Free Collaborative Domain Adaptation via Multi-Perspective
Feature Enrichment for Functional MRI Analysis [55.03872260158717]
Resting-state MRI functional (rs-fMRI) is increasingly employed in multi-site research to aid neurological disorder analysis.
Many methods have been proposed to reduce fMRI heterogeneity between source and target domains.
But acquiring source data is challenging due to concerns and/or data storage burdens in multi-site studies.
We design a source-free collaborative domain adaptation framework for fMRI analysis, where only a pretrained source model and unlabeled target data are accessible.
arXiv Detail & Related papers (2023-08-24T01:30:18Z) - SurvivalGAN: Generating Time-to-Event Data for Survival Analysis [121.84429525403694]
Imbalances in censoring and time horizons cause generative models to experience three new failure modes specific to survival analysis.
We propose SurvivalGAN, a generative model that handles survival data by addressing the imbalance in the censoring and event horizons.
We evaluate this method via extensive experiments on medical datasets.
arXiv Detail & Related papers (2023-02-24T17:03:51Z) - Federated Survival Forests [9.413131350284083]
We present a novel algorithm for survival analysis based on one of the most successful survival models, the random survival forest.
With a single communication round, FedSurF obtains a discriminative power comparable to deep-learning-based federated models trained over hundreds of iterations.
arXiv Detail & Related papers (2023-02-06T14:31:51Z) - Heterogeneous Datasets for Federated Survival Analysis Simulation [6.489759672413373]
This work proposes a novel technique for constructing realistic heterogeneous datasets by starting from existing non-federated datasets in a reproducible way.
Specifically, we provide two novel dataset-splitting algorithms based on the Dirichlet distribution to assign each data sample to a carefully chosen client.
The implementation of the proposed methods is publicly available in favor of and to encourage common practices to simulate federated environments for survival analysis.
arXiv Detail & Related papers (2023-01-28T11:37:07Z) - FedPseudo: Pseudo value-based Deep Learning Models for Federated
Survival Analysis [9.659041001051415]
We propose a first-of-its-kind, pseudo value-based deep learning model for federated survival analysis called FedPseudo.
Our proposed FL framework achieves similar performance as the best centrally trained deep survival analysis model.
arXiv Detail & Related papers (2022-07-12T01:10:36Z) - Practical Challenges in Differentially-Private Federated Survival
Analysis of Medical Data [57.19441629270029]
In this paper, we take advantage of the inherent properties of neural networks to federate the process of training of survival analysis models.
In the realistic setting of small medical datasets and only a few data centers, this noise makes it harder for the models to converge.
We propose DPFed-post which adds a post-processing stage to the private federated learning scheme.
arXiv Detail & Related papers (2022-02-08T10:03:24Z) - FLOP: Federated Learning on Medical Datasets using Partial Networks [84.54663831520853]
COVID-19 Disease due to the novel coronavirus has caused a shortage of medical resources.
Different data-driven deep learning models have been developed to mitigate the diagnosis of COVID-19.
The data itself is still scarce due to patient privacy concerns.
We propose a simple yet effective algorithm, named textbfFederated textbfL textbfon Medical datasets using textbfPartial Networks (FLOP)
arXiv Detail & Related papers (2021-02-10T01:56:58Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.