Federated Survival Forests
- URL: http://arxiv.org/abs/2302.02807v2
- Date: Mon, 7 Aug 2023 07:43:37 GMT
- Title: Federated Survival Forests
- Authors: Alberto Archetti, Matteo Matteucci
- Abstract summary: We present a novel algorithm for survival analysis based on one of the most successful survival models, the random survival forest.
With a single communication round, FedSurF obtains a discriminative power comparable to deep-learning-based federated models trained over hundreds of iterations.
- Score: 9.413131350284083
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Survival analysis is a subfield of statistics concerned with modeling the
occurrence time of a particular event of interest for a population. Survival
analysis found widespread applications in healthcare, engineering, and social
sciences. However, real-world applications involve survival datasets that are
distributed, incomplete, censored, and confidential. In this context, federated
learning can tremendously improve the performance of survival analysis
applications. Federated learning provides a set of privacy-preserving
techniques to jointly train machine learning models on multiple datasets
without compromising user privacy, leading to a better generalization
performance. However, despite the widespread development of federated learning
in recent AI research, few studies focus on federated survival analysis. In
this work, we present a novel federated algorithm for survival analysis based
on one of the most successful survival models, the random survival forest. We
call the proposed method Federated Survival Forest (FedSurF). With a single
communication round, FedSurF obtains a discriminative power comparable to
deep-learning-based federated models trained over hundreds of federated
iterations. Moreover, FedSurF retains all the advantages of random forests,
namely low computational cost and natural handling of missing values and
incomplete datasets. These advantages are especially desirable in real-world
federated environments with multiple small datasets stored on devices with low
computational capabilities. Numerical experiments compare FedSurF with
state-of-the-art survival models in federated networks, showing how FedSurF
outperforms deep-learning-based federated algorithms in realistic environments
with non-identically distributed data.
Related papers
- Developing Federated Time-to-Event Scores Using Heterogeneous Real-World
Survival Data [8.3734832069509]
Existing methods for constructing survival scores presume that data originates from a single source.
We propose a novel framework for building federated scoring systems for multi-site survival outcomes.
In testing datasets from each participant site, our proposed federated scoring system consistently outperformed all local models.
arXiv Detail & Related papers (2024-03-08T11:32:00Z) - FedRSU: Federated Learning for Scene Flow Estimation on Roadside Units [50.930389484343515]
Roadside unit (RSU) can significantly improve the safety and robustness of autonomous vehicles through Vehicle-to-Everything (V2X) communication.
Currently, the usage of a single RSU mainly focuses on real-time inference and V2X collaboration.
Integrating the vast amounts of data from numerous RSUs can provide a rich source of data for model training.
We introduce FedRSU, an innovative federated learning framework for self-supervised scene flow estimation.
arXiv Detail & Related papers (2024-01-23T15:52:57Z) - Factor-Assisted Federated Learning for Personalized Optimization with
Heterogeneous Data [6.024145412139383]
Federated learning is an emerging distributed machine learning framework aiming at protecting data privacy.
Data in different clients contain both common knowledge and personalized knowledge.
We develop a novel personalized federated learning framework for heterogeneous data, which we refer to as FedSplit.
arXiv Detail & Related papers (2023-12-07T13:05:47Z) - Scaling Survival Analysis in Healthcare with Federated Survival Forests:
A Comparative Study on Heart Failure and Breast Cancer Genomics [7.967995669387532]
In real-world applications, survival data are often incomplete, censored, distributed, and confidential.
We propose an extension of the Federated Survival Forest algorithm, called FedSurF++.
arXiv Detail & Related papers (2023-08-04T15:25:56Z) - Tackling Computational Heterogeneity in FL: A Few Theoretical Insights [68.8204255655161]
We introduce and analyse a novel aggregation framework that allows for formalizing and tackling computational heterogeneous data.
Proposed aggregation algorithms are extensively analyzed from a theoretical, and an experimental prospective.
arXiv Detail & Related papers (2023-07-12T16:28:21Z) - Heterogeneous Datasets for Federated Survival Analysis Simulation [6.489759672413373]
This work proposes a novel technique for constructing realistic heterogeneous datasets by starting from existing non-federated datasets in a reproducible way.
Specifically, we provide two novel dataset-splitting algorithms based on the Dirichlet distribution to assign each data sample to a carefully chosen client.
The implementation of the proposed methods is publicly available in favor of and to encourage common practices to simulate federated environments for survival analysis.
arXiv Detail & Related papers (2023-01-28T11:37:07Z) - Practical Challenges in Differentially-Private Federated Survival
Analysis of Medical Data [57.19441629270029]
In this paper, we take advantage of the inherent properties of neural networks to federate the process of training of survival analysis models.
In the realistic setting of small medical datasets and only a few data centers, this noise makes it harder for the models to converge.
We propose DPFed-post which adds a post-processing stage to the private federated learning scheme.
arXiv Detail & Related papers (2022-02-08T10:03:24Z) - Local Learning Matters: Rethinking Data Heterogeneity in Federated
Learning [61.488646649045215]
Federated learning (FL) is a promising strategy for performing privacy-preserving, distributed learning with a network of clients (i.e., edge devices)
arXiv Detail & Related papers (2021-11-28T19:03:39Z) - Concept drift detection and adaptation for federated and continual
learning [55.41644538483948]
Smart devices can collect vast amounts of data from their environment.
This data is suitable for training machine learning models, which can significantly improve their behavior.
In this work, we present a new method, called Concept-Drift-Aware Federated Averaging.
arXiv Detail & Related papers (2021-05-27T17:01:58Z) - WAFFLe: Weight Anonymized Factorization for Federated Learning [88.44939168851721]
In domains where data are sensitive or private, there is great value in methods that can learn in a distributed manner without the data ever leaving the local devices.
We propose Weight Anonymized Factorization for Federated Learning (WAFFLe), an approach that combines the Indian Buffet Process with a shared dictionary of weight factors for neural networks.
arXiv Detail & Related papers (2020-08-13T04:26:31Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.