Collaborative Training of Medical Artificial Intelligence Models with
non-uniform Labels
- URL: http://arxiv.org/abs/2211.13606v2
- Date: Thu, 13 Apr 2023 16:08:06 GMT
- Title: Collaborative Training of Medical Artificial Intelligence Models with
non-uniform Labels
- Authors: Soroosh Tayebi Arasteh, Peter Isfort, Marwin Saehn, Gustav
Mueller-Franzes, Firas Khader, Jakob Nikolas Kather, Christiane Kuhl, Sven
Nebelung, Daniel Truhn
- Abstract summary: Building powerful and robust deep learning models requires training with large multi-party datasets.
We propose flexible federated learning (FFL) for collaborative training on such data.
We demonstrate that having heterogeneously labeled datasets, FFL-based training leads to significant performance increase.
- Score: 0.07176066267895696
- License: http://creativecommons.org/licenses/by-sa/4.0/
- Abstract: Due to the rapid advancements in recent years, medical image analysis is
largely dominated by deep learning (DL). However, building powerful and robust
DL models requires training with large multi-party datasets. While multiple
stakeholders have provided publicly available datasets, the ways in which these
data are labeled vary widely. For Instance, an institution might provide a
dataset of chest radiographs containing labels denoting the presence of
pneumonia, while another institution might have a focus on determining the
presence of metastases in the lung. Training a single AI model utilizing all
these data is not feasible with conventional federated learning (FL). This
prompts us to propose an extension to the widespread FL process, namely
flexible federated learning (FFL) for collaborative training on such data.
Using 695,000 chest radiographs from five institutions from across the globe -
each with differing labels - we demonstrate that having heterogeneously labeled
datasets, FFL-based training leads to significant performance increase compared
to conventional FL training, where only the uniformly annotated images are
utilized. We believe that our proposed algorithm could accelerate the process
of bringing collaborative training methods from research and simulation phase
to the real-world applications in healthcare.
Related papers
- FedCVD: The First Real-World Federated Learning Benchmark on Cardiovascular Disease Data [52.55123685248105]
Cardiovascular diseases (CVDs) are currently the leading cause of death worldwide, highlighting the critical need for early diagnosis and treatment.
Machine learning (ML) methods can help diagnose CVDs early, but their performance relies on access to substantial data with high quality.
This paper presents the first real-world FL benchmark for cardiovascular disease detection, named FedCVD.
arXiv Detail & Related papers (2024-10-28T02:24:01Z) - Future-Proofing Medical Imaging with Privacy-Preserving Federated Learning and Uncertainty Quantification: A Review [14.88874727211064]
AI could soon become routine in clinical practice for disease diagnosis, prognosis, treatment planning, and post-treatment surveillance.
Privacy concerns surrounding patient data present a major barrier to the widespread adoption of AI in medical imaging.
Federated Learning (FL) offers a solution that enables organizations to train AI models collaboratively without sharing sensitive data.
arXiv Detail & Related papers (2024-09-24T16:55:32Z) - FedMM: Federated Multi-Modal Learning with Modality Heterogeneity in
Computational Pathology [3.802258033231335]
Federated Multi-Modal (FedMM) is a learning framework that trains multiple single-modal feature extractors to enhance subsequent classification performance.
FedMM notably outperforms two baselines in accuracy and AUC metrics.
arXiv Detail & Related papers (2024-02-24T16:58:42Z) - Vision-Language Modelling For Radiological Imaging and Reports In The
Low Data Regime [70.04389979779195]
This paper explores training medical vision-language models (VLMs) where the visual and language inputs are embedded into a common space.
We explore several candidate methods to improve low-data performance, including adapting generic pre-trained models to novel image and text domains.
Using text-to-image retrieval as a benchmark, we evaluate the performance of these methods with variable sized training datasets of paired chest X-rays and radiological reports.
arXiv Detail & Related papers (2023-03-30T18:20:00Z) - Federated Multilingual Models for Medical Transcript Analysis [11.877236847857336]
We present a federated learning system for training a large-scale multi-lingual model.
None of the training data is ever transmitted to any central location.
We show that the global model performance can be further improved by a training step performed locally.
arXiv Detail & Related papers (2022-11-04T01:07:54Z) - FLamby: Datasets and Benchmarks for Cross-Silo Federated Learning in
Realistic Healthcare Settings [51.09574369310246]
Federated Learning (FL) is a novel approach enabling several clients holding sensitive data to collaboratively train machine learning models.
We propose a novel cross-silo dataset suite focused on healthcare, FLamby, to bridge the gap between theory and practice of cross-silo FL.
Our flexible and modular suite allows researchers to easily download datasets, reproduce results and re-use the different components for their research.
arXiv Detail & Related papers (2022-10-10T12:17:30Z) - Label-Efficient Self-Supervised Federated Learning for Tackling Data
Heterogeneity in Medical Imaging [23.08596805950814]
We present a robust and label-efficient self-supervised FL framework for medical image analysis.
Specifically, we introduce a novel distributed self-supervised pre-training paradigm into the existing FL pipeline.
We show that our self-supervised FL algorithm generalizes well to out-of-distribution data and learns federated models more effectively in limited label scenarios.
arXiv Detail & Related papers (2022-05-17T18:33:43Z) - Federated Cross Learning for Medical Image Segmentation [23.075410916203005]
Federated learning (FL) can collaboratively train deep learning models using isolated patient data owned by different hospitals for various clinical applications.
A major problem of FL is its performance degradation when dealing with data that are not independently and identically distributed (non-iid)
arXiv Detail & Related papers (2022-04-05T18:55:02Z) - Federated Cycling (FedCy): Semi-supervised Federated Learning of
Surgical Phases [57.90226879210227]
FedCy is a semi-supervised learning (FSSL) method that combines FL and self-supervised learning to exploit a decentralized dataset of both labeled and unlabeled videos.
We demonstrate significant performance gains over state-of-the-art FSSL methods on the task of automatic recognition of surgical phases.
arXiv Detail & Related papers (2022-03-14T17:44:53Z) - BERT WEAVER: Using WEight AVERaging to enable lifelong learning for
transformer-based models in biomedical semantic search engines [49.75878234192369]
We present WEAVER, a simple, yet efficient post-processing method that infuses old knowledge into the new model.
We show that applying WEAVER in a sequential manner results in similar word embedding distributions as doing a combined training on all data at once.
arXiv Detail & Related papers (2022-02-21T10:34:41Z) - Differentially private federated deep learning for multi-site medical
image segmentation [56.30543374146002]
Collaborative machine learning techniques such as federated learning (FL) enable the training of models on effectively larger datasets without data transfer.
Recent initiatives have demonstrated that segmentation models trained with FL can achieve performance similar to locally trained models.
However, FL is not a fully privacy-preserving technique and privacy-centred attacks can disclose confidential patient data.
arXiv Detail & Related papers (2021-07-06T12:57:32Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.