Federated Learning in Distributed Medical Databases: Meta-Analysis of Large-Scale Subcortical Brain Data
- URL: http://arxiv.org/abs/1810.08553v4
- Date: Tue, 28 Jan 2025 21:14:39 GMT
- Title: Federated Learning in Distributed Medical Databases: Meta-Analysis of Large-Scale Subcortical Brain Data
- Authors: Santiago Silva, Boris Gutman, Eduardo Romero, Paul M Thompson, Andre Altmann, Marco Lorenzi,
- Abstract summary: We propose a federated learning framework for securely accessing and meta-analyzing any biomedical data without sharing individual information.
We illustrate our framework by investigating brain structural relationships across diseases and clinical cohorts.
- Score: 4.380066835476896
- License:
- Abstract: At this moment, databanks worldwide contain brain images of previously unimaginable numbers. Combined with developments in data science, these massive data provide the potential to better understand the genetic underpinnings of brain diseases. However, different datasets, which are stored at different institutions, cannot always be shared directly due to privacy and legal concerns, thus limiting the full exploitation of big data in the study of brain disorders. Here we propose a federated learning framework for securely accessing and meta-analyzing any biomedical data without sharing individual information. We illustrate our framework by investigating brain structural relationships across diseases and clinical cohorts. The framework is first tested on synthetic data and then applied to multi-centric, multi-database studies including ADNI, PPMI, MIRIAD and UK Biobank, showing the potential of the approach for further applications in distributed analysis of multi-centric cohorts
Related papers
- BIOMEDICA: An Open Biomedical Image-Caption Archive, Dataset, and Vision-Language Models Derived from Scientific Literature [73.39593644054865]
BIOMEDICA is a scalable, open-source framework to extract, annotate, and serialize the entirety of the PubMed Central Open Access subset into an easy-to-use, publicly accessible dataset.
Our framework produces a comprehensive archive with over 24 million unique image-text pairs from over 6 million articles.
BMCA-CLIP is a suite of CLIP-style models continuously pretrained on the BIOMEDICA dataset via streaming, eliminating the need to download 27 TB of data locally.
arXiv Detail & Related papers (2025-01-13T09:58:03Z) - Federated brain tumor segmentation: an extensive benchmark [2.515027627030043]
We propose an extensive benchmark of federated learning algorithms from all three classes on this task.
We show that some methods from each category can bring a slight performance improvement and potentially limit the final model(s) bias toward the predominant data distribution of the federation.
arXiv Detail & Related papers (2024-10-07T09:32:19Z) - Machine Learning Techniques for MRI Data Processing at Expanding Scale [0.5221459608786241]
Imaging sites around the world generate growing amounts of medical scan data with ever more versatile and affordable technology.
These large datasets encode substantial information about human health and hold considerable potential for machine learning training and analysis.
This chapter examines ongoing large-scale studies and the challenge of distribution shifts between them.
arXiv Detail & Related papers (2024-04-22T16:38:41Z) - UniBrain: Universal Brain MRI Diagnosis with Hierarchical
Knowledge-enhanced Pre-training [66.16134293168535]
We propose a hierarchical knowledge-enhanced pre-training framework for the universal brain MRI diagnosis, termed as UniBrain.
Specifically, UniBrain leverages a large-scale dataset of 24,770 imaging-report pairs from routine diagnostics.
arXiv Detail & Related papers (2023-09-13T09:22:49Z) - Incomplete Multimodal Learning for Complex Brain Disorders Prediction [65.95783479249745]
We propose a new incomplete multimodal data integration approach that employs transformers and generative adversarial networks.
We apply our new method to predict cognitive degeneration and disease outcomes using the multimodal imaging genetic data from Alzheimer's Disease Neuroimaging Initiative cohort.
arXiv Detail & Related papers (2023-05-25T16:29:16Z) - Evaluation and Analysis of Different Aggregation and Hyperparameter
Selection Methods for Federated Brain Tumor Segmentation [2.294014185517203]
We focus on the federated learning paradigm, a distributed learning approach for decentralized data.
Studies show that federated learning can provide competitive performance with conventional central training.
We explore different strategies for faster convergence and better performance which can also work on strong Non-IID cases.
arXiv Detail & Related papers (2022-02-16T07:49:04Z) - 2021 BEETL Competition: Advancing Transfer Learning for Subject
Independence & Heterogenous EEG Data Sets [89.84774119537087]
We design two transfer learning challenges around diagnostics and Brain-Computer-Interfacing (BCI)
Task 1 is centred on medical diagnostics, addressing automatic sleep stage annotation across subjects.
Task 2 is centred on Brain-Computer Interfacing (BCI), addressing motor imagery decoding across both subjects and data sets.
arXiv Detail & Related papers (2022-02-14T12:12:20Z) - FedMed-GAN: Federated Domain Translation on Unsupervised Cross-Modality
Brain Image Synthesis [55.939957482776194]
We propose a new benchmark for federated domain translation on unsupervised brain image synthesis (termed as FedMed-GAN)
FedMed-GAN mitigates the mode collapse without sacrificing the performance of generators.
A comprehensive evaluation is provided for comparing FedMed-GAN and other centralized methods.
arXiv Detail & Related papers (2022-01-22T02:50:29Z) - FedDis: Disentangled Federated Learning for Unsupervised Brain Pathology
Segmentation [12.863361425822314]
We propose Federated Disentanglement (FedDis) to disentangle the parameter space into shape and appearance.
FedDis is based on the assumption that the anatomical structure in brain MRI images is similar across multiple institutions.
We demonstrate a superior performance of FedDis on real pathological databases containing 109 subjects.
arXiv Detail & Related papers (2021-03-05T14:29:52Z) - Modeling Shared Responses in Neuroimaging Studies through MultiView ICA [94.31804763196116]
Group studies involving large cohorts of subjects are important to draw general conclusions about brain functional organization.
We propose a novel MultiView Independent Component Analysis model for group studies, where data from each subject are modeled as a linear combination of shared independent sources plus noise.
We demonstrate the usefulness of our approach first on fMRI data, where our model demonstrates improved sensitivity in identifying common sources among subjects.
arXiv Detail & Related papers (2020-06-11T17:29:53Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.