Federated Learning for Pediatric Pneumonia Detection: Enabling Collaborative Diagnosis Without Sharing Patient Data
- URL: http://arxiv.org/abs/2511.11714v1
- Date: Wed, 12 Nov 2025 18:17:06 GMT
- Title: Federated Learning for Pediatric Pneumonia Detection: Enabling Collaborative Diagnosis Without Sharing Patient Data
- Authors: Daniel M. Jimenez-Gutierrez, Enrique Zuazua, Joaquin Del Rio, Oleksii Sliusarenko, Xabi Uribe-Etxebarria,
- Abstract summary: Early and accurate pneumonia detection from chest X-rays is clinically critical to expedite treatment and isolation, reduce complications, and curb unnecessary antibiotic use.<n>Development of CXR-based detection is hindered by globally distributed data, high inter-hospital variability, and strict privacy regulations.<n>In this paper, we evaluate Federated Learning (FL) using the Sherpa.ai FL platform.<n>FL delivers high-performing, generalizable, secure and private pneumonia detection across healthcare networks.
- Score: 0.20878272814614088
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Early and accurate pneumonia detection from chest X-rays (CXRs) is clinically critical to expedite treatment and isolation, reduce complications, and curb unnecessary antibiotic use. Although artificial intelligence (AI) substantially improves CXR-based detection, development is hindered by globally distributed data, high inter-hospital variability, and strict privacy regulations (e.g., HIPAA, GDPR) that make centralization impractical. These constraints are compounded by heterogeneous imaging protocols, uneven data availability, and the costs of transferring large medical images across geographically dispersed sites. In this paper, we evaluate Federated Learning (FL) using the Sherpa.ai FL platform, enabling multiple hospitals (nodes) to collaboratively train a CXR classifier for pneumonia while keeping data in place and private. Using the Pediatric Pneumonia Chest X-ray dataset, we simulate cross-hospital collaboration with non-independent and non-identically distributed (non-IID) data, reproducing real-world variability across institutions and jurisdictions. Our experiments demonstrate that collaborative and privacy-preserving training across multiple hospitals via FL led to a dramatic performance improvement achieving 0.900 Accuracy and 0.966 ROC-AUC, corresponding to 47.5% and 50.0% gains over single-hospital models (0.610; 0.644), without transferring any patient CXR. These results indicate that FL delivers high-performing, generalizable, secure and private pneumonia detection across healthcare networks, with data kept local. This is especially relevant for rare diseases, where FL enables secure multi-institutional collaboration without data movement, representing a breakthrough for accelerating diagnosis and treatment development in low-data domains.
Related papers
- A Federated and Parameter-Efficient Framework for Large Language Model Training in Medicine [59.78991974851707]
Large language models (LLMs) have demonstrated strong performance on medical benchmarks, including question answering and diagnosis.<n>Most medical LLMs are trained on data from a single institution, which faces limitations in generalizability and safety in heterogeneous systems.<n>We introduce the model-agnostic and parameter-efficient federated learning framework for adapting LLMs to medical applications.
arXiv Detail & Related papers (2026-01-29T18:48:21Z) - Federated Proximal Optimization for Privacy-Preserving Heart Disease Prediction: A Controlled Simulation Study on Non-IID Clinical Data [1.620240963217448]
This paper presents a comprehensive simulation research of Federated Proximal Optimization (FedProx) for Heart Disease prediction based on UCI Heart Disease dataset.<n>We generate realistic non-IID data partitions by simulating four heterogeneous hospital clients from the Cleveland Clinic.<n>Our results are directly transferable to hospital IT-administrators, implementing privacy-preserving collaborative learning.
arXiv Detail & Related papers (2026-01-23T21:18:08Z) - FedCVD: The First Real-World Federated Learning Benchmark on Cardiovascular Disease Data [52.55123685248105]
Cardiovascular diseases (CVDs) are currently the leading cause of death worldwide, highlighting the critical need for early diagnosis and treatment.
Machine learning (ML) methods can help diagnose CVDs early, but their performance relies on access to substantial data with high quality.
This paper presents the first real-world FL benchmark for cardiovascular disease detection, named FedCVD.
arXiv Detail & Related papers (2024-10-28T02:24:01Z) - Comparing Federated Stochastic Gradient Descent and Federated Averaging for Predicting Hospital Length of Stay [0.0]
Predicting hospital length of stay (LOS) reliably is an essential need for efficient resource allocation at hospitals.
Traditional predictive modeling tools frequently have difficulty acquiring sufficient and diverse data because healthcare institutions have privacy rules in place.
This modeling approach facilitates collaborative model training by modeling decentralized data sources from different hospitals without extracting sensitive data outside of hospitals.
arXiv Detail & Related papers (2024-07-17T17:00:20Z) - Leveraging Federated Learning for Automatic Detection of Clopidogrel
Treatment Failures [0.8132630541462695]
In this study, we leverage federated learning strategies to address clopidogrel treatment failure detection.
We partitioned the data based on geographic centers and evaluated the performance of federated learning.
Our findings underscore the potential of federated learning in addressing clopidogrel treatment failure detection.
arXiv Detail & Related papers (2024-03-05T23:31:07Z) - Learning Federated Visual Prompt in Null Space for MRI Reconstruction [83.71117888610547]
We propose a new algorithm, FedPR, to learn federated visual prompts in the null space of global prompt for MRI reconstruction.
FedPR significantly outperforms state-of-the-art FL algorithms with 6% of communication costs when given the limited amount of local training data.
arXiv Detail & Related papers (2023-03-28T17:46:16Z) - When Accuracy Meets Privacy: Two-Stage Federated Transfer Learning
Framework in Classification of Medical Images on Limited Data: A COVID-19
Case Study [77.34726150561087]
COVID-19 pandemic has spread rapidly and caused a shortage of global medical resources.
CNN has been widely utilized and verified in analyzing medical images.
arXiv Detail & Related papers (2022-03-24T02:09:41Z) - Advancing COVID-19 Diagnosis with Privacy-Preserving Collaboration in
Artificial Intelligence [79.038671794961]
We launch the Unified CT-COVID AI Diagnostic Initiative (UCADI), where the AI model can be distributedly trained and independently executed at each host institution.
Our study is based on 9,573 chest computed tomography scans (CTs) from 3,336 patients collected from 23 hospitals located in China and the UK.
arXiv Detail & Related papers (2021-11-18T00:43:41Z) - The pitfalls of using open data to develop deep learning solutions for
COVID-19 detection in chest X-rays [64.02097860085202]
Deep learning models have been developed to identify COVID-19 from chest X-rays.
Results have been exceptional when training and testing on open-source data.
Data analysis and model evaluations show that the popular open-source dataset COVIDx is not representative of the real clinical problem.
arXiv Detail & Related papers (2021-09-14T10:59:11Z) - Development of a Multi-Task Learning V-Net for Pulmonary Lobar
Segmentation on Computed Tomography and Application to Diseased Lungs [0.19573380763700707]
Diseased lung regions often produce high-density zones on CT images, limiting an algorithm's execution to specify damaged lobes.
This impact motivated developing an improved machine learning method to segment lung lobes.
The approach can be readily adopted in the clinical setting as a robust tool for radiologists.
arXiv Detail & Related papers (2021-05-11T17:10:25Z) - FLOP: Federated Learning on Medical Datasets using Partial Networks [84.54663831520853]
COVID-19 Disease due to the novel coronavirus has caused a shortage of medical resources.
Different data-driven deep learning models have been developed to mitigate the diagnosis of COVID-19.
The data itself is still scarce due to patient privacy concerns.
We propose a simple yet effective algorithm, named textbfFederated textbfL textbfon Medical datasets using textbfPartial Networks (FLOP)
arXiv Detail & Related papers (2021-02-10T01:56:58Z) - BS-Net: learning COVID-19 pneumonia severity on a large Chest X-Ray
dataset [6.5800499500032705]
We design an end-to-end deep learning architecture for predicting, on Chest X-rays images (CXR), a multi-regional score conveying the degree of lung compromise in COVID-19 patients.
We exploit a clinical dataset of almost 5,000 CXR annotated images collected in the same hospital.
Our solution outperforms single human annotators in rating accuracy and consistency.
arXiv Detail & Related papers (2020-06-08T13:55:58Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.