Federated Learning Enables Big Data for Rare Cancer Boundary Detection
- URL: http://arxiv.org/abs/2204.10836v2
- Date: Mon, 25 Apr 2022 20:07:43 GMT
- Title: Federated Learning Enables Big Data for Rare Cancer Boundary Detection
- Authors: Sarthak Pati, Ujjwal Baid, Brandon Edwards, Micah Sheller, Shih-Han
Wang, G Anthony Reina, Patrick Foley, Alexey Gruzdev, Deepthi Karkada,
Christos Davatzikos, Chiharu Sako, Satyam Ghodasara, Michel Bilello, Suyash
Mohan, Philipp Vollmuth, Gianluca Brugnara, Chandrakanth J Preetha, Felix
Sahm, Klaus Maier-Hein, Maximilian Zenk, Martin Bendszus, Wolfgang Wick, Evan
Calabrese, Jeffrey Rudie, Javier Villanueva-Meyer, Soonmee Cha, Madhura
Ingalhalikar, Manali Jadhav, Umang Pandey, Jitender Saini, John Garrett,
Matthew Larson, Robert Jeraj, Stuart Currie, Russell Frood, Kavi Fatania,
Raymond Y Huang, Ken Chang, Carmen Balana, Jaume Capellades, Josep Puig,
Johannes Trenkler, Josef Pichler, Georg Necker, Andreas Haunschmidt, Stephan
Meckel, Gaurav Shukla, Spencer Liem, Gregory S Alexander, Joseph Lombardo,
Joshua D Palmer, Adam E Flanders, Adam P Dicker, Haris I Sair, Craig K Jones,
Archana Venkataraman, Meirui Jiang, Tiffany Y So, Cheng Chen, Pheng Ann Heng,
Qi Dou, Michal Kozubek, Filip Lux, Jan Mich\'alek, Petr Matula, Milo\v{s}
Ke\v{r}kovsk\'y, Tereza Kop\v{r}ivov\'a, Marek Dost\'al, V\'aclav Vyb\'ihal,
Michael A Vogelbaum, J Ross Mitchell, Joaquim Farinhas, Joseph A Maldjian,
Chandan Ganesh Bangalore Yogananda, Marco C Pinho, Divya Reddy, James
Holcomb, Benjamin C Wagner, Benjamin M Ellingson, Timothy F Cloughesy,
Catalina Raymond, Talia Oughourlian, Akifumi Hagiwara, Chencai Wang, Minh-Son
To, Sargam Bhardwaj, Chee Chong, Marc Agzarian, Alexandre Xavier Falc\~ao,
Samuel B Martins, Bernardo C A Teixeira, Fl\'avia Sprenger, David Menotti,
Diego R Lucio, Pamela LaMontagne, Daniel Marcus, Benedikt Wiestler, Florian
Kofler, Ivan Ezhov, Marie Metz, Rajan Jain, Matthew Lee, Yvonne W Lui,
Richard McKinley, Johannes Slotboom, Piotr Radojewski, Raphael Meier, Roland
Wiest, Derrick Murcia, Eric Fu, Rourke Haas, John Thompson, David Ryan
Ormond, Chaitra Badve, Andrew E Sloan, Vachan Vadmal, Kristin Waite, Rivka R
Colen, Linmin Pei, Murat Ak, Ashok Srinivasan, J Rajiv Bapuraj, Arvind Rao,
Nicholas Wang, Ota Yoshiaki, Toshio Moritani, Sevcan Turk, Joonsang Lee,
Snehal Prabhudesai, Fanny Mor\'on, Jacob Mandel, Konstantinos Kamnitsas, Ben
Glocker, Luke V M Dixon, Matthew Williams, Peter Zampakis, Vasileios
Panagiotopoulos, Panagiotis Tsiganos, Sotiris Alexiou, Ilias Haliassos,
Evangelia I Zacharaki, Konstantinos Moustakas, Christina Kalogeropoulou,
Dimitrios M Kardamakis, Yoon Seong Choi, Seung-Koo Lee, Jong Hee Chang, Sung
Soo Ahn, Bing Luo, Laila Poisson, Ning Wen, Pallavi Tiwari, Ruchika Verma,
Rohan Bareja, Ipsa Yadav, Jonathan Chen, Neeraj Kumar, Marion Smits,
Sebastian R van der Voort, Ahmed Alafandi, Fatih Incekara, Maarten MJ
Wijnenga, Georgios Kapsas, Renske Gahrmann, Joost W Schouten, Hendrikus J
Dubbink, Arnaud JPE Vincent, Martin J van den Bent, Pim J French, Stefan
Klein, Yading Yuan, Sonam Sharma, Tzu-Chi Tseng, Saba Adabi, Simone P Niclou,
Olivier Keunen, Ann-Christin Hau, Martin Valli\`eres, David Fortin, Martin
Lepage, Bennett Landman, Karthik Ramadass, Kaiwen Xu, Silky Chotai, Lola B
Chambless, Akshitkumar Mistry, Reid C Thompson, Yuriy Gusev, Krithika
Bhuvaneshwar, Anousheh Sayah, Camelia Bencheqroun, Anas Belouali, Subha
Madhavan, Thomas C Booth, Alysha Chelliah, Marc Modat, Haris Shuaib, Carmen
Dragos, Aly Abayazeed, Kenneth Kolodziej, Michael Hill, Ahmed Abbassy, Shady
Gamal, Mahmoud Mekhaimar, Mohamed Qayati, Mauricio Reyes, Ji Eun Park, Jihye
Yun, Ho Sung Kim, Abhishek Mahajan, Mark Muzi, Sean Benson, Regina G H
Beets-Tan, Jonas Teuwen, Alejandro Herrera-Trujillo, Maria Trujillo, William
Escobar, Ana Abello, Jose Bernal, Jhon G\'omez, Joseph Choi, Stephen Baek,
Yusung Kim, Heba Ismael, Bryan Allen, John M Buatti, Aikaterini Kotrotsou,
Hongwei Li, Tobias Weiss, Michael Weller, Andrea Bink, Bertrand Pouymayou,
Hassan F Shaykh, Joel Saltz, Prateek Prasanna, Sampurna Shrestha, Kartik M
Mani, David Payne, Tahsin Kurc, Enrique Pelaez, Heydy Franco-Maldonado,
Francis Loayza, Sebastian Quevedo, Pamela Guevara, Esteban Torche, Cristobal
Mendoza, Franco Vera, Elvis R\'ios, Eduardo L\'opez, Sergio A Velastin,
Godwin Ogbole, Dotun Oyekunle, Olubunmi Odafe-Oyibotha, Babatunde Osobu,
Mustapha Shu'aibu, Adeleye Dorcas, Mayowa Soneye, Farouk Dako, Amber L
Simpson, Mohammad Hamghalam, Jacob J Peoples, Ricky Hu, Anh Tran, Danielle
Cutler, Fabio Y Moraes, Michael A Boss, James Gimpel, Deepak Kattil Veettil,
Kendall Schmidt, Brian Bialecki, Sailaja Marella, Cynthia Price, Lisa Cimino,
Charles Apgar, Prashant Shah, Bjoern Menze, Jill S Barnholtz-Sloan, Jason
Martin, Spyridon Bakas
- Abstract summary: We present findings from the largest Federated ML study to-date, involving data from 71 healthcare institutions across 6 continents.
We generate an automatic tumor boundary detector for the rare disease of glioblastoma.
We demonstrate a 33% improvement over a publicly trained model to delineate the surgically targetable tumor, and 23% improvement over the tumor's entire extent.
- Score: 98.5549882883963
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Although machine learning (ML) has shown promise in numerous domains, there
are concerns about generalizability to out-of-sample data. This is currently
addressed by centrally sharing ample, and importantly diverse, data from
multiple sites. However, such centralization is challenging to scale (or even
not feasible) due to various limitations. Federated ML (FL) provides an
alternative to train accurate and generalizable ML models, by only sharing
numerical model updates. Here we present findings from the largest FL study
to-date, involving data from 71 healthcare institutions across 6 continents, to
generate an automatic tumor boundary detector for the rare disease of
glioblastoma, utilizing the largest dataset of such patients ever used in the
literature (25,256 MRI scans from 6,314 patients). We demonstrate a 33%
improvement over a publicly trained model to delineate the surgically
targetable tumor, and 23% improvement over the tumor's entire extent. We
anticipate our study to: 1) enable more studies in healthcare informed by large
and diverse data, ensuring meaningful results for rare diseases and
underrepresented populations, 2) facilitate further quantitative analyses for
glioblastoma via performance optimization of our consensus model for eventual
public release, and 3) demonstrate the effectiveness of FL at such scale and
task complexity as a paradigm shift for multi-site collaborations, alleviating
the need for data sharing.
Related papers
- Weakly supervised deep learning model with size constraint for prostate cancer detection in multiparametric MRI and generalization to unseen domains [0.90668179713299]
We show that the model achieves on-par performance with strong fully supervised baseline models.
We also observe a performance decrease for both fully supervised and weakly supervised models when tested on unseen data domains.
arXiv Detail & Related papers (2024-11-04T12:24:33Z) - Generating 3D Brain Tumor Regions in MRI using Vector-Quantization
Generative Adversarial Networks [5.380977479547755]
We present a novel framework that uses vector-quantization GAN and a transformer incorporating masked token modeling to generate high-resolution and diverse 3D brain tumor ROIs.
Our proposed method has the potential to facilitate an accurate diagnosis of rare brain tumors using MRI scans.
arXiv Detail & Related papers (2023-10-02T14:39:10Z) - The effect of data augmentation and 3D-CNN depth on Alzheimer's Disease
detection [51.697248252191265]
This work summarizes and strictly observes best practices regarding data handling, experimental design, and model evaluation.
We focus on Alzheimer's Disease (AD) detection, which serves as a paradigmatic example of challenging problem in healthcare.
Within this framework, we train predictive 15 models, considering three different data augmentation strategies and five distinct 3D CNN architectures.
arXiv Detail & Related papers (2023-09-13T10:40:41Z) - Generative models improve fairness of medical classifiers under
distribution shifts [49.10233060774818]
We show that learning realistic augmentations automatically from data is possible in a label-efficient manner using generative models.
We demonstrate that these learned augmentations can surpass ones by making models more robust and statistically fair in- and out-of-distribution.
arXiv Detail & Related papers (2023-04-18T18:15:38Z) - Fader Networks for domain adaptation on fMRI: ABIDE-II study [68.5481471934606]
We use 3D convolutional autoencoders to build the domain irrelevant latent space image representation and demonstrate this method to outperform existing approaches on ABIDE data.
arXiv Detail & Related papers (2020-10-14T16:50:50Z) - Select-ProtoNet: Learning to Select for Few-Shot Disease Subtype
Prediction [55.94378672172967]
We focus on few-shot disease subtype prediction problem, identifying subgroups of similar patients.
We introduce meta learning techniques to develop a new model, which can extract the common experience or knowledge from interrelated clinical tasks.
Our new model is built upon a carefully designed meta-learner, called Prototypical Network, that is a simple yet effective meta learning machine for few-shot image classification.
arXiv Detail & Related papers (2020-09-02T02:50:30Z) - Heterogeneity Loss to Handle Intersubject and Intrasubject Variability
in Cancer [11.440201348567681]
Deep learning (DL) models have shown impressive results in medical domain.
These AI methods can provide immense support to developing nations as affordable healthcare solutions.
This work is focused on one such application of blood cancer diagnosis.
arXiv Detail & Related papers (2020-03-06T16:16:23Z) - MS-Net: Multi-Site Network for Improving Prostate Segmentation with
Heterogeneous MRI Data [75.73881040581767]
We propose a novel multi-site network (MS-Net) for improving prostate segmentation by learning robust representations.
Our MS-Net improves the performance across all datasets consistently, and outperforms state-of-the-art methods for multi-site learning.
arXiv Detail & Related papers (2020-02-09T14:11:50Z) - Multi-site fMRI Analysis Using Privacy-preserving Federated Learning and
Domain Adaptation: ABIDE Results [13.615292855384729]
To train a high-quality deep learning model, the aggregation of a significant amount of patient information is required.
Due to the need to protect the privacy of patient data, it is hard to assemble a central database from multiple institutions.
Federated learning allows for population-level models to be trained without centralizing entities' data.
arXiv Detail & Related papers (2020-01-16T04:49:33Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.