ManyDG: Many-domain Generalization for Healthcare Applications
- URL: http://arxiv.org/abs/2301.08834v1
- Date: Sat, 21 Jan 2023 00:26:33 GMT
- Title: ManyDG: Many-domain Generalization for Healthcare Applications
- Authors: Chaoqi Yang, M Brandon Westover, Jimeng Sun
- Abstract summary: We develop a new domain generalization method ManyDG, that can scale to such many-domain problems.
Experiments show that ManyDG can boost the generalization performance on multiple real-world healthcare tasks.
- Score: 36.22753628246332
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: The vast amount of health data has been continuously collected for each
patient, providing opportunities to support diverse healthcare predictive tasks
such as seizure detection and hospitalization prediction. Existing models are
mostly trained on other patients data and evaluated on new patients. Many of
them might suffer from poor generalizability. One key reason can be overfitting
due to the unique information related to patient identities and their data
collection environments, referred to as patient covariates in the paper. These
patient covariates usually do not contribute to predicting the targets but are
often difficult to remove. As a result, they can bias the model training
process and impede generalization. In healthcare applications, most existing
domain generalization methods assume a small number of domains. In this paper,
considering the diversity of patient covariates, we propose a new setting by
treating each patient as a separate domain (leading to many domains). We
develop a new domain generalization method ManyDG, that can scale to such
many-domain problems. Our method identifies the patient domain covariates by
mutual reconstruction and removes them via an orthogonal projection step.
Extensive experiments show that ManyDG can boost the generalization performance
on multiple real-world healthcare tasks (e.g., 3.7% Jaccard improvements on
MIMIC drug recommendation) and support realistic but challenging settings such
as insufficient data and continuous learning.
Related papers
- Continually Evolved Multimodal Foundation Models for Cancer Prognosis [50.43145292874533]
Cancer prognosis is a critical task that involves predicting patient outcomes and survival rates.
Previous studies have integrated diverse data modalities, such as clinical notes, medical images, and genomic data, leveraging their complementary information.
Existing approaches face two major limitations. First, they struggle to incorporate newly arrived data with varying distributions into training, such as patient records from different hospitals.
Second, most multimodal integration methods rely on simplistic concatenation or task-specific pipelines, which fail to capture the complex interdependencies across modalities.
arXiv Detail & Related papers (2025-01-30T06:49:57Z) - A Contrastive Pretrain Model with Prompt Tuning for Multi-center Medication Recommendation [50.43785306804359]
We introduce a novel conTrastive prEtrain Model with Prompt Tuning (TEMPT) for multi-center medication recommendation.
We devise a novel prompt tuning method to capture the specific information of each hospital rather than adopting the common finetuning.
To validate the proposed model, we conduct extensive experiments on the public eICU multi-center medical dataset.
arXiv Detail & Related papers (2024-12-28T06:12:02Z) - Generalization in medical AI: a perspective on developing scalable
models [3.003979691986621]
Many prestigious journals now require reporting results both on the local hidden test set as well as on external datasets.
This is because of the variability encountered in intended use and specificities across hospital cultures.
We establish a hierarchical three-level scale system reflecting the generalization level of a medical AI algorithm.
arXiv Detail & Related papers (2023-11-09T14:54:28Z) - DGM-DR: Domain Generalization with Mutual Information Regularized
Diabetic Retinopathy Classification [40.35834579068518]
Domain shift between training and testing data presents a significant challenge for training general deep learning models.
We introduce a DG method that re-establishes the model objective function as a pretrained model to the medical imaging field.
Our proposed method consistently outperforms the previous state-of-the-art by a margin of 5.25% in average accuracy and a lower standard deviation.
arXiv Detail & Related papers (2023-09-18T11:17:13Z) - Domain Generalization with Adversarial Intensity Attack for Medical
Image Segmentation [27.49427483473792]
In real-world scenarios, it is common for models to encounter data from new and different domains to which they were not exposed to during training.
domain generalization (DG) is a promising direction as it enables models to handle data from previously unseen domains.
We introduce a novel DG method called Adversarial Intensity Attack (AdverIN), which leverages adversarial training to generate training data with an infinite number of styles.
arXiv Detail & Related papers (2023-04-05T19:40:51Z) - Federated Learning Enables Big Data for Rare Cancer Boundary Detection [98.5549882883963]
We present findings from the largest Federated ML study to-date, involving data from 71 healthcare institutions across 6 continents.
We generate an automatic tumor boundary detector for the rare disease of glioblastoma.
We demonstrate a 33% improvement over a publicly trained model to delineate the surgically targetable tumor, and 23% improvement over the tumor's entire extent.
arXiv Detail & Related papers (2022-04-22T17:27:00Z) - Privacy-Preserving Constrained Domain Generalization via Gradient
Alignment [37.916630896194334]
Deep neural networks (DNN) have demonstrated unprecedented success for medical imaging applications.
Due to the issue of limited dataset availability and the strict legal and ethical requirements for patient privacy protection, the broad applications of medical imaging classification have been largely hindered.
In this paper, we aim to tackle this problem by developing the privacy-preserving constrained domain generalization method.
arXiv Detail & Related papers (2021-05-14T15:21:13Z) - Adversarial Sample Enhanced Domain Adaptation: A Case Study on
Predictive Modeling with Electronic Health Records [57.75125067744978]
We propose a data augmentation method to facilitate domain adaptation.
adversarially generated samples are used during domain adaptation.
Results confirm the effectiveness of our method and the generality on different tasks.
arXiv Detail & Related papers (2021-01-13T03:20:20Z) - Domain Generalization for Medical Imaging Classification with
Linear-Dependency Regularization [59.5104563755095]
We introduce a simple but effective approach to improve the generalization capability of deep neural networks in the field of medical imaging classification.
Motivated by the observation that the domain variability of the medical images is to some extent compact, we propose to learn a representative feature space through variational encoding.
arXiv Detail & Related papers (2020-09-27T12:30:30Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.