Adapting Machine Learning Diagnostic Models to New Populations Using a
Small Amount of Data: Results from Clinical Neuroscience
- URL: http://arxiv.org/abs/2308.03175v1
- Date: Sun, 6 Aug 2023 18:05:39 GMT
- Title: Adapting Machine Learning Diagnostic Models to New Populations Using a
Small Amount of Data: Results from Clinical Neuroscience
- Authors: Rongguang Wang, Guray Erus, Pratik Chaudhari, Christos Davatzikos
- Abstract summary: We develop a weighted empirical risk minimization approach that optimally combines data from a source group to make predictions on a target group.
We apply this method to multi-source data of 15,363 individuals from 20 neuroimaging studies to build ML models for diagnosis of Alzheimer's disease and estimation of brain age.
- Score: 17.161866044628205
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Machine learning (ML) has shown great promise for revolutionizing a number of
areas, including healthcare. However, it is also facing a reproducibility
crisis, especially in medicine. ML models that are carefully constructed from
and evaluated on a training set might not generalize well on data from
different patient populations or acquisition instrument settings and protocols.
We tackle this problem in the context of neuroimaging of Alzheimer's disease
(AD), schizophrenia (SZ) and brain aging. We develop a weighted empirical risk
minimization approach that optimally combines data from a source group, e.g.,
subjects are stratified by attributes such as sex, age group, race and clinical
cohort to make predictions on a target group, e.g., other sex, age group, etc.
using a small fraction (10%) of data from the target group. We apply this
method to multi-source data of 15,363 individuals from 20 neuroimaging studies
to build ML models for diagnosis of AD and SZ, and estimation of brain age. We
found that this approach achieves substantially better accuracy than existing
domain adaptation techniques: it obtains area under curve greater than 0.95 for
AD classification, area under curve greater than 0.7 for SZ classification and
mean absolute error less than 5 years for brain age prediction on all target
groups, achieving robustness to variations of scanners, protocols, and
demographic or clinical characteristics. In some cases, it is even better than
training on all data from the target group, because it leverages the diversity
and size of a larger training set. We also demonstrate the utility of our
models for prognostic tasks such as predicting disease progression in
individuals with mild cognitive impairment. Critically, our brain age
prediction models lead to new clinical insights regarding correlations with
neurophysiological tests.
Related papers
- Using Pre-training and Interaction Modeling for ancestry-specific disease prediction in UK Biobank [69.90493129893112]
Recent genome-wide association studies (GWAS) have uncovered the genetic basis of complex traits, but show an under-representation of non-European descent individuals.
Here, we assess whether we can improve disease prediction across diverse ancestries using multiomic data.
arXiv Detail & Related papers (2024-04-26T16:39:50Z) - A Hybrid Transfer Learning Assisted Decision Support System for Accurate
Prediction of Alzheimer Disease [0.0]
Alzheimer's disease is the most common long-term illness in elderly people.
Deep neural model is more accurate and effective than general machine learning.
arXiv Detail & Related papers (2023-10-13T06:48:38Z) - Incomplete Multimodal Learning for Complex Brain Disorders Prediction [65.95783479249745]
We propose a new incomplete multimodal data integration approach that employs transformers and generative adversarial networks.
We apply our new method to predict cognitive degeneration and disease outcomes using the multimodal imaging genetic data from Alzheimer's Disease Neuroimaging Initiative cohort.
arXiv Detail & Related papers (2023-05-25T16:29:16Z) - Machine Learning Models Are Not Necessarily Biased When Constructed
Properly: Evidence from Neuroimaging Studies [19.288217559980545]
We provide experimental data which support that when properly trained, machine learning models can generalize well across diverse conditions.
Specifically, by using multi-study magnetic resonance imaging consortia for diagnosing Alzheimer's disease, schizophrenia, and autism spectrum disorder, we find that, the accuracy of well-trained models is consistent across different subgroups.
arXiv Detail & Related papers (2022-05-26T15:24:39Z) - Bootstrapping Your Own Positive Sample: Contrastive Learning With
Electronic Health Record Data [62.29031007761901]
This paper proposes a novel contrastive regularized clinical classification model.
We introduce two unique positive sampling strategies specifically tailored for EHR data.
Our framework yields highly competitive experimental results in predicting the mortality risk on real-world COVID-19 EHR data.
arXiv Detail & Related papers (2021-04-07T06:02:04Z) - UNITE: Uncertainty-based Health Risk Prediction Leveraging Multi-sourced
Data [81.00385374948125]
We present UNcertaInTy-based hEalth risk prediction (UNITE) model.
UNITE provides accurate disease risk prediction and uncertainty estimation leveraging multi-sourced health data.
We evaluate UNITE on real-world disease risk prediction tasks: nonalcoholic fatty liver disease (NASH) and Alzheimer's disease (AD)
UNITE achieves up to 0.841 in F1 score for AD detection, up to 0.609 in PR-AUC for NASH detection, and outperforms various state-of-the-art baselines by up to $19%$ over the best baseline.
arXiv Detail & Related papers (2020-10-22T02:28:11Z) - Select-ProtoNet: Learning to Select for Few-Shot Disease Subtype
Prediction [55.94378672172967]
We focus on few-shot disease subtype prediction problem, identifying subgroups of similar patients.
We introduce meta learning techniques to develop a new model, which can extract the common experience or knowledge from interrelated clinical tasks.
Our new model is built upon a carefully designed meta-learner, called Prototypical Network, that is a simple yet effective meta learning machine for few-shot image classification.
arXiv Detail & Related papers (2020-09-02T02:50:30Z) - Application of Machine Learning to Predict the Risk of Alzheimer's
Disease: An Accurate and Practical Solution for Early Diagnostics [1.1470070927586016]
Alzheimer's Disease (AD) ravages the cognitive ability of more than 5 million Americans and creates an enormous strain on the health care system.
This paper proposes a machine learning predictive model for AD development without medical imaging and with fewer clinical visits and tests.
Our model is trained and validated using demographic, biomarker and cognitive test data from two prominent research studies.
arXiv Detail & Related papers (2020-06-02T14:52:51Z) - Risk of Training Diagnostic Algorithms on Data with Demographic Bias [0.5599792629509227]
We conduct a survey of the MICCAI 2018 proceedings to investigate the common practice in medical image analysis applications.
Surprisingly, we found that papers focusing on diagnosis rarely describe the demographics of the datasets used.
We show that it is possible to learn unbiased features by explicitly using demographic variables in an adversarial training setup.
arXiv Detail & Related papers (2020-05-20T13:51:01Z) - 1-D Convlutional Neural Networks for the Analysis of Pupil Size
Variations in Scotopic Conditions [79.71065005161566]
1-D convolutional neural network models are trained for classification of short-range sequences.
Model provides prediction with high average accuracy on a hold out test set.
arXiv Detail & Related papers (2020-02-06T17:25:37Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.