Using Machine Learning Techniques to Identify Key Risk Factors for
Diabetes and Undiagnosed Diabetes
- URL: http://arxiv.org/abs/2105.09379v1
- Date: Wed, 19 May 2021 20:02:35 GMT
- Title: Using Machine Learning Techniques to Identify Key Risk Factors for
Diabetes and Undiagnosed Diabetes
- Authors: Avraham Adler
- Abstract summary: This paper reviews a wide selection of machine learning models built to predict the presence of diabetes and the presence of undiagnosed diabetes.
The most relevant variables of the best performing models are then compared.
Blood osmolality, family history, the prevalance of various compounds, and hypertension are key indicators for all diabetes risk.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: This paper reviews a wide selection of machine learning models built to
predict both the presence of diabetes and the presence of undiagnosed diabetes
using eight years of National Health and Nutrition Examination Survey (NHANES)
data. Models are tuned and compared via their Brier Scores. The most relevant
variables of the best performing models are then compared. A Support Vector
Machine with a linear kernel performed best for predicting diabetes, returning
a Brier score of 0.0654 and an AUROC of 0.9235 on the test set. An elastic net
regression performed best for predicting undiagnosed diabetes with a Brier
score of 0.0294 and an AUROC of 0.9439 on the test set. Similar features appear
prominently in the models for both sets of models. Blood osmolality, family
history, the prevalance of various compounds, and hypertension are key
indicators for all diabetes risk. For undiagnosed diabetes in particular, there
are ethnicity or genetic components which arise as strong correlates as well.
Related papers
- Brain Tumor Classification on MRI in Light of Molecular Markers [61.77272414423481]
Co-deletion of the 1p/19q gene is associated with clinical outcomes in low-grade gliomas.
This study aims to utilize a specially MRI-based convolutional neural network for brain cancer detection.
arXiv Detail & Related papers (2024-09-29T07:04:26Z) - From Glucose Patterns to Health Outcomes: A Generalizable Foundation Model for Continuous Glucose Monitor Data Analysis [50.80532910808962]
We present GluFormer, a generative foundation model on biomedical temporal data based on a transformer architecture.
GluFormer generalizes to 15 different external datasets, including 4936 individuals across 5 different geographical regions.
It can also predict onset of future health outcomes even 4 years in advance.
arXiv Detail & Related papers (2024-08-20T13:19:06Z) - Using Pre-training and Interaction Modeling for ancestry-specific disease prediction in UK Biobank [69.90493129893112]
Recent genome-wide association studies (GWAS) have uncovered the genetic basis of complex traits, but show an under-representation of non-European descent individuals.
Here, we assess whether we can improve disease prediction across diverse ancestries using multiomic data.
arXiv Detail & Related papers (2024-04-26T16:39:50Z) - DiabetesNet: A Deep Learning Approach to Diabetes Diagnosis [6.095029229301643]
Experimental results on three datasets show significant improvements in overall accuracy, sensitivity, and specificity compared to traditional methods.
This underscores the potential of deep learning models for robust diabetes diagnosis.
arXiv Detail & Related papers (2024-03-12T10:18:59Z) - Supervised Learning Models for Early Detection of Albuminuria Risk in
Type-2 Diabetes Mellitus Patients [0.0]
This study aimed to develop a supervised learning model to predict the risk of developing albuminuria in T2DM patients.
It consisted of 10 attributes as features and 1 attribute as the target (albuminuria)
It achieved accuracy and f1-score values as high as 0.74 and 0.75, respectively, making it suitable for screening purposes in predicting albuminuria in T2DM.
arXiv Detail & Related papers (2023-09-28T08:41:12Z) - Prognosis and Treatment Prediction of Type-2 Diabetes Using Deep Neural
Network and Machine Learning Classifiers [1.1470070927586016]
The motion of this research is a comparative study of seven machine learning classifiers and an artificial neural network method to prognosticate the detection and treatment of diabetes with high accuracy.
Our training and test dataset is an accumulation of 9483 diabetes patients information.
Our high-performing model can be used by hospitals to predict diabetes and drive research into more accurate prediction models.
arXiv Detail & Related papers (2023-01-08T19:10:20Z) - On the explainability of hospitalization prediction on a large COVID-19
patient dataset [45.82374977939355]
We develop various AI models to predict hospitalization on a large (over 110$k$) cohort of COVID-19 positive-tested US patients.
Despite high data unbalance, the models reach average precision 0.96-0.98 (0.75-0.85), recall 0.96-0.98 (0.74-0.85), and $F_score 0.97-0.98 (0.79-0.83) on the non-hospitalized (or hospitalized) class.
arXiv Detail & Related papers (2021-10-28T10:23:38Z) - Development of a dynamic type 2 diabetes risk prediction tool: a UK
Biobank study [0.8620335948752806]
We developed a predictive 10-year type 2 diabetes risk score using 301 features from the UK Biobank dataset.
A Cox proportional hazards model slightly overperformed a DeepSurv model trained using the same features.
This tool can be used for clinical screening of individuals at risk of developing type 2 diabetes and to foster patient empowerment.
arXiv Detail & Related papers (2021-04-20T16:37:26Z) - UNITE: Uncertainty-based Health Risk Prediction Leveraging Multi-sourced
Data [81.00385374948125]
We present UNcertaInTy-based hEalth risk prediction (UNITE) model.
UNITE provides accurate disease risk prediction and uncertainty estimation leveraging multi-sourced health data.
We evaluate UNITE on real-world disease risk prediction tasks: nonalcoholic fatty liver disease (NASH) and Alzheimer's disease (AD)
UNITE achieves up to 0.841 in F1 score for AD detection, up to 0.609 in PR-AUC for NASH detection, and outperforms various state-of-the-art baselines by up to $19%$ over the best baseline.
arXiv Detail & Related papers (2020-10-22T02:28:11Z) - GLYFE: Review and Benchmark of Personalized Glucose Predictive Models in
Type-1 Diabetes [4.17510581764131]
GLYFE is a benchmark of machine-learning-based glucose-predictive models.
The results of nine different models coming from the glucose-prediction literature are presented.
arXiv Detail & Related papers (2020-06-29T11:34:41Z) - Short Term Blood Glucose Prediction based on Continuous Glucose
Monitoring Data [53.01543207478818]
This study explores the use of Continuous Glucose Monitoring (CGM) data as input for digital decision support tools.
We investigate how Recurrent Neural Networks (RNNs) can be used for Short Term Blood Glucose (STBG) prediction.
arXiv Detail & Related papers (2020-02-06T16:39:44Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.