ANOVA-based Automatic Attribute Selection and a Predictive Model for
Heart Disease Prognosis
- URL: http://arxiv.org/abs/2208.00296v1
- Date: Sat, 30 Jul 2022 19:29:18 GMT
- Title: ANOVA-based Automatic Attribute Selection and a Predictive Model for
Heart Disease Prognosis
- Authors: Mohammed Nowshad Ruhani Chowdhury, Wandong Zhang, Thangarajah Akilan
- Abstract summary: This work proposes an information fusion technique that combines key attributes of a person through analysis of variance (ANOVA) and domain experts' knowledge.
The proposed approach can achieve a competitive mean average accuracy (mAA) of 99.2% and a mean average AUC of 97.9%.
- Score: 8.258177970935085
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: Studies show that Studies that cardiovascular diseases (CVDs) are malignant
for human health. Thus, it is important to have an efficient way of CVD
prognosis. In response to this, the healthcare industry has adopted machine
learning-based smart solutions to alleviate the manual process of CVD
prognosis. Thus, this work proposes an information fusion technique that
combines key attributes of a person through analysis of variance (ANOVA) and
domain experts' knowledge. It also introduces a new collection of CVD data
samples for emerging research. There are thirty-eight experiments conducted
exhaustively to verify the performance of the proposed framework on four
publicly available benchmark datasets and the newly created dataset in this
work. The ablation study shows that the proposed approach can achieve a
competitive mean average accuracy (mAA) of 99.2% and a mean average AUC of
97.9%.
Related papers
- Enhancing Readmission Prediction with Deep Learning: Extracting Biomedical Concepts from Clinical Texts [0.26813152817733554]
This study focuses on predicting patient readmission within less than 30 days using text mining techniques.
Various machine learning and deep learning methods were employed to develop a classification model for this purpose.
arXiv Detail & Related papers (2024-03-12T09:03:44Z) - MedDiffusion: Boosting Health Risk Prediction via Diffusion-based Data
Augmentation [58.93221876843639]
This paper introduces a novel, end-to-end diffusion-based risk prediction model, named MedDiffusion.
It enhances risk prediction performance by creating synthetic patient data during training to enlarge sample space.
It discerns hidden relationships between patient visits using a step-wise attention mechanism, enabling the model to automatically retain the most vital information for generating high-quality data.
arXiv Detail & Related papers (2023-10-04T01:36:30Z) - Improving VTE Identification through Adaptive NLP Model Selection and
Clinical Expert Rule-based Classifier from Radiology Reports [2.0637891440066363]
Venous thromboembolism (VTE) is a severe cardiovascular condition including deep vein thrombosis (DVT) and pulmonary embolism (PE)
automated methods have shown promising advancements in identifying VTE events from retrospective data cohorts or aiding clinical experts in identifying VTE events from radiology reports.
However, effectively training Deep Learning (DL) and the NLP models is challenging due to limited labeled medical text data, the complexity and heterogeneity of radiology reports, and data imbalance.
This study proposes novel method combinations of DL methods, along with data augmentation, adaptive pre-trained NLP model selection, and a clinical expert NLP rule-based
arXiv Detail & Related papers (2023-09-21T17:29:37Z) - AI Framework for Early Diagnosis of Coronary Artery Disease: An
Integration of Borderline SMOTE, Autoencoders and Convolutional Neural
Networks Approach [0.44998333629984877]
We develop a methodology for balancing and augmenting data for more accurate prediction when the data is imbalanced and the sample size is small.
The experimental results revealed that the average accuracy of our proposed method for CAD prediction was 95.36, and was higher than random forest (RF), decision tree (DT), support vector machine (SVM), logistic regression (LR), and artificial neural network (ANN)
arXiv Detail & Related papers (2023-08-29T14:33:38Z) - Supervised multi-specialist topic model with applications on large-scale
electronic health record data [3.322262654060203]
We present MixEHR-S to jointly infer specialist-disease topics from the EHR data.
For efficient inference, we developed a closed-form collapsed variational inference algorithm.
In three applications, MixEHR-S conferred clinically meaningful latent topics among the most predictive latent topics.
arXiv Detail & Related papers (2021-05-04T01:27:11Z) - Bootstrapping Your Own Positive Sample: Contrastive Learning With
Electronic Health Record Data [62.29031007761901]
This paper proposes a novel contrastive regularized clinical classification model.
We introduce two unique positive sampling strategies specifically tailored for EHR data.
Our framework yields highly competitive experimental results in predicting the mortality risk on real-world COVID-19 EHR data.
arXiv Detail & Related papers (2021-04-07T06:02:04Z) - Adversarial Sample Enhanced Domain Adaptation: A Case Study on
Predictive Modeling with Electronic Health Records [57.75125067744978]
We propose a data augmentation method to facilitate domain adaptation.
adversarially generated samples are used during domain adaptation.
Results confirm the effectiveness of our method and the generality on different tasks.
arXiv Detail & Related papers (2021-01-13T03:20:20Z) - UNITE: Uncertainty-based Health Risk Prediction Leveraging Multi-sourced
Data [81.00385374948125]
We present UNcertaInTy-based hEalth risk prediction (UNITE) model.
UNITE provides accurate disease risk prediction and uncertainty estimation leveraging multi-sourced health data.
We evaluate UNITE on real-world disease risk prediction tasks: nonalcoholic fatty liver disease (NASH) and Alzheimer's disease (AD)
UNITE achieves up to 0.841 in F1 score for AD detection, up to 0.609 in PR-AUC for NASH detection, and outperforms various state-of-the-art baselines by up to $19%$ over the best baseline.
arXiv Detail & Related papers (2020-10-22T02:28:11Z) - Hemogram Data as a Tool for Decision-making in COVID-19 Management:
Applications to Resource Scarcity Scenarios [62.997667081978825]
COVID-19 pandemics has challenged emergency response systems worldwide, with widespread reports of essential services breakdown and collapse of health care structure.
This work describes a machine learning model derived from hemogram exam data performed in symptomatic patients.
Proposed models can predict COVID-19 qRT-PCR results in symptomatic individuals with high accuracy, sensitivity and specificity.
arXiv Detail & Related papers (2020-05-10T01:45:03Z) - Predictive Modeling of ICU Healthcare-Associated Infections from
Imbalanced Data. Using Ensembles and a Clustering-Based Undersampling
Approach [55.41644538483948]
This work is focused on both the identification of risk factors and the prediction of healthcare-associated infections in intensive-care units.
The aim is to support decision making addressed at reducing the incidence rate of infections.
arXiv Detail & Related papers (2020-05-07T16:13:12Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.