Predicting Heart Disease and Reducing Survey Time Using Machine Learning
Algorithms
- URL: http://arxiv.org/abs/2306.00023v1
- Date: Tue, 30 May 2023 21:15:21 GMT
- Title: Predicting Heart Disease and Reducing Survey Time Using Machine Learning
Algorithms
- Authors: Salahaldeen Rababa, Asma Yamin, Shuxia Lu and Ashraf Obaidat
- Abstract summary: The Centers for Disease Control and Prevention (CDC) conducts a health-related telephone survey yearly from over 400,000 participants.
This study aims to utilize several machine learning techniques, such as support vector machines and logistic regression, to investigate the accuracy of the CDC's heart disease survey in the United States.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Currently, many researchers and analysts are working toward medical diagnosis
enhancement for various diseases. Heart disease is one of the common diseases
that can be considered a significant cause of mortality worldwide. Early
detection of heart disease significantly helps in reducing the risk of heart
failure. Consequently, the Centers for Disease Control and Prevention (CDC)
conducts a health-related telephone survey yearly from over 400,000
participants. However, several concerns arise regarding the reliability of the
data in predicting heart disease and whether all of the survey questions are
strongly related. This study aims to utilize several machine learning
techniques, such as support vector machines and logistic regression, to
investigate the accuracy of the CDC's heart disease survey in the United
States. Furthermore, we use various feature selection methods to identify the
most relevant subset of questions that can be utilized to forecast heart
conditions. To reach a robust conclusion, we perform stability analysis by
randomly sampling the data 300 times. The experimental results show that the
survey data can be useful up to 80% in terms of predicting heart disease, which
significantly improves the diagnostic process before bloodwork and tests. In
addition, the amount of time spent conducting the survey can be reduced by 77%
while maintaining the same level of performance.
Related papers
- Multi-class heart disease Detection, Classification, and Prediction using Machine Learning Models [0.5018974919510383]
Heart disease is a leading cause of premature death worldwide, particularly among middle-aged and older adults.
Non-communicable diseases, including heart disease, account for 25% (17.9 million) of global deaths, with over 43,204 annual fatalities in Bangladesh.
arXiv Detail & Related papers (2024-12-06T05:55:41Z) - Predicting Coronary Heart Disease Using a Suite of Machine Learning Models [0.1979158763744267]
Coronary Heart Disease affects millions of people worldwide and is a well-studied area of healthcare.
There are many viable and accurate methods for the diagnosis and prediction of heart disease, but they have limiting points such as invasiveness, late detection, or cost.
Supervised learning via machine learning algorithms presents a low-cost (computationally speaking), non-invasive solution that can be a precursor for early diagnosis.
arXiv Detail & Related papers (2024-09-21T19:22:41Z) - Ensemble Framework for Cardiovascular Disease Prediction [0.0]
Heart disease is the major cause of non-communicable and silent death worldwide.
We have proposed a framework with a stacked ensemble using several machine learning algorithms including ExtraTrees, Random Forest, XGBoost, and so on.
Our proposed framework attained an accuracy of 92.34% which is higher than the existing literature.
arXiv Detail & Related papers (2023-06-16T17:37:43Z) - Analyzing the impact of feature selection on the accuracy of heart
disease prediction [0.5249805590164902]
The aim of this research is to identify the most important risk-factors from a highly dimensional dataset which helps in the accurate classification of heart disease with less complications.
The performance of the classification models improved significantly with a reduced training time as compared with models trained on full feature set.
arXiv Detail & Related papers (2022-06-07T12:51:37Z) - Investigating myocardial infarction and its effects in patients with
urgent medical problems using advanced data mining tools [0.0]
Myocardial infarction is a serious risk factor in mortality.
The purpose of the present study is to utilize data analysis algorithms and compare their accuracy in patients with a heart attack.
arXiv Detail & Related papers (2021-12-15T05:10:55Z) - A random shuffle method to expand a narrow dataset and overcome the
associated challenges in a clinical study: a heart failure cohort example [50.591267188664666]
The aim of this study was to design a random shuffle method to enhance the cardinality of an HF dataset while it is statistically legitimate.
The proposed random shuffle method was able to enhance the HF dataset cardinality circa 10 times and circa 21 times when followed by a random repeated-measures approach.
arXiv Detail & Related papers (2020-12-12T10:59:38Z) - Identification of Ischemic Heart Disease by using machine learning
technique based on parameters measuring Heart Rate Variability [50.591267188664666]
In this study, 18 non-invasive features (age, gender, left ventricular ejection fraction and 15 obtained from HRV) of 243 subjects were used to train and validate a series of several ANN.
The best result was obtained using 7 input parameters and 7 hidden nodes with an accuracy of 98.9% and 82% for the training and validation dataset.
arXiv Detail & Related papers (2020-10-29T19:14:41Z) - Predicting Parkinson's Disease with Multimodal Irregularly Collected
Longitudinal Smartphone Data [75.23250968928578]
Parkinsons Disease is a neurological disorder and prevalent in elderly people.
Traditional ways to diagnose the disease rely on in-person subjective clinical evaluations on the quality of a set of activity tests.
We propose a novel time-series based approach to predicting Parkinson's Disease with raw activity test data collected by smartphones in the wild.
arXiv Detail & Related papers (2020-09-25T01:50:15Z) - Prediction of the onset of cardiovascular diseases from electronic
health records using multi-task gated recurrent units [51.14334174570822]
We propose a multi-task recurrent neural network with attention mechanism for predicting cardiovascular events from electronic health records.
The proposed approach is compared to a standard clinical risk predictor (QRISK) and machine learning alternatives using 5-year data from a NHS Foundation Trust.
arXiv Detail & Related papers (2020-07-16T17:43:13Z) - Robustness to Spurious Correlations via Human Annotations [100.63051542531171]
We present a framework for making models robust to spurious correlations by leveraging humans' common sense knowledge of causality.
Specifically, we use human annotation to augment each training example with a potential unmeasured variable.
We then introduce a new distributionally robust optimization objective over unmeasured variables (UV-DRO) to control the worst-case loss over possible test-time shifts.
arXiv Detail & Related papers (2020-07-13T20:05:19Z) - Joint Prediction and Time Estimation of COVID-19 Developing Severe
Symptoms using Chest CT Scan [49.209225484926634]
We propose a joint classification and regression method to determine whether the patient would develop severe symptoms in the later time.
To do this, the proposed method takes into account 1) the weight for each sample to reduce the outliers' influence and explore the problem of imbalance classification.
Our proposed method yields 76.97% of accuracy for predicting the severe cases, 0.524 of the correlation coefficient, and 0.55 days difference for the converted time.
arXiv Detail & Related papers (2020-05-07T12:16:37Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.