Analyzing the impact of feature selection on the accuracy of heart
disease prediction
- URL: http://arxiv.org/abs/2206.03239v1
- Date: Tue, 7 Jun 2022 12:51:37 GMT
- Title: Analyzing the impact of feature selection on the accuracy of heart
disease prediction
- Authors: Muhammad Salman Pathan, Avishek Nag, Muhammad Mohisn Pathan, and
Soumyabrata Dev
- Abstract summary: The aim of this research is to identify the most important risk-factors from a highly dimensional dataset which helps in the accurate classification of heart disease with less complications.
The performance of the classification models improved significantly with a reduced training time as compared with models trained on full feature set.
- Score: 0.5249805590164902
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Heart Disease has become one of the most serious diseases that has a
significant impact on human life. It has emerged as one of the leading causes
of mortality among the people across the globe during the last decade. In order
to prevent patients from further damage, an accurate diagnosis of heart disease
on time is an essential factor. Recently we have seen the usage of non-invasive
medical procedures, such as artificial intelligence-based techniques in the
field of medical. Specially machine learning employs several algorithms and
techniques that are widely used and are highly useful in accurately diagnosing
the heart disease with less amount of time. However, the prediction of heart
disease is not an easy task. The increasing size of medical datasets has made
it a complicated task for practitioners to understand the complex feature
relations and make disease predictions. Accordingly, the aim of this research
is to identify the most important risk-factors from a highly dimensional
dataset which helps in the accurate classification of heart disease with less
complications. For a broader analysis, we have used two heart disease datasets
with various medical features. The classification results of the benchmarked
models proved that there is a high impact of relevant features on the
classification accuracy. Even with a reduced number of features, the
performance of the classification models improved significantly with a reduced
training time as compared with models trained on full feature set.
Related papers
- A data balancing approach towards design of an expert system for Heart Disease Prediction [0.9895793818721335]
Heart disease is a serious global health issue that claims millions of lives every year.
We employed five machine learning methods in this paper: Decision Tree (DT), Random Forest (RF), Linear Discriminant Analysis, Extra TreeBoost, and AdaBoost.
The accuracy of the Random Forest and Decision Tree model was 99.83%.
arXiv Detail & Related papers (2024-07-26T08:56:13Z) - Optimizing Skin Lesion Classification via Multimodal Data and Auxiliary
Task Integration [54.76511683427566]
This research introduces a novel multimodal method for classifying skin lesions, integrating smartphone-captured images with essential clinical and demographic information.
A distinctive aspect of this method is the integration of an auxiliary task focused on super-resolution image prediction.
The experimental evaluations have been conducted using the PAD-UFES20 dataset, applying various deep-learning architectures.
arXiv Detail & Related papers (2024-02-16T05:16:20Z) - Evaluating The Accuracy of Classification Algorithms for Detecting Heart
Disease Risk [0.0]
This work utilizes the classification algorithms with a medical dataset of heart disease.
The performance of the utilized algorithms was evaluated using standard metrics such as accuracy, sensitivity and specificity.
The results indicate that the best algorithm for predicting heart disease was Random Forest with an accuracy of 99.24%.
arXiv Detail & Related papers (2023-12-06T06:41:48Z) - Multi-task Explainable Skin Lesion Classification [54.76511683427566]
We propose a few-shot-based approach for skin lesions that generalizes well with few labelled data.
The proposed approach comprises a fusion of a segmentation network that acts as an attention module and classification network.
arXiv Detail & Related papers (2023-10-11T05:49:47Z) - Ensemble Framework for Cardiovascular Disease Prediction [0.0]
Heart disease is the major cause of non-communicable and silent death worldwide.
We have proposed a framework with a stacked ensemble using several machine learning algorithms including ExtraTrees, Random Forest, XGBoost, and so on.
Our proposed framework attained an accuracy of 92.34% which is higher than the existing literature.
arXiv Detail & Related papers (2023-06-16T17:37:43Z) - Deep Reinforcement Learning Framework for Thoracic Diseases
Classification via Prior Knowledge Guidance [49.87607548975686]
The scarcity of labeled data for related diseases poses a huge challenge to an accurate diagnosis.
We propose a novel deep reinforcement learning framework, which introduces prior knowledge to direct the learning of diagnostic agents.
Our approach's performance was demonstrated using the well-known NIHX-ray 14 and CheXpert datasets.
arXiv Detail & Related papers (2023-06-02T01:46:31Z) - Novel Deep Learning Architecture for Heart Disease Prediction using
Convolutional Neural Network [0.0]
Heart disease is one of the deadliest diseases which is hampering the lives of many people around the world.
This paper proposes a novel deep learning architecture using a 1D convolutional neural network for classification between healthy and non-healthy persons.
The proposed network achieves over 97% training accuracy and 96% test accuracy on the dataset.
arXiv Detail & Related papers (2021-05-22T22:00:57Z) - Relational Subsets Knowledge Distillation for Long-tailed Retinal
Diseases Recognition [65.77962788209103]
We propose class subset learning by dividing the long-tailed data into multiple class subsets according to prior knowledge.
It enforces the model to focus on learning the subset-specific knowledge.
The proposed framework proved to be effective for the long-tailed retinal diseases recognition task.
arXiv Detail & Related papers (2021-04-22T13:39:33Z) - Feature selection for medical diagnosis: Evaluation for using a hybrid
Stacked-Genetic approach in the diagnosis of heart disease [0.0]
Heart disease has been one of the most important causes of death in the last 10 years.
We propose an ensemble-genetic learning method using wrapper feature reduction to select features in disease classification.
arXiv Detail & Related papers (2021-03-15T07:31:14Z) - Many-to-One Distribution Learning and K-Nearest Neighbor Smoothing for
Thoracic Disease Identification [83.6017225363714]
deep learning has become the most powerful computer-aided diagnosis technology for improving disease identification performance.
For chest X-ray imaging, annotating large-scale data requires professional domain knowledge and is time-consuming.
In this paper, we propose many-to-one distribution learning (MODL) and K-nearest neighbor smoothing (KNNS) methods to improve a single model's disease identification performance.
arXiv Detail & Related papers (2021-02-26T02:29:30Z) - Self-Training with Improved Regularization for Sample-Efficient Chest
X-Ray Classification [80.00316465793702]
We present a deep learning framework that enables robust modeling in challenging scenarios.
Our results show that using 85% lesser labeled data, we can build predictive models that match the performance of classifiers trained in a large-scale data setting.
arXiv Detail & Related papers (2020-05-03T02:36:00Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.