Identifying Stroke Indicators Using Rough Sets
- URL: http://arxiv.org/abs/2110.10152v1
- Date: Tue, 19 Oct 2021 06:04:48 GMT
- Title: Identifying Stroke Indicators Using Rough Sets
- Authors: Muhammad Salman Pathan, Jianbiao Zhang, Deepu John, Avishek Nag, and
Soumyabrata Dev
- Abstract summary: We propose a novel rough-set based technique for ranking the importance of the various EHR records in detecting stroke.
Age, average glucose level, heart disease, and hypertension were the most essential attributes for detecting stroke in patients.
- Score: 0.7340017786387767
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Stroke is widely considered as the second most common cause of mortality. The
adverse consequences of stroke have led to global interest and work for
improving the management and diagnosis of stroke. Various techniques for data
mining have been used globally for accurate prediction of occurrence of stroke
based on the risk factors that are associated with the electronic health care
records (EHRs) of the patients. In particular, EHRs routinely contain several
thousands of features and most of them are redundant and irrelevant that need
to be discarded to enhance the prediction accuracy. The choice of
feature-selection methods can help in improving the prediction accuracy of the
model and efficient data management of the archived input features. In this
paper, we systematically analyze the various features in EHR records for the
detection of stroke. We propose a novel rough-set based technique for ranking
the importance of the various EHR records in detecting stroke. Unlike the
conventional rough-set techniques, our proposed technique can be applied on any
dataset that comprises binary feature sets. We evaluated our proposed method in
a publicly available dataset of EHR, and concluded that age, average glucose
level, heart disease, and hypertension were the most essential attributes for
detecting stroke in patients. Furthermore, we benchmarked the proposed
technique with other popular feature-selection techniques. We obtained the best
performance in ranking the importance of individual features in detecting
stroke.
Related papers
- Machine Learning for ALSFRS-R Score Prediction: Making Sense of the Sensor Data [44.99833362998488]
Amyotrophic Lateral Sclerosis (ALS) is a rapidly progressive neurodegenerative disease that presents individuals with limited treatment options.
The present investigation, spearheaded by the iDPP@CLEF 2024 challenge, focuses on utilizing sensor-derived data obtained through an app.
arXiv Detail & Related papers (2024-07-10T19:17:23Z) - ERASE: Benchmarking Feature Selection Methods for Deep Recommender Systems [40.838320650137625]
This paper presents ERASE, a comprehensive bEnchmaRk for feAture SElection for Deep Recommender Systems (DRS)
ERASE comprises a thorough evaluation of eleven feature selection methods, covering both traditional and deep learning approaches.
Our code is available online for ease of reproduction.
arXiv Detail & Related papers (2024-03-19T11:49:35Z) - Improving Diffusion Models for ECG Imputation with an Augmented Template
Prior [43.6099225257178]
noisy and poor-quality recordings are a major issue for signals collected using mobile health systems.
Recent studies have explored the imputation of missing values in ECG with probabilistic time-series models.
We present a template-guided denoising diffusion probabilistic model (DDPM), PulseDiff, which is conditioned on an informative prior for a range of health conditions.
arXiv Detail & Related papers (2023-10-24T11:34:15Z) - Automatic diagnosis of knee osteoarthritis severity using Swin
transformer [55.01037422579516]
Knee osteoarthritis (KOA) is a widespread condition that can cause chronic pain and stiffness in the knee joint.
We propose an automated approach that employs the Swin Transformer to predict the severity of KOA.
arXiv Detail & Related papers (2023-07-10T09:49:30Z) - A predictive analytics approach for stroke prediction using machine
learning and neural networks [4.984181486695979]
This paper systematically analyzes the various factors in electronic health records for effective stroke prediction.
Age, heart disease, average glucose level, and hypertension are the most important factors for detecting stroke in patients.
A perceptron neural network using these four attributes provides the highest accuracy rate and lowest miss rate.
arXiv Detail & Related papers (2022-03-01T14:45:15Z) - A Graph-based Imputation Method for Sparse Medical Records [3.136861161060886]
We propose a graph-based imputation method that is robust to sparsity and to unreliable unmeasured events.
Results indicate that the model learns to embed different event types in a clinically meaningful way.
arXiv Detail & Related papers (2021-11-17T13:06:08Z) - A Profile-Based Binary Feature Extraction Method Using Frequent Itemsets
for Improving Coronary Artery Disease Diagnosis [0.0]
This paper introduces a CAD diagnosis method with a novel feature extraction technique called the Profile-Based Binary Feature Extraction (PBBFE)
In this method, after partitioning numerical features, frequent itemsets are extracted by the Apriori algorithm and then used as features to increase the CAD diagnosis accuracy.
The proposed method was tested on the Z-Alizadeh Sani dataset, which is one the richest databases in the field of CAD.
arXiv Detail & Related papers (2021-09-22T18:33:45Z) - Bootstrapping Your Own Positive Sample: Contrastive Learning With
Electronic Health Record Data [62.29031007761901]
This paper proposes a novel contrastive regularized clinical classification model.
We introduce two unique positive sampling strategies specifically tailored for EHR data.
Our framework yields highly competitive experimental results in predicting the mortality risk on real-world COVID-19 EHR data.
arXiv Detail & Related papers (2021-04-07T06:02:04Z) - Curvature-based Feature Selection with Application in Classifying
Electronic Health Records [13.427883408828642]
We propose an efficient curvature-based feature selection method for supporting more precise diagnosis.
Our method achieves state-of-the-art performance on four benchmark healthcare data sets.
arXiv Detail & Related papers (2021-01-10T16:55:40Z) - UNITE: Uncertainty-based Health Risk Prediction Leveraging Multi-sourced
Data [81.00385374948125]
We present UNcertaInTy-based hEalth risk prediction (UNITE) model.
UNITE provides accurate disease risk prediction and uncertainty estimation leveraging multi-sourced health data.
We evaluate UNITE on real-world disease risk prediction tasks: nonalcoholic fatty liver disease (NASH) and Alzheimer's disease (AD)
UNITE achieves up to 0.841 in F1 score for AD detection, up to 0.609 in PR-AUC for NASH detection, and outperforms various state-of-the-art baselines by up to $19%$ over the best baseline.
arXiv Detail & Related papers (2020-10-22T02:28:11Z) - An efficient representation of chronological events in medical texts [9.118144540451514]
We proposed a systematic methodology for learning from chronological events available in clinical notes.
The proposed methodological it path signature framework creates a non-parametric hierarchical representation of sequential events of any type.
The methodology was developed and externally validated using the largest in the UK secondary care mental health EHR data.
arXiv Detail & Related papers (2020-10-16T14:54:29Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.