A Novel Cluster Detection of COVID-19 Patients and Medical Disease
Conditions Using Improved Evolutionary Clustering Algorithm Star
- URL: http://arxiv.org/abs/2109.09492v1
- Date: Mon, 20 Sep 2021 12:47:09 GMT
- Title: A Novel Cluster Detection of COVID-19 Patients and Medical Disease
Conditions Using Improved Evolutionary Clustering Algorithm Star
- Authors: Bryar A. Hassan, Tarik A. Rashid, Hozan K. Hamarashid
- Abstract summary: We improve the current evolutionary clustering algorithm star (ECA*), called iECA*, in three manners.
Experiments were conducted to examine the performance of iECA* against state-of-the-art algorithms.
- Score: 0.9990687944474739
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: With the increasing number of samples, the manual clustering of COVID-19 and
medical disease data samples becomes time-consuming and requires highly skilled
labour. Recently, several algorithms have been used for clustering medical
datasets deterministically; however, these definitions have not been effective
in grouping and analysing medical diseases. The use of evolutionary clustering
algorithms may help to effectively cluster these diseases. On this presumption,
we improved the current evolutionary clustering algorithm star (ECA*), called
iECA*, in three manners: (i) utilising the elbow method to find the correct
number of clusters; (ii) cleaning and processing data as part of iECA* to apply
it to multivariate and domain-theory datasets; (iii) using iECA* for real-world
applications in clustering COVID-19 and medical disease datasets. Experiments
were conducted to examine the performance of iECA* against state-of-the-art
algorithms using performance and validation measures (validation measures,
statistical benchmarking, and performance ranking framework). The results
demonstrate three primary findings. First, iECA* was more effective than other
algorithms in grouping the chosen medical disease datasets according to the
cluster validation criteria. Second, iECA* exhibited the lower execution time
and memory consumption for clustering all the datasets, compared to the current
clustering methods analysed. Third, an operational framework was proposed to
rate the effectiveness of iECA* against other algorithms in the datasets
analysed, and the results indicated that iECA* exhibited the best performance
in clustering all medical datasets. Further research is required on real-world
multi-dimensional data containing complex knowledge fields for experimental
verification of iECA* compared to evolutionary algorithms.
Related papers
- Artificial Data Point Generation in Clustered Latent Space for Small
Medical Datasets [4.542616945567623]
This paper introduces a novel method, Artificial Data Point Generation in Clustered Latent Space (AGCL)
AGCL is designed to enhance classification performance on small medical datasets through synthetic data generation.
It was applied to Parkinson's disease screening, utilizing facial expression data.
arXiv Detail & Related papers (2024-09-26T09:51:08Z) - A3S: A General Active Clustering Method with Pairwise Constraints [66.74627463101837]
A3S features strategic active clustering adjustment on the initial cluster result, which is obtained by an adaptive clustering algorithm.
In extensive experiments across diverse real-world datasets, A3S achieves desired results with significantly fewer human queries.
arXiv Detail & Related papers (2024-07-14T13:37:03Z) - Machine Learning Based Missing Values Imputation in Categorical Datasets [2.5611256859404983]
This research looked into the use of machine learning algorithms to fill in the gaps in categorical datasets.
The emphasis was on ensemble models constructed using the Error Correction Output Codes framework.
Deep learning for missing data imputation has obstacles despite these encouraging results, including the requirement for large amounts of labeled data.
arXiv Detail & Related papers (2023-06-10T03:29:48Z) - Significance-Based Categorical Data Clustering [7.421725101465365]
We use the likelihood ratio test to derive a test statistic that can serve as a significance-based objective function in categorical data clustering.
A new clustering algorithm is proposed in which the significance-based objective function is optimized via a Monte Carlo search procedure.
arXiv Detail & Related papers (2022-11-08T02:06:31Z) - Unified Multi-View Orthonormal Non-Negative Graph Based Clustering
Framework [74.25493157757943]
We formulate a novel clustering model, which exploits the non-negative feature property and incorporates the multi-view information into a unified joint learning framework.
We also explore, for the first time, the multi-model non-negative graph-based approach to clustering data based on deep features.
arXiv Detail & Related papers (2022-11-03T08:18:27Z) - Detection and Evaluation of Clusters within Sequential Data [58.720142291102135]
Clustering algorithms for Block Markov Chains possess theoretical optimality guarantees.
In particular, our sequential data is derived from human DNA, written text, animal movement data and financial markets.
It is found that the Block Markov Chain model assumption can indeed produce meaningful insights in exploratory data analyses.
arXiv Detail & Related papers (2022-10-04T15:22:39Z) - Performance evaluation results of evolutionary clustering algorithm star
for clustering heterogeneous datasets [15.154538450706474]
This article presents the data used to evaluate the performance of evolutionary clustering algorithm star (ECA*)
Two experimental methods are employed to examine the performance of ECA* against five traditional and modern clustering algorithms.
arXiv Detail & Related papers (2021-04-30T08:17:19Z) - Deep learning-based COVID-19 pneumonia classification using chest CT
images: model generalizability [54.86482395312936]
Deep learning (DL) classification models were trained to identify COVID-19-positive patients on 3D computed tomography (CT) datasets from different countries.
We trained nine identical DL-based classification models by using combinations of the datasets with a 72% train, 8% validation, and 20% test data split.
The models trained on multiple datasets and evaluated on a test set from one of the datasets used for training performed better.
arXiv Detail & Related papers (2021-02-18T21:14:52Z) - A Multi-disciplinary Ensemble Algorithm for Clustering Heterogeneous
Datasets [0.76146285961466]
We propose a new evolutionary clustering algorithm (ECAStar) based on social class ranking and meta-heuristic algorithms.
ECAStar is integrated with recombinational evolutionary operators, Levy flight optimisation, and some statistical techniques.
Experiments are conducted to evaluate the ECAStar against five conventional approaches.
arXiv Detail & Related papers (2021-01-01T07:20:50Z) - Deep Semi-Supervised Embedded Clustering (DSEC) for Stratification of
Heart Failure Patients [50.48904066814385]
In this work we apply deep semi-supervised embedded clustering to determine data-driven patient subgroups of heart failure.
We find clinically relevant clusters from an embedded space derived from heterogeneous data.
The proposed algorithm can potentially find new undiagnosed subgroups of patients that have different outcomes.
arXiv Detail & Related papers (2020-12-24T12:56:46Z) - Scalable Hierarchical Agglomerative Clustering [65.66407726145619]
Existing scalable hierarchical clustering methods sacrifice quality for speed.
We present a scalable, agglomerative method for hierarchical clustering that does not sacrifice quality and scales to billions of data points.
arXiv Detail & Related papers (2020-10-22T15:58:35Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.