Fuzzy Gene Selection and Cancer Classification Based on Deep Learning
Model
- URL: http://arxiv.org/abs/2305.04883v1
- Date: Thu, 4 May 2023 21:52:57 GMT
- Title: Fuzzy Gene Selection and Cancer Classification Based on Deep Learning
Model
- Authors: Mahmood Khalsan, Mu Mu, Eman Salih Al-Shamery, Lee Machado, Suraj
Ajit, Michael Opoku Agyeman
- Abstract summary: We developed a new fuzzy gene selection technique (FGS) to identify informative genes to facilitate cancer classification.
With our FGS-enhanced method, the cancer classification model achieved 96.5%,96.2%,96%, and 95.9% for accuracy, precision, recall, and f1-score respectively.
In examining the six datasets that were used, the proposed model demonstrates it's capacity to classify cancer effectively.
- Score: 1.3072222152900117
- License: http://creativecommons.org/licenses/by-sa/4.0/
- Abstract: Machine learning (ML) approaches have been used to develop highly accurate
and efficient applications in many fields including bio-medical science.
However, even with advanced ML techniques, cancer classification using gene
expression data is still complicated because of the high dimensionality of the
datasets employed. We developed a new fuzzy gene selection technique (FGS) to
identify informative genes to facilitate cancer classification and reduce the
dimensionality of the available gene expression data. Three feature selection
methods (Mutual Information, F-ClassIf, and Chi-squared) were evaluated and
employed to obtain the score and rank for each gene. Then, using Fuzzification
and Defuzzification methods to obtain the best single score for each gene,
which aids in the identification of significant genes. Our study applied the
fuzzy measures to six gene expression datasets including four Microarray and
two RNA-seq datasets for evaluating the proposed algorithm. With our
FGS-enhanced method, the cancer classification model achieved 96.5%,96.2%,96%,
and 95.9% for accuracy, precision, recall, and f1-score respectively, which is
significantly higher than 69.2% accuracy, 57.8% precision, 66% recall, and
58.2% f1-score when the standard MLP method was used. In examining the six
datasets that were used, the proposed model demonstrates it's capacity to
classify cancer effectively.
Related papers
- Precision Cancer Classification and Biomarker Identification from mRNA Gene Expression via Dimensionality Reduction and Explainable AI [0.9423257767158634]
This research presents a comprehensive pipeline designed to accurately identify 33 distinct cancer types and their corresponding gene sets.
It incorporates a combination of normalization and feature selection techniques to reduce dataset dimensionality effectively.
We leverage Explainable AI to elucidate the biological significance of the identified cancer-specific genes.
arXiv Detail & Related papers (2024-10-08T18:56:31Z) - Single-Cell Deep Clustering Method Assisted by Exogenous Gene
Information: A Novel Approach to Identifying Cell Types [50.55583697209676]
We develop an attention-enhanced graph autoencoder, which is designed to efficiently capture the topological features between cells.
During the clustering process, we integrated both sets of information and reconstructed the features of both cells and genes to generate a discriminative representation.
This research offers enhanced insights into the characteristics and distribution of cells, thereby laying the groundwork for early diagnosis and treatment of diseases.
arXiv Detail & Related papers (2023-11-28T09:14:55Z) - Genetic InfoMax: Exploring Mutual Information Maximization in
High-Dimensional Imaging Genetics Studies [50.11449968854487]
Genome-wide association studies (GWAS) are used to identify relationships between genetic variations and specific traits.
Representation learning for imaging genetics is largely under-explored due to the unique challenges posed by GWAS.
We introduce a trans-modal learning framework Genetic InfoMax (GIM) to address the specific challenges of GWAS.
arXiv Detail & Related papers (2023-09-26T03:59:21Z) - A Hybrid Machine Learning Model for Classifying Gene Mutations in Cancer using LSTM, BiLSTM, CNN, GRU, and GloVe [0.0]
We introduce a novel hybrid ensemble model that synergistically combines LSTM, BiLSTM, CNN, GRU, and GloVe embeddings for the classification of gene mutations in cancer.
Our approach achieved a training accuracy of 80.6%, precision of 81.6%, recall of 80.6%, and an F1 score of 83.1%, alongside a significantly reduced Mean Squared Error (MSE) of 2.596.
arXiv Detail & Related papers (2023-07-24T21:01:46Z) - Machine Learning Methods for Cancer Classification Using Gene Expression
Data: A Review [77.34726150561087]
Cancer is the second major cause of death after cardiovascular diseases.
Gene expression can play a fundamental role in the early detection of cancer.
This study reviews recent progress in gene expression analysis for cancer classification using machine learning methods.
arXiv Detail & Related papers (2023-01-28T15:03:03Z) - Gene selection from microarray expression data: A Multi-objective PSO
with adaptive K-nearest neighborhood [0.0]
This paper deals with the classification problem of human cancer diseases by using gene expression data.
It is presented a new methodology to analyze microarray datasets and efficiently classify cancer diseases.
arXiv Detail & Related papers (2022-05-27T04:22:10Z) - Cancer Gene Profiling through Unsupervised Discovery [49.28556294619424]
We introduce a novel, automatic and unsupervised framework to discover low-dimensional gene biomarkers.
Our method is based on the LP-Stability algorithm, a high dimensional center-based unsupervised clustering algorithm.
Our signature reports promising results on distinguishing immune inflammatory and immune desert tumors.
arXiv Detail & Related papers (2021-02-11T09:04:45Z) - Feature reduction for machine learning on molecular features: The
GeneScore [58.720142291102135]
The GeneScore is a concept of feature reduction for Machine Learning analysis of biomedical data.
We show that the GeneScore is superior to a binary matrix in the classification of cancer entities.
arXiv Detail & Related papers (2021-01-14T10:58:39Z) - Select-ProtoNet: Learning to Select for Few-Shot Disease Subtype
Prediction [55.94378672172967]
We focus on few-shot disease subtype prediction problem, identifying subgroups of similar patients.
We introduce meta learning techniques to develop a new model, which can extract the common experience or knowledge from interrelated clinical tasks.
Our new model is built upon a carefully designed meta-learner, called Prototypical Network, that is a simple yet effective meta learning machine for few-shot image classification.
arXiv Detail & Related papers (2020-09-02T02:50:30Z) - A New Gene Selection Algorithm using Fuzzy-Rough Set Theory for Tumor
Classification [0.0]
We present a new technique for gene selection using a discernibility matrix of fuzzy-rough sets.
The proposed technique takes into account the similarity of those instances that have the same and different class labels to improve the gene selection results.
Experimental results demonstrate that this technique provides better efficiency compared to the state-of-the-art approaches.
arXiv Detail & Related papers (2020-03-26T13:43:25Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.