Machine learning methods for prediction of cancer driver genes: a survey
paper
- URL: http://arxiv.org/abs/2109.13685v1
- Date: Tue, 28 Sep 2021 13:00:07 GMT
- Title: Machine learning methods for prediction of cancer driver genes: a survey
paper
- Authors: Renan Andrades, Mariana Recamonde-Mendoza
- Abstract summary: This survey aims to perform a comprehensive analysis of machine learning (ML)-based approaches to identify cancer driver mutations and genes.
We discuss how the interactions among data types and ML algorithms have been explored in previous solutions.
We hope that by helping readers become more familiar with significant developments in the field brought by ML, we may inspire new researchers to address open problems and advance our knowledge towards cancer driver discovery.
- Score: 1.713291434132985
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Identifying the genes and mutations that drive the emergence of tumors is a
major step to improve understanding of cancer and identify new directions for
disease diagnosis and treatment. Despite the large volume of genomics data, the
precise detection of driver mutations and their carrying genes, known as cancer
driver genes, from the millions of possible somatic mutations remains a
challenge. Computational methods play an increasingly important role in
identifying genomic patterns associated with cancer drivers and developing
models to predict driver events. Machine learning (ML) has been the engine
behind many of these efforts and provides excellent opportunities for tackling
remaining gaps in the field. Thus, this survey aims to perform a comprehensive
analysis of ML-based computational approaches to identify cancer driver
mutations and genes, providing an integrated, panoramic view of the broad data
and algorithmic landscape within this scientific problem. We discuss how the
interactions among data types and ML algorithms have been explored in previous
solutions and outline current analytical limitations that deserve further
attention from the scientific community. We hope that by helping readers become
more familiar with significant developments in the field brought by ML, we may
inspire new researchers to address open problems and advance our knowledge
towards cancer driver discovery.
Related papers
- Variational and Explanatory Neural Networks for Encoding Cancer Profiles and Predicting Drug Responses [40.80133767939435]
Existing AI models face challenges due to noise in transcriptomics data and lack of biological interpretability.
We introduce VETE, a novel neural network framework that incorporates a variational component to mitigate noise effects.
VETE bridges the gap between AI-driven predictions and biologically meaningful insights in cancer research.
arXiv Detail & Related papers (2024-07-05T13:13:02Z) - Knowledge-Informed Machine Learning for Cancer Diagnosis and Prognosis:
A review [2.2268038840298714]
We review the state-of-the-art machine learning studies that adopted the fusion of biomedical knowledge and data.
We provide an overview of diverse forms of knowledge representation and current strategies of knowledge integration into machine learning pipelines.
arXiv Detail & Related papers (2024-01-12T07:01:36Z) - Single-Cell Deep Clustering Method Assisted by Exogenous Gene
Information: A Novel Approach to Identifying Cell Types [50.55583697209676]
We develop an attention-enhanced graph autoencoder, which is designed to efficiently capture the topological features between cells.
During the clustering process, we integrated both sets of information and reconstructed the features of both cells and genes to generate a discriminative representation.
This research offers enhanced insights into the characteristics and distribution of cells, thereby laying the groundwork for early diagnosis and treatment of diseases.
arXiv Detail & Related papers (2023-11-28T09:14:55Z) - Causal machine learning for single-cell genomics [94.28105176231739]
We discuss the application of machine learning techniques to single-cell genomics and their challenges.
We first present the model that underlies most of current causal approaches to single-cell biology.
We then identify open problems in the application of causal approaches to single-cell data.
arXiv Detail & Related papers (2023-10-23T13:35:24Z) - Genetic InfoMax: Exploring Mutual Information Maximization in
High-Dimensional Imaging Genetics Studies [50.11449968854487]
Genome-wide association studies (GWAS) are used to identify relationships between genetic variations and specific traits.
Representation learning for imaging genetics is largely under-explored due to the unique challenges posed by GWAS.
We introduce a trans-modal learning framework Genetic InfoMax (GIM) to address the specific challenges of GWAS.
arXiv Detail & Related papers (2023-09-26T03:59:21Z) - Cancer-inspired Genomics Mapper Model for the Generation of Synthetic
DNA Sequences with Desired Genomics Signatures [0.0]
Cancer-inspired genomics mapper model (CGMM) combines genetic algorithm (GA) and deep learning (DL) methods.
We demonstrate that CGMM can generate synthetic genomes of selected phenotypes such as ancestry and cancer.
arXiv Detail & Related papers (2023-05-01T07:16:40Z) - Machine Learning Methods for Cancer Classification Using Gene Expression
Data: A Review [77.34726150561087]
Cancer is the second major cause of death after cardiovascular diseases.
Gene expression can play a fundamental role in the early detection of cancer.
This study reviews recent progress in gene expression analysis for cancer classification using machine learning methods.
arXiv Detail & Related papers (2023-01-28T15:03:03Z) - SurvODE: Extrapolating Gene Expression Distribution for Early Cancer
Identification [8.868473888198597]
We propose a novel method that can simulate the gene expression distribution at any given time point.
Our visualization results and further analysis indicate how our method can be used to simulate expression at the early cancer stage.
arXiv Detail & Related papers (2021-11-30T02:49:11Z) - Topological Data Analysis of copy number alterations in cancer [70.85487611525896]
We explore the potential to capture information contained in cancer genomic information using a novel topology-based approach.
We find that this technique has the potential to extract meaningful low-dimensional representations in cancer somatic genetic data.
arXiv Detail & Related papers (2020-11-22T17:31:23Z) - Select-ProtoNet: Learning to Select for Few-Shot Disease Subtype
Prediction [55.94378672172967]
We focus on few-shot disease subtype prediction problem, identifying subgroups of similar patients.
We introduce meta learning techniques to develop a new model, which can extract the common experience or knowledge from interrelated clinical tasks.
Our new model is built upon a carefully designed meta-learner, called Prototypical Network, that is a simple yet effective meta learning machine for few-shot image classification.
arXiv Detail & Related papers (2020-09-02T02:50:30Z) - The scalable Birth-Death MCMC Algorithm for Mixed Graphical Model
Learning with Application to Genomic Data Integration [0.0]
We propose a novel mixed graphical model approach to analyze multi-omic data of different types.
We find that our method is superior in terms of both computational efficiency and the accuracy of the model selection results.
arXiv Detail & Related papers (2020-05-08T16:34:58Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.