Recent Advances in Network-based Methods for Disease Gene Prediction
- URL: http://arxiv.org/abs/2007.10848v2
- Date: Thu, 17 Dec 2020 22:47:07 GMT
- Title: Recent Advances in Network-based Methods for Disease Gene Prediction
- Authors: Sezin Kircali Ata, Min Wu, Yuan Fang, Le Ou-Yang, Chee Keong Kwoh and
Xiao-Li Li
- Abstract summary: Disease-gene association through Genome-wide association study (GWAS) is an arduous task for researchers.
To provide the researchers with alternative low-cost disease-gene association evidence, computational approaches come into play.
Since molecular networks are able to capture complex interplay among molecules in diseases, they become one of the most extensively used data for disease-gene association prediction.
- Score: 15.625526953844638
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Disease-gene association through Genome-wide association study (GWAS) is an
arduous task for researchers. Investigating single nucleotide polymorphisms
(SNPs) that correlate with specific diseases needs statistical analysis of
associations. Considering the huge number of possible mutations, in addition to
its high cost, another important drawback of GWAS analysis is the large number
of false-positives. Thus, researchers search for more evidence to cross-check
their results through different sources. To provide the researchers with
alternative low-cost disease-gene association evidence, computational
approaches come into play. Since molecular networks are able to capture complex
interplay among molecules in diseases, they become one of the most extensively
used data for disease-gene association prediction. In this survey, we aim to
provide a comprehensive and an up-to-date review of network-based methods for
disease gene prediction. We also conduct an empirical analysis on 14
state-of-the-art methods. To summarize, we first elucidate the task definition
for disease gene prediction. Secondly, we categorize existing network-based
efforts into network diffusion methods, traditional machine learning methods
with handcrafted graph features and graph representation learning methods.
Thirdly, an empirical analysis is conducted to evaluate the performance of the
selected methods across seven diseases. We also provide distinguishing findings
about the discussed methods based on our empirical analysis. Finally, we
highlight potential research directions for future studies on disease gene
prediction.
Related papers
- Survey and Improvement Strategies for Gene Prioritization with Large Language Models [61.24568051916653]
Large language models (LLMs) have performed well in medical exams, but their effectiveness in diagnosing rare genetic diseases has not been assessed.
We used multi-agent and Human Phenotype Ontology (HPO) classification to categorized patients based on phenotypes and solvability levels.
At baseline, GPT-4 outperformed other LLMs, achieving near 30% accuracy in ranking causal genes correctly.
arXiv Detail & Related papers (2025-01-30T23:03:03Z) - Stacked ensemble\-based mutagenicity prediction model using multiple modalities with graph attention network [0.9736758288065405]
Mutagenicity is a concern due to its association with genetic mutations which can result in a variety of negative consequences.
In this work, we introduce a novel stacked ensemble based mutagenicity prediction model.
arXiv Detail & Related papers (2024-09-03T09:14:21Z) - Using Pre-training and Interaction Modeling for ancestry-specific disease prediction in UK Biobank [69.90493129893112]
Recent genome-wide association studies (GWAS) have uncovered the genetic basis of complex traits, but show an under-representation of non-European descent individuals.
Here, we assess whether we can improve disease prediction across diverse ancestries using multiomic data.
arXiv Detail & Related papers (2024-04-26T16:39:50Z) - Seeing Unseen: Discover Novel Biomedical Concepts via
Geometry-Constrained Probabilistic Modeling [53.7117640028211]
We present a geometry-constrained probabilistic modeling treatment to resolve the identified issues.
We incorporate a suite of critical geometric properties to impose proper constraints on the layout of constructed embedding space.
A spectral graph-theoretic method is devised to estimate the number of potential novel classes.
arXiv Detail & Related papers (2024-03-02T00:56:05Z) - Single-Cell Deep Clustering Method Assisted by Exogenous Gene
Information: A Novel Approach to Identifying Cell Types [50.55583697209676]
We develop an attention-enhanced graph autoencoder, which is designed to efficiently capture the topological features between cells.
During the clustering process, we integrated both sets of information and reconstructed the features of both cells and genes to generate a discriminative representation.
This research offers enhanced insights into the characteristics and distribution of cells, thereby laying the groundwork for early diagnosis and treatment of diseases.
arXiv Detail & Related papers (2023-11-28T09:14:55Z) - Machine Learning Methods for Cancer Classification Using Gene Expression
Data: A Review [77.34726150561087]
Cancer is the second major cause of death after cardiovascular diseases.
Gene expression can play a fundamental role in the early detection of cancer.
This study reviews recent progress in gene expression analysis for cancer classification using machine learning methods.
arXiv Detail & Related papers (2023-01-28T15:03:03Z) - Data-Driven Logistic Regression Ensembles With Applications in Genomics [0.0]
We propose a new approach for dealing with high-dimensional binary classification problems that combines ideas from regularization and ensembling.
We demonstrate the good performance of our method in terms of prediction accuracy and identification of key biomarkers using several medical datasets involving common diseases such as cancer, multiple sclerosis and psoriasis.
arXiv Detail & Related papers (2021-02-17T05:57:26Z) - Relation-weighted Link Prediction for Disease Gene Identification [0.3078691410268859]
We propose a novel machine learning method that identifies disease genes on such graphs.
We show that our algorithms outperform its closest state-of-the-art competitor in disease gene identification by 24.1%.
We also show that we achieve higher precision than Open Targets, the leading initiative for target identification, with respect to predicting drug targets in clinical trials for Parkinson's disease.
arXiv Detail & Related papers (2020-11-10T15:09:33Z) - Expectile Neural Networks for Genetic Data Analysis of Complex Diseases [3.0088453915399747]
We develop an expectile neural network (ENN) method for genetic data analyses of complex diseases.
Similar to expectile regression, ENN provides a comprehensive view of relationships between genetic variants and disease phenotypes.
We show that the proposed method outperformed an existing expectile regression when there exist complex relationships between genetic variants and disease phenotypes.
arXiv Detail & Related papers (2020-10-26T21:07:40Z) - Select-ProtoNet: Learning to Select for Few-Shot Disease Subtype
Prediction [55.94378672172967]
We focus on few-shot disease subtype prediction problem, identifying subgroups of similar patients.
We introduce meta learning techniques to develop a new model, which can extract the common experience or knowledge from interrelated clinical tasks.
Our new model is built upon a carefully designed meta-learner, called Prototypical Network, that is a simple yet effective meta learning machine for few-shot image classification.
arXiv Detail & Related papers (2020-09-02T02:50:30Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.