Relation-weighted Link Prediction for Disease Gene Identification
- URL: http://arxiv.org/abs/2011.05138v3
- Date: Fri, 13 Nov 2020 14:48:00 GMT
- Title: Relation-weighted Link Prediction for Disease Gene Identification
- Authors: Srivamshi Pittala, William Koehler, Jonathan Deans, Daniel Salinas,
Martin Bringmann, Katharina Sophia Volz, Berk Kapicioglu
- Abstract summary: We propose a novel machine learning method that identifies disease genes on such graphs.
We show that our algorithms outperform its closest state-of-the-art competitor in disease gene identification by 24.1%.
We also show that we achieve higher precision than Open Targets, the leading initiative for target identification, with respect to predicting drug targets in clinical trials for Parkinson's disease.
- Score: 0.3078691410268859
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Identification of disease genes, which are a set of genes associated with a
disease, plays an important role in understanding and curing diseases. In this
paper, we present a biomedical knowledge graph designed specifically for this
problem, propose a novel machine learning method that identifies disease genes
on such graphs by leveraging recent advances in network biology and graph
representation learning, study the effects of various relation types on
prediction performance, and empirically demonstrate that our algorithms
outperform its closest state-of-the-art competitor in disease gene
identification by 24.1%. We also show that we achieve higher precision than
Open Targets, the leading initiative for target identification, with respect to
predicting drug targets in clinical trials for Parkinson's disease.
Related papers
- Improving Disease Comorbidity Prediction Based on Human Interactome with Biologically Supervised Graph Embedding [0.0]
Comorbidity carries significant implications for disease understanding and management.
Human interactome, as a large incomplete graph, presents its own challenges to extracting useful features for comorbidity prediction.
Biologically Supervised Graph Embedding (BSE) allows for selecting most relevant features to enhance the prediction accuracy of comorbid disease pairs.
arXiv Detail & Related papers (2024-10-08T03:52:12Z) - Predicting Genetic Mutation from Whole Slide Images via Biomedical-Linguistic Knowledge Enhanced Multi-label Classification [119.13058298388101]
We develop a Biological-knowledge enhanced PathGenomic multi-label Transformer to improve genetic mutation prediction performances.
BPGT first establishes a novel gene encoder that constructs gene priors by two carefully designed modules.
BPGT then designs a label decoder that finally performs genetic mutation prediction by two tailored modules.
arXiv Detail & Related papers (2024-06-05T06:42:27Z) - Using Pre-training and Interaction Modeling for ancestry-specific disease prediction in UK Biobank [69.90493129893112]
Recent genome-wide association studies (GWAS) have uncovered the genetic basis of complex traits, but show an under-representation of non-European descent individuals.
Here, we assess whether we can improve disease prediction across diverse ancestries using multiomic data.
arXiv Detail & Related papers (2024-04-26T16:39:50Z) - Single-Cell Deep Clustering Method Assisted by Exogenous Gene
Information: A Novel Approach to Identifying Cell Types [50.55583697209676]
We develop an attention-enhanced graph autoencoder, which is designed to efficiently capture the topological features between cells.
During the clustering process, we integrated both sets of information and reconstructed the features of both cells and genes to generate a discriminative representation.
This research offers enhanced insights into the characteristics and distribution of cells, thereby laying the groundwork for early diagnosis and treatment of diseases.
arXiv Detail & Related papers (2023-11-28T09:14:55Z) - Causal machine learning for single-cell genomics [94.28105176231739]
We discuss the application of machine learning techniques to single-cell genomics and their challenges.
We first present the model that underlies most of current causal approaches to single-cell biology.
We then identify open problems in the application of causal approaches to single-cell data.
arXiv Detail & Related papers (2023-10-23T13:35:24Z) - Genetic InfoMax: Exploring Mutual Information Maximization in
High-Dimensional Imaging Genetics Studies [50.11449968854487]
Genome-wide association studies (GWAS) are used to identify relationships between genetic variations and specific traits.
Representation learning for imaging genetics is largely under-explored due to the unique challenges posed by GWAS.
We introduce a trans-modal learning framework Genetic InfoMax (GIM) to address the specific challenges of GWAS.
arXiv Detail & Related papers (2023-09-26T03:59:21Z) - Knowledge Graph Completion based on Tensor Decomposition for Disease
Gene Prediction [2.838553480267889]
We construct a biological knowledge graph centered on diseases and genes, and develop an end-to-end Knowledge graph completion model for Disease Gene Prediction.
KDGene introduces an interaction module between the embeddings of entities and relations to tensor decomposition, which can effectively enhance the information interaction in biological knowledge.
arXiv Detail & Related papers (2023-02-18T13:57:44Z) - An Information-Theoretic Framework for Identifying Age-Related Genes
Using Human Dermal Fibroblast Transcriptome Data [0.8122270502556371]
We develop an information-theoretic framework for identifying genes that are associated with aging.
We use unsupervised and semi-supervised learning techniques on human dermal fibroblast gene expression data.
Performance assessment for both unsupervised and semi-supervised methods show the effectiveness of the framework.
arXiv Detail & Related papers (2021-11-04T02:41:33Z) - Data-Driven Logistic Regression Ensembles With Applications in Genomics [0.0]
We propose a new approach for dealing with high-dimensional binary classification problems that combines ideas from regularization and ensembling.
We demonstrate the good performance of our method in terms of prediction accuracy and identification of key biomarkers using several medical datasets involving common diseases such as cancer, multiple sclerosis and psoriasis.
arXiv Detail & Related papers (2021-02-17T05:57:26Z) - Select-ProtoNet: Learning to Select for Few-Shot Disease Subtype
Prediction [55.94378672172967]
We focus on few-shot disease subtype prediction problem, identifying subgroups of similar patients.
We introduce meta learning techniques to develop a new model, which can extract the common experience or knowledge from interrelated clinical tasks.
Our new model is built upon a carefully designed meta-learner, called Prototypical Network, that is a simple yet effective meta learning machine for few-shot image classification.
arXiv Detail & Related papers (2020-09-02T02:50:30Z) - Recent Advances in Network-based Methods for Disease Gene Prediction [15.625526953844638]
Disease-gene association through Genome-wide association study (GWAS) is an arduous task for researchers.
To provide the researchers with alternative low-cost disease-gene association evidence, computational approaches come into play.
Since molecular networks are able to capture complex interplay among molecules in diseases, they become one of the most extensively used data for disease-gene association prediction.
arXiv Detail & Related papers (2020-07-19T14:13:38Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.