Knowledge Graph Completion based on Tensor Decomposition for Disease
Gene Prediction
- URL: http://arxiv.org/abs/2302.09335v1
- Date: Sat, 18 Feb 2023 13:57:44 GMT
- Title: Knowledge Graph Completion based on Tensor Decomposition for Disease
Gene Prediction
- Authors: Xinyan Wang, Ting Jia, Chongyu Wang, Kuan Xu, Zixin Shu, Kuo Yang,
Xuezhong Zhou
- Abstract summary: We construct a biological knowledge graph centered on diseases and genes, and develop an end-to-end Knowledge graph completion model for Disease Gene Prediction.
KDGene introduces an interaction module between the embeddings of entities and relations to tensor decomposition, which can effectively enhance the information interaction in biological knowledge.
- Score: 2.838553480267889
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Accurate identification of disease genes has consistently been one of the
keys to decoding a disease's molecular mechanism. Most current approaches focus
on constructing biological networks and utilizing machine learning, especially,
deep learning to identify disease genes, but ignore the complex relations
between entities in the biological knowledge graph. In this paper, we construct
a biological knowledge graph centered on diseases and genes, and develop an
end-to-end Knowledge graph completion model for Disease Gene Prediction using
interactional tensor decomposition (called KDGene). KDGene introduces an
interaction module between the embeddings of entities and relations to tensor
decomposition, which can effectively enhance the information interaction in
biological knowledge. Experimental results show that KDGene significantly
outperforms state-of-the-art algorithms. Furthermore, the comprehensive
biological analysis of the case of diabetes mellitus confirms KDGene's ability
for identifying new and accurate candidate genes. This work proposes a scalable
knowledge graph completion framework to identify disease candidate genes, from
which the results are promising to provide valuable references for further wet
experiments.
Related papers
- Improving Disease Comorbidity Prediction Based on Human Interactome with Biologically Supervised Graph Embedding [0.0]
Comorbidity carries significant implications for disease understanding and management.
Human interactome, as a large incomplete graph, presents its own challenges to extracting useful features for comorbidity prediction.
Biologically Supervised Graph Embedding (BSE) allows for selecting most relevant features to enhance the prediction accuracy of comorbid disease pairs.
arXiv Detail & Related papers (2024-10-08T03:52:12Z) - Predicting Genetic Mutation from Whole Slide Images via Biomedical-Linguistic Knowledge Enhanced Multi-label Classification [119.13058298388101]
We develop a Biological-knowledge enhanced PathGenomic multi-label Transformer to improve genetic mutation prediction performances.
BPGT first establishes a novel gene encoder that constructs gene priors by two carefully designed modules.
BPGT then designs a label decoder that finally performs genetic mutation prediction by two tailored modules.
arXiv Detail & Related papers (2024-06-05T06:42:27Z) - Causal machine learning for single-cell genomics [94.28105176231739]
We discuss the application of machine learning techniques to single-cell genomics and their challenges.
We first present the model that underlies most of current causal approaches to single-cell biology.
We then identify open problems in the application of causal approaches to single-cell data.
arXiv Detail & Related papers (2023-10-23T13:35:24Z) - Tertiary Lymphoid Structures Generation through Graph-based Diffusion [54.37503714313661]
In this work, we leverage state-of-the-art graph-based diffusion models to generate biologically meaningful cell-graphs.
We show that the adopted graph diffusion model is able to accurately learn the distribution of cells in terms of their tertiary lymphoid structures (TLS) content.
arXiv Detail & Related papers (2023-10-10T14:37:17Z) - Genetic InfoMax: Exploring Mutual Information Maximization in
High-Dimensional Imaging Genetics Studies [50.11449968854487]
Genome-wide association studies (GWAS) are used to identify relationships between genetic variations and specific traits.
Representation learning for imaging genetics is largely under-explored due to the unique challenges posed by GWAS.
We introduce a trans-modal learning framework Genetic InfoMax (GIM) to address the specific challenges of GWAS.
arXiv Detail & Related papers (2023-09-26T03:59:21Z) - Applying BioBERT to Extract Germline Gene-Disease Associations for Building a Knowledge Graph from the Biomedical Literature [0.0]
This paper presents SimpleGermKG, an automatic knowledge graph construction approach that connects germline genes and diseases.
For the extraction of genes and diseases, we employ BioBERT, a pre-trained BERT model on biomedical corpora.
For semantic relationships between articles, genes, and diseases, we implemented a part-whole relation approach.
Our knowledge graph contains 297 genes, 130 diseases, and 46,747 triples.
arXiv Detail & Related papers (2023-09-11T18:05:12Z) - Machine Learning Methods for Cancer Classification Using Gene Expression
Data: A Review [77.34726150561087]
Cancer is the second major cause of death after cardiovascular diseases.
Gene expression can play a fundamental role in the early detection of cancer.
This study reviews recent progress in gene expression analysis for cancer classification using machine learning methods.
arXiv Detail & Related papers (2023-01-28T15:03:03Z) - Unsupervised ensemble-based phenotyping helps enhance the
discoverability of genes related to heart morphology [57.25098075813054]
We propose a new framework for gene discovery entitled Un Phenotype Ensembles.
It builds a redundant yet highly expressive representation by pooling a set of phenotypes learned in an unsupervised manner.
These phenotypes are then analyzed via (GWAS), retaining only highly confident and stable associations.
arXiv Detail & Related papers (2023-01-07T18:36:44Z) - Graph Representation Learning on Tissue-Specific Multi-Omics [0.0]
We leverage a graph embedding model (i.e VGAE) to perform link prediction on tissue-specific Gene-Gene Interaction (GGI) networks.
We prove that the combination of multiple biological modalities (i.e multi-omics) leads to powerful embeddings and better link prediction performances.
arXiv Detail & Related papers (2021-07-25T17:38:45Z) - Predicting Gene-Disease Associations with Knowledge Graph Embeddings
over Multiple Ontologies [0.0]
Ontology-based approaches for predicting gene-disease associations include knowledge graph embeddings.
We investigate the impact of knowledge graph embeddings on complex tasks such as gene-disease association.
arXiv Detail & Related papers (2021-05-11T11:20:38Z) - Relation-weighted Link Prediction for Disease Gene Identification [0.3078691410268859]
We propose a novel machine learning method that identifies disease genes on such graphs.
We show that our algorithms outperform its closest state-of-the-art competitor in disease gene identification by 24.1%.
We also show that we achieve higher precision than Open Targets, the leading initiative for target identification, with respect to predicting drug targets in clinical trials for Parkinson's disease.
arXiv Detail & Related papers (2020-11-10T15:09:33Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.