Gene Function Prediction with Gene Interaction Networks: A Context Graph
Kernel Approach
- URL: http://arxiv.org/abs/2204.10473v1
- Date: Fri, 22 Apr 2022 02:54:01 GMT
- Title: Gene Function Prediction with Gene Interaction Networks: A Context Graph
Kernel Approach
- Authors: Xin Li, Hsinchun Chen, Jiexun Li, Zhu Zhang
- Abstract summary: We propose to use a gene's context graph, i.e., the gene interaction network associated with the focal gene, to infer its functions.
In a kernel-based machine-learning framework, we design a context graph kernel to capture the information in context graphs.
- Score: 24.234645183601998
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: Predicting gene functions is a challenge for biologists in the post genomic
era. Interactions among genes and their products compose networks that can be
used to infer gene functions. Most previous studies adopt a linkage assumption,
i.e., they assume that gene interactions indicate functional similarities
between connected genes. In this study, we propose to use a gene's context
graph, i.e., the gene interaction network associated with the focal gene, to
infer its functions. In a kernel-based machine-learning framework, we design a
context graph kernel to capture the information in context graphs. Our
experimental study on a testbed of p53-related genes demonstrates the advantage
of using indirect gene interactions and shows the empirical superiority of the
proposed approach over linkage-assumption-based methods, such as the algorithm
to minimize inconsistent connected genes and diffusion kernels.
Related papers
- Learning to Discover Regulatory Elements for Gene Expression Prediction [59.470991831978516]
Seq2Exp is a Sequence to Expression network designed to discover and extract regulatory elements that drive target gene expression.
Our approach captures the causal relationship between epigenomic signals, DNA sequences and their associated regulatory elements.
arXiv Detail & Related papers (2025-02-19T03:25:49Z) - GoBERT: Gene Ontology Graph Informed BERT for Universal Gene Function Prediction [33.42820754967865]
We propose to tackle the gene function prediction problem by exploring Gene Ontology graph and annotation with BERT (GoBERT)
Two pre-train tasks are designed to jointly train GoBERT to capture both explicit and implicit relations of functions.
GoBERT possess the ability to predict novel functions for various gene and gene products based on known functional annotations.
arXiv Detail & Related papers (2025-01-03T18:02:50Z) - Gene-Level Representation Learning via Interventional Style Transfer in Optical Pooled Screening [3.7038542578642715]
We employ a style-transfer approach to learn gene-level feature representations from images of genetically perturbed cells obtained via Optical pooled screening (OPS)
Our method outperforms widely used engineered features in clustering gene representations according to gene function, demonstrating its utility for uncovering latent biological relationships.
arXiv Detail & Related papers (2024-06-11T22:56:50Z) - Predicting Genetic Mutation from Whole Slide Images via Biomedical-Linguistic Knowledge Enhanced Multi-label Classification [119.13058298388101]
We develop a Biological-knowledge enhanced PathGenomic multi-label Transformer to improve genetic mutation prediction performances.
BPGT first establishes a novel gene encoder that constructs gene priors by two carefully designed modules.
BPGT then designs a label decoder that finally performs genetic mutation prediction by two tailored modules.
arXiv Detail & Related papers (2024-06-05T06:42:27Z) - FGBERT: Function-Driven Pre-trained Gene Language Model for Metagenomics [46.189419603576084]
FGBERT is a novel metagenomic pre-trained model that employs a protein-based gene representation as a context-aware tokenizer.
It demonstrates superior performance on metagenomic datasets at four levels, spanning gene, functional, bacterial, and environmental levels.
arXiv Detail & Related papers (2024-02-24T13:13:17Z) - GENER: A Parallel Layer Deep Learning Network To Detect Gene-Gene
Interactions From Gene Expression Data [0.7660368798066375]
We introduce a parallel-layer deep learning network designed exclusively for the identification of gene-gene relationships using gene expression data.
Our model achieved an average AUROC score of 0.834 on the combined BioGRID&DREAM5 dataset, outperforming competing methods in predicting gene-gene interactions.
arXiv Detail & Related papers (2023-10-05T15:45:53Z) - MuSe-GNN: Learning Unified Gene Representation From Multimodal
Biological Graph Data [22.938437500266847]
We introduce a novel model called Multimodal Similarity Learning Graph Neural Network.
It combines Multimodal Machine Learning and Deep Graph Neural Networks to learn gene representations from single-cell sequencing and spatial transcriptomic data.
Our model efficiently produces unified gene representations for the analysis of gene functions, tissue functions, diseases, and species evolution.
arXiv Detail & Related papers (2023-09-29T13:33:53Z) - Machine Learning Methods for Cancer Classification Using Gene Expression
Data: A Review [77.34726150561087]
Cancer is the second major cause of death after cardiovascular diseases.
Gene expression can play a fundamental role in the early detection of cancer.
This study reviews recent progress in gene expression analysis for cancer classification using machine learning methods.
arXiv Detail & Related papers (2023-01-28T15:03:03Z) - Unsupervised ensemble-based phenotyping helps enhance the
discoverability of genes related to heart morphology [57.25098075813054]
We propose a new framework for gene discovery entitled Un Phenotype Ensembles.
It builds a redundant yet highly expressive representation by pooling a set of phenotypes learned in an unsupervised manner.
These phenotypes are then analyzed via (GWAS), retaining only highly confident and stable associations.
arXiv Detail & Related papers (2023-01-07T18:36:44Z) - Granger causal inference on DAGs identifies genomic loci regulating
transcription [77.58911272503771]
GrID-Net is a framework based on graph neural networks with lagged message passing for Granger causal inference on DAG-structured systems.
Our application is the analysis of single-cell multimodal data to identify genomic loci that mediate the regulation of specific genes.
arXiv Detail & Related papers (2022-10-18T21:15:10Z) - SimpleChrome: Encoding of Combinatorial Effects for Predicting Gene
Expression [8.326669256957352]
We present SimpleChrome, a deep learning model that learns the histone modification representations of genes.
The features learned from the model allow us to better understand the latent effects of cross-gene interactions and direct gene regulation on the target gene expression.
arXiv Detail & Related papers (2020-12-15T23:30:36Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.