Global Mapping of Gene/Protein Interactions in PubMed Abstracts: A
Framework and an Experiment with P53 Interactions
- URL: http://arxiv.org/abs/2204.10476v1
- Date: Fri, 22 Apr 2022 03:04:19 GMT
- Title: Global Mapping of Gene/Protein Interactions in PubMed Abstracts: A
Framework and an Experiment with P53 Interactions
- Authors: Xin Li, Hsinchun Chen, Zan Huang, Hua Su, Jesse D. Martinez
- Abstract summary: The large body of biomedical literature is an important source of gene/protein interaction information.
Recent advances in text mining tools have made it possible to automatically extract such documented interactions from free-text literature.
We propose a comprehensive framework for constructing and analyzing large-scale gene functional networks based on the gene/protein interactions extracted from biomedical literature repositories.
- Score: 7.361249273831739
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: Gene/protein interactions provide critical information for a thorough
understanding of cellular processes. Recently, considerable interest and effort
has been focused on the construction and analysis of genome-wide gene networks.
The large body of biomedical literature is an important source of gene/protein
interaction information. Recent advances in text mining tools have made it
possible to automatically extract such documented interactions from free-text
literature. In this paper, we propose a comprehensive framework for
constructing and analyzing large-scale gene functional networks based on the
gene/protein interactions extracted from biomedical literature repositories
using text mining tools. Our proposed framework consists of analyses of the
network topology, network topology-gene function relationship, and temporal
network evolution to distill valuable information embedded in the gene
functional interactions in literature. We demonstrate the application of the
proposed framework using a testbed of P53-related PubMed abstracts, which shows
that literature-based P53 networks exhibit small-world and scale-free
properties. We also found that high degree genes in the literature-based
networks have a high probability of appearing in the manually curated database
and genes in the same pathway tend to form local clusters in our
literature-based networks. Temporal analysis showed that genes interacting with
many other genes tend to be involved in a large number of newly discovered
interactions.
Related papers
- BioDiscoveryAgent: An AI Agent for Designing Genetic Perturbation Experiments [112.25067497985447]
We introduce BioDiscoveryAgent, an agent that designs new experiments, reasons about their outcomes, and efficiently navigates the hypothesis space to reach desired solutions.
BioDiscoveryAgent can uniquely design new experiments without the need to train a machine learning model.
It achieves an average of 21% improvement in predicting relevant genetic perturbations across six datasets.
arXiv Detail & Related papers (2024-05-27T19:57:17Z) - Leveraging Biomolecule and Natural Language through Multi-Modal
Learning: A Survey [75.47055414002571]
The integration of biomolecular modeling with natural language (BL) has emerged as a promising interdisciplinary area at the intersection of artificial intelligence, chemistry and biology.
We provide an analysis of recent advancements achieved through cross modeling of biomolecules and natural language.
arXiv Detail & Related papers (2024-03-03T14:59:47Z) - Single-Cell Deep Clustering Method Assisted by Exogenous Gene
Information: A Novel Approach to Identifying Cell Types [50.55583697209676]
We develop an attention-enhanced graph autoencoder, which is designed to efficiently capture the topological features between cells.
During the clustering process, we integrated both sets of information and reconstructed the features of both cells and genes to generate a discriminative representation.
This research offers enhanced insights into the characteristics and distribution of cells, thereby laying the groundwork for early diagnosis and treatment of diseases.
arXiv Detail & Related papers (2023-11-28T09:14:55Z) - DDeMON: Ontology-based function prediction by Deep Learning from Dynamic
Multiplex Networks [0.7349727826230864]
The goal of this work is to explore how the fusion of systems' level information with temporal dynamics of gene expression can be used to predict novel gene functions.
We propose DDeMON, an approach for scalable, systems-level inference of function annotation using time-dependent multiscale biological information.
arXiv Detail & Related papers (2023-02-08T06:53:02Z) - Unsupervised ensemble-based phenotyping helps enhance the
discoverability of genes related to heart morphology [57.25098075813054]
We propose a new framework for gene discovery entitled Un Phenotype Ensembles.
It builds a redundant yet highly expressive representation by pooling a set of phenotypes learned in an unsupervised manner.
These phenotypes are then analyzed via (GWAS), retaining only highly confident and stable associations.
arXiv Detail & Related papers (2023-01-07T18:36:44Z) - EBOCA: Evidences for BiOmedical Concepts Association Ontology [55.41644538483948]
This paper proposes EBOCA, an ontology that describes (i) biomedical domain concepts and associations between them, and (ii) evidences supporting these associations.
Test data coming from a subset of DISNET and automatic association extractions from texts has been transformed to create a Knowledge Graph that can be used in real scenarios.
arXiv Detail & Related papers (2022-08-01T18:47:03Z) - Gene Function Prediction with Gene Interaction Networks: A Context Graph
Kernel Approach [24.234645183601998]
We propose to use a gene's context graph, i.e., the gene interaction network associated with the focal gene, to infer its functions.
In a kernel-based machine-learning framework, we design a context graph kernel to capture the information in context graphs.
arXiv Detail & Related papers (2022-04-22T02:54:01Z) - Graph Representation Learning on Tissue-Specific Multi-Omics [0.0]
We leverage a graph embedding model (i.e VGAE) to perform link prediction on tissue-specific Gene-Gene Interaction (GGI) networks.
We prove that the combination of multiple biological modalities (i.e multi-omics) leads to powerful embeddings and better link prediction performances.
arXiv Detail & Related papers (2021-07-25T17:38:45Z) - Heterogeneous Graph based Deep Learning for Biomedical Network Link
Prediction [7.628651624423363]
We propose a Graph Pair based Link Prediction model (GPLP) for predicting biomedical network links.
InP, 1-hop subgraphs extracted from known network interaction matrix is learnt to predict missing links.
Our method demonstrates the potential applications in other biomedical networks.
arXiv Detail & Related papers (2021-01-28T07:35:29Z) - Predicting Biomedical Interactions with Higher-Order Graph Convolutional
Networks [2.9488233765621295]
We present a higher-order graph convolutional network (HOGCN) to aggregate information from the higher-order neighborhood for biomedical interaction prediction.
Experiments on four interaction networks, including protein-protein, drug-drug, drug-target, and gene-disease interactions, show that HOGCN achieves more accurate and calibrated predictions.
arXiv Detail & Related papers (2020-10-16T17:16:09Z) - Extracting a Knowledge Base of Mechanisms from COVID-19 Papers [50.17242035034729]
We pursue the construction of a knowledge base (KB) of mechanisms.
We develop a broad, unified schema that strikes a balance between relevance and breadth.
Experiments demonstrate the utility of our KB in supporting interdisciplinary scientific search over COVID-19 literature.
arXiv Detail & Related papers (2020-10-08T07:54:14Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.