KG-Hub -- Building and Exchanging Biological Knowledge Graphs
- URL: http://arxiv.org/abs/2302.10800v1
- Date: Tue, 31 Jan 2023 21:29:35 GMT
- Title: KG-Hub -- Building and Exchanging Biological Knowledge Graphs
- Authors: J Harry Caufield, Tim Putman, Kevin Schaper, Deepak R Unni, Harshad
Hegde, Tiffany J Callahan, Luca Cappelletti, Sierra AT Moxon, Vida Ravanmehr,
Seth Carbon, Lauren E Chan, Katherina Cortes, Kent A Shefchek, Glass
Elsarboukh, James P Balhoff, Tommaso Fontana, Nicolas Matentzoglu, Richard M
Bruskiewich, Anne E Thessen, Nomi L Harris, Monica C Munoz-Torres, Melissa A
Haendel, Peter N Robinson, Marcin P Joachimiak, Christopher J Mungall, Justin
T Reese
- Abstract summary: KG-Hub is a platform that enables standardized construction, exchange, and reuse of knowledge graphs.
Current KG-Hub projects span use cases including COVID-19 research, drug repurposing, microbial-environmental interactions, and rare disease research.
- Score: 0.5369297590461578
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Knowledge graphs (KGs) are a powerful approach for integrating heterogeneous
data and making inferences in biology and many other domains, but a coherent
solution for constructing, exchanging, and facilitating the downstream use of
knowledge graphs is lacking. Here we present KG-Hub, a platform that enables
standardized construction, exchange, and reuse of knowledge graphs. Features
include a simple, modular extract-transform-load (ETL) pattern for producing
graphs compliant with Biolink Model (a high-level data model for standardizing
biological data), easy integration of any OBO (Open Biological and Biomedical
Ontologies) ontology, cached downloads of upstream data sources, versioned and
automatically updated builds with stable URLs, web-browsable storage of KG
artifacts on cloud infrastructure, and easy reuse of transformed subgraphs
across projects. Current KG-Hub projects span use cases including COVID-19
research, drug repurposing, microbial-environmental interactions, and rare
disease research. KG-Hub is equipped with tooling to easily analyze and
manipulate knowledge graphs. KG-Hub is also tightly integrated with graph
machine learning (ML) tools which allow automated graph machine learning,
including node embeddings and training of models for link prediction and node
classification.
Related papers
- Distill-SynthKG: Distilling Knowledge Graph Synthesis Workflow for Improved Coverage and Efficiency [59.6772484292295]
Knowledge graphs (KGs) generated by large language models (LLMs) are increasingly valuable for Retrieval-Augmented Generation (RAG) applications.
Existing KG extraction methods rely on prompt-based approaches, which are inefficient for processing large-scale corpora.
We propose SynthKG, a multi-step, document-level synthesis KG workflow based on LLMs.
We also design a novel graph-based retrieval framework for RAG.
arXiv Detail & Related papers (2024-10-22T00:47:54Z) - LLaVA Needs More Knowledge: Retrieval Augmented Natural Language Generation with Knowledge Graph for Explaining Thoracic Pathologies [3.2221734920470797]
We propose a Vision-Language framework augmented with a Knowledge Graph (KG)-based datastore to generate Natural Language Explanations (NLEs) for medical images.
Our framework employs a KG-based retrieval mechanism that not only improves the precision of the generated explanations but also preserves data privacy by avoiding direct data retrieval.
These frameworks are validated on the MIMIC-NLE dataset, where they achieve state-of-the-art results.
arXiv Detail & Related papers (2024-10-07T04:59:08Z) - Graph Relation Distillation for Efficient Biomedical Instance
Segmentation [80.51124447333493]
We propose a graph relation distillation approach for efficient biomedical instance segmentation.
We introduce two graph distillation schemes deployed at both the intra-image level and the inter-image level.
Experimental results on a number of biomedical datasets validate the effectiveness of our approach.
arXiv Detail & Related papers (2024-01-12T04:41:23Z) - GraphMETRO: Mitigating Complex Graph Distribution Shifts via Mixture of Aligned Experts [75.51612253852002]
GraphMETRO is a Graph Neural Network architecture that models natural diversity and captures complex distributional shifts.
GraphMETRO achieves state-of-the-art results on four datasets from the GOOD benchmark.
arXiv Detail & Related papers (2023-12-07T20:56:07Z) - GiGaMAE: Generalizable Graph Masked Autoencoder via Collaborative Latent
Space Reconstruction [76.35904458027694]
Masked autoencoder models lack good generalization ability on graph data.
We propose a novel graph masked autoencoder framework called GiGaMAE.
Our results will shed light on the design of foundation models on graph-structured data.
arXiv Detail & Related papers (2023-08-18T16:30:51Z) - An Open-Source Knowledge Graph Ecosystem for the Life Sciences [5.665519167428707]
PheKnowLator is a semantic ecosystem for automating the construction of ontologically grounded knowledge graphs.
The ecosystem includes KG construction resources, analysis tools, and benchmarks.
PheKnowLator enables fully customizable KGs without compromising performance or usability.
arXiv Detail & Related papers (2023-07-11T18:55:09Z) - Scientific Language Models for Biomedical Knowledge Base Completion: An
Empirical Study [62.376800537374024]
We study scientific LMs for KG completion, exploring whether we can tap into their latent knowledge to enhance biomedical link prediction.
We integrate the LM-based models with KG embedding models, using a router method that learns to assign each input example to either type of model and provides a substantial boost in performance.
arXiv Detail & Related papers (2021-06-17T17:55:33Z) - Relational Learning Analysis of Social Politics using Knowledge Graph
Embedding [11.978556412301975]
This paper presents a novel credibility domain-based KG Embedding framework.
It involves capturing a fusion of data obtained from heterogeneous resources into a formal KG representation depicted by a domain.
The framework also embodies a credibility module to ensure data quality and trustworthiness.
arXiv Detail & Related papers (2020-06-02T14:10:28Z) - KGTK: A Toolkit for Large Knowledge Graph Manipulation and Analysis [9.141014703209494]
KGTK is a data science-centric toolkit designed to represent, create, transform, enhance and analyze KGs.
We illustrate the framework with real-world scenarios where we have used KGTK to integrate and manipulate large KGs, such as Wikidata, DBpedia and ConceptNet.
arXiv Detail & Related papers (2020-05-29T21:29:14Z) - ENT-DESC: Entity Description Generation by Exploring Knowledge Graph [53.03778194567752]
In practice, the input knowledge could be more than enough, since the output description may only cover the most significant knowledge.
We introduce a large-scale and challenging dataset to facilitate the study of such a practical scenario in KG-to-text.
We propose a multi-graph structure that is able to represent the original graph information more comprehensively.
arXiv Detail & Related papers (2020-04-30T14:16:19Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.