A Biomedical Knowledge Graph for Biomarker Discovery in Cancer
- URL: http://arxiv.org/abs/2302.04737v1
- Date: Thu, 9 Feb 2023 16:17:57 GMT
- Title: A Biomedical Knowledge Graph for Biomarker Discovery in Cancer
- Authors: Md. Rezaul Karim and Lina Comet and Oya Beyan and Michael Cochez and
Dietrich Rebholz-Schuhmann and Stefan Decker
- Abstract summary: A domain-specific knowledge graph(KG) is an explicit conceptualization of a specific subject-matter domain.
The KG is constructed by integrating cancer-related knowledge and facts from multiple sources.
We listed down some queries and some examples of QA and deducing knowledge based on the KG.
- Score: 1.7860709946876898
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: Structured and unstructured data and facts about drugs, genes, protein,
viruses, and their mechanism are spread across a huge number of scientific
articles. These articles are a large-scale knowledge source and can have a huge
impact on disseminating knowledge about the mechanisms of certain biological
processes. A domain-specific knowledge graph~(KG) is an explicit
conceptualization of a specific subject-matter domain represented w.r.t
semantically interrelated entities and relations. A KG can be constructed by
integrating such facts and data and be used for data integration, exploration,
and federated queries. However, exploration and querying large-scale KGs is
tedious for certain groups of users due to a lack of knowledge about underlying
data assets or semantic technologies. Such a KG will not only allow deducing
new knowledge and question answering(QA) but also allows domain experts to
explore. Since cross-disciplinary explanations are important for accurate
diagnosis, it is important to query the KG to provide interactive explanations
about learned biomarkers. Inspired by these, we construct a domain-specific KG,
particularly for cancer-specific biomarker discovery. The KG is constructed by
integrating cancer-related knowledge and facts from multiple sources. First, we
construct a domain-specific ontology, which we call OncoNet Ontology (ONO). The
ONO ontology is developed to enable semantic reasoning for verification of the
predictions for relations between diseases and genes. The KG is then developed
and enriched by harmonizing the ONO, additional metadata schemas, ontologies,
controlled vocabularies, and additional concepts from external sources using a
BERT-based information extraction method. BioBERT and SciBERT are finetuned
with the selected articles crawled from PubMed. We listed down some queries and
some examples of QA and deducing knowledge based on the KG.
Related papers
- Diversifying Knowledge Enhancement of Biomedical Language Models using
Adapter Modules and Knowledge Graphs [54.223394825528665]
We develop an approach that uses lightweight adapter modules to inject structured biomedical knowledge into pre-trained language models.
We use two large KGs, the biomedical knowledge system UMLS and the novel biochemical OntoChem, with two prominent biomedical PLMs, PubMedBERT and BioLinkBERT.
We show that our methodology leads to performance improvements in several instances while keeping requirements in computing power low.
arXiv Detail & Related papers (2023-12-21T14:26:57Z) - From Large Language Models to Knowledge Graphs for Biomarker Discovery
in Cancer [0.9437165725355702]
A challenging scenarios for artificial intelligence (AI) is using biomedical data to provide diagnosis and treatment recommendations for cancerous conditions.
A large-scale knowledge graph (KG) can be constructed by integrating and extracting facts about semantically interrelated entities and relations.
In this paper, we develop a domain KG to leverage cancer-specific biomarker discovery and interactive QA.
arXiv Detail & Related papers (2023-10-12T14:36:13Z) - Know2BIO: A Comprehensive Dual-View Benchmark for Evolving Biomedical
Knowledge Graphs [45.53337864477857]
Know2BIO is a general-purpose heterogeneous KG benchmark for the biomedical domain.
It integrates data from 30 diverse sources, capturing intricate relationships across 11 biomedical categories.
Know2BIO is capable of user-directed automated updating to reflect the latest knowledge in biomedical science.
arXiv Detail & Related papers (2023-10-05T00:34:56Z) - Biomedical Multi-hop Question Answering Using Knowledge Graph Embeddings
and Language Models [0.0]
We have created a multi-hop biomedical question-answering dataset in natural language for testing the biomedical multi-hop question-answering system.
The major contribution of this research is an integrated system that combines language models with KG embeddings to give highly relevant answers to free-form questions.
arXiv Detail & Related papers (2022-11-10T05:43:57Z) - Question Answering Over Biological Knowledge Graph via Amazon Alexa [0.4462334751640166]
This paper is about using Amazon Alexa's voice-enabled interface for QA over knowledge graph (KG)
As a proof-of-concept, we use the well-known DisgeNET KG, which contains knowledge covering 1.13 million gene-disease associations.
Our study shows how Alex could be of help to find facts about certain biological entities from large-scale knowledge bases.
arXiv Detail & Related papers (2022-10-12T09:17:06Z) - EBOCA: Evidences for BiOmedical Concepts Association Ontology [55.41644538483948]
This paper proposes EBOCA, an ontology that describes (i) biomedical domain concepts and associations between them, and (ii) evidences supporting these associations.
Test data coming from a subset of DISNET and automatic association extractions from texts has been transformed to create a Knowledge Graph that can be used in real scenarios.
arXiv Detail & Related papers (2022-08-01T18:47:03Z) - BertNet: Harvesting Knowledge Graphs with Arbitrary Relations from
Pretrained Language Models [65.51390418485207]
We propose a new approach of harvesting massive KGs of arbitrary relations from pretrained LMs.
With minimal input of a relation definition, the approach efficiently searches in the vast entity pair space to extract diverse accurate knowledge.
We deploy the approach to harvest KGs of over 400 new relations from different LMs.
arXiv Detail & Related papers (2022-06-28T19:46:29Z) - Scientific Language Models for Biomedical Knowledge Base Completion: An
Empirical Study [62.376800537374024]
We study scientific LMs for KG completion, exploring whether we can tap into their latent knowledge to enhance biomedical link prediction.
We integrate the LM-based models with KG embedding models, using a router method that learns to assign each input example to either type of model and provides a substantial boost in performance.
arXiv Detail & Related papers (2021-06-17T17:55:33Z) - COVID-19 Literature Knowledge Graph Construction and Drug Repurposing
Report Generation [79.33545724934714]
We have developed a novel and comprehensive knowledge discovery framework, COVID-KG, to extract fine-grained multimedia knowledge elements from scientific literature.
Our framework also provides detailed contextual sentences, subfigures, and knowledge subgraphs as evidence.
arXiv Detail & Related papers (2020-07-01T16:03:20Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.