The Digitalization of Bioassays in the Open Research Knowledge Graph
- URL: http://arxiv.org/abs/2203.14574v1
- Date: Mon, 28 Mar 2022 08:35:01 GMT
- Title: The Digitalization of Bioassays in the Open Research Knowledge Graph
- Authors: Jennifer D'Souza and Anita Monteverdi and Muhammad Haris and Marco
Anteghini and Kheir Eddine Farfar and Markus Stocker and Vitor A.P. Martins
dos Santos and S\"oren Auer
- Abstract summary: ORKG-assays is an AI micro-service written in Python designed to assist scientists obtain semantified bioassays as a set of triples.
It uses an AI-based clustering algorithm which on gold-standard evaluations over 900 bioassays with 5,514 unique property-value pairs for 103 predicates shows competitive performance.
- Score: 6.508148285794385
- License: http://creativecommons.org/licenses/by-sa/4.0/
- Abstract: Background: Recent years are seeing a growing impetus in the semantification
of scholarly knowledge at the fine-grained level of scientific entities in
knowledge graphs. The Open Research Knowledge Graph (ORKG)
https://www.orkg.org/ represents an important step in this direction, with
thousands of scholarly contributions as structured, fine-grained,
machine-readable data. There is a need, however, to engender change in
traditional community practices of recording contributions as unstructured,
non-machine-readable text. For this in turn, there is a strong need for AI
tools designed for scientists that permit easy and accurate semantification of
their scholarly contributions. We present one such tool, ORKG-assays.
Implementation: ORKG-assays is a freely available AI micro-service in ORKG
written in Python designed to assist scientists obtain semantified bioassays as
a set of triples. It uses an AI-based clustering algorithm which on
gold-standard evaluations over 900 bioassays with 5,514 unique property-value
pairs for 103 predicates shows competitive performance. Results and Discussion:
As a result, semantified assay collections can be surveyed on the ORKG platform
via tabulation or chart-based visualizations of key property values of the
chemicals and compounds offering smart knowledge access to biochemists and
pharmaceutical researchers in the advancement of drug development.
Related papers
- BioKGBench: A Knowledge Graph Checking Benchmark of AI Agent for Biomedical Science [43.624608816218505]
BioKGBench is an evaluation benchmark for AI-driven biomedical agents.
We first disentangle "Understanding Literature" into two atomic abilities.
We then formulate a novel agent task, dubbed KGCheck, using KGQA and domain-based Retrieval-Augmented Generation.
We collect over two thousand data for two atomic tasks and 225 high-quality annotated data for the agent task.
arXiv Detail & Related papers (2024-06-29T15:23:28Z) - SciRIFF: A Resource to Enhance Language Model Instruction-Following over Scientific Literature [80.49349719239584]
We present SciRIFF (Scientific Resource for Instruction-Following and Finetuning), a dataset of 137K instruction-following demonstrations for 54 tasks.
SciRIFF is the first dataset focused on extracting and synthesizing information from research literature across a wide range of scientific fields.
arXiv Detail & Related papers (2024-06-10T21:22:08Z) - Diversifying Knowledge Enhancement of Biomedical Language Models using
Adapter Modules and Knowledge Graphs [54.223394825528665]
We develop an approach that uses lightweight adapter modules to inject structured biomedical knowledge into pre-trained language models.
We use two large KGs, the biomedical knowledge system UMLS and the novel biochemical OntoChem, with two prominent biomedical PLMs, PubMedBERT and BioLinkBERT.
We show that our methodology leads to performance improvements in several instances while keeping requirements in computing power low.
arXiv Detail & Related papers (2023-12-21T14:26:57Z) - From Large Language Models to Knowledge Graphs for Biomarker Discovery
in Cancer [0.9437165725355702]
A challenging scenarios for artificial intelligence (AI) is using biomedical data to provide diagnosis and treatment recommendations for cancerous conditions.
A large-scale knowledge graph (KG) can be constructed by integrating and extracting facts about semantically interrelated entities and relations.
In this paper, we develop a domain KG to leverage cancer-specific biomarker discovery and interactive QA.
arXiv Detail & Related papers (2023-10-12T14:36:13Z) - Neurosymbolic AI for Reasoning on Biomedical Knowledge Graphs [0.9085310904484414]
Biomedical datasets are often modeled as knowledge graphs (KGs) because they capture the multi-relational, heterogeneous, and dynamic natures of biomedical systems.
KG completion (KGC), can, therefore, help researchers make predictions to inform tasks like drug repositioning.
arXiv Detail & Related papers (2023-07-17T11:47:05Z) - A Review on Knowledge Graphs for Healthcare: Resources, Applications, and Promises [52.31710895034573]
This work provides the first comprehensive review of healthcare knowledge graphs (HKGs)
It summarizes the pipeline and key techniques for HKG construction, as well as the common utilization approaches.
At the application level, we delve into the successful integration of HKGs across various health domains.
arXiv Detail & Related papers (2023-06-07T21:51:56Z) - Relationship extraction for knowledge graph creation from biomedical
literature [0.0]
We present and compare few rule-based and machine learning-based methods for scalable relationship extraction from biomedical literature.
We examine how resilient are these various methods to unbalanced and fairly small datasets.
The best performing model was T5 model fine-tuned on balanced data, with reported F1-score of 0.88.
arXiv Detail & Related papers (2022-01-05T15:09:33Z) - Scientific Language Models for Biomedical Knowledge Base Completion: An
Empirical Study [62.376800537374024]
We study scientific LMs for KG completion, exploring whether we can tap into their latent knowledge to enhance biomedical link prediction.
We integrate the LM-based models with KG embedding models, using a router method that learns to assign each input example to either type of model and provides a substantial boost in performance.
arXiv Detail & Related papers (2021-06-17T17:55:33Z) - COVID-19 Literature Knowledge Graph Construction and Drug Repurposing
Report Generation [79.33545724934714]
We have developed a novel and comprehensive knowledge discovery framework, COVID-KG, to extract fine-grained multimedia knowledge elements from scientific literature.
Our framework also provides detailed contextual sentences, subfigures, and knowledge subgraphs as evidence.
arXiv Detail & Related papers (2020-07-01T16:03:20Z) - Opportunities and Challenges of Deep Learning Methods for
Electrocardiogram Data: A Systematic Review [62.490310870300746]
The electrocardiogram (ECG) is one of the most commonly used diagnostic tools in medicine and healthcare.
Deep learning methods have achieved promising results on predictive healthcare tasks using ECG signals.
This paper presents a systematic review of deep learning methods for ECG data from both modeling and application perspectives.
arXiv Detail & Related papers (2019-12-28T02:44:29Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.