Benchmark and Best Practices for Biomedical Knowledge Graph Embeddings
- URL: http://arxiv.org/abs/2006.13774v1
- Date: Wed, 24 Jun 2020 14:47:33 GMT
- Title: Benchmark and Best Practices for Biomedical Knowledge Graph Embeddings
- Authors: David Chang, Ivana Balazevic, Carl Allen, Daniel Chawla, Cynthia
Brandt, Richard Andrew Taylor
- Abstract summary: We train several state-of-the-art knowledge graph embedding models on the SNOMED-CT knowledge graph.
We make a case for the importance of leveraging the multi-relational nature of knowledge graphs for learning biomedical knowledge representation.
- Score: 8.835844347471626
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Much of biomedical and healthcare data is encoded in discrete, symbolic form
such as text and medical codes. There is a wealth of expert-curated biomedical
domain knowledge stored in knowledge bases and ontologies, but the lack of
reliable methods for learning knowledge representation has limited their
usefulness in machine learning applications. While text-based representation
learning has significantly improved in recent years through advances in natural
language processing, attempts to learn biomedical concept embeddings so far
have been lacking. A recent family of models called knowledge graph embeddings
have shown promising results on general domain knowledge graphs, and we explore
their capabilities in the biomedical domain. We train several state-of-the-art
knowledge graph embedding models on the SNOMED-CT knowledge graph, provide a
benchmark with comparison to existing methods and in-depth discussion on best
practices, and make a case for the importance of leveraging the
multi-relational nature of knowledge graphs for learning biomedical knowledge
representation. The embeddings, code, and materials will be made available to
the communitY.
Related papers
- The Role of Graph Topology in the Performance of Biomedical Knowledge Graph Completion Models [3.1666540219908272]
We conduct a comprehensive investigation into the properties of publicly available biomedical Knowledge Graphs.
We establish links to the accuracy observed in real-world applications.
We release all model predictions and a new suite of analysis tools.
arXiv Detail & Related papers (2024-09-06T08:09:15Z) - A Textbook Remedy for Domain Shifts: Knowledge Priors for Medical Image Analysis [48.84443450990355]
Deep networks have achieved broad success in analyzing natural images, when applied to medical scans, they often fail in unexcepted situations.
We investigate this challenge and focus on model sensitivity to domain shifts, such as data sampled from different hospitals or data confounded by demographic variables such as sex, race, etc, in the context of chest X-rays and skin lesion images.
Taking inspiration from medical training, we propose giving deep networks a prior grounded in explicit medical knowledge communicated in natural language.
arXiv Detail & Related papers (2024-05-23T17:55:02Z) - MLIP: Enhancing Medical Visual Representation with Divergence Encoder
and Knowledge-guided Contrastive Learning [48.97640824497327]
We propose a novel framework leveraging domain-specific medical knowledge as guiding signals to integrate language information into the visual domain through image-text contrastive learning.
Our model includes global contrastive learning with our designed divergence encoder, local token-knowledge-patch alignment contrastive learning, and knowledge-guided category-level contrastive learning with expert knowledge.
Notably, MLIP surpasses state-of-the-art methods even with limited annotated data, highlighting the potential of multimodal pre-training in advancing medical representation learning.
arXiv Detail & Related papers (2024-02-03T05:48:50Z) - Graph Relation Distillation for Efficient Biomedical Instance
Segmentation [80.51124447333493]
We propose a graph relation distillation approach for efficient biomedical instance segmentation.
We introduce two graph distillation schemes deployed at both the intra-image level and the inter-image level.
Experimental results on a number of biomedical datasets validate the effectiveness of our approach.
arXiv Detail & Related papers (2024-01-12T04:41:23Z) - Diversifying Knowledge Enhancement of Biomedical Language Models using
Adapter Modules and Knowledge Graphs [54.223394825528665]
We develop an approach that uses lightweight adapter modules to inject structured biomedical knowledge into pre-trained language models.
We use two large KGs, the biomedical knowledge system UMLS and the novel biochemical OntoChem, with two prominent biomedical PLMs, PubMedBERT and BioLinkBERT.
We show that our methodology leads to performance improvements in several instances while keeping requirements in computing power low.
arXiv Detail & Related papers (2023-12-21T14:26:57Z) - LLaVA-Med: Training a Large Language-and-Vision Assistant for
Biomedicine in One Day [85.19963303642427]
We propose a cost-efficient approach for training a vision-language conversational assistant that can answer open-ended research questions of biomedical images.
The model first learns to align biomedical vocabulary using the figure-caption pairs as is, then learns to master open-ended conversational semantics.
This enables us to train a Large Language and Vision Assistant for BioMedicine in less than 15 hours (with eight A100s)
arXiv Detail & Related papers (2023-06-01T16:50:07Z) - Knowledge Graph Embeddings in the Biomedical Domain: Are They Useful? A
Look at Link Prediction, Rule Learning, and Downstream Polypharmacy Tasks [12.896135204106423]
This study aims to apply state-of-the-art knowledge graph embedding models in the context of a recent biomedical knowledge graph, BioKG.
We achieve a three-fold improvement in terms of performance based on the HITS@10 score over previous work on the same biomedical knowledge graph.
Results suggest that knowledge learnt from large biomedical knowledge graphs can be transferred to such downstream use cases.
arXiv Detail & Related papers (2023-05-31T16:04:25Z) - BERT Based Clinical Knowledge Extraction for Biomedical Knowledge Graph
Construction and Analysis [0.4893345190925178]
We propose an end-to-end approach for knowledge extraction and analysis from biomedical clinical notes.
The proposed framework can successfully extract relevant structured information with high accuracy.
arXiv Detail & Related papers (2023-04-21T14:45:33Z) - Scientific Language Models for Biomedical Knowledge Base Completion: An
Empirical Study [62.376800537374024]
We study scientific LMs for KG completion, exploring whether we can tap into their latent knowledge to enhance biomedical link prediction.
We integrate the LM-based models with KG embedding models, using a router method that learns to assign each input example to either type of model and provides a substantial boost in performance.
arXiv Detail & Related papers (2021-06-17T17:55:33Z) - A Literature Review of Recent Graph Embedding Techniques for Biomedical
Data [36.446560017794845]
Many graph-based learning methods have been proposed to analyze such type of data.
The main difficulty is how to handle high dimensionality and sparsity of the biomedical graphs.
graph embedding methods provide an effective and efficient way to address the above issues.
arXiv Detail & Related papers (2021-01-17T01:53:50Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.