Discovering Drug-Target Interaction Knowledge from Biomedical Literature
- URL: http://arxiv.org/abs/2109.13187v1
- Date: Mon, 27 Sep 2021 17:00:14 GMT
- Title: Discovering Drug-Target Interaction Knowledge from Biomedical Literature
- Authors: Yutai Hou, Yingce Xia, Lijun Wu, Shufang Xie, Yang Fan, Jinhua Zhu,
Wanxiang Che, Tao Qin, Tie-Yan Liu
- Abstract summary: The Interaction between Drugs and Targets (DTI) in human body plays a crucial role in biomedical science and applications.
As millions of papers come out every year in the biomedical domain, automatically discovering DTI knowledge from literature becomes an urgent demand in the industry.
We explore the first end-to-end solution for this task by using generative approaches.
We regard the DTI triplets as a sequence and use a Transformer-based model to directly generate them without using the detailed annotations of entities and relations.
- Score: 107.98712673387031
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The Interaction between Drugs and Targets (DTI) in human body plays a crucial
role in biomedical science and applications. As millions of papers come out
every year in the biomedical domain, automatically discovering DTI knowledge
from biomedical literature, which are usually triplets about drugs, targets and
their interaction, becomes an urgent demand in the industry. Existing methods
of discovering biological knowledge are mainly extractive approaches that often
require detailed annotations (e.g., all mentions of biological entities,
relations between every two entity mentions, etc.). However, it is difficult
and costly to obtain sufficient annotations due to the requirement of expert
knowledge from biomedical domains. To overcome these difficulties, we explore
the first end-to-end solution for this task by using generative approaches. We
regard the DTI triplets as a sequence and use a Transformer-based model to
directly generate them without using the detailed annotations of entities and
relations. Further, we propose a semi-supervised method, which leverages the
aforementioned end-to-end model to filter unlabeled literature and label them.
Experimental results show that our method significantly outperforms extractive
baselines on DTI discovery. We also create a dataset, KD-DTI, to advance this
task and will release it to the community.
Related papers
- Generalized knowledge-enhanced framework for biomedical entity and relation extraction [0.6856896119187885]
We develop a novel framework to construct a task-independent and reusable background knowledge graph for biomedical entity and relation extraction.
The design of our model is inspired by how humans learn domain-specific topics.
Our framework employs such common-knowledge-sharing mechanism to build a general neural-network knowledge graph that is learning transferable to different domain-specific biomedical texts effectively.
arXiv Detail & Related papers (2024-08-13T04:06:45Z) - Intent Detection and Entity Extraction from BioMedical Literature [14.52164637112797]
Large-language models (LLMs) motivated by endeavours to attain generalized intelligence, their efficacy in replacing task and domain-specific natural language understanding approaches remains questionable.
We show that Supervised Fine Tuned approaches are still relevant and more effective than general-purpose LLMs.
arXiv Detail & Related papers (2024-04-04T17:09:52Z) - Diversifying Knowledge Enhancement of Biomedical Language Models using
Adapter Modules and Knowledge Graphs [54.223394825528665]
We develop an approach that uses lightweight adapter modules to inject structured biomedical knowledge into pre-trained language models.
We use two large KGs, the biomedical knowledge system UMLS and the novel biochemical OntoChem, with two prominent biomedical PLMs, PubMedBERT and BioLinkBERT.
We show that our methodology leads to performance improvements in several instances while keeping requirements in computing power low.
arXiv Detail & Related papers (2023-12-21T14:26:57Z) - High-throughput Biomedical Relation Extraction for Semi-Structured Web Articles Empowered by Large Language Models [1.9665865095034865]
We formulate the relation extraction task as binary classifications for large language models.
We designate the main title as the tail entity and explicitly incorporate it into the context.
Longer contents are sliced into text chunks, embedded, and retrieved with additional embedding models.
arXiv Detail & Related papers (2023-12-13T16:43:41Z) - SynerGPT: In-Context Learning for Personalized Drug Synergy Prediction
and Drug Design [64.69434941796904]
We propose a novel setting and models for in-context drug synergy learning.
We are given a small "personalized dataset" of 10-20 drug synergy relationships in the context of specific cancer cell targets.
Our goal is to predict additional drug synergy relationships in that context.
arXiv Detail & Related papers (2023-06-19T17:03:46Z) - EBOCA: Evidences for BiOmedical Concepts Association Ontology [55.41644538483948]
This paper proposes EBOCA, an ontology that describes (i) biomedical domain concepts and associations between them, and (ii) evidences supporting these associations.
Test data coming from a subset of DISNET and automatic association extractions from texts has been transformed to create a Knowledge Graph that can be used in real scenarios.
arXiv Detail & Related papers (2022-08-01T18:47:03Z) - Knowledge-Rich Self-Supervised Entity Linking [58.838404666183656]
Knowledge-RIch Self-Supervision ($tt KRISSBERT$) is a universal entity linker for four million UMLS entities.
Our approach subsumes zero-shot and few-shot methods, and can easily incorporate entity descriptions and gold mention labels if available.
Without using any labeled information, our method produces $tt KRISSBERT$, a universal entity linker for four million UMLS entities.
arXiv Detail & Related papers (2021-12-15T05:05:12Z) - R-BERT-CNN: Drug-target interactions extraction from biomedical
literature [1.8814209805277506]
We present our participation for the DrugProt task BioCreative VII challenge.
Drug-target interactions (DTIs) are critical for drug discovery and repurposing.
There are >32M biomedical articles on PubMed and manually extracting DTIs from such a huge knowledge base is challenging.
arXiv Detail & Related papers (2021-10-31T22:50:33Z) - Towards Incorporating Entity-specific Knowledge Graph Information in
Predicting Drug-Drug Interactions [1.14219428942199]
We propose a new method, BERTKG-DDI, to combine drug embeddings obtained from its interaction with other biomedical entities along with domain-specific BioBERT embedding-based RC architecture.
Experiments conducted on the DDIExtraction 2013 corpus clearly indicate that this strategy improves other baselines architectures by 4.1% macro F1-score.
arXiv Detail & Related papers (2020-12-21T06:44:32Z) - MolTrans: Molecular Interaction Transformer for Drug Target Interaction
Prediction [68.5766865583049]
Drug target interaction (DTI) prediction is a foundational task for in silico drug discovery.
Recent years have witnessed promising progress for deep learning in DTI predictions.
We propose a Molecular Interaction Transformer (TransMol) to address these limitations.
arXiv Detail & Related papers (2020-04-23T18:56:04Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.