Integrating Heterogeneous Domain Information into Relation Extraction: A
Case Study on Drug-Drug Interaction Extraction
- URL: http://arxiv.org/abs/2212.10714v1
- Date: Wed, 21 Dec 2022 01:26:07 GMT
- Title: Integrating Heterogeneous Domain Information into Relation Extraction: A
Case Study on Drug-Drug Interaction Extraction
- Authors: Masaki Asada
- Abstract summary: This thesis works on Drug-Drug Interactions (DDIs) from the literature as a case study.
A deep neural relation extraction model is prepared and its attention mechanism is analyzed.
In order to further exploit the heterogeneous information, drug-related items, such as protein entries, medical terms and pathways are collected.
- Score: 1.0152838128195465
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: The development of deep neural networks has improved representation learning
in various domains, including textual, graph structural, and relational triple
representations. This development opened the door to new relation extraction
beyond the traditional text-oriented relation extraction. However, research on
the effectiveness of considering multiple heterogeneous domain information
simultaneously is still under exploration, and if a model can take an advantage
of integrating heterogeneous information, it is expected to exhibit a
significant contribution to many problems in the world. This thesis works on
Drug-Drug Interactions (DDIs) from the literature as a case study and realizes
relation extraction utilizing heterogeneous domain information. First, a deep
neural relation extraction model is prepared and its attention mechanism is
analyzed. Next, a method to combine the drug molecular structure information
and drug description information to the input sentence information is proposed,
and the effectiveness of utilizing drug molecular structures and drug
descriptions for the relation extraction task is shown. Then, in order to
further exploit the heterogeneous information, drug-related items, such as
protein entries, medical terms and pathways are collected from multiple
existing databases and a new data set in the form of a knowledge graph (KG) is
constructed. A link prediction task on the constructed data set is conducted to
obtain embedding representations of drugs that contain the heterogeneous domain
information. Finally, a method that integrates the input sentence information
and the heterogeneous KG information is proposed. The proposed model is trained
and evaluated on a widely used data set, and as a result, it is shown that
utilizing heterogeneous domain information significantly improves the
performance of relation extraction from the literature.
Related papers
- Extracting Protein-Protein Interactions (PPIs) from Biomedical
Literature using Attention-based Relational Context Information [5.456047952635665]
This work presents a unified, multi-source PPI corpora with vetted interaction definitions augmented by binary interaction type labels.
A Transformer-based deep learning method exploits entities' relational context information for relation representation to improve relation classification performance.
The model's performance is evaluated on four widely studied biomedical relation extraction datasets.
arXiv Detail & Related papers (2024-03-08T01:43:21Z) - Seeing Unseen: Discover Novel Biomedical Concepts via
Geometry-Constrained Probabilistic Modeling [53.7117640028211]
We present a geometry-constrained probabilistic modeling treatment to resolve the identified issues.
We incorporate a suite of critical geometric properties to impose proper constraints on the layout of constructed embedding space.
A spectral graph-theoretic method is devised to estimate the number of potential novel classes.
arXiv Detail & Related papers (2024-03-02T00:56:05Z) - Sentence Bag Graph Formulation for Biomedical Distant Supervision
Relation Extraction [13.173870222632454]
We propose a graph view of sentence bags referring to an entity pair, which enables message-passing based aggregation of information related to the entity pair over the sentence bag.
The proposed framework alleviates the common problem of noisy labeling in distantly supervised relation extraction and also effectively incorporates inter-dependencies between sentences within a bag.
arXiv Detail & Related papers (2023-10-29T05:48:04Z) - Drug Synergistic Combinations Predictions via Large-Scale Pre-Training
and Graph Structure Learning [82.93806087715507]
Drug combination therapy is a well-established strategy for disease treatment with better effectiveness and less safety degradation.
Deep learning models have emerged as an efficient way to discover synergistic combinations.
Our framework achieves state-of-the-art results in comparison with other deep learning-based methods.
arXiv Detail & Related papers (2023-01-14T15:07:43Z) - EBOCA: Evidences for BiOmedical Concepts Association Ontology [55.41644538483948]
This paper proposes EBOCA, an ontology that describes (i) biomedical domain concepts and associations between them, and (ii) evidences supporting these associations.
Test data coming from a subset of DISNET and automatic association extractions from texts has been transformed to create a Knowledge Graph that can be used in real scenarios.
arXiv Detail & Related papers (2022-08-01T18:47:03Z) - Communicative Subgraph Representation Learning for Multi-Relational
Inductive Drug-Gene Interaction Prediction [17.478102754113294]
We propose a novel Communicative Subgraph representation learning for Multi-relational Inductive drug-Gene interactions prediction (CoSMIG)
The model strengthened the relations on the drug-gene graph through a communicative message passing mechanism.
Our method outperformed state-of-the-art baselines in the transductive scenarios and achieved superior performance in the inductive ones.
arXiv Detail & Related papers (2022-05-12T08:53:45Z) - Graph-in-Graph (GiG): Learning interpretable latent graphs in
non-Euclidean domain for biological and healthcare applications [52.65389473899139]
Graphs are a powerful tool for representing and analyzing unstructured, non-Euclidean data ubiquitous in the healthcare domain.
Recent works have shown that considering relationships between input data samples have a positive regularizing effect for the downstream task.
We propose Graph-in-Graph (GiG), a neural network architecture for protein classification and brain imaging applications.
arXiv Detail & Related papers (2022-04-01T10:01:37Z) - Learning Neural Causal Models with Active Interventions [83.44636110899742]
We introduce an active intervention-targeting mechanism which enables a quick identification of the underlying causal structure of the data-generating process.
Our method significantly reduces the required number of interactions compared with random intervention targeting.
We demonstrate superior performance on multiple benchmarks from simulated to real-world data.
arXiv Detail & Related papers (2021-09-06T13:10:37Z) - Predicting Drug-Drug Interactions from Heterogeneous Data: An Embedding
Approach [20.87835183671462]
We present the first work that uses multiple data sources such as drug structure images, drug structure string representation and relational representation of drug relationships as the input.
Our empirical evaluation against several state-of-the-art methods using standalone different data types for drugs clearly demonstrate the efficacy of combining heterogeneous data in predicting DDIs.
arXiv Detail & Related papers (2021-03-19T17:25:48Z) - Deep Co-Attention Network for Multi-View Subspace Learning [73.3450258002607]
We propose a deep co-attention network for multi-view subspace learning.
It aims to extract both the common information and the complementary information in an adversarial setting.
In particular, it uses a novel cross reconstruction loss and leverages the label information to guide the construction of the latent representation.
arXiv Detail & Related papers (2021-02-15T18:46:44Z) - A logic-based relational learning approach to relation extraction: The
OntoILPER system [0.9176056742068812]
We present OntoILPER, a logic-based relational learning approach to Relation Extraction.
OntoILPER takes profit of a rich relational representation of examples, which can alleviate the drawbacks.
The proposed relational approach seems to be more suitable for Relation Extraction than statistical ones.
arXiv Detail & Related papers (2020-01-13T12:47:49Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.