Extracting Protein-Protein Interactions (PPIs) from Biomedical
Literature using Attention-based Relational Context Information
- URL: http://arxiv.org/abs/2403.05602v1
- Date: Fri, 8 Mar 2024 01:43:21 GMT
- Title: Extracting Protein-Protein Interactions (PPIs) from Biomedical
Literature using Attention-based Relational Context Information
- Authors: Gilchan Park, Sean McCorkle, Carlos Soto, Ian Blaby, Shinjae Yoo
- Abstract summary: This work presents a unified, multi-source PPI corpora with vetted interaction definitions augmented by binary interaction type labels.
A Transformer-based deep learning method exploits entities' relational context information for relation representation to improve relation classification performance.
The model's performance is evaluated on four widely studied biomedical relation extraction datasets.
- Score: 5.456047952635665
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Because protein-protein interactions (PPIs) are crucial to understand living
systems, harvesting these data is essential to probe disease development and
discern gene/protein functions and biological processes. Some curated datasets
contain PPI data derived from the literature and other sources (e.g., IntAct,
BioGrid, DIP, and HPRD). However, they are far from exhaustive, and their
maintenance is a labor-intensive process. On the other hand, machine learning
methods to automate PPI knowledge extraction from the scientific literature
have been limited by a shortage of appropriate annotated data. This work
presents a unified, multi-source PPI corpora with vetted interaction
definitions augmented by binary interaction type labels and a Transformer-based
deep learning method that exploits entities' relational context information for
relation representation to improve relation classification performance. The
model's performance is evaluated on four widely studied biomedical relation
extraction datasets, as well as this work's target PPI datasets, to observe the
effectiveness of the representation to relation extraction tasks in various
data. Results show the model outperforms prior state-of-the-art models. The
code and data are available at:
https://github.com/BNLNLP/PPI-Relation-Extraction
Related papers
- Graph Relation Distillation for Efficient Biomedical Instance
Segmentation [80.51124447333493]
We propose a graph relation distillation approach for efficient biomedical instance segmentation.
We introduce two graph distillation schemes deployed at both the intra-image level and the inter-image level.
Experimental results on a number of biomedical datasets validate the effectiveness of our approach.
arXiv Detail & Related papers (2024-01-12T04:41:23Z) - Learning to Denoise Biomedical Knowledge Graph for Robust Molecular Interaction Prediction [50.7901190642594]
We propose BioKDN (Biomedical Knowledge Graph Denoising Network) for robust molecular interaction prediction.
BioKDN refines the reliable structure of local subgraphs by denoising noisy links in a learnable manner.
It maintains consistent and robust semantics by smoothing relations around the target interaction.
arXiv Detail & Related papers (2023-12-09T07:08:00Z) - BioREx: Improving Biomedical Relation Extraction by Leveraging
Heterogeneous Datasets [7.7587371896752595]
Biomedical relation extraction (RE) is a central task in biomedical natural language processing (NLP) research.
We present a novel framework for systematically addressing the data heterogeneity of individual datasets and combining them into a large dataset.
Our evaluation shows that BioREx achieves significantly higher performance than the benchmark system trained on the individual dataset.
arXiv Detail & Related papers (2023-06-19T22:48:18Z) - BioBLP: A Modular Framework for Learning on Multimodal Biomedical
Knowledge Graphs [3.780924717521521]
We propose a modular framework for learning embeddings in knowledge graphs.
It allows encoding attribute data of different modalities while also supporting entities with missing attributes.
We train models using a biomedical KG containing approximately 2 million triples.
arXiv Detail & Related papers (2023-06-06T11:49:38Z) - SemiGNN-PPI: Self-Ensembling Multi-Graph Neural Network for Efficient
and Generalizable Protein-Protein Interaction Prediction [16.203794286288815]
Protein-protein interactions (PPIs) are crucial in various biological processes and their study has significant implications for drug development and disease diagnosis.
Existing deep learning methods suffer from significant performance degradation under complex real-world scenarios.
We propose a self-ensembling multigraph neural network (SemiGNN-PPI) that can effectively predict PPIs while being both efficient and generalizable.
arXiv Detail & Related papers (2023-05-15T03:06:44Z) - Does Synthetic Data Generation of LLMs Help Clinical Text Mining? [51.205078179427645]
We investigate the potential of OpenAI's ChatGPT to aid in clinical text mining.
We propose a new training paradigm that involves generating a vast quantity of high-quality synthetic data.
Our method has resulted in significant improvements in the performance of downstream tasks.
arXiv Detail & Related papers (2023-03-08T03:56:31Z) - Integrating Heterogeneous Domain Information into Relation Extraction: A
Case Study on Drug-Drug Interaction Extraction [1.0152838128195465]
This thesis works on Drug-Drug Interactions (DDIs) from the literature as a case study.
A deep neural relation extraction model is prepared and its attention mechanism is analyzed.
In order to further exploit the heterogeneous information, drug-related items, such as protein entries, medical terms and pathways are collected.
arXiv Detail & Related papers (2022-12-21T01:26:07Z) - Combining Feature and Instance Attribution to Detect Artifacts [62.63504976810927]
We propose methods to facilitate identification of training data artifacts.
We show that this proposed training-feature attribution approach can be used to uncover artifacts in training data.
We execute a small user study to evaluate whether these methods are useful to NLP researchers in practice.
arXiv Detail & Related papers (2021-07-01T09:26:13Z) - Type-augmented Relation Prediction in Knowledge Graphs [65.88395564516115]
We propose a type-augmented relation prediction (TaRP) method, where we apply both the type information and instance-level information for relation prediction.
Our proposed TaRP method achieves significantly better performance than state-of-the-art methods on four benchmark datasets.
arXiv Detail & Related papers (2020-09-16T21:14:18Z) - Assigning function to protein-protein interactions: a weakly supervised
BioBERT based approach using PubMed abstracts [2.208694022993555]
Protein-protein interactions (PPI) are critical to the function of proteins in both normal and diseased cells.
Only a small percentage of PPIs captured in protein interaction databases have annotations of function available.
Here, we aim to label the function type of PPIs by extracting relationships described in PubMed abstracts.
arXiv Detail & Related papers (2020-08-20T01:42:28Z) - Mining Implicit Entity Preference from User-Item Interaction Data for
Knowledge Graph Completion via Adversarial Learning [82.46332224556257]
We propose a novel adversarial learning approach by leveraging user interaction data for the Knowledge Graph Completion task.
Our generator is isolated from user interaction data, and serves to improve the performance of the discriminator.
To discover implicit entity preference of users, we design an elaborate collaborative learning algorithms based on graph neural networks.
arXiv Detail & Related papers (2020-03-28T05:47:33Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.