A Data-driven Approach for Noise Reduction in Distantly Supervised
Biomedical Relation Extraction
- URL: http://arxiv.org/abs/2005.12565v1
- Date: Tue, 26 May 2020 08:15:32 GMT
- Title: A Data-driven Approach for Noise Reduction in Distantly Supervised
Biomedical Relation Extraction
- Authors: Saadullah Amin, Katherine Ann Dunfield, Anna Vechkaeva and G\"unter
Neumann
- Abstract summary: We extend an entity-enriched relation classification BERT model to the problem of multiple instance learning.
We define a simple data encoding scheme that significantly reduces noise.
Our approach further encodes knowledge about the direction of relation triples, allowing for increased focus on relation learning.
- Score: 2.771933807499954
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Fact triples are a common form of structured knowledge used within the
biomedical domain. As the amount of unstructured scientific texts continues to
grow, manual annotation of these texts for the task of relation extraction
becomes increasingly expensive. Distant supervision offers a viable approach to
combat this by quickly producing large amounts of labeled, but considerably
noisy, data. We aim to reduce such noise by extending an entity-enriched
relation classification BERT model to the problem of multiple instance
learning, and defining a simple data encoding scheme that significantly reduces
noise, reaching state-of-the-art performance for distantly-supervised
biomedical relation extraction. Our approach further encodes knowledge about
the direction of relation triples, allowing for increased focus on relation
learning by reducing noise and alleviating the need for joint learning with
knowledge graph completion.
Related papers
- Learning to Denoise Biomedical Knowledge Graph for Robust Molecular Interaction Prediction [50.7901190642594]
We propose BioKDN (Biomedical Knowledge Graph Denoising Network) for robust molecular interaction prediction.
BioKDN refines the reliable structure of local subgraphs by denoising noisy links in a learnable manner.
It maintains consistent and robust semantics by smoothing relations around the target interaction.
arXiv Detail & Related papers (2023-12-09T07:08:00Z) - Sentence Bag Graph Formulation for Biomedical Distant Supervision
Relation Extraction [13.173870222632454]
We propose a graph view of sentence bags referring to an entity pair, which enables message-passing based aggregation of information related to the entity pair over the sentence bag.
The proposed framework alleviates the common problem of noisy labeling in distantly supervised relation extraction and also effectively incorporates inter-dependencies between sentences within a bag.
arXiv Detail & Related papers (2023-10-29T05:48:04Z) - ReOnto: A Neuro-Symbolic Approach for Biomedical Relation Extraction [3.263873198567265]
ReOnto Relation Extraction (RE) is the task of extracting semantic relationships between entities in a sentence and aligning them to relations defined in a vocabulary.
We present a novel technique that makes use of neuro symbolic knowledge for the RE task.
Experimental results on two public biomedical datasets, BioRel and ADE, show that our method outperforms all the baselines.
arXiv Detail & Related papers (2023-09-04T05:36:58Z) - Boosting Event Extraction with Denoised Structure-to-Text Augmentation [52.21703002404442]
Event extraction aims to recognize pre-defined event triggers and arguments from texts.
Recent data augmentation methods often neglect the problem of grammatical incorrectness.
We propose a denoised structure-to-text augmentation framework for event extraction DAEE.
arXiv Detail & Related papers (2023-05-16T16:52:07Z) - Enhancing Contrastive Learning with Noise-Guided Attack: Towards
Continual Relation Extraction in the Wild [57.468184469589744]
We develop a noise-resistant contrastive framework named as textbfNoise-guided textbfattack in textbfContrative textbfLearning(NaCL)
Compared to direct noise discarding or inaccessible noise relabeling, we present modifying the feature space to match the given noisy labels via attacking.
arXiv Detail & Related papers (2023-05-11T18:48:18Z) - Dynamic Graph Enhanced Contrastive Learning for Chest X-ray Report
Generation [92.73584302508907]
We propose a knowledge graph with Dynamic structure and nodes to facilitate medical report generation with Contrastive Learning.
In detail, the fundamental structure of our graph is pre-constructed from general knowledge.
Each image feature is integrated with its very own updated graph before being fed into the decoder module for report generation.
arXiv Detail & Related papers (2023-03-18T03:53:43Z) - BioIE: Biomedical Information Extraction with Multi-head Attention
Enhanced Graph Convolutional Network [9.227487525657901]
We propose Biomedical Information Extraction, a hybrid neural network to extract relations from biomedical text and unstructured medical reports.
We evaluate our model on two major biomedical relationship extraction tasks, chemical-disease relation and chemical-protein interaction, and a cross-hospital pan-cancer pathology report corpus.
arXiv Detail & Related papers (2021-10-26T13:19:28Z) - SAIS: Supervising and Augmenting Intermediate Steps for Document-Level
Relation Extraction [51.27558374091491]
We propose to explicitly teach the model to capture relevant contexts and entity types by supervising and augmenting intermediate steps (SAIS) for relation extraction.
Based on a broad spectrum of carefully designed tasks, our proposed SAIS method not only extracts relations of better quality due to more effective supervision, but also retrieves the corresponding supporting evidence more accurately.
arXiv Detail & Related papers (2021-09-24T17:37:35Z) - Modular Self-Supervision for Document-Level Relation Extraction [17.039775384229355]
We propose decomposing document-level relation extraction into relation detection and argument resolution.
We conduct a thorough evaluation in biomedical machine reading for precision oncology, where cross-paragraph relation mentions are prevalent.
Our method outperforms prior state of the art, such as multi-scale learning and graph neural networks, by over 20 absolute F1 points.
arXiv Detail & Related papers (2021-09-11T20:09:18Z) - Learning Neural Causal Models with Active Interventions [83.44636110899742]
We introduce an active intervention-targeting mechanism which enables a quick identification of the underlying causal structure of the data-generating process.
Our method significantly reduces the required number of interactions compared with random intervention targeting.
We demonstrate superior performance on multiple benchmarks from simulated to real-world data.
arXiv Detail & Related papers (2021-09-06T13:10:37Z) - Relation Extraction from Biomedical and Clinical Text: Unified Multitask
Learning Framework [41.4339656250053]
We study the relation extraction task from three major biomedical and clinical tasks, namely drug-drug interaction, protein-protein interaction, and medical concept relation extraction.
We introduce for the first time the concept of structured self-attentive network complemented with the adversarial learning approach for the prediction of relationships from the biomedical and clinical text.
arXiv Detail & Related papers (2020-09-20T19:50:28Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.