Jointprop: Joint Semi-supervised Learning for Entity and Relation
Extraction with Heterogeneous Graph-based Propagation
- URL: http://arxiv.org/abs/2305.15872v1
- Date: Thu, 25 May 2023 09:07:04 GMT
- Title: Jointprop: Joint Semi-supervised Learning for Entity and Relation
Extraction with Heterogeneous Graph-based Propagation
- Authors: Yandan Zheng, Anran Hao, Anh Tuan Luu
- Abstract summary: We propose Jointprop, a Heterogeneous Graph-based Propagation framework for joint semi-supervised entity and relation extraction.
We construct a unified span-based heterogeneous graph from entity and relation candidates and propagate class labels based on confidence scores.
We show that our framework outperforms the state-of-the-art semi-supervised approaches on NER and RE tasks.
- Score: 13.418617500641401
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Semi-supervised learning has been an important approach to address challenges
in extracting entities and relations from limited data. However, current
semi-supervised works handle the two tasks (i.e., Named Entity Recognition and
Relation Extraction) separately and ignore the cross-correlation of entity and
relation instances as well as the existence of similar instances across
unlabeled data. To alleviate the issues, we propose Jointprop, a Heterogeneous
Graph-based Propagation framework for joint semi-supervised entity and relation
extraction, which captures the global structure information between individual
tasks and exploits interactions within unlabeled data. Specifically, we
construct a unified span-based heterogeneous graph from entity and relation
candidates and propagate class labels based on confidence scores. We then
employ a propagation learning scheme to leverage the affinities between
labelled and unlabeled samples. Experiments on benchmark datasets show that our
framework outperforms the state-of-the-art semi-supervised approaches on NER
and RE tasks. We show that the joint semi-supervised learning of the two tasks
benefits from their codependency and validates the importance of utilizing the
shared information between unlabeled data.
Related papers
- Enhancing Missing Data Imputation through Combined Bipartite Graph and Complete Directed Graph [18.06658040186476]
We introduce a novel framework named the Bipartite and Complete Directed Graph Neural Network (BCGNN)
Within BCGNN, observations and features are differentiated as two distinct node types, and the values of observed features are converted into attributed edges linking them.
In parallel, the complete directed graph segment adeptly outlines and communicates the complex interdependencies among features.
arXiv Detail & Related papers (2024-11-07T17:48:37Z) - SEG:Seeds-Enhanced Iterative Refinement Graph Neural Network for Entity Alignment [13.487673375206276]
This paper presents a soft label propagation framework that integrates multi-source data and iterative seed enhancement.
A bidirectional weighted joint loss function is implemented, which reduces the distance between positive samples and differentially processes negative samples.
Our method outperforms existing semi-supervised approaches, as evidenced by superior results on multiple datasets.
arXiv Detail & Related papers (2024-10-28T04:50:46Z) - Entity Alignment with Unlabeled Dangling Cases [49.86384156476041]
We propose a novel GNN-based dangling detection and entity alignment framework.
While the two tasks share the same GNN, the detected dangling entities are removed in the alignment.
Our framework is featured by a designed entity and relation attention mechanism for selective neighborhood aggregation in representation learning.
arXiv Detail & Related papers (2024-03-16T17:21:58Z) - Distantly-Supervised Joint Extraction with Noise-Robust Learning [36.23022433465051]
We focus on the problem of joint extraction in distantly-labeled data, whose labels are generated by aligning entity mentions with the corresponding entity and relation tags using a knowledge base (KB)
Existing approaches, either considering only one source of noise or making decisions using external knowledge, cannot well-utilize significant information in the training data.
We propose DENRL, a generalizable framework that incorporates a lightweight transformer backbone into a sequence labeling scheme for joint tagging.
arXiv Detail & Related papers (2023-10-08T03:42:15Z) - CARE: Co-Attention Network for Joint Entity and Relation Extraction [0.0]
We propose a Co-Attention network for joint entity and relation extraction.
Our approach includes adopting a parallel encoding strategy to learn separate representations for each subtask.
At the core of our approach is the co-attention module that captures two-way interaction between the two subtasks.
arXiv Detail & Related papers (2023-08-24T03:40:54Z) - Relation Clustering in Narrative Knowledge Graphs [71.98234178455398]
relational sentences in the original text are embedded (with SBERT) and clustered in order to merge together semantically similar relations.
Preliminary tests show that such clustering might successfully detect similar relations, and provide a valuable preprocessing for semi-supervised approaches.
arXiv Detail & Related papers (2020-11-27T10:43:04Z) - Cross-Supervised Joint-Event-Extraction with Heterogeneous Information
Networks [61.950353376870154]
Joint-event-extraction is a sequence-to-sequence labeling task with a tag set composed of tags of triggers and entities.
We propose a Cross-Supervised Mechanism (CSM) to alternately supervise the extraction of triggers or entities.
Our approach outperforms the state-of-the-art methods in both entity and trigger extraction.
arXiv Detail & Related papers (2020-10-13T11:51:17Z) - Unsupervised Heterogeneous Coupling Learning for Categorical
Representation [50.1603042640492]
This work introduces a UNsupervised heTerogeneous couplIng lEarning (UNTIE) approach for representing coupled categorical data by untying the interactions between couplings.
UNTIE is efficiently optimized w.r.t. a kernel k-means objective function for unsupervised representation learning of heterogeneous and hierarchical value-to-object couplings.
The UNTIE-learned representations make significant performance improvement against the state-of-the-art categorical representations and deep representation models.
arXiv Detail & Related papers (2020-07-21T11:23:27Z) - Dual-Teacher: Integrating Intra-domain and Inter-domain Teachers for
Annotation-efficient Cardiac Segmentation [65.81546955181781]
We propose a novel semi-supervised domain adaptation approach, namely Dual-Teacher.
The student model learns the knowledge of unlabeled target data and labeled source data by two teacher models.
We demonstrate that our approach is able to concurrently utilize unlabeled data and cross-modality data with superior performance.
arXiv Detail & Related papers (2020-07-13T10:00:44Z) - Relabel the Noise: Joint Extraction of Entities and Relations via
Cooperative Multiagents [52.55119217982361]
We propose a joint extraction approach to handle noisy instances with a group of cooperative multiagents.
To handle noisy instances in a fine-grained manner, each agent in the cooperative group evaluates the instance by calculating a continuous confidence score from its own perspective.
A confidence consensus module is designed to gather the wisdom of all agents and re-distribute the noisy training set with confidence-scored labels.
arXiv Detail & Related papers (2020-04-21T12:03:04Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.