Span-based joint entity and relation extraction augmented with sequence
tagging mechanism
- URL: http://arxiv.org/abs/2210.12720v1
- Date: Sun, 23 Oct 2022 12:39:27 GMT
- Title: Span-based joint entity and relation extraction augmented with sequence
tagging mechanism
- Authors: Bin Ji, Shasha Li, Hao Xu, Jie Yu, Jun Ma, Huijun Liu, Jing Yang
- Abstract summary: We propose a Sequence Tagging augmented Span-based Network (STSN), a span-based joint model that can make use of token-level label information.
Experimental results on three benchmark datasets show that STSN consistently outperforms the strongest baselines in terms of F1.
- Score: 13.782829752102785
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Span-based joint extraction simultaneously conducts named entity recognition
(NER) and relation extraction (RE) in text span form. However, since previous
span-based models rely on span-level classifications, they cannot benefit from
token-level label information, which has been proven advantageous for the task.
In this paper, we propose a Sequence Tagging augmented Span-based Network
(STSN), a span-based joint model that can make use of token-level label
information. In STSN, we construct a core neural architecture by deep stacking
multiple attention layers, each of which consists of three basic attention
units. On the one hand, the core architecture enables our model to learn
token-level label information via the sequence tagging mechanism and then uses
the information in the span-based joint extraction; on the other hand, it
establishes a bi-directional information interaction between NER and RE.
Experimental results on three benchmark datasets show that STSN consistently
outperforms the strongest baselines in terms of F1, creating new
state-of-the-art results.
Related papers
- HIORE: Leveraging High-order Interactions for Unified Entity Relation
Extraction [85.80317530027212]
We propose HIORE, a new method for unified entity relation extraction.
The key insight is to leverage the complex association among word pairs, which contains richer information than the first-order word-by-word interactions.
Experiments show that HIORE achieves the state-of-the-art performance on relation extraction and an improvement of 1.11.8 F1 points over the prior best unified model.
arXiv Detail & Related papers (2023-05-07T14:57:42Z) - Deep Dependency Networks for Multi-Label Classification [24.24496964886951]
We show that the performance of previous approaches that combine Markov Random Fields with neural networks can be modestly improved.
We propose a new modeling framework called deep dependency networks, which augments a dependency network.
Despite its simplicity, jointly learning this new architecture yields significant improvements in performance.
arXiv Detail & Related papers (2023-02-01T17:52:40Z) - ReSel: N-ary Relation Extraction from Scientific Text and Tables by
Learning to Retrieve and Select [53.071352033539526]
We study the problem of extracting N-ary relations from scientific articles.
Our proposed method ReSel decomposes this task into a two-stage procedure.
Our experiments on three scientific information extraction datasets show that ReSel outperforms state-of-the-art baselines significantly.
arXiv Detail & Related papers (2022-10-26T02:28:02Z) - Pack Together: Entity and Relation Extraction with Levitated Marker [61.232174424421025]
We propose a novel span representation approach, named Packed Levitated Markers, to consider the dependencies between the spans (pairs) by strategically packing the markers in the encoder.
Our experiments show that our model with packed levitated markers outperforms the sequence labeling model by 0.4%-1.9% F1 on three flat NER tasks, and beats the token concat model on six NER benchmarks.
arXiv Detail & Related papers (2021-09-13T15:38:13Z) - Boosting Span-based Joint Entity and Relation Extraction via Squence
Tagging Mechanism [10.894755638322]
Span-based joint extraction simultaneously conducts named entity recognition (NER) and relation extraction (RE) in text span form.
Recent studies have shown that token labels can convey crucial task-specific information and enrich token semantics.
We pro-pose Sequence Tagging enhanced Span-based Network (STSN), a span-based joint extrac-tion network that is enhanced by token BIO label information.
arXiv Detail & Related papers (2021-05-21T01:10:03Z) - CaEGCN: Cross-Attention Fusion based Enhanced Graph Convolutional
Network for Clustering [51.62959830761789]
We propose a cross-attention based deep clustering framework, named Cross-Attention Fusion based Enhanced Graph Convolutional Network (CaEGCN)
CaEGCN contains four main modules: cross-attention fusion, Content Auto-encoder, Graph Convolutional Auto-encoder and self-supervised model.
Experimental results on different types of datasets prove the superiority and robustness of the proposed CaEGCN.
arXiv Detail & Related papers (2021-01-18T05:21:59Z) - Few-Shot Named Entity Recognition: A Comprehensive Study [92.40991050806544]
We investigate three schemes to improve the model generalization ability for few-shot settings.
We perform empirical comparisons on 10 public NER datasets with various proportions of labeled data.
We create new state-of-the-art results on both few-shot and training-free settings.
arXiv Detail & Related papers (2020-12-29T23:43:16Z) - Sparse Semi-Supervised Action Recognition with Active Learning [10.558951653323286]
Current state-of-the-art methods for skeleton-based action recognition are supervised and rely on labels.
We propose a novel approach for skeleton-based action recognition, called SESAR, that connects these approaches.
Our results outperform standalone skeleton-based supervised, unsupervised with cluster identification, and active-learning methods for action recognition when applied to sparse labeled samples.
arXiv Detail & Related papers (2020-12-03T07:48:31Z) - A Self-Supervised Gait Encoding Approach with Locality-Awareness for 3D
Skeleton Based Person Re-Identification [65.18004601366066]
Person re-identification (Re-ID) via gait features within 3D skeleton sequences is a newly-emerging topic with several advantages.
This paper proposes a self-supervised gait encoding approach that can leverage unlabeled skeleton data to learn gait representations for person Re-ID.
arXiv Detail & Related papers (2020-09-05T16:06:04Z) - Skeleton-based Action Recognition via Spatial and Temporal Transformer
Networks [12.06555892772049]
We propose a novel Spatial-Temporal Transformer network (ST-TR) which models dependencies between joints using the Transformer self-attention operator.
The proposed ST-TR achieves state-of-the-art performance on all datasets when using joints' coordinates as input, and results on-par with state-of-the-art when adding bones information.
arXiv Detail & Related papers (2020-08-17T15:25:40Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.