Unified Named Entity Recognition as Word-Word Relation Classification
- URL: http://arxiv.org/abs/2112.10070v1
- Date: Sun, 19 Dec 2021 06:11:07 GMT
- Title: Unified Named Entity Recognition as Word-Word Relation Classification
- Authors: Jingye Li, Hao Fei, Jiang Liu, Shengqiong Wu, Meishan Zhang, Chong
Teng, Donghong Ji, Fei Li
- Abstract summary: We present a novel alternative by modeling the unified NER as word-word relation classification, namely W2NER.
The architecture resolves the kernel bottleneck of unified NER by effectively modeling the neighboring relations between entity words.
Based on the W2NER scheme we develop a neural framework, in which the unified NER is modeled as a 2D grid of word pairs.
- Score: 25.801945832005504
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: So far, named entity recognition (NER) has been involved with three major
types, including flat, overlapped (aka. nested), and discontinuous NER, which
have mostly been studied individually. Recently, a growing interest has been
built for unified NER, tackling the above three jobs concurrently with one
single model. Current best-performing methods mainly include span-based and
sequence-to-sequence models, where unfortunately the former merely focus on
boundary identification and the latter may suffer from exposure bias. In this
work, we present a novel alternative by modeling the unified NER as word-word
relation classification, namely W^2NER. The architecture resolves the kernel
bottleneck of unified NER by effectively modeling the neighboring relations
between entity words with Next-Neighboring-Word (NNW) and Tail-Head-Word-*
(THW-*) relations. Based on the W^2NER scheme we develop a neural framework, in
which the unified NER is modeled as a 2D grid of word pairs. We then propose
multi-granularity 2D convolutions for better refining the grid representations.
Finally, a co-predictor is used to sufficiently reason the word-word relations.
We perform extensive experiments on 14 widely-used benchmark datasets for flat,
overlapped, and discontinuous NER (8 English and 6 Chinese datasets), where our
model beats all the current top-performing baselines, pushing the
state-of-the-art performances of unified NER.
Related papers
- In-Context Learning for Few-Shot Nested Named Entity Recognition [53.55310639969833]
We introduce an effective and innovative ICL framework for the setting of few-shot nested NER.
We improve the ICL prompt by devising a novel example demonstration selection mechanism, EnDe retriever.
In EnDe retriever, we employ contrastive learning to perform three types of representation learning, in terms of semantic similarity, boundary similarity, and label similarity.
arXiv Detail & Related papers (2024-02-02T06:57:53Z) - NERetrieve: Dataset for Next Generation Named Entity Recognition and
Retrieval [49.827932299460514]
We argue that capabilities provided by large language models are not the end of NER research, but rather an exciting beginning.
We present three variants of the NER task, together with a dataset to support them.
We provide a large, silver-annotated corpus of 4 million paragraphs covering 500 entity types.
arXiv Detail & Related papers (2023-10-22T12:23:00Z) - Gaussian Prior Reinforcement Learning for Nested Named Entity
Recognition [52.46740830977898]
We propose a novel seq2seq model named GPRL, which formulates the nested NER task as an entity triplet sequence generation process.
Experiments on three nested NER datasets demonstrate that GPRL outperforms previous nested NER models.
arXiv Detail & Related papers (2023-05-12T05:55:34Z) - Multi-task Transformer with Relation-attention and Type-attention for
Named Entity Recognition [35.44123819012004]
Named entity recognition (NER) is an important research problem in natural language processing.
This paper proposes a multi-task Transformer, which incorporates an entity boundary detection task into the named entity recognition task.
arXiv Detail & Related papers (2023-03-20T05:11:22Z) - TOE: A Grid-Tagging Discontinuous NER Model Enhanced by Embedding
Tag/Word Relations and More Fine-Grained Tags [32.95446649391798]
We propose a competitive grid-tagging model for discontinuous named entity recognition.
We incorporate two kinds of Tag-Oriented Enhancement mechanisms into a state-of-the-art (SOTA) grid-tagging model.
Our model pushes the SOTA results by about 0.83%, 0.05% and 0.66% in F1, demonstrating its effectiveness.
arXiv Detail & Related papers (2022-11-01T18:17:49Z) - Optimizing Bi-Encoder for Named Entity Recognition via Contrastive
Learning [80.36076044023581]
We present an efficient bi-encoder framework for named entity recognition (NER)
We frame NER as a metric learning problem that maximizes the similarity between the vector representations of an entity mention and its type.
A major challenge to this bi-encoder formulation for NER lies in separating non-entity spans from entity mentions.
arXiv Detail & Related papers (2022-08-30T23:19:04Z) - An Embarrassingly Easy but Strong Baseline for Nested Named Entity
Recognition [55.080101447586635]
We propose using Conal Neural Network (CNN) to model spatial relations in the score matrix.
Our model surpasses several recently proposed methods with the same pre-trained encoders.
arXiv Detail & Related papers (2022-08-09T04:33:46Z) - Nested Named Entity Recognition as Holistic Structure Parsing [92.8397338250383]
This work models the full nested NEs in a sentence as a holistic structure, then we propose a holistic structure parsing algorithm to disclose the entire NEs once for all.
Experiments show that our model yields promising results on widely-used benchmarks which approach or even achieve state-of-the-art.
arXiv Detail & Related papers (2022-04-17T12:48:20Z) - A Sequence-to-Set Network for Nested Named Entity Recognition [38.05786148160635]
We propose a novel sequence-to-set neural network for nested NER.
We use a non-autoregressive decoder to predict the final set of entities in one pass.
Experimental results show that our proposed model achieves state-of-the-art on three nested NER corpora.
arXiv Detail & Related papers (2021-05-19T03:10:04Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.