Locate and Label: A Two-stage Identifier for Nested Named Entity
Recognition
- URL: http://arxiv.org/abs/2105.06804v1
- Date: Fri, 14 May 2021 12:52:34 GMT
- Title: Locate and Label: A Two-stage Identifier for Nested Named Entity
Recognition
- Authors: Yongliang Shen, Xinyin Ma, Zeqi Tan, Shuai Zhang, Wen Wang and Weiming
Lu
- Abstract summary: We propose a two-stage entity identifier for named entity recognition.
First, we generate span proposals by filtering and boundary regression on the seed spans to locate the entities, and then label the boundary-adjusted span proposals with the corresponding categories.
Our method effectively utilizes the boundary information of entities and partially matched spans during training.
- Score: 9.809157050048375
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Named entity recognition (NER) is a well-studied task in natural language
processing. Traditional NER research only deals with flat entities and ignores
nested entities. The span-based methods treat entity recognition as a span
classification task. Although these methods have the innate ability to handle
nested NER, they suffer from high computational cost, ignorance of boundary
information, under-utilization of the spans that partially match with entities,
and difficulties in long entity recognition. To tackle these issues, we propose
a two-stage entity identifier. First we generate span proposals by filtering
and boundary regression on the seed spans to locate the entities, and then
label the boundary-adjusted span proposals with the corresponding categories.
Our method effectively utilizes the boundary information of entities and
partially matched spans during training. Through boundary regression, entities
of any length can be covered theoretically, which improves the ability to
recognize long entities. In addition, many low-quality seed spans are filtered
out in the first stage, which reduces the time complexity of inference.
Experiments on nested NER datasets demonstrate that our proposed method
outperforms previous state-of-the-art models.
Related papers
- A Boundary Offset Prediction Network for Named Entity Recognition [9.885278527023532]
Named entity recognition (NER) is a fundamental task in natural language processing that aims to identify and classify named entities in text.
We propose a novel approach for NER, named the Boundary Offset Prediction Network (BOPN), which predicts the boundary offsets between candidate spans and their nearest entity spans.
Our method integrates entity type and span representations to generate type-aware boundary offsets instead of using entity types as detection targets.
arXiv Detail & Related papers (2023-10-23T05:04:07Z) - Named Entity Recognition via Machine Reading Comprehension: A Multi-Task
Learning Approach [50.12455129619845]
Named Entity Recognition (NER) aims to extract and classify entity mentions in the text into pre-defined types.
We propose to incorporate the label dependencies among entity types into a multi-task learning framework for better MRC-based NER.
arXiv Detail & Related papers (2023-09-20T03:15:05Z) - Gaussian Prior Reinforcement Learning for Nested Named Entity
Recognition [52.46740830977898]
We propose a novel seq2seq model named GPRL, which formulates the nested NER task as an entity triplet sequence generation process.
Experiments on three nested NER datasets demonstrate that GPRL outperforms previous nested NER models.
arXiv Detail & Related papers (2023-05-12T05:55:34Z) - IXA/Cogcomp at SemEval-2023 Task 2: Context-enriched Multilingual Named
Entity Recognition using Knowledge Bases [53.054598423181844]
We present a novel NER cascade approach comprising three steps.
We empirically demonstrate the significance of external knowledge bases in accurately classifying fine-grained and emerging entities.
Our system exhibits robust performance in the MultiCoNER2 shared task, even in the low-resource language setting.
arXiv Detail & Related papers (2023-04-20T20:30:34Z) - Nested Named Entity Recognition from Medical Texts: An Adaptive Shared
Network Architecture with Attentive CRF [53.55504611255664]
We propose a novel method, referred to as ASAC, to solve the dilemma caused by the nested phenomenon.
The proposed method contains two key modules: the adaptive shared (AS) part and the attentive conditional random field (ACRF) module.
Our model could learn better entity representations by capturing the implicit distinctions and relationships between different categories of entities.
arXiv Detail & Related papers (2022-11-09T09:23:56Z) - Optimizing Bi-Encoder for Named Entity Recognition via Contrastive
Learning [80.36076044023581]
We present an efficient bi-encoder framework for named entity recognition (NER)
We frame NER as a metric learning problem that maximizes the similarity between the vector representations of an entity mention and its type.
A major challenge to this bi-encoder formulation for NER lies in separating non-entity spans from entity mentions.
arXiv Detail & Related papers (2022-08-30T23:19:04Z) - Propose-and-Refine: A Two-Stage Set Prediction Network for Nested Named
Entity Recognition [13.010064498077863]
We present the Propose-and-Refine Network (PnRNet), a two-stage set prediction network for nested NER.
In the propose stage, we use a span-based predictor to generate some coarse entity predictions as entity proposals.
In the refine stage, proposals interact with each other, and richer contextual information is incorporated into the proposal representations.
Experiments show that PnRNet achieves state-of-the-art performance on four nested NER datasets and one flat NER dataset.
arXiv Detail & Related papers (2022-04-27T06:58:45Z) - Nested Named Entity Recognition as Latent Lexicalized Constituency
Parsing [29.705133932275892]
Recently, (Fu et al, 2021) adapt a span-based constituency to tackle nested NER.
In this work, we resort to more expressive structures, lexicalized constituency trees in which constituents are annotated by headwords.
We leverage the Eisner-Satta algorithm to perform partial marginalization and inference efficiently.
arXiv Detail & Related papers (2022-03-09T12:02:59Z) - BoningKnife: Joint Entity Mention Detection and Typing for Nested NER
via prior Boundary Knowledge [1.5149438988761574]
We propose a joint entity mention detection and typing model via prior boundary knowledge (BoningKnife) to better handle nested NER extraction and recognition tasks.
BoningKnife consists of two modules, MentionTagger and TypeClassifier.
Experiments over different datasets show that our approach outperforms previous state of the art methods and achieves 86.41, 85.46, and 94.2 F1 scores on ACE2004, ACE2005, and NNE, respectively.
arXiv Detail & Related papers (2021-07-20T11:44:36Z) - A Sequence-to-Set Network for Nested Named Entity Recognition [38.05786148160635]
We propose a novel sequence-to-set neural network for nested NER.
We use a non-autoregressive decoder to predict the final set of entities in one pass.
Experimental results show that our proposed model achieves state-of-the-art on three nested NER corpora.
arXiv Detail & Related papers (2021-05-19T03:10:04Z) - Autoregressive Entity Retrieval [55.38027440347138]
Entities are at the center of how we represent and aggregate knowledge.
The ability to retrieve such entities given a query is fundamental for knowledge-intensive tasks such as entity linking and open-domain question answering.
We propose GENRE, the first system that retrieves entities by generating their unique names, left to right, token-by-token in an autoregressive fashion.
arXiv Detail & Related papers (2020-10-02T10:13:31Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.