Recognizing Nested Entities from Flat Supervision: A New NER Subtask,
Feasibility and Challenges
- URL: http://arxiv.org/abs/2211.00301v1
- Date: Tue, 1 Nov 2022 06:41:42 GMT
- Title: Recognizing Nested Entities from Flat Supervision: A New NER Subtask,
Feasibility and Challenges
- Authors: Enwei Zhu, Yiyang Liu, Ming Jin, Jinpeng Li
- Abstract summary: This study proposes a new subtask, nested-from-flat NER, which corresponds to a realistic application scenario.
We train span-based models and deliberately ignore the spans nested inside labeled entities, since these spans are possibly unlabeled entities.
With nested entities removed from the training data, our model achieves 54.8%, 54.2% and 41.1% F1 scores on the subset of spans within entities on ACE 2004, ACE 2005 and GENIA, respectively.
- Score: 3.614392310669357
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Many recent named entity recognition (NER) studies criticize flat NER for its
non-overlapping assumption, and switch to investigating nested NER. However,
existing nested NER models heavily rely on training data annotated with nested
entities, while labeling such data is costly. This study proposes a new
subtask, nested-from-flat NER, which corresponds to a realistic application
scenario: given data annotated with flat entities only, one may still desire
the trained model capable of recognizing nested entities. To address this task,
we train span-based models and deliberately ignore the spans nested inside
labeled entities, since these spans are possibly unlabeled entities. With
nested entities removed from the training data, our model achieves 54.8%, 54.2%
and 41.1% F1 scores on the subset of spans within entities on ACE 2004, ACE
2005 and GENIA, respectively. This suggests the effectiveness of our approach
and the feasibility of the task. In addition, the model's performance on flat
entities is entirely unaffected. We further manually annotate the nested
entities in the test set of CoNLL 2003, creating a nested-from-flat NER
benchmark. Analysis results show that the main challenges stem from the data
and annotation inconsistencies between the flat and nested entities.
Related papers
- Composited-Nested-Learning with Data Augmentation for Nested Named Entity Recognition [5.188242370198818]
Nested Named Entity Recognition (NNER) focuses on addressing overlapped entity recognition.
Data augmentation is an effective approach to address the insufficient annotated corpus.
We propose Composited-Nested-Label Classification (CNLC) in which constituents are combined by nested-word and nested-label, to model nested entities.
arXiv Detail & Related papers (2024-06-18T16:46:18Z) - Gaussian Prior Reinforcement Learning for Nested Named Entity
Recognition [52.46740830977898]
We propose a novel seq2seq model named GPRL, which formulates the nested NER task as an entity triplet sequence generation process.
Experiments on three nested NER datasets demonstrate that GPRL outperforms previous nested NER models.
arXiv Detail & Related papers (2023-05-12T05:55:34Z) - Optimizing Bi-Encoder for Named Entity Recognition via Contrastive
Learning [80.36076044023581]
We present an efficient bi-encoder framework for named entity recognition (NER)
We frame NER as a metric learning problem that maximizes the similarity between the vector representations of an entity mention and its type.
A major challenge to this bi-encoder formulation for NER lies in separating non-entity spans from entity mentions.
arXiv Detail & Related papers (2022-08-30T23:19:04Z) - QUEACO: Borrowing Treasures from Weakly-labeled Behavior Data for Query
Attribute Value Extraction [57.56700153507383]
This paper proposes a unified query attribute value extraction system in e-commerce search named QUEACO.
For the NER phase, QUEACO adopts a novel teacher-student network, where a teacher network that is trained on the strongly-labeled data generates pseudo-labels.
For the AVN phase, we also leverage the weakly-labeled query-to-attribute behavior data to normalize surface form attribute values from queries into canonical forms from products.
arXiv Detail & Related papers (2021-08-19T03:24:23Z) - WSSOD: A New Pipeline for Weakly- and Semi-Supervised Object Detection [75.80075054706079]
We propose a weakly- and semi-supervised object detection framework (WSSOD)
An agent detector is first trained on a joint dataset and then used to predict pseudo bounding boxes on weakly-annotated images.
The proposed framework demonstrates remarkable performance on PASCAL-VOC and MSCOCO benchmark, achieving a high performance comparable to those obtained in fully-supervised settings.
arXiv Detail & Related papers (2021-05-21T11:58:50Z) - A Sequence-to-Set Network for Nested Named Entity Recognition [38.05786148160635]
We propose a novel sequence-to-set neural network for nested NER.
We use a non-autoregressive decoder to predict the final set of entities in one pass.
Experimental results show that our proposed model achieves state-of-the-art on three nested NER corpora.
arXiv Detail & Related papers (2021-05-19T03:10:04Z) - Locate and Label: A Two-stage Identifier for Nested Named Entity
Recognition [9.809157050048375]
We propose a two-stage entity identifier for named entity recognition.
First, we generate span proposals by filtering and boundary regression on the seed spans to locate the entities, and then label the boundary-adjusted span proposals with the corresponding categories.
Our method effectively utilizes the boundary information of entities and partially matched spans during training.
arXiv Detail & Related papers (2021-05-14T12:52:34Z) - Empirical Analysis of Unlabeled Entity Problem in Named Entity
Recognition [47.273602658066196]
In many scenarios, named entity recognition models severely suffer from unlabeled entity problem.
We propose a general approach, which can almost eliminate the misguidance brought by unlabeled entities.
Our model is robust to unlabeled entity problem and surpasses prior baselines.
arXiv Detail & Related papers (2020-12-10T02:53:59Z) - Named Entity Recognition as Dependency Parsing [16.544333689188246]
We use graph-based dependency parsing to provide our model a global view on the input via a biaffine model.
We show that the model works well for both nested and flat NER through evaluation on 8 corpora and achieving SoTA performance on all of them, with accuracy gains of up to 2.2 percentage points.
arXiv Detail & Related papers (2020-05-14T17:11:41Z) - Bipartite Flat-Graph Network for Nested Named Entity Recognition [94.91507634620133]
Bipartite flat-graph network (BiFlaG) for nested named entity recognition (NER)
We propose a novel bipartite flat-graph network (BiFlaG) for nested named entity recognition (NER)
arXiv Detail & Related papers (2020-05-01T15:14:22Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.