Mulco: Recognizing Chinese Nested Named Entities Through Multiple Scopes
- URL: http://arxiv.org/abs/2211.10854v1
- Date: Sun, 20 Nov 2022 02:53:05 GMT
- Title: Mulco: Recognizing Chinese Nested Named Entities Through Multiple Scopes
- Authors: Jiuding Yang, Jinwen Luo, Weidong Guo, Jerry Chen, Di Niu, Yu Xu
- Abstract summary: We propose Mulco, a novel method that can recognize named entities in nested structures through multiple scopes.
Based on ChiNesE, we propose Mulco, a novel method that can recognize named entities in nested structures through multiple scopes.
- Score: 18.359084133065817
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Nested Named Entity Recognition (NNER) has been a long-term challenge to
researchers as an important sub-area of Named Entity Recognition. NNER is where
one entity may be part of a longer entity, and this may happen on multiple
levels, as the term nested suggests. These nested structures make traditional
sequence labeling methods unable to properly recognize all entities. While
recent researches focus on designing better recognition methods for NNER in a
variety of languages, the Chinese NNER (CNNER) still lacks attention, where a
free-for-access, CNNER-specialized benchmark is absent. In this paper, we aim
to solve CNNER problems by providing a Chinese dataset and a learning-based
model to tackle the issue. To facilitate the research on this task, we release
ChiNesE, a CNNER dataset with 20,000 sentences sampled from online passages of
multiple domains, containing 117,284 entities failing in 10 categories, where
43.8 percent of those entities are nested. Based on ChiNesE, we propose Mulco,
a novel method that can recognize named entities in nested structures through
multiple scopes. Each scope use a designed scope-based sequence labeling
method, which predicts an anchor and the length of a named entity to recognize
it. Experiment results show that Mulco has outperformed several baseline
methods with the different recognizing schemes on ChiNesE. We also conduct
extensive experiments on ACE2005 Chinese corpus, where Mulco has achieved the
best performance compared with the baseline methods.
Related papers
- LLM-DER:A Named Entity Recognition Method Based on Large Language Models for Chinese Coal Chemical Domain [4.639851504108679]
We propose a Large Language Models (LLMs)-based entity recognition framework LLM-DER for the domain-specific entity recognition problem in Chinese.
LLMs-DER generates a list of relationships containing entity types through LLMs, and designs a plausibility and consistency evaluation method to remove misrecognized entities.
The experimental results of this paper on the Resume dataset and the self-constructed coal chemical dataset Coal show that LLM-DER performs outstandingly in domain-specific entity recognition.
arXiv Detail & Related papers (2024-09-16T08:28:05Z) - Composited-Nested-Learning with Data Augmentation for Nested Named Entity Recognition [5.188242370198818]
Nested Named Entity Recognition (NNER) focuses on addressing overlapped entity recognition.
Data augmentation is an effective approach to address the insufficient annotated corpus.
We propose Composited-Nested-Label Classification (CNLC) in which constituents are combined by nested-word and nested-label, to model nested entities.
arXiv Detail & Related papers (2024-06-18T16:46:18Z) - Named Entity Recognition via Machine Reading Comprehension: A Multi-Task
Learning Approach [50.12455129619845]
Named Entity Recognition (NER) aims to extract and classify entity mentions in the text into pre-defined types.
We propose to incorporate the label dependencies among entity types into a multi-task learning framework for better MRC-based NER.
arXiv Detail & Related papers (2023-09-20T03:15:05Z) - PromptNER: A Prompting Method for Few-shot Named Entity Recognition via
k Nearest Neighbor Search [56.81939214465558]
We propose PromptNER: a novel prompting method for few-shot NER via k nearest neighbor search.
We use prompts that contains entity category information to construct label prototypes, which enables our model to fine-tune with only the support set.
Our approach achieves excellent transfer learning ability, and extensive experiments on the Few-NERD and CrossNER datasets demonstrate that our model achieves superior performance over state-of-the-art methods.
arXiv Detail & Related papers (2023-05-20T15:47:59Z) - Gaussian Prior Reinforcement Learning for Nested Named Entity
Recognition [52.46740830977898]
We propose a novel seq2seq model named GPRL, which formulates the nested NER task as an entity triplet sequence generation process.
Experiments on three nested NER datasets demonstrate that GPRL outperforms previous nested NER models.
arXiv Detail & Related papers (2023-05-12T05:55:34Z) - CROP: Zero-shot Cross-lingual Named Entity Recognition with Multilingual
Labeled Sequence Translation [113.99145386490639]
Cross-lingual NER can transfer knowledge between languages via aligned cross-lingual representations or machine translation results.
We propose a Cross-lingual Entity Projection framework (CROP) to enable zero-shot cross-lingual NER.
We adopt a multilingual labeled sequence translation model to project the tagged sequence back to the target language and label the target raw sentence.
arXiv Detail & Related papers (2022-10-13T13:32:36Z) - An Embarrassingly Easy but Strong Baseline for Nested Named Entity
Recognition [55.080101447586635]
We propose using Conal Neural Network (CNN) to model spatial relations in the score matrix.
Our model surpasses several recently proposed methods with the same pre-trained encoders.
arXiv Detail & Related papers (2022-08-09T04:33:46Z) - Trigger-GNN: A Trigger-Based Graph Neural Network for Nested Named
Entity Recognition [5.9049664765234295]
We propose a trigger-based graph neural network (Trigger-GNN) to leverage the nested NER.
It obtains the complementary annotation embeddings through entity trigger encoding and semantic matching.
It helps the model to learn and generalize more efficiently and cost-effectively.
arXiv Detail & Related papers (2022-04-12T04:15:39Z) - A Sequence-to-Set Network for Nested Named Entity Recognition [38.05786148160635]
We propose a novel sequence-to-set neural network for nested NER.
We use a non-autoregressive decoder to predict the final set of entities in one pass.
Experimental results show that our proposed model achieves state-of-the-art on three nested NER corpora.
arXiv Detail & Related papers (2021-05-19T03:10:04Z) - Locate and Label: A Two-stage Identifier for Nested Named Entity
Recognition [9.809157050048375]
We propose a two-stage entity identifier for named entity recognition.
First, we generate span proposals by filtering and boundary regression on the seed spans to locate the entities, and then label the boundary-adjusted span proposals with the corresponding categories.
Our method effectively utilizes the boundary information of entities and partially matched spans during training.
arXiv Detail & Related papers (2021-05-14T12:52:34Z) - Zero-Resource Cross-Domain Named Entity Recognition [68.83177074227598]
Existing models for cross-domain named entity recognition rely on numerous unlabeled corpus or labeled NER training data in target domains.
We propose a cross-domain NER model that does not use any external resources.
arXiv Detail & Related papers (2020-02-14T09:04:18Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.