Nested and Balanced Entity Recognition using Multi-Task Learning
- URL: http://arxiv.org/abs/2106.06216v1
- Date: Fri, 11 Jun 2021 07:52:32 GMT
- Title: Nested and Balanced Entity Recognition using Multi-Task Learning
- Authors: Andreas Waldis and Luca Mazzola
- Abstract summary: This paper introduces a partly-layered network architecture that deals with the complexity of overlapping and nested cases.
We train and evaluate this architecture to recognise two kinds of entities - Concepts (CR) and Named Entities (NER)
Our approach achieves state-of-the-art NER performances, while it outperforms previous CR approaches.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Entity Recognition (ER) within a text is a fundamental exercise in Natural
Language Processing, enabling further depending tasks such as Knowledge
Extraction, Text Summarisation, or Keyphrase Extraction. An entity consists of
single words or of a consecutive sequence of terms, constituting the basic
building blocks for communication. Mainstream ER approaches are mainly limited
to flat structures, concentrating on the outermost entities while ignoring the
inner ones. This paper introduces a partly-layered network architecture that
deals with the complexity of overlapping and nested cases. The proposed
architecture consists of two parts: (1) a shared Sequence Layer and (2) a
stacked component with multiple Tagging Layers. The adoption of such an
architecture has the advantage of preventing overfit to a specific word-length,
thus maintaining performance for longer entities despite their lower frequency.
To verify the proposed architecture's effectiveness, we train and evaluate this
architecture to recognise two kinds of entities - Concepts (CR) and Named
Entities (NER). Our approach achieves state-of-the-art NER performances, while
it outperforms previous CR approaches. Considering these promising results, we
see the possibility to evolve the architecture for other cases such as the
extraction of events or the detection of argumentative components.
Related papers
- Task Agnostic Architecture for Algorithm Induction via Implicit Composition [10.627575117586417]
This position paper aims to explore developing such a unified architecture and proposes a theoretical framework of how it could be constructed.
Recent Generative AI, especially Transformer-based models, demonstrate potential as an architecture capable of constructing algorithms for a wide range of domains.
Our exploration delves into current capabilities and limitations of Transformer-based and other methods in efficient and correct algorithm composition.
arXiv Detail & Related papers (2024-04-03T04:31:09Z) - Named Entity Recognition via Machine Reading Comprehension: A Multi-Task
Learning Approach [50.12455129619845]
Named Entity Recognition (NER) aims to extract and classify entity mentions in the text into pre-defined types.
We propose to incorporate the label dependencies among entity types into a multi-task learning framework for better MRC-based NER.
arXiv Detail & Related papers (2023-09-20T03:15:05Z) - Semantics-Aware Dynamic Localization and Refinement for Referring Image
Segmentation [102.25240608024063]
Referring image segments an image from a language expression.
We develop an algorithm that shifts from being localization-centric to segmentation-language.
Compared to its counterparts, our method is more versatile yet effective.
arXiv Detail & Related papers (2023-03-11T08:42:40Z) - Few-Shot Nested Named Entity Recognition [4.8693196802491405]
This paper is the first one dedicated to studying the few-shot nested NER task.
We propose a Biaffine-based Contrastive Learning (BCL) framework to learn contextual dependency to distinguish nested entities.
The BCL outperformed three baseline models on the 1-shot and 5-shot tasks in terms of F1 score.
arXiv Detail & Related papers (2022-12-02T03:42:23Z) - ReSel: N-ary Relation Extraction from Scientific Text and Tables by
Learning to Retrieve and Select [53.071352033539526]
We study the problem of extracting N-ary relations from scientific articles.
Our proposed method ReSel decomposes this task into a two-stage procedure.
Our experiments on three scientific information extraction datasets show that ReSel outperforms state-of-the-art baselines significantly.
arXiv Detail & Related papers (2022-10-26T02:28:02Z) - A Unified Architecture of Semantic Segmentation and Hierarchical
Generative Adversarial Networks for Expression Manipulation [52.911307452212256]
We develop a unified architecture of semantic segmentation and hierarchical GANs.
A unique advantage of our framework is that on forward pass the semantic segmentation network conditions the generative model.
We evaluate our method on two challenging facial expression translation benchmarks, AffectNet and RaFD, and a semantic segmentation benchmark, CelebAMask-HQ.
arXiv Detail & Related papers (2021-12-08T22:06:31Z) - BoningKnife: Joint Entity Mention Detection and Typing for Nested NER
via prior Boundary Knowledge [1.5149438988761574]
We propose a joint entity mention detection and typing model via prior boundary knowledge (BoningKnife) to better handle nested NER extraction and recognition tasks.
BoningKnife consists of two modules, MentionTagger and TypeClassifier.
Experiments over different datasets show that our approach outperforms previous state of the art methods and achieves 86.41, 85.46, and 94.2 F1 scores on ACE2004, ACE2005, and NNE, respectively.
arXiv Detail & Related papers (2021-07-20T11:44:36Z) - Recurrent Neural Networks with Mixed Hierarchical Structures for Natural
Language Processing [13.960152426268767]
Hierarchical structures exist in both linguistics and Natural Language Processing (NLP) tasks.
How to design RNNs to learn hierarchical representations of natural languages remains a long-standing challenge.
In this paper, we define two different types of boundaries referred to as static and dynamic boundaries, respectively, and then use them to construct a multi-layer hierarchical structure for document classification tasks.
arXiv Detail & Related papers (2021-06-04T15:50:42Z) - Boosting Span-based Joint Entity and Relation Extraction via Squence
Tagging Mechanism [10.894755638322]
Span-based joint extraction simultaneously conducts named entity recognition (NER) and relation extraction (RE) in text span form.
Recent studies have shown that token labels can convey crucial task-specific information and enrich token semantics.
We pro-pose Sequence Tagging enhanced Span-based Network (STSN), a span-based joint extrac-tion network that is enhanced by token BIO label information.
arXiv Detail & Related papers (2021-05-21T01:10:03Z) - Probing Linguistic Features of Sentence-Level Representations in Neural
Relation Extraction [80.38130122127882]
We introduce 14 probing tasks targeting linguistic properties relevant to neural relation extraction (RE)
We use them to study representations learned by more than 40 different encoder architecture and linguistic feature combinations trained on two datasets.
We find that the bias induced by the architecture and the inclusion of linguistic features are clearly expressed in the probing task performance.
arXiv Detail & Related papers (2020-04-17T09:17:40Z) - A Dependency Syntactic Knowledge Augmented Interactive Architecture for
End-to-End Aspect-based Sentiment Analysis [73.74885246830611]
We propose a novel dependency syntactic knowledge augmented interactive architecture with multi-task learning for end-to-end ABSA.
This model is capable of fully exploiting the syntactic knowledge (dependency relations and types) by leveraging a well-designed Dependency Relation Embedded Graph Convolutional Network (DreGcn)
Extensive experimental results on three benchmark datasets demonstrate the effectiveness of our approach.
arXiv Detail & Related papers (2020-04-04T14:59:32Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.