SU-NLP at SemEval-2022 Task 11: Complex Named Entity Recognition with
Entity Linking
- URL: http://arxiv.org/abs/2203.11841v1
- Date: Tue, 22 Mar 2022 16:09:34 GMT
- Title: SU-NLP at SemEval-2022 Task 11: Complex Named Entity Recognition with
Entity Linking
- Authors: Buse \c{C}ar{\i}k, Fatih Beyhan and Reyyan Yeniterzi
- Abstract summary: We developed an unsupervised entity linking pipeline that detects potential entity mentions with the help of Wikipedia.
Our results showed that our pipeline improved performance significantly, especially for complex entities in low-context settings.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: This paper describes the system proposed by Sabanc{\i} University Natural
Language Processing Group in the SemEval-2022 MultiCoNER task. We developed an
unsupervised entity linking pipeline that detects potential entity mentions
with the help of Wikipedia and also uses the corresponding Wikipedia context to
help the classifier in finding the named entity type of that mention. Our
results showed that our pipeline improved performance significantly, especially
for complex entities in low-context settings.
Related papers
- PAI at SemEval-2023 Task 2: A Universal System for Named Entity
Recognition with External Entity Information [19.995198769980345]
The MultiCoNER II task aims to detect complex, ambiguous, and fine-grained named entities in low-context situations and noisy scenarios.
Our system retrieves entities with properties from the knowledge base (i.e. Wikipedia) for a given text, then retrieves entity information with the input sentence and feeds it into Transformer-based models.
arXiv Detail & Related papers (2023-05-10T12:40:48Z) - DAMO-NLP at SemEval-2023 Task 2: A Unified Retrieval-augmented System
for Multilingual Named Entity Recognition [94.90258603217008]
The MultiCoNER RNum2 shared task aims to tackle multilingual named entity recognition (NER) in fine-grained and noisy scenarios.
Previous top systems in the MultiCoNER RNum1 either incorporate the knowledge bases or gazetteers.
We propose a unified retrieval-augmented system (U-RaNER) for fine-grained multilingual NER.
arXiv Detail & Related papers (2023-05-05T16:59:26Z) - Disambiguation of Company names via Deep Recurrent Networks [101.90357454833845]
We propose a Siamese LSTM Network approach to extract -- via supervised learning -- an embedding of company name strings.
We analyse how an Active Learning approach to prioritise the samples to be labelled leads to a more efficient overall learning pipeline.
arXiv Detail & Related papers (2023-03-07T15:07:57Z) - DAMO-NLP at NLPCC-2022 Task 2: Knowledge Enhanced Robust NER for Speech
Entity Linking [32.915297772110364]
Speech Entity Linking aims to recognize and disambiguate named entities in spoken languages.
Conventional methods suffer from the unfettered speech styles and the noisy transcripts generated by ASR systems.
We propose Knowledge Enhanced Named Entity Recognition (KENER), which focuses on improving robustness through painlessly incorporating proper knowledge in the entity recognition stage.
Our system achieves 1st place in Track 1 and 2nd place in Track 2 of NLPCC-2022 Shared Task 2.
arXiv Detail & Related papers (2022-09-27T06:43:56Z) - TEAM-Atreides at SemEval-2022 Task 11: On leveraging data augmentation
and ensemble to recognize complex Named Entities in Bangla [11.963792253163247]
We describe our contribution to SemEval 2022 Task 11 on identifying complex Named Entities.
We have leveraged the ensemble of multiple ELECTRA-based models that were exclusively pretrained on the Bangla language.
We will also present the outcomes of our experiments on architectural decisions, dataset augmentations, and post-competition findings.
arXiv Detail & Related papers (2022-04-21T08:40:17Z) - Nested Named Entity Recognition as Holistic Structure Parsing [92.8397338250383]
This work models the full nested NEs in a sentence as a holistic structure, then we propose a holistic structure parsing algorithm to disclose the entire NEs once for all.
Experiments show that our model yields promising results on widely-used benchmarks which approach or even achieve state-of-the-art.
arXiv Detail & Related papers (2022-04-17T12:48:20Z) - DAMO-NLP at SemEval-2022 Task 11: A Knowledge-based System for
Multilingual Named Entity Recognition [94.1865071914727]
MultiCoNER aims at detecting semantically ambiguous named entities in short and low-context settings for multiple languages.
Our team DAMO-NLP proposes a knowledge-based system, where we build a multilingual knowledge base based on Wikipedia.
Given an input sentence, our system effectively retrieves related contexts from the knowledge base.
Our system wins 10 out of 13 tracks in the MultiCoNER shared task.
arXiv Detail & Related papers (2022-03-01T15:29:35Z) - LMN at SemEval-2022 Task 11: A Transformer-based System for English
Named Entity Recognition [0.0]
We present our participation in the English track of SemEval-2022 Task 11: Multilingual Complex Named Entity Recognition.
Inspired by the recent advances in pretrained Transformer language models, we propose a simple yet effective Transformer-based baseline for the task.
Our proposed approach shows competitive results in the leaderboard as we ranked 12 over 30 teams.
arXiv Detail & Related papers (2022-02-13T05:46:14Z) - Probing Linguistic Features of Sentence-Level Representations in Neural
Relation Extraction [80.38130122127882]
We introduce 14 probing tasks targeting linguistic properties relevant to neural relation extraction (RE)
We use them to study representations learned by more than 40 different encoder architecture and linguistic feature combinations trained on two datasets.
We find that the bias induced by the architecture and the inclusion of linguistic features are clearly expressed in the probing task performance.
arXiv Detail & Related papers (2020-04-17T09:17:40Z) - CASE: Context-Aware Semantic Expansion [68.30244980290742]
This paper defines and studies a new task called Context-Aware Semantic Expansion (CASE)
Given a seed term in a sentential context, we aim to suggest other terms that well fit the context as the seed.
We show that annotations for this task can be harvested at scale from existing corpora, in a fully automatic manner.
arXiv Detail & Related papers (2019-12-31T06:38:57Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.