Named entity recognition architecture combining contextual and global
features
- URL: http://arxiv.org/abs/2112.08033v1
- Date: Wed, 15 Dec 2021 10:54:36 GMT
- Title: Named entity recognition architecture combining contextual and global
features
- Authors: Tran Thi Hong Hanh, Antoine Doucet, Nicolas Sidere, Jose G. Moreno,
and Senja Pollak
- Abstract summary: Named entity recognition (NER) is an information extraction technique that aims to locate and classify named entities.
We propose the combination of contextual features from XLNet and global features from Graph Convolution Network (GCN) to enhance NER performance.
- Score: 5.92351086183376
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Named entity recognition (NER) is an information extraction technique that
aims to locate and classify named entities (e.g., organizations, locations,...)
within a document into predefined categories. Correctly identifying these
phrases plays a significant role in simplifying information access. However, it
remains a difficult task because named entities (NEs) have multiple forms and
they are context-dependent. While the context can be represented by contextual
features, global relations are often misrepresented by those models. In this
paper, we propose the combination of contextual features from XLNet and global
features from Graph Convolution Network (GCN) to enhance NER performance.
Experiments over a widely-used dataset, CoNLL 2003, show the benefits of our
strategy, with results competitive with the state of the art (SOTA).
Related papers
- Web-Scale Visual Entity Recognition: An LLM-Driven Data Approach [56.55633052479446]
Web-scale visual entity recognition presents significant challenges due to the lack of clean, large-scale training data.
We propose a novel methodology to curate such a dataset, leveraging a multimodal large language model (LLM) for label verification, metadata generation, and rationale explanation.
Experiments demonstrate that models trained on this automatically curated data achieve state-of-the-art performance on web-scale visual entity recognition tasks.
arXiv Detail & Related papers (2024-10-31T06:55:24Z) - Retrieval-Enhanced Named Entity Recognition [1.2187048691454239]
RENER is a technique for named entity recognition using autoregressive language models based on In-Context Learning and information retrieval techniques.
Experimental results show that in the CrossNER collection we achieve state-of-the-art performance with the proposed technique.
arXiv Detail & Related papers (2024-10-17T01:12:48Z) - Bridging Local Details and Global Context in Text-Attributed Graphs [62.522550655068336]
GraphBridge is a framework that bridges local and global perspectives by leveraging contextual textual information.
Our method achieves state-of-theart performance, while our graph-aware token reduction module significantly enhances efficiency and solves scalability issues.
arXiv Detail & Related papers (2024-06-18T13:35:25Z) - SRFUND: A Multi-Granularity Hierarchical Structure Reconstruction Benchmark in Form Understanding [55.48936731641802]
We present the SRFUND, a hierarchically structured multi-task form understanding benchmark.
SRFUND provides refined annotations on top of the original FUNSD and XFUND datasets.
The dataset includes eight languages including English, Chinese, Japanese, German, French, Spanish, Italian, and Portuguese.
arXiv Detail & Related papers (2024-06-13T02:35:55Z) - The Role of Global and Local Context in Named Entity Recognition [3.1638713158723686]
This article explores the impact of global document context, and its relationships with local context.
We find that correctly retrieving global document context has a greater impact on performance than only leveraging local context.
arXiv Detail & Related papers (2023-05-04T20:22:18Z) - Modeling Entities as Semantic Points for Visual Information Extraction
in the Wild [55.91783742370978]
We propose an alternative approach to precisely and robustly extract key information from document images.
We explicitly model entities as semantic points, i.e., center points of entities are enriched with semantic information describing the attributes and relationships of different entities.
The proposed method can achieve significantly enhanced performance on entity labeling and linking, compared with previous state-of-the-art models.
arXiv Detail & Related papers (2023-03-23T08:21:16Z) - Disambiguation of Company names via Deep Recurrent Networks [101.90357454833845]
We propose a Siamese LSTM Network approach to extract -- via supervised learning -- an embedding of company name strings.
We analyse how an Active Learning approach to prioritise the samples to be labelled leads to a more efficient overall learning pipeline.
arXiv Detail & Related papers (2023-03-07T15:07:57Z) - Dynamic Named Entity Recognition [5.9401550252715865]
We introduce a new task: Dynamic Named Entity Recognition (DNER)
DNER provides a framework to better evaluate the ability of algorithms to extract entities by exploiting the context.
We evaluate baseline models and present experiments reflecting issues and research axes related to this novel task.
arXiv Detail & Related papers (2023-02-16T15:50:02Z) - The Fellowship of the Authors: Disambiguating Names from Social Network
Context [2.3605348648054454]
Authority lists with extensive textual descriptions for each entity are lacking and ambiguous named entities.
We combine BERT-based mention representations with a variety of graph induction strategies and experiment with supervised and unsupervised cluster inference methods.
We find that in-domain language model pretraining can significantly improve mention representations, especially for larger corpora.
arXiv Detail & Related papers (2022-08-31T21:51:55Z) - Global Attention for Name Tagging [56.62059996864408]
We present a new framework to improve name tagging by utilizing local, document-level, and corpus-level contextual information.
We propose a model that learns to incorporate document-level and corpus-level contextual information alongside local contextual information via global attentions.
Experiments on benchmark datasets show the effectiveness of our approach.
arXiv Detail & Related papers (2020-10-19T07:27:15Z) - Global-to-Local Neural Networks for Document-Level Relation Extraction [11.900280120655898]
Relation extraction (RE) aims to identify the semantic relations between named entities in text.
Recent years have witnessed it raised to the document level, which requires complex reasoning with entities and mentions throughout an entire document.
We propose a novel model to document-level RE, by encoding the document information in terms of entity global and local representations as well as context relation representations.
arXiv Detail & Related papers (2020-09-22T07:30:19Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.