Local Hypergraph-based Nested Named Entity Recognition as Query-based
Sequence Labeling
- URL: http://arxiv.org/abs/2204.11467v1
- Date: Mon, 25 Apr 2022 06:56:49 GMT
- Title: Local Hypergraph-based Nested Named Entity Recognition as Query-based
Sequence Labeling
- Authors: Yukun Yan, Sen Song
- Abstract summary: We propose a novel local hypergraph-based method to recognize nested named entities.
Our method is free of the high computation cost of span sampling and the risk of losing long entities.
It achieves a new state-of-the-art F1 score on the ACE 2004 dataset and competitive F1 scores with previous state-of-the-art methods on three other nested NER datasets: ACE 2005, GENIA, and KBP 2017.
- Score: 5.134435281973136
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: There has been a growing academic interest in the recognition of nested named
entities in many domains. We tackle the task with a novel local
hypergraph-based method: We first propose start token candidates and generate
corresponding queries with their surrounding context, then use a query-based
sequence labeling module to form a local hypergraph for each candidate. An end
token estimator is used to correct the hypergraphs and get the final
predictions. Compared to span-based approaches, our method is free of the high
computation cost of span sampling and the risk of losing long entities.
Sequential prediction makes it easier to leverage information in word order
inside nested structures, and richer representations are built with a local
hypergraph. Experiments show that our proposed method outperforms all the
previous hypergraph-based and sequence labeling approaches with large margins
on all four nested datasets. It achieves a new state-of-the-art F1 score on the
ACE 2004 dataset and competitive F1 scores with previous state-of-the-art
methods on three other nested NER datasets: ACE 2005, GENIA, and KBP 2017.
Related papers
- From Hypergraph Energy Functions to Hypergraph Neural Networks [94.88564151540459]
We present an expressive family of parameterized, hypergraph-regularized energy functions.
We then demonstrate how minimizers of these energies effectively serve as node embeddings.
We draw parallels between the proposed bilevel hypergraph optimization, and existing GNN architectures in common use.
arXiv Detail & Related papers (2023-06-16T04:40:59Z) - Optimizing Bi-Encoder for Named Entity Recognition via Contrastive
Learning [80.36076044023581]
We present an efficient bi-encoder framework for named entity recognition (NER)
We frame NER as a metric learning problem that maximizes the similarity between the vector representations of an entity mention and its type.
A major challenge to this bi-encoder formulation for NER lies in separating non-entity spans from entity mentions.
arXiv Detail & Related papers (2022-08-30T23:19:04Z) - Autoregressive Search Engines: Generating Substrings as Document
Identifiers [53.0729058170278]
Autoregressive language models are emerging as the de-facto standard for generating answers.
Previous work has explored ways to partition the search space into hierarchical structures.
In this work we propose an alternative that doesn't force any structure in the search space: using all ngrams in a passage as its possible identifiers.
arXiv Detail & Related papers (2022-04-22T10:45:01Z) - Progressive End-to-End Object Detection in Crowded Scenes [96.92416613336096]
Previous query-based detectors suffer from two drawbacks: first, multiple predictions will be inferred for a single object, typically in crowded scenes; second, the performance saturates as the depth of the decoding stage increases.
We propose a progressive predicting method to address the above issues. Specifically, we first select accepted queries to generate true positive predictions, then refine the rest noisy queries according to the previously accepted predictions.
Experiments show that our method can significantly boost the performance of query-based detectors in crowded scenes.
arXiv Detail & Related papers (2022-03-15T06:12:00Z) - Reinforcement Learning Based Query Vertex Ordering Model for Subgraph
Matching [58.39970828272366]
Subgraph matching algorithms enumerate all is embeddings of a query graph in a data graph G.
matching order plays a critical role in time efficiency of these backtracking based subgraph matching algorithms.
In this paper, for the first time we apply the Reinforcement Learning (RL) and Graph Neural Networks (GNNs) techniques to generate the high-quality matching order for subgraph matching algorithms.
arXiv Detail & Related papers (2022-01-25T00:10:03Z) - Bottom-Up Constituency Parsing and Nested Named Entity Recognition with
Pointer Networks [24.337440797369702]
Constituency parsing and nested named entity recognition (NER) are typical textitnested structured prediction tasks.
We propose a novel global pointing mechanism for bottom-up parsing with pointer networks to do both tasks, which needs linear steps to parse.
Our method obtains the state-of-the-art performance on PTB among all BERT-based models (96.01 F1 score) and competitive performance on CTB7 in constituency parsing.
arXiv Detail & Related papers (2021-10-11T17:01:43Z) - Exploiting Global Contextual Information for Document-level Named Entity
Recognition [46.99922251839363]
We propose a model called Global Context enhanced Document-level NER (GCDoc)
At word-level, a document graph is constructed to model a wider range of dependencies between words.
At sentence-level, for appropriately modeling wider context beyond single sentence, we employ a cross-sentence module.
Our model reaches F1 score of 92.22 (93.40 with BERT) on CoNLL 2003 dataset and 88.32 (90.49 with BERT) on Ontonotes 5.0 dataset.
arXiv Detail & Related papers (2021-06-02T01:52:07Z) - Cross-domain Speech Recognition with Unsupervised Character-level
Distribution Matching [60.8427677151492]
We propose CMatch, a Character-level distribution matching method to perform fine-grained adaptation between each character in two domains.
Experiments on the Libri-Adapt dataset show that our proposed approach achieves 14.39% and 16.50% relative Word Error Rate (WER) reduction on both cross-device and cross-environment ASR.
arXiv Detail & Related papers (2021-04-15T14:36:54Z) - Community detection, pattern recognition, and hypergraph-based learning:
approaches using metric geometry and persistent homology [1.3477333339913569]
We introduce a new topological structure to hypergraph data which bears a resemblance to a usual metric space structure.
Using this new topological space structure of hypergraph data, we propose several approaches to study community detection problem.
We then apply our modified nearest neighbors methods to study sign prediction problem in hypegraph data constructed using our method.
arXiv Detail & Related papers (2020-09-29T21:20:12Z) - Hypergraph Random Walks, Laplacians, and Clustering [9.488853155989615]
We propose a flexible framework for clustering hypergraph-structured data based on recently proposed random walks.
We show that the proposed methods produce higher-quality clusters and conclude by highlighting avenues for future work.
arXiv Detail & Related papers (2020-06-29T20:58:15Z) - Minimizing Localized Ratio Cut Objectives in Hypergraphs [32.80813008862995]
We present a framework for local hypergraph clustering based on minimizing localized ratio cut objectives.
Our algorithm is strongly-local, meaning that its runtime depends only on the size of the input set, and does not need to explore the entire hypergraph to find good local clusters.
arXiv Detail & Related papers (2020-02-21T17:42:22Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.