Related papers: Local Hypergraph-based Nested Named Entity Recognition as Query-based Sequence Labeling

Local Hypergraph-based Nested Named Entity Recognition as Query-based Sequence Labeling

URL: http://arxiv.org/abs/2204.11467v1
Date: Mon, 25 Apr 2022 06:56:49 GMT
Title: Local Hypergraph-based Nested Named Entity Recognition as Query-based Sequence Labeling
Authors: Yukun Yan, Sen Song
Abstract summary: We propose a novel local hypergraph-based method to recognize nested named entities. Our method is free of the high computation cost of span sampling and the risk of losing long entities. It achieves a new state-of-the-art F1 score on the ACE 2004 dataset and competitive F1 scores with previous state-of-the-art methods on three other nested NER datasets: ACE 2005, GENIA, and KBP 2017.
Score: 5.134435281973136
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: There has been a growing academic interest in the recognition of nested named entities in many domains. We tackle the task with a novel local hypergraph-based method: We first propose start token candidates and generate corresponding queries with their surrounding context, then use a query-based sequence labeling module to form a local hypergraph for each candidate. An end token estimator is used to correct the hypergraphs and get the final predictions. Compared to span-based approaches, our method is free of the high computation cost of span sampling and the risk of losing long entities. Sequential prediction makes it easier to leverage information in word order inside nested structures, and richer representations are built with a local hypergraph. Experiments show that our proposed method outperforms all the previous hypergraph-based and sequence labeling approaches with large margins on all four nested datasets. It achieves a new state-of-the-art F1 score on the ACE 2004 dataset and competitive F1 scores with previous state-of-the-art methods on three other nested NER datasets: ACE 2005, GENIA, and KBP 2017.

Related papers

Divide-Then-Rule: A Cluster-Driven Hierarchical Interpolator for Attribute-Missing Graphs [51.13363550716544]
Deep graph clustering is an unsupervised task aimed at partitioning nodes with incomplete attributes into distinct clusters.<n>Existing imputation methods for attribute-missing graphs often fail to account for the varying amounts of information available across node neighborhoods.<n>We propose Divide-Then-Rule Graph Completion (DTRGC) to address this issue.
arXiv Detail & Related papers (2025-07-12T03:33:19Z)
HyperQuery: Beyond Binary Link Prediction [0.7100520098029438]
We introduce a novel feature extraction technique using node level clustering and show how integrating data from node-level labels can improve system performance. Our self-supervised approach achieves significant improvement over state of the art baselines on several hyperedge prediction and knowledge hypergraph completion benchmarks.
arXiv Detail & Related papers (2025-01-13T22:46:24Z)
From Hypergraph Energy Functions to Hypergraph Neural Networks [94.88564151540459]
We present an expressive family of parameterized, hypergraph-regularized energy functions. We then demonstrate how minimizers of these energies effectively serve as node embeddings. We draw parallels between the proposed bilevel hypergraph optimization, and existing GNN architectures in common use.
arXiv Detail & Related papers (2023-06-16T04:40:59Z)
Optimizing Bi-Encoder for Named Entity Recognition via Contrastive Learning [80.36076044023581]
We present an efficient bi-encoder framework for named entity recognition (NER) We frame NER as a metric learning problem that maximizes the similarity between the vector representations of an entity mention and its type. A major challenge to this bi-encoder formulation for NER lies in separating non-entity spans from entity mentions.
arXiv Detail & Related papers (2022-08-30T23:19:04Z)
Autoregressive Search Engines: Generating Substrings as Document Identifiers [53.0729058170278]
Autoregressive language models are emerging as the de-facto standard for generating answers. Previous work has explored ways to partition the search space into hierarchical structures. In this work we propose an alternative that doesn't force any structure in the search space: using all ngrams in a passage as its possible identifiers.
arXiv Detail & Related papers (2022-04-22T10:45:01Z)
Progressive End-to-End Object Detection in Crowded Scenes [96.92416613336096]
Previous query-based detectors suffer from two drawbacks: first, multiple predictions will be inferred for a single object, typically in crowded scenes; second, the performance saturates as the depth of the decoding stage increases. We propose a progressive predicting method to address the above issues. Specifically, we first select accepted queries to generate true positive predictions, then refine the rest noisy queries according to the previously accepted predictions. Experiments show that our method can significantly boost the performance of query-based detectors in crowded scenes.
arXiv Detail & Related papers (2022-03-15T06:12:00Z)
Reinforcement Learning Based Query Vertex Ordering Model for Subgraph Matching [58.39970828272366]
Subgraph matching algorithms enumerate all is embeddings of a query graph in a data graph G. matching order plays a critical role in time efficiency of these backtracking based subgraph matching algorithms. In this paper, for the first time we apply the Reinforcement Learning (RL) and Graph Neural Networks (GNNs) techniques to generate the high-quality matching order for subgraph matching algorithms.
arXiv Detail & Related papers (2022-01-25T00:10:03Z)
Bottom-Up Constituency Parsing and Nested Named Entity Recognition with Pointer Networks [24.337440797369702]
Constituency parsing and nested named entity recognition (NER) are typical textitnested structured prediction tasks. We propose a novel global pointing mechanism for bottom-up parsing with pointer networks to do both tasks, which needs linear steps to parse. Our method obtains the state-of-the-art performance on PTB among all BERT-based models (96.01 F1 score) and competitive performance on CTB7 in constituency parsing.
arXiv Detail & Related papers (2021-10-11T17:01:43Z)
Exploiting Global Contextual Information for Document-level Named Entity Recognition [46.99922251839363]
We propose a model called Global Context enhanced Document-level NER (GCDoc) At word-level, a document graph is constructed to model a wider range of dependencies between words. At sentence-level, for appropriately modeling wider context beyond single sentence, we employ a cross-sentence module. Our model reaches F1 score of 92.22 (93.40 with BERT) on CoNLL 2003 dataset and 88.32 (90.49 with BERT) on Ontonotes 5.0 dataset.
arXiv Detail & Related papers (2021-06-02T01:52:07Z)
Cross-domain Speech Recognition with Unsupervised Character-level Distribution Matching [60.8427677151492]
We propose CMatch, a Character-level distribution matching method to perform fine-grained adaptation between each character in two domains. Experiments on the Libri-Adapt dataset show that our proposed approach achieves 14.39% and 16.50% relative Word Error Rate (WER) reduction on both cross-device and cross-environment ASR.
arXiv Detail & Related papers (2021-04-15T14:36:54Z)
Community detection, pattern recognition, and hypergraph-based learning: approaches using metric geometry and persistent homology [1.3477333339913569]
We introduce a new topological structure to hypergraph data which bears a resemblance to a usual metric space structure. Using this new topological space structure of hypergraph data, we propose several approaches to study community detection problem. We then apply our modified nearest neighbors methods to study sign prediction problem in hypegraph data constructed using our method.
arXiv Detail & Related papers (2020-09-29T21:20:12Z)
Hypergraph Random Walks, Laplacians, and Clustering [9.488853155989615]
We propose a flexible framework for clustering hypergraph-structured data based on recently proposed random walks. We show that the proposed methods produce higher-quality clusters and conclude by highlighting avenues for future work.
arXiv Detail & Related papers (2020-06-29T20:58:15Z)
Minimizing Localized Ratio Cut Objectives in Hypergraphs [32.80813008862995]
We present a framework for local hypergraph clustering based on minimizing localized ratio cut objectives. Our algorithm is strongly-local, meaning that its runtime depends only on the size of the input set, and does not need to explore the entire hypergraph to find good local clusters.
arXiv Detail & Related papers (2020-02-21T17:42:22Z)

This list is automatically generated from the titles and abstracts of the papers in this site.