Fine-Grained Named Entity Typing over Distantly Supervised Data via
Refinement in Hyperbolic Space
- URL: http://arxiv.org/abs/2101.11212v1
- Date: Wed, 27 Jan 2021 05:39:05 GMT
- Title: Fine-Grained Named Entity Typing over Distantly Supervised Data via
Refinement in Hyperbolic Space
- Authors: Muhammad Asif Ali, Yifang Sun, Bing Li, Wei Wang
- Abstract summary: FGNET-HR is a novel framework that benefits from the hyperbolic geometry in combination with the graph structures to perform entity typing.
Experimentation using different benchmark datasets shows that FGNET-HR improves the performance on FG-NET by up to 3.5% in terms of strict accuracy.
- Score: 17.38820879842774
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Fine-Grained Named Entity Typing (FG-NET) aims at classifying the entity
mentions into a wide range of entity types (usually hundreds) depending upon
the context. While distant supervision is the most common way to acquire
supervised training data, it brings in label noise, as it assigns type labels
to the entity mentions irrespective of mentions' context. In attempts to deal
with the label noise, leading research on the FG-NET assumes that the
fine-grained entity typing data possesses a euclidean nature, which restraints
the ability of the existing models in combating the label noise. Given the fact
that the fine-grained type hierarchy exhibits a hierarchal structure, it makes
hyperbolic space a natural choice to model the FG-NET data. In this research,
we propose FGNET-HR, a novel framework that benefits from the hyperbolic
geometry in combination with the graph structures to perform entity typing in a
performance-enhanced fashion. FGNET-HR initially uses LSTM networks to encode
the mention in relation with its context, later it forms a graph to
distill/refine the mention's encodings in the hyperbolic space. Finally, the
refined mention encoding is used for entity typing. Experimentation using
different benchmark datasets shows that FGNET-HR improves the performance on
FG-NET by up to 3.5% in terms of strict accuracy.
Related papers
- Seed-Guided Fine-Grained Entity Typing in Science and Engineering
Domains [51.02035914828596]
We study the task of seed-guided fine-grained entity typing in science and engineering domains.
We propose SEType which first enriches the weak supervision by finding more entities for each seen type from an unlabeled corpus.
It then matches the enriched entities to unlabeled text to get pseudo-labeled samples and trains a textual entailment model that can make inferences for both seen and unseen types.
arXiv Detail & Related papers (2024-01-23T22:36:03Z) - Contrastive Meta-Learning for Few-shot Node Classification [54.36506013228169]
Few-shot node classification aims to predict labels for nodes on graphs with only limited labeled nodes as references.
We create a novel contrastive meta-learning framework on graphs, named COSMIC, with two key designs.
arXiv Detail & Related papers (2023-06-27T02:22:45Z) - A Fused Gromov-Wasserstein Framework for Unsupervised Knowledge Graph
Entity Alignment [22.526341223786375]
In this paper, we introduce FGWEA, an unsupervised entity alignment framework that leverages the Fused Gromov-Wasserstein (FGW) distance.
We show that FGWEA surpasses 21 competitive baselines, including cutting-edge supervised entity alignment methods.
arXiv Detail & Related papers (2023-05-11T05:17:54Z) - Grounded Keys-to-Text Generation: Towards Factual Open-Ended Generation [92.1582872870226]
We propose a new grounded keys-to-text generation task.
The task is to generate a factual description about an entity given a set of guiding keys, and grounding passages.
Inspired by recent QA-based evaluation measures, we propose an automatic metric, MAFE, for factual correctness of generated descriptions.
arXiv Detail & Related papers (2022-12-04T23:59:41Z) - Modeling Fine-grained Information via Knowledge-aware Hierarchical Graph
for Zero-shot Entity Retrieval [11.533614615010643]
We propose GER to capture more fine-grained information as complementary to sentence embeddings.
We learn the fine-grained information about mention/entity by aggregating information from these knowledge units.
Experimental results on popular benchmarks demonstrate that our proposed GER framework performs better than previous state-of-the-art models.
arXiv Detail & Related papers (2022-11-20T14:37:53Z) - Entity Type Prediction Leveraging Graph Walks and Entity Descriptions [4.147346416230273]
textitGRAND is a novel approach for entity typing leveraging different graph walk strategies in RDF2vec together with textual entity descriptions.
The proposed approach outperforms the baseline approaches on the benchmark datasets DBpedia and FIGER for entity typing in KGs for both fine-grained and coarse-grained classes.
arXiv Detail & Related papers (2022-07-28T13:56:55Z) - A Robust Stacking Framework for Training Deep Graph Models with
Multifaceted Node Features [61.92791503017341]
Graph Neural Networks (GNNs) with numerical node features and graph structure as inputs have demonstrated superior performance on various supervised learning tasks with graph data.
The best models for such data types in most standard supervised learning settings with IID (non-graph) data are not easily incorporated into a GNN.
Here we propose a robust stacking framework that fuses graph-aware propagation with arbitrary models intended for IID data.
arXiv Detail & Related papers (2022-06-16T22:46:33Z) - PGNet: Real-time Arbitrarily-Shaped Text Spotting with Point Gathering
Network [54.03560668182197]
We propose a novel fully convolutional Point Gathering Network (PGNet) for reading arbitrarily-shaped text in real-time.
With a PG-CTC decoder, we gather high-level character classification vectors from two-dimensional space and decode them into text symbols without NMS and RoI operations.
Experiments prove that the proposed method achieves competitive accuracy, meanwhile significantly improving the running speed.
arXiv Detail & Related papers (2021-04-12T13:27:34Z) - Autoregressive Entity Retrieval [55.38027440347138]
Entities are at the center of how we represent and aggregate knowledge.
The ability to retrieve such entities given a query is fundamental for knowledge-intensive tasks such as entity linking and open-domain question answering.
We propose GENRE, the first system that retrieves entities by generating their unique names, left to right, token-by-token in an autoregressive fashion.
arXiv Detail & Related papers (2020-10-02T10:13:31Z) - Fine-Grained Named Entity Typing over Distantly Supervised Data Based on
Refined Representations [16.30478830298353]
Fine-Grained Named Entity Typing (FG-NET) is a key component in Natural Language Processing (NLP)
We propose an edge-weighted attentive graph convolution network that refines the noisy mention representations by attending over corpus-level contextual clues prior to the end classification.
Experimental evaluation shows that the proposed model outperforms the existing research by a relative score of upto 10.2% and 8.3% for macro f1 and micro f1 respectively.
arXiv Detail & Related papers (2020-04-07T17:26:36Z) - MZET: Memory Augmented Zero-Shot Fine-grained Named Entity Typing [11.88688584631821]
Named entity typing (NET) is a classification task of assigning an entity mention in the context with given semantic types.
We propose MZET, a novel memory augmented FNET model, to tackle the unseen types in a zero-shot manner.
MZET incorporates character-level, word-level, and contextural-level information to learn the entity mention representation.
arXiv Detail & Related papers (2020-04-02T21:17:33Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.