Understand the Dynamic World: An End-to-End Knowledge Informed Framework
for Open Domain Entity State Tracking
- URL: http://arxiv.org/abs/2304.13854v1
- Date: Wed, 26 Apr 2023 22:45:30 GMT
- Title: Understand the Dynamic World: An End-to-End Knowledge Informed Framework
for Open Domain Entity State Tracking
- Authors: Mingchen Li and Lifu Huang
- Abstract summary: Open domain entity state tracking aims to predict reasonable state changes of entities (i.e., [attribute] of [entity] was [before_state] and [after_state] afterwards) given the action descriptions.
It's challenging as the model needs to predict an arbitrary number of entity state changes caused by the action while most of the entities are implicitly relevant to the actions and their attributes as well as states are from open vocabularies.
We propose a novel end-to-end Knowledge Informed framework for open domain Entity State Tracking, namely KIEST, which explicitly retrieves the relevant entities and attributes from
- Score: 15.421012879083463
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Open domain entity state tracking aims to predict reasonable state changes of
entities (i.e., [attribute] of [entity] was [before_state] and [after_state]
afterwards) given the action descriptions. It's important to many reasoning
tasks to support human everyday activities. However, it's challenging as the
model needs to predict an arbitrary number of entity state changes caused by
the action while most of the entities are implicitly relevant to the actions
and their attributes as well as states are from open vocabularies. To tackle
these challenges, we propose a novel end-to-end Knowledge Informed framework
for open domain Entity State Tracking, namely KIEST, which explicitly retrieves
the relevant entities and attributes from external knowledge graph (i.e.,
ConceptNet) and incorporates them to autoregressively generate all the entity
state changes with a novel dynamic knowledge grained encoder-decoder framework.
To enforce the logical coherence among the predicted entities, attributes, and
states, we design a new constraint decoding strategy and employ a coherence
reward to improve the decoding process. Experimental results show that our
proposed KIEST framework significantly outperforms the strong baselines on the
public benchmark dataset OpenPI.
Related papers
- OneNet: A Fine-Tuning Free Framework for Few-Shot Entity Linking via Large Language Model Prompting [49.655711022673046]
OneNet is an innovative framework that utilizes the few-shot learning capabilities of Large Language Models (LLMs) without the need for fine-tuning.
OneNet is structured around three key components prompted by LLMs: (1) an entity reduction processor that simplifies inputs by summarizing and filtering out irrelevant entities, (2) a dual-perspective entity linker that combines contextual cues and prior knowledge for precise entity linking, and (3) an entity consensus judger that employs a unique consistency algorithm to alleviate the hallucination in the entity linking reasoning.
arXiv Detail & Related papers (2024-10-10T02:45:23Z) - DERA: Dense Entity Retrieval for Entity Alignment in Knowledge Graphs [3.500936203815729]
We propose a dense entity retrieval framework for Entity Alignment (EA)
We leverage language models to uniformly encode various features of entities and facilitate nearest entity search across Knowledge Graphs (KGs)
Our approach achieves state-of-the-art performance compared to existing EA methods.
arXiv Detail & Related papers (2024-08-02T10:12:42Z) - Coherent Entity Disambiguation via Modeling Topic and Categorical
Dependency [87.16283281290053]
Previous entity disambiguation (ED) methods adopt a discriminative paradigm, where prediction is made based on matching scores between mention context and candidate entities.
We propose CoherentED, an ED system equipped with novel designs aimed at enhancing the coherence of entity predictions.
We achieve new state-of-the-art results on popular ED benchmarks, with an average improvement of 1.3 F1 points.
arXiv Detail & Related papers (2023-11-06T16:40:13Z) - OpenPI-C: A Better Benchmark and Stronger Baseline for Open-Vocabulary
State Tracking [55.62705574507595]
OpenPI is the only dataset annotated for open-vocabulary state tracking.
We categorize 3 types of problems on the procedure level, step level and state change level respectively.
For the evaluation metric, we propose a cluster-based metric to fix the original metric's preference for repetition.
arXiv Detail & Related papers (2023-06-01T16:48:20Z) - OpenPI2.0: An Improved Dataset for Entity Tracking in Texts [36.84433853139042]
An earlier dataset, OpenPI, provided crowdsourced annotations of entity state changes in text.
We present an improved dataset, OpenPI2.0, where entities and attributes are fully canonicalized and additional entity salience annotations are added.
We show that using state changes of salient entities as a chain-of-thought prompt, downstream performance is improved on tasks such as question answering and classical planning.
arXiv Detail & Related papers (2023-05-24T00:57:35Z) - GE-Blender: Graph-Based Knowledge Enhancement for Blender [3.8841367260456487]
Unseen entities can have a large impact on the dialogue generation task.
We construct a graph by extracting entity nodes in them, enhancing the representation of the context.
We add the named entity tag prediction task to apply the problem that the unseen entity does not exist in the graph.
arXiv Detail & Related papers (2023-01-30T13:00:20Z) - Out of Context: A New Clue for Context Modeling of Aspect-based
Sentiment Analysis [54.735400754548635]
ABSA aims to predict the sentiment expressed in a review with respect to a given aspect.
The given aspect should be considered as a new clue out of context in the context modeling process.
We design several aspect-aware context encoders based on different backbones.
arXiv Detail & Related papers (2021-06-21T02:26:03Z) - Neural Production Systems [90.75211413357577]
Visual environments are structured, consisting of distinct objects or entities.
To partition images into entities, deep-learning researchers have proposed structural inductive biases.
We take inspiration from cognitive science and resurrect a classic approach, which consists of a set of rule templates.
This architecture achieves a flexible, dynamic flow of control and serves to factorize entity-specific and rule-based information.
arXiv Detail & Related papers (2021-03-02T18:53:20Z) - Towards Entity Alignment in the Open World: An Unsupervised Approach [29.337157862514204]
It is a pivotal step for integrating knowledge graphs (KGs) to increase knowledge coverage and quality.
State-of-the-art solutions tend to rely on labeled data for model training.
We offer an unsupervised framework that performs entity alignment in the open world.
arXiv Detail & Related papers (2021-01-26T03:10:24Z) - RuREBus: a Case Study of Joint Named Entity Recognition and Relation
Extraction from e-Government Domain [7.6462329126769815]
We show-case an application of information extraction methods, such as named entity recognition (NER) and relation extraction (RE) to a novel corpus, consisting of documents, issued by a state agency.
The main challenges of this corpus are: 1) the annotation scheme differs greatly from the one used for the general domain corpora, and 2) the documents are written in a language other than English.
arXiv Detail & Related papers (2020-10-29T20:56:15Z) - A Dependency Syntactic Knowledge Augmented Interactive Architecture for
End-to-End Aspect-based Sentiment Analysis [73.74885246830611]
We propose a novel dependency syntactic knowledge augmented interactive architecture with multi-task learning for end-to-end ABSA.
This model is capable of fully exploiting the syntactic knowledge (dependency relations and types) by leveraging a well-designed Dependency Relation Embedded Graph Convolutional Network (DreGcn)
Extensive experimental results on three benchmark datasets demonstrate the effectiveness of our approach.
arXiv Detail & Related papers (2020-04-04T14:59:32Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.