Modeling Instance Interactions for Joint Information Extraction with
Neural High-Order Conditional Random Field
- URL: http://arxiv.org/abs/2212.08929v2
- Date: Sun, 28 May 2023 09:48:24 GMT
- Title: Modeling Instance Interactions for Joint Information Extraction with
Neural High-Order Conditional Random Field
- Authors: Zixia Jia, Zhaohui Yan, Wenjuan Han, Zilong Zheng, Kewei Tu
- Abstract summary: We introduce a joint IE framework (CRFIE) that formulates joint IE as a high-order Conditional Random Field.
Specifically, we design binary factors and ternary factors to directly model interactions between not only a pair of instances but also triplets.
We incorporate a high-order neural decoder that is unfolded from a mean-field variational inference method.
- Score: 39.055053720433435
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Prior works on joint Information Extraction (IE) typically model instance
(e.g., event triggers, entities, roles, relations) interactions by
representation enhancement, type dependencies scoring, or global decoding. We
find that the previous models generally consider binary type dependency scoring
of a pair of instances, and leverage local search such as beam search to
approximate global solutions. To better integrate cross-instance interactions,
in this work, we introduce a joint IE framework (CRFIE) that formulates joint
IE as a high-order Conditional Random Field. Specifically, we design binary
factors and ternary factors to directly model interactions between not only a
pair of instances but also triplets. Then, these factors are utilized to
jointly predict labels of all instances. To address the intractability problem
of exact high-order inference, we incorporate a high-order neural decoder that
is unfolded from a mean-field variational inference method, which achieves
consistent learning and inference. The experimental results show that our
approach achieves consistent improvements on three IE tasks compared with our
baseline and prior work.
Related papers
- A Plug-and-Play Method for Rare Human-Object Interactions Detection by Bridging Domain Gap [50.079224604394]
We present a novel model-agnostic framework called textbfContext-textbfEnhanced textbfFeature textbfAment (CEFA)
CEFA consists of a feature alignment module and a context enhancement module.
Our method can serve as a plug-and-play module to improve the detection performance of HOI models on rare categories.
arXiv Detail & Related papers (2024-07-31T08:42:48Z) - Interaction Event Forecasting in Multi-Relational Recursive HyperGraphs: A Temporal Point Process Approach [12.142292322071299]
This work addresses the problem of forecasting higher-order interaction events in multi-relational recursive hypergraphs.
The proposed model, textitRelational Recursive Hyperedge Temporal Point Process (RRHyperTPP), uses an encoder that learns a dynamic node representation based on the historical interaction patterns.
We have experimentally shown that our models perform better than previous state-of-the-art methods for interaction forecasting.
arXiv Detail & Related papers (2024-04-27T15:46:54Z) - DHRNet: A Dual-Path Hierarchical Relation Network for Multi-Person Pose Estimation [14.267849773487834]
Multi-person pose estimation (MPPE) presents a formidable yet crucial challenge in computer vision.
This paper introduces a novel CNN-based single-stage method, named Dual-path Hierarchical Relation Network (DHRNet), to extract instance-to-joint and joint-to-instance interactions concurrently.
arXiv Detail & Related papers (2024-04-22T09:41:03Z) - Spatial-Temporal Graph Enhanced DETR Towards Multi-Frame 3D Object Detection [54.041049052843604]
We present STEMD, a novel end-to-end framework that enhances the DETR-like paradigm for multi-frame 3D object detection.
First, to model the inter-object spatial interaction and complex temporal dependencies, we introduce the spatial-temporal graph attention network.
Finally, it poses a challenge for the network to distinguish between the positive query and other highly similar queries that are not the best match.
arXiv Detail & Related papers (2023-07-01T13:53:14Z) - Generalization Properties of Retrieval-based Models [50.35325326050263]
Retrieval-based machine learning methods have enjoyed success on a wide range of problems.
Despite growing literature showcasing the promise of these models, the theoretical underpinning for such models remains underexplored.
We present a formal treatment of retrieval-based models to characterize their generalization ability.
arXiv Detail & Related papers (2022-10-06T00:33:01Z) - OneRel:Joint Entity and Relation Extraction with One Module in One Step [42.576188878294886]
Joint entity and relation extraction is an essential task in natural language processing and knowledge graph construction.
We propose a novel joint entity and relation extraction model, named OneRel, which casts joint extraction as a fine-grained triple classification problem.
arXiv Detail & Related papers (2022-03-10T15:09:59Z) - Uncovering Main Causalities for Long-tailed Information Extraction [14.39860866665021]
Long-tailed distributions caused by the selection bias of a dataset may lead to incorrect correlations.
This motivates us to propose counterfactual IE (CFIE), a novel framework that aims to uncover the main causalities behind data.
arXiv Detail & Related papers (2021-09-11T08:08:24Z) - A Universal Model for Cross Modality Mapping by Relational Reasoning [29.081989993636338]
Cross modality mapping has attracted growing attention in the computer vision community.
We propose a GCN-based Reasoning Network (RR-Net) in which inter and intra relations are efficiently computed.
Experiments on three example tasks, i.e., image classification, social recommendation and sound recognition, clearly demonstrate the superiority and universality of our proposed model.
arXiv Detail & Related papers (2021-02-26T08:56:24Z) - DCR-Net: A Deep Co-Interactive Relation Network for Joint Dialog Act
Recognition and Sentiment Classification [77.59549450705384]
In dialog system, dialog act recognition and sentiment classification are two correlative tasks.
Most of the existing systems either treat them as separate tasks or just jointly model the two tasks.
We propose a Deep Co-Interactive Relation Network (DCR-Net) to explicitly consider the cross-impact and model the interaction between the two tasks.
arXiv Detail & Related papers (2020-08-16T14:13:32Z) - Cascaded Human-Object Interaction Recognition [175.60439054047043]
We introduce a cascade architecture for a multi-stage, coarse-to-fine HOI understanding.
At each stage, an instance localization network progressively refines HOI proposals and feeds them into an interaction recognition network.
With our carefully-designed human-centric relation features, these two modules work collaboratively towards effective interaction understanding.
arXiv Detail & Related papers (2020-03-09T17:05:04Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.