Modeling Task Interactions in Document-Level Joint Entity and Relation
Extraction
- URL: http://arxiv.org/abs/2205.01909v1
- Date: Wed, 4 May 2022 06:18:28 GMT
- Title: Modeling Task Interactions in Document-Level Joint Entity and Relation
Extraction
- Authors: Liyan Xu, Jinho D. Choi
- Abstract summary: Graph Compatibility (GC) is designed to leverage task characteristics, bridging decisions of two tasks for direct task interference.
GC achieves the best performance by up to 2.3/5.1 F1 improvement over the baseline.
- Score: 20.548299226366193
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: We target on the document-level relation extraction in an end-to-end setting,
where the model needs to jointly perform mention extraction, coreference
resolution (COREF) and relation extraction (RE) at once, and gets evaluated in
an entity-centric way. Especially, we address the two-way interaction between
COREF and RE that has not been the focus by previous work, and propose to
introduce explicit interaction namely Graph Compatibility (GC) that is
specifically designed to leverage task characteristics, bridging decisions of
two tasks for direct task interference. Our experiments are conducted on DocRED
and DWIE; in addition to GC, we implement and compare different multi-task
settings commonly adopted in previous work, including pipeline, shared
encoders, graph propagation, to examine the effectiveness of different
interactions. The result shows that GC achieves the best performance by up to
2.3/5.1 F1 improvement over the baseline.
Related papers
- CARE: Co-Attention Network for Joint Entity and Relation Extraction [0.0]
We propose a Co-Attention network for joint entity and relation extraction.
Our approach includes adopting a parallel encoding strategy to learn separate representations for each subtask.
At the core of our approach is the co-attention module that captures two-way interaction between the two subtasks.
arXiv Detail & Related papers (2023-08-24T03:40:54Z) - HIORE: Leveraging High-order Interactions for Unified Entity Relation
Extraction [85.80317530027212]
We propose HIORE, a new method for unified entity relation extraction.
The key insight is to leverage the complex association among word pairs, which contains richer information than the first-order word-by-word interactions.
Experiments show that HIORE achieves the state-of-the-art performance on relation extraction and an improvement of 1.11.8 F1 points over the prior best unified model.
arXiv Detail & Related papers (2023-05-07T14:57:42Z) - A Hierarchical Interactive Network for Joint Span-based Aspect-Sentiment
Analysis [34.1489054082536]
We propose a hierarchical interactive network (HI-ASA) to model two-way interactions between two tasks appropriately.
We use cross-stitch mechanism to combine the different task-specific features selectively as the input to ensure proper two-way interactions.
Experiments on three real-world datasets demonstrate HI-ASA's superiority over baselines.
arXiv Detail & Related papers (2022-08-24T03:03:49Z) - Entity-Graph Enhanced Cross-Modal Pretraining for Instance-level Product
Retrieval [152.3504607706575]
This research aims to conduct weakly-supervised multi-modal instance-level product retrieval for fine-grained product categories.
We first contribute the Product1M datasets, and define two real practical instance-level retrieval tasks.
We exploit to train a more effective cross-modal model which is adaptively capable of incorporating key concept information from the multi-modal data.
arXiv Detail & Related papers (2022-06-17T15:40:45Z) - Document-level Relation Extraction with Context Guided Mention
Integration and Inter-pair Reasoning [18.374097786748834]
Document-level Relation Extraction (DRE) aims to recognize the relations between two entities.
Few previous studies have investigated the mention integration, which may be problematic.
We propose two novel techniques, Context Guided Mention Integration and Inter-pair Reasoning.
arXiv Detail & Related papers (2022-01-13T08:00:23Z) - Cross-Supervised Joint-Event-Extraction with Heterogeneous Information
Networks [61.950353376870154]
Joint-event-extraction is a sequence-to-sequence labeling task with a tag set composed of tags of triggers and entities.
We propose a Cross-Supervised Mechanism (CSM) to alternately supervise the extraction of triggers or entities.
Our approach outperforms the state-of-the-art methods in both entity and trigger extraction.
arXiv Detail & Related papers (2020-10-13T11:51:17Z) - A Co-Interactive Transformer for Joint Slot Filling and Intent Detection [61.109486326954205]
Intent detection and slot filling are two main tasks for building a spoken language understanding (SLU) system.
Previous studies either model the two tasks separately or only consider the single information flow from intent to slot.
We propose a Co-Interactive Transformer to consider the cross-impact between the two tasks simultaneously.
arXiv Detail & Related papers (2020-10-08T10:16:52Z) - DCR-Net: A Deep Co-Interactive Relation Network for Joint Dialog Act
Recognition and Sentiment Classification [77.59549450705384]
In dialog system, dialog act recognition and sentiment classification are two correlative tasks.
Most of the existing systems either treat them as separate tasks or just jointly model the two tasks.
We propose a Deep Co-Interactive Relation Network (DCR-Net) to explicitly consider the cross-impact and model the interaction between the two tasks.
arXiv Detail & Related papers (2020-08-16T14:13:32Z) - Cascaded Human-Object Interaction Recognition [175.60439054047043]
We introduce a cascade architecture for a multi-stage, coarse-to-fine HOI understanding.
At each stage, an instance localization network progressively refines HOI proposals and feeds them into an interaction recognition network.
With our carefully-designed human-centric relation features, these two modules work collaboratively towards effective interaction understanding.
arXiv Detail & Related papers (2020-03-09T17:05:04Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.