Entity-centered Cross-document Relation Extraction
- URL: http://arxiv.org/abs/2210.16541v1
- Date: Sat, 29 Oct 2022 09:27:15 GMT
- Title: Entity-centered Cross-document Relation Extraction
- Authors: Fengqi Wang, Fei Li, Hao Fei, Jingye Li, Shengqiong Wu, Fangfang Su,
Wenxuan Shi, Donghong Ji, Bo Cai
- Abstract summary: Relation Extraction (RE) is a fundamental task of information extraction, which has attracted a large amount of research attention.
Previous studies focus on extracting the relations within a sentence or document, while currently researchers begin to explore cross-document RE.
In this paper, we aim to address both of these shortages and push the state-of-the-art for cross-document RE.
- Score: 34.38369224008656
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Relation Extraction (RE) is a fundamental task of information extraction,
which has attracted a large amount of research attention. Previous studies
focus on extracting the relations within a sentence or document, while
currently researchers begin to explore cross-document RE. However, current
cross-document RE methods directly utilize text snippets surrounding target
entities in multiple given documents, which brings considerable noisy and
non-relevant sentences. Moreover, they utilize all the text paths in a document
bag in a coarse-grained way, without considering the connections between these
text paths.In this paper, we aim to address both of these shortages and push
the state-of-the-art for cross-document RE. First, we focus on input
construction for our RE model and propose an entity-based document-context
filter to retain useful information in the given documents by using the bridge
entities in the text paths. Second, we propose a cross-document RE model based
on cross-path entity relation attention, which allow the entity relations
across text paths to interact with each other. We compare our cross-document RE
method with the state-of-the-art methods in the dataset CodRED. Our method
outperforms them by at least 10% in F1, thus demonstrating its effectiveness.
Related papers
- Unified Multi-Modal Interleaved Document Representation for Information Retrieval [57.65409208879344]
We produce more comprehensive and nuanced document representations by holistically embedding documents interleaved with different modalities.
Specifically, we achieve this by leveraging the capability of recent vision-language models that enable the processing and integration of text, images, and tables into a unified format and representation.
arXiv Detail & Related papers (2024-10-03T17:49:09Z) - Contextual Document Embeddings [77.22328616983417]
We propose two complementary methods for contextualized document embeddings.
First, an alternative contrastive learning objective that explicitly incorporates the document neighbors into the intra-batch contextual loss.
Second, a new contextual architecture that explicitly encodes neighbor document information into the encoded representation.
arXiv Detail & Related papers (2024-10-03T14:33:34Z) - Reward-based Input Construction for Cross-document Relation Extraction [11.52832308525974]
Cross-document Relation extraction (RE) is a fundamental task in natural language processing.
We propose REward-based Input Construction (REIC), the first learning-based sentence selector for cross-document RE.
REIC extracts sentences based on relational evidence, enabling the RE module to effectively infer relations.
arXiv Detail & Related papers (2024-05-31T07:30:34Z) - Knowledge-Driven Cross-Document Relation Extraction [3.868708275322908]
Relation extraction (RE) is a well-known NLP application often treated as a sentence- or document-level task.
We propose a novel approach, KXDocRE, that embed domain knowledge of entities with input text for cross-document RE.
arXiv Detail & Related papers (2024-05-22T11:30:59Z) - DAPR: A Benchmark on Document-Aware Passage Retrieval [57.45793782107218]
We propose and name this task emphDocument-Aware Passage Retrieval (DAPR)
While analyzing the errors of the State-of-The-Art (SoTA) passage retrievers, we find the major errors (53.5%) are due to missing document context.
Our created benchmark enables future research on developing and comparing retrieval systems for the new task.
arXiv Detail & Related papers (2023-05-23T10:39:57Z) - Document-level Relation Extraction with Cross-sentence Reasoning Graph [14.106582119686635]
Relation extraction (RE) has recently moved from the sentence-level to document-level.
We propose a novel document-level RE model with a GRaph information Aggregation and Cross-sentence Reasoning network (GRACR)
Experimental results show GRACR achieves excellent performance on two public datasets of document-level RE.
arXiv Detail & Related papers (2023-03-07T14:14:12Z) - Multi-hop Evidence Retrieval for Cross-document Relation Extraction [23.98136192661566]
We propose MR.COD (Multi-hop evidence retrieval for Cross-document relation extraction), which is a multi-hop evidence retrieval method based on evidence path mining and ranking.
Experiments on CodRED show that evidence retrieval with MR.COD effectively acquires crossdocument evidence and boosts end-to-end RE performance in both closed and open settings.
arXiv Detail & Related papers (2022-12-21T06:00:22Z) - ReSel: N-ary Relation Extraction from Scientific Text and Tables by
Learning to Retrieve and Select [53.071352033539526]
We study the problem of extracting N-ary relations from scientific articles.
Our proposed method ReSel decomposes this task into a two-stage procedure.
Our experiments on three scientific information extraction datasets show that ReSel outperforms state-of-the-art baselines significantly.
arXiv Detail & Related papers (2022-10-26T02:28:02Z) - Cross-document Event Coreference Search: Task, Dataset and Modeling [26.36068336169796]
We propose an appealing, and often more applicable, complementary set up for the task - Cross-document Coreference Search.
To support research on this task, we create a corresponding dataset, which is derived from Wikipedia.
We present a novel model that integrates a powerful coreference scoring scheme into the DPR architecture, yielding improved performance.
arXiv Detail & Related papers (2022-10-23T08:21:25Z) - Eider: Evidence-enhanced Document-level Relation Extraction [56.71004595444816]
Document-level relation extraction (DocRE) aims at extracting semantic relations among entity pairs in a document.
We propose a three-stage evidence-enhanced DocRE framework consisting of joint relation and evidence extraction, evidence-centered relation extraction (RE), and fusion of extraction results.
arXiv Detail & Related papers (2021-06-16T09:43:16Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.