Joint Open Knowledge Base Canonicalization and Linking
- URL: http://arxiv.org/abs/2212.01207v1
- Date: Fri, 2 Dec 2022 14:38:58 GMT
- Title: Joint Open Knowledge Base Canonicalization and Linking
- Authors: Yinan Liu and Wei Shen and Yuanfei Wang and Jianyong Wang and Zhenglu
Yang and Xiaojie Yuan
- Abstract summary: noun phrases (NPs) and relation phrases (RPs) in Open Knowledge Bases are not canonicalized.
We propose a novel framework JOCL based on factor graph model to make them reinforce each other.
- Score: 24.160755953937763
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Open Information Extraction (OIE) methods extract a large number of OIE
triples (noun phrase, relation phrase, noun phrase) from text, which compose
large Open Knowledge Bases (OKBs). However, noun phrases (NPs) and relation
phrases (RPs) in OKBs are not canonicalized and often appear in different
paraphrased textual variants, which leads to redundant and ambiguous facts. To
address this problem, there are two related tasks: OKB canonicalization (i.e.,
convert NPs and RPs to canonicalized form) and OKB linking (i.e., link NPs and
RPs with their corresponding entities and relations in a curated Knowledge Base
(e.g., DBPedia). These two tasks are tightly coupled, and one task can benefit
significantly from the other. However, they have been studied in isolation so
far. In this paper, we explore the task of joint OKB canonicalization and
linking for the first time, and propose a novel framework JOCL based on factor
graph model to make them reinforce each other. JOCL is flexible enough to
combine different signals from both tasks, and able to extend to fit any new
signals. A thorough experimental study over two large scale OIE triple data
sets shows that our framework outperforms all the baseline methods for the task
of OKB canonicalization (OKB linking) in terms of average F1 (accuracy).
Related papers
- REXEL: An End-to-end Model for Document-Level Relation Extraction and Entity Linking [11.374031643273941]
REXEL is a highly efficient and accurate model for the joint task of document level cIE (DocIE)
It is on average 11 times faster than competitive existing approaches in a similar setting.
The combination of speed and accuracy makes REXEL an accurate cost-efficient system for extracting structured information at web-scale.
arXiv Detail & Related papers (2024-04-19T11:04:27Z) - Open Knowledge Base Canonicalization with Multi-task Learning [18.053863554106307]
Large open knowledge bases (OKBs) are integral to many knowledge-driven applications on the world wide web such as web search.
noun phrases and relational phrases in OKBs often suffer from redundancy and ambiguity, which calls for the investigation on OKB canonicalization.
Current solutions address OKB canonicalization by devising advanced clustering algorithms and using knowledge graph embedding (KGE) to further facilitate the canonicalization process.
We put forward a multi-task learning framework, namely MulCanon, to tackle OKB canonicalization.
arXiv Detail & Related papers (2024-03-21T08:03:46Z) - Open Knowledge Base Canonicalization with Multi-task Unlearning [19.130159457887]
MulCanon is a multi-task unlearning framework to tackle machine unlearning problem in OKB canonicalization.
A thorough experimental study on popular OKB canonicalization datasets validates that MulCanon achieves advanced machine unlearning effects.
arXiv Detail & Related papers (2023-10-25T07:13:06Z) - Text2NKG: Fine-Grained N-ary Relation Extraction for N-ary relational Knowledge Graph Construction [20.281505340983035]
Text2NKG is a novel fine-grained n-ary relation extraction framework for n-ary relational knowledge graph construction.
We introduce a span-tuple classification approach with hetero-ordered merging and output merging to accomplish fine-grained n-ary relation extraction in different arity.
arXiv Detail & Related papers (2023-10-08T14:47:13Z) - Learnable Pillar-based Re-ranking for Image-Text Retrieval [119.9979224297237]
Image-text retrieval aims to bridge the modality gap and retrieve cross-modal content based on semantic similarities.
Re-ranking, a popular post-processing practice, has revealed the superiority of capturing neighbor relations in single-modality retrieval tasks.
We propose a novel learnable pillar-based re-ranking paradigm for image-text retrieval.
arXiv Detail & Related papers (2023-04-25T04:33:27Z) - Enriching Relation Extraction with OpenIE [70.52564277675056]
Relation extraction (RE) is a sub-discipline of information extraction (IE)
In this work, we explore how recent approaches for open information extraction (OpenIE) may help to improve the task of RE.
Our experiments over two annotated corpora, KnowledgeNet and FewRel, demonstrate the improved accuracy of our enriched models.
arXiv Detail & Related papers (2022-12-19T11:26:23Z) - UniKGQA: Unified Retrieval and Reasoning for Solving Multi-hop Question
Answering Over Knowledge Graph [89.98762327725112]
Multi-hop Question Answering over Knowledge Graph(KGQA) aims to find the answer entities that are multiple hops away from the topic entities mentioned in a natural language question.
We propose UniKGQA, a novel approach for multi-hop KGQA task, by unifying retrieval and reasoning in both model architecture and parameter learning.
arXiv Detail & Related papers (2022-12-02T04:08:09Z) - Multi-View Clustering for Open Knowledge Base Canonicalization [9.976636206355394]
Noun phrases and relation phrases in large open knowledge bases (OKBs) are not canonicalized.
We propose CMVC, a novel unsupervised framework that leverages two views of knowledge jointly for canonicalizing OKBs.
We demonstrate the superiority of our framework through extensive experiments on multiple real-world OKB data sets against state-of-the-art methods.
arXiv Detail & Related papers (2022-06-22T14:23:16Z) - Text-based NP Enrichment [51.403543011094975]
We establish the task of text-based NP enrichment (TNE), that is, enriching each NP with all the preposition-mediated relations that hold between this and the other NPs in the text.
Humans recover such relations seamlessly, while current state-of-the-art models struggle with them due to the implicit nature of the problem.
We build the first large-scale dataset for the problem, provide the formal framing and scope of annotation, analyze the data, and report the result of fine-tuned neural language models on the task.
arXiv Detail & Related papers (2021-09-24T17:23:25Z) - Inferential Text Generation with Multiple Knowledge Sources and
Meta-Learning [117.23425857240679]
We study the problem of generating inferential texts of events for a variety of commonsense like textitif-else relations.
Existing approaches typically use limited evidence from training examples and learn for each relation individually.
In this work, we use multiple knowledge sources as fuels for the model.
arXiv Detail & Related papers (2020-04-07T01:49:18Z) - Graph Structured Network for Image-Text Matching [127.68148793548116]
We present a novel Graph Structured Matching Network to learn fine-grained correspondence.
The GSMN explicitly models object, relation and attribute as a structured phrase.
Experiments show that GSMN outperforms state-of-the-art methods on benchmarks.
arXiv Detail & Related papers (2020-04-01T08:20:42Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.