mOKB6: A Multilingual Open Knowledge Base Completion Benchmark
- URL: http://arxiv.org/abs/2211.06959v2
- Date: Sun, 28 May 2023 10:18:59 GMT
- Title: mOKB6: A Multilingual Open Knowledge Base Completion Benchmark
- Authors: Shubham Mittal, Keshav Kolluru, Soumen Chakrabarti, Mausam
- Abstract summary: We construct the first multilingual Open KBC dataset, called mOKB6, containing facts from Wikipedia in six languages (including English)
We experiment with several models for the task and observe a consistent benefit of combining languages with the help of shared embedding space as well as translations of facts.
- Score: 38.91023041725193
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Automated completion of open knowledge bases (Open KBs), which are
constructed from triples of the form (subject phrase, relation phrase, object
phrase), obtained via open information extraction (Open IE) system, are useful
for discovering novel facts that may not be directly present in the text.
However, research in Open KB completion (Open KBC) has so far been limited to
resource-rich languages like English. Using the latest advances in multilingual
Open IE, we construct the first multilingual Open KBC dataset, called mOKB6,
containing facts from Wikipedia in six languages (including English). Improving
the previous Open KB construction pipeline by doing multilingual coreference
resolution and keeping only entity-linked triples, we create a dense Open KB.
We experiment with several models for the task and observe a consistent benefit
of combining languages with the help of shared embedding space as well as
translations of facts. We also observe that current multilingual models
struggle to remember facts seen in languages of different scripts.
Related papers
- KBLaM: Knowledge Base augmented Language Model [8.247901935078357]
We propose Knowledge Base augmented Language Model (KBLaM) for augmenting Large Language Models with external knowledge.
KBLaM works with a knowledge base constructed from a corpus of documents, transforming each piece of knowledge in the KB into continuous key-value vector pairs.
Experiments demonstrate KBLaM's effectiveness in various tasks, including question-answering and open-ended reasoning.
arXiv Detail & Related papers (2024-10-14T12:45:10Z) - KnowledGPT: Enhancing Large Language Models with Retrieval and Storage
Access on Knowledge Bases [55.942342665806656]
KnowledGPT is a comprehensive framework to bridge large language models with various knowledge bases.
The retrieval process employs the program of thought prompting, which generates search language for KBs in code format.
KnowledGPT offers the capability to store knowledge in a personalized KB, catering to individual user demands.
arXiv Detail & Related papers (2023-08-17T13:07:00Z) - Mapping and Cleaning Open Commonsense Knowledge Bases with Generative
Translation [14.678465723838599]
In particular, open information extraction (OpenIE) is often used to induce structure from a text.
OpenIEs contain an open-ended, non-canonicalized set of relations, making the extracted knowledge's downstream exploitation harder.
We propose approaching the problem by generative translation, i.e., by training a language model to generate fixed- assertions from open ones.
arXiv Detail & Related papers (2023-06-22T09:42:54Z) - Cross-Lingual Question Answering over Knowledge Base as Reading
Comprehension [61.079852289005025]
Cross-lingual question answering over knowledge base (xKBQA) aims to answer questions in languages different from that of the provided knowledge base.
One of the major challenges facing xKBQA is the high cost of data annotation.
We propose a novel approach for xKBQA in a reading comprehension paradigm.
arXiv Detail & Related papers (2023-02-26T05:52:52Z) - Prix-LM: Pretraining for Multilingual Knowledge Base Construction [59.02868906044296]
We propose a unified framework, Prix-LM, for multilingual knowledge construction and completion.
We leverage two types of knowledge, monolingual triples and cross-lingual links, extracted from existing multilingual KBs.
Experiments on standard entity-related tasks, such as link prediction in multiple languages, cross-lingual entity linking and bilingual lexicon induction, demonstrate its effectiveness.
arXiv Detail & Related papers (2021-10-16T02:08:46Z) - Automatic Construction of Sememe Knowledge Bases via Dictionaries [53.8700954466358]
Sememe knowledge bases (SKBs) enable sememes to be applied to natural language processing.
Most languages have no SKBs, and manual construction of SKBs is time-consuming and labor-intensive.
We propose a simple and fully automatic method of building an SKB via an existing dictionary.
arXiv Detail & Related papers (2021-05-26T14:41:01Z) - Reasoning Over Virtual Knowledge Bases With Open Predicate Relations [85.19305347984515]
We present the Open Predicate Query Language (OPQL)
OPQL is a method for constructing a virtual Knowledge Base (VKB) trained entirely from text.
We demonstrate that OPQL outperforms prior VKB methods on two different KB reasoning tasks.
arXiv Detail & Related papers (2021-02-14T01:29:54Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.