A Benchmark for Generalizable and Interpretable Temporal Question
Answering over Knowledge Bases
- URL: http://arxiv.org/abs/2201.05793v1
- Date: Sat, 15 Jan 2022 08:49:09 GMT
- Title: A Benchmark for Generalizable and Interpretable Temporal Question
Answering over Knowledge Bases
- Authors: Sumit Neelam, Udit Sharma, Hima Karanam, Shajith Ikbal, Pavan
Kapanipathi, Ibrahim Abdelaziz, Nandana Mihindukulasooriya, Young-Suk Lee,
Santosh Srivastava, Cezar Pendus, Saswati Dana, Dinesh Garg, Achille Fokoue,
G P Shrivatsa Bhargav, Dinesh Khandelwal, Srinivas Ravishankar, Sairam
Gurajada, Maria Chang, Rosario Uceda-Sosa, Salim Roukos, Alexander Gray,
Guilherme Lima, Ryan Riegel, Francois Luus, L Venkata Subramaniam
- Abstract summary: TempQA-WD is a benchmark dataset for temporal reasoning.
It is based on Wikidata, which is the most frequently curated, openly available knowledge base.
- Score: 67.33560134350427
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Knowledge Base Question Answering (KBQA) tasks that involve complex reasoning
are emerging as an important research direction. However, most existing KBQA
datasets focus primarily on generic multi-hop reasoning over explicit facts,
largely ignoring other reasoning types such as temporal, spatial, and taxonomic
reasoning. In this paper, we present a benchmark dataset for temporal
reasoning, TempQA-WD, to encourage research in extending the present approaches
to target a more challenging set of complex reasoning tasks. Specifically, our
benchmark is a temporal question answering dataset with the following
advantages: (a) it is based on Wikidata, which is the most frequently curated,
openly available knowledge base, (b) it includes intermediate sparql queries to
facilitate the evaluation of semantic parsing based approaches for KBQA, and
(c) it generalizes to multiple knowledge bases: Freebase and Wikidata. The
TempQA-WD dataset is available at https://github.com/IBM/tempqa-wd.
Related papers
- ComplexTempQA: A Large-Scale Dataset for Complex Temporal Question Answering [24.046966640011124]
ComplexTempQA is a large-scale dataset consisting of over 100 million question-answer pairs.
The dataset covers questions spanning over two decades and offers an unmatched breadth of topics.
arXiv Detail & Related papers (2024-06-07T12:01:59Z) - Multi-hop Question Answering under Temporal Knowledge Editing [9.356343796845662]
Multi-hop question answering (MQA) under knowledge editing (KE) has garnered significant attention in the era of large language models.
Existing models for MQA under KE exhibit poor performance when dealing with questions containing explicit temporal contexts.
We propose TEMPoral knowLEdge augmented Multi-hop Question Answering (TEMPLE-MQA) to address this limitation.
arXiv Detail & Related papers (2024-03-30T23:22:51Z) - Towards Robust Temporal Reasoning of Large Language Models via a Multi-Hop QA Dataset and Pseudo-Instruction Tuning [73.51314109184197]
It is crucial for large language models (LLMs) to understand the concept of temporal knowledge.
We propose a complex temporal question-answering dataset Complex-TR that focuses on multi-answer and multi-hop temporal reasoning.
arXiv Detail & Related papers (2023-11-16T11:49:29Z) - Open-Set Knowledge-Based Visual Question Answering with Inference Paths [79.55742631375063]
The purpose of Knowledge-Based Visual Question Answering (KB-VQA) is to provide a correct answer to the question with the aid of external knowledge bases.
We propose a new retriever-ranker paradigm of KB-VQA, Graph pATH rankER (GATHER for brevity)
Specifically, it contains graph constructing, pruning, and path-level ranking, which not only retrieves accurate answers but also provides inference paths that explain the reasoning process.
arXiv Detail & Related papers (2023-10-12T09:12:50Z) - DecAF: Joint Decoding of Answers and Logical Forms for Question
Answering over Knowledge Bases [81.19499764899359]
We propose a novel framework DecAF that jointly generates both logical forms and direct answers.
DecAF achieves new state-of-the-art accuracy on WebQSP, FreebaseQA, and GrailQA benchmarks.
arXiv Detail & Related papers (2022-09-30T19:51:52Z) - Targeted Extraction of Temporal Facts from Textual Resources for
Improved Temporal Question Answering over Knowledge Bases [21.108609901224572]
Knowledge Base Question Answering (KBQA) systems have the goal of answering complex natural language questions by reasoning over relevant facts retrieved from Knowledge Bases (KB)
One of the major challenges faced by these systems is their inability to retrieve all relevant facts due to incomplete KB and entity/relation linking errors.
We propose a novel approach where a targeted temporal fact extraction technique is used to assist KBQA whenever it fails to retrieve temporal facts from the KB.
arXiv Detail & Related papers (2022-03-21T15:26:35Z) - A Chinese Multi-type Complex Questions Answering Dataset over Wikidata [45.31495982252219]
Complex Knowledge Base Question Answering is a popular area of research in the past decade.
Recent public datasets have led to encouraging results in this field, but are mostly limited to English.
Few state-of-the-art KBQA models are trained on Wikidata, one of the most popular real-world knowledge bases.
We propose CLC-QuAD, the first large scale complex Chinese semantic parsing dataset over Wikidata to address these challenges.
arXiv Detail & Related papers (2021-11-11T07:39:16Z) - SYGMA: System for Generalizable Modular Question Answering OverKnowledge
Bases [57.89642289610301]
We present SYGMA, a modular approach facilitating general-izability across multiple knowledge bases and multiple rea-soning types.
We demonstrate effectiveness of our system by evaluating on datasets belonging to two distinct knowledge bases,DBpedia and Wikidata.
arXiv Detail & Related papers (2021-09-28T01:57:56Z) - A Survey on Complex Question Answering over Knowledge Base: Recent
Advances and Challenges [71.4531144086568]
Question Answering (QA) over Knowledge Base (KB) aims to automatically answer natural language questions.
Researchers have shifted their attention from simple questions to complex questions, which require more KB triples and constraint inference.
arXiv Detail & Related papers (2020-07-26T07:13:32Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.