Crake: Causal-Enhanced Table-Filler for Question Answering over Large
Scale Knowledge Base
- URL: http://arxiv.org/abs/2207.03680v1
- Date: Fri, 8 Jul 2022 04:21:26 GMT
- Title: Crake: Causal-Enhanced Table-Filler for Question Answering over Large
Scale Knowledge Base
- Authors: Minhao Zhang, Ruoyu Zhang, Yanzeng Li, Lei Zou
- Abstract summary: We formalize semantic parsing into two stages.
In the first stage, we propose a causal-enhanced table-filler to overcome the issues in sequence-modelling and to learn the internal causalities.
In the second stage, an efficient beam-search algorithm is presented to scale complex queries on large-scale KBs.
- Score: 11.888045774125787
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: Semantic parsing solves knowledge base (KB) question answering (KBQA) by
composing a KB query, which generally involves node extraction (NE) and graph
composition (GC) to detect and connect related nodes in a query. Despite the
strong causal effects between NE and GC, previous works fail to directly model
such causalities in their pipeline, hindering the learning of subtask
correlations. Also, the sequence-generation process for GC in previous works
induces ambiguity and exposure bias, which further harms accuracy. In this
work, we formalize semantic parsing into two stages. In the first stage (graph
structure generation), we propose a causal-enhanced table-filler to overcome
the issues in sequence-modelling and to learn the internal causalities. In the
second stage (relation extraction), an efficient beam-search algorithm is
presented to scale complex queries on large-scale KBs. Experiments on LC-QuAD
1.0 indicate that our method surpasses previous state-of-the-arts by a large
margin (17%) while remaining time and space efficiency. The code and models are
available at https://github.com/AOZMH/Crake.
Related papers
- DeCaFlow: A Deconfounding Causal Generative Model [58.411886466157185]
Causal generative models (CGMs) have recently emerged as capable approaches to simulate the causal mechanisms generating our observations.
We introduce DeCaFlow, a CGM that accounts for hidden confounders in a single amortized training process using only observational data and the causal graph.
For the first time to our knowledge, we show that a confounded counterfactual query is identifiable, and thus solvable by DeCaFlow.
arXiv Detail & Related papers (2025-03-19T11:14:16Z) - SPARKLE: Enhancing SPARQL Generation with Direct KG Integration in Decoding [0.46040036610482665]
We present a novel end-to-end natural language to SPARQL framework, SPARKLE.
SPARKLE leverages the structure of knowledge base directly during the decoding, effectively integrating knowledge into the query generation.
We show that SPARKLE achieves new state-of-the-art results on SimpleQuestions-Wiki and highest F1 score on LCQuAD 1.0.
arXiv Detail & Related papers (2024-06-29T06:43:11Z) - Root Cause Explanation of Outliers under Noisy Mechanisms [50.59446568076628]
Causal processes are often modelled as graphs with entities being nodes and their paths/interconnections as edge.
Existing work only consider the contribution of nodes in the generative process.
We consider both individual edge and node of each mechanism when identifying the root causes.
arXiv Detail & Related papers (2023-12-19T03:24:26Z) - Event Extraction as Question Generation and Answering [72.04433206754489]
Recent work on Event Extraction has reframed the task as Question Answering (QA)
We propose QGA-EE, which enables a Question Generation (QG) model to generate questions that incorporate rich contextual information instead of using fixed templates.
Experiments show that QGA-EE outperforms all prior single-task-based models on the ACE05 English dataset.
arXiv Detail & Related papers (2023-07-10T01:46:15Z) - Constructing and Interpreting Causal Knowledge Graphs from News [3.3071569417370745]
Many financial jobs rely on news to learn about causal events in the past and present, to make informed decisions and predictions about the future.
We propose a methodology to construct causal knowledge graphs (KGs) from news using two steps: (1) Extraction of Causal Relations, and (2) Argument Clustering and Representation into KG.
arXiv Detail & Related papers (2023-05-16T11:33:32Z) - CUTS: Neural Causal Discovery from Irregular Time-Series Data [27.06531262632836]
Causal discovery from time-series data has been a central task in machine learning.
We present CUTS, a neural Granger causal discovery algorithm to jointly impute unobserved data points and build causal graphs.
Our approach constitutes a promising step towards applying causal discovery to real applications with non-ideal observations.
arXiv Detail & Related papers (2023-02-15T04:16:34Z) - Towards Better Out-of-Distribution Generalization of Neural Algorithmic
Reasoning Tasks [51.8723187709964]
We study the OOD generalization of neural algorithmic reasoning tasks.
The goal is to learn an algorithm from input-output pairs using deep neural networks.
arXiv Detail & Related papers (2022-11-01T18:33:20Z) - TIARA: Multi-grained Retrieval for Robust Question Answering over Large
Knowledge Bases [20.751369684593985]
TIARA outperforms previous SOTA, including those using PLMs or oracle entity annotations, by at least 4.1 and 1.1 F1 points on GrailQA and WebQuestionsSP.
arXiv Detail & Related papers (2022-10-24T02:41:10Z) - RnG-KBQA: Generation Augmented Iterative Ranking for Knowledge Base
Question Answering [57.94658176442027]
We present RnG-KBQA, a Rank-and-Generate approach for KBQA.
We achieve new state-of-the-art results on GrailQA and WebQSP datasets.
arXiv Detail & Related papers (2021-09-17T17:58:28Z) - Efficient Contextualization using Top-k Operators for Question Answering
over Knowledge Graphs [24.520002698010856]
This work presents ECQA, an efficient method that prunes irrelevant parts of the search space using KB-aware signals.
Experiments with two recent QA benchmarks demonstrate the superiority of ECQA over state-of-the-art baselines with respect to answer presence, size of the search space, and runtimes.
arXiv Detail & Related papers (2021-08-19T10:06:14Z) - ClarQ: A large-scale and diverse dataset for Clarification Question
Generation [67.1162903046619]
We devise a novel bootstrapping framework that assists in the creation of a diverse, large-scale dataset of clarification questions based on postcomments extracted from stackexchange.
We quantitatively demonstrate the utility of the newly created dataset by applying it to the downstream task of question-answering.
We release this dataset in order to foster research into the field of clarification question generation with the larger goal of enhancing dialog and question answering systems.
arXiv Detail & Related papers (2020-06-10T17:56:50Z) - Harvesting and Refining Question-Answer Pairs for Unsupervised QA [95.9105154311491]
We introduce two approaches to improve unsupervised Question Answering (QA)
First, we harvest lexically and syntactically divergent questions from Wikipedia to automatically construct a corpus of question-answer pairs (named as RefQA)
Second, we take advantage of the QA model to extract more appropriate answers, which iteratively refines data over RefQA.
arXiv Detail & Related papers (2020-05-06T15:56:06Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.