Counterfactual Multihop QA: A Cause-Effect Approach for Reducing
Disconnected Reasoning
- URL: http://arxiv.org/abs/2210.07138v1
- Date: Thu, 13 Oct 2022 16:21:53 GMT
- Title: Counterfactual Multihop QA: A Cause-Effect Approach for Reducing
Disconnected Reasoning
- Authors: Wangzhen Guo, Qinkang Gong, Hanjiang Lai
- Abstract summary: Multi-hop QA requires reasoning over multiple supporting facts to answer the question.
We propose a novel counterfactual multihop QA, a causal-effect approach that enables to reduce the disconnected reasoning.
Our method achieves 5.8% higher points of its Supp$_s$ score on HotpotQA through true multihop reasoning.
- Score: 5.343815893782489
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Multi-hop QA requires reasoning over multiple supporting facts to answer the
question. However, the existing QA models always rely on shortcuts, e.g.,
providing the true answer by only one fact, rather than multi-hop reasoning,
which is referred as $\textit{disconnected reasoning}$ problem. To alleviate
this issue, we propose a novel counterfactual multihop QA, a causal-effect
approach that enables to reduce the disconnected reasoning. It builds upon
explicitly modeling of causality: 1) the direct causal effects of disconnected
reasoning and 2) the causal effect of true multi-hop reasoning from the total
causal effect. With the causal graph, a counterfactual inference is proposed to
disentangle the disconnected reasoning from the total causal effect, which
provides us a new perspective and technology to learn a QA model that exploits
the true multi-hop reasoning instead of shortcuts. Extensive experiments have
conducted on the benchmark HotpotQA dataset, which demonstrate that the
proposed method can achieve notable improvement on reducing disconnected
reasoning. For example, our method achieves 5.8% higher points of its Supp$_s$
score on HotpotQA through true multihop reasoning. The code is available at
supplementary material.
Related papers
- HOP, UNION, GENERATE: Explainable Multi-hop Reasoning without Rationale
Supervision [118.0818807474809]
This work proposes a principled, probabilistic approach for training explainable multi-hop QA systems without rationale supervision.
Our approach performs multi-hop reasoning by explicitly modeling rationales as sets, enabling the model to capture interactions between documents and sentences within a document.
arXiv Detail & Related papers (2023-05-23T16:53:49Z) - Understanding and Improving Zero-shot Multi-hop Reasoning in Generative
Question Answering [85.79940770146557]
We decompose multi-hop questions into multiple corresponding single-hop questions.
We find marked inconsistency in QA models' answers on these pairs of ostensibly identical question chains.
When trained only on single-hop questions, models generalize poorly to multi-hop questions.
arXiv Detail & Related papers (2022-10-09T11:48:07Z) - Prompt-based Conservation Learning for Multi-hop Question Answering [11.516763652013005]
Multi-hop question answering requires reasoning over multiple documents to answer a complex question.
Most existing multi-hop QA methods fail to answer a large fraction of sub-questions.
We propose the Prompt-based Conservation Learning framework for multi-hop QA.
arXiv Detail & Related papers (2022-09-14T20:50:46Z) - Locate Then Ask: Interpretable Stepwise Reasoning for Multi-hop Question
Answering [71.49131159045811]
Multi-hop reasoning requires aggregating multiple documents to answer a complex question.
Existing methods usually decompose the multi-hop question into simpler single-hop questions.
We propose an interpretable stepwise reasoning framework to incorporate both single-hop supporting sentence identification and single-hop question generation.
arXiv Detail & Related papers (2022-08-22T13:24:25Z) - Modeling Multi-hop Question Answering as Single Sequence Prediction [88.72621430714985]
We propose a simple generative approach (PathFid) that extends the task beyond just answer generation.
PathFid explicitly models the reasoning process to resolve the answer for multi-hop questions.
Our experiments demonstrate that PathFid leads to strong performance gains on two multi-hop QA datasets.
arXiv Detail & Related papers (2022-05-18T21:57:59Z) - Reasoning Chain Based Adversarial Attack for Multi-hop Question
Answering [0.0]
Previous adversarial attack works usually edit the whole question sentence.
We propose a multi-hop reasoning chain based adversarial attack method.
Results demonstrate significant performance reduction on both answer and supporting facts prediction.
arXiv Detail & Related papers (2021-12-17T18:03:14Z) - Counterfactual Variable Control for Robust and Interpretable Question
Answering [57.25261576239862]
Deep neural network based question answering (QA) models are neither robust nor explainable in many cases.
In this paper, we inspect such spurious "capability" of QA models using causal inference.
We propose a novel approach called Counterfactual Variable Control (CVC) that explicitly mitigates any shortcut correlation.
arXiv Detail & Related papers (2020-10-12T10:09:05Z) - Is Multihop QA in DiRe Condition? Measuring and Reducing Disconnected
Reasoning [50.114651561111245]
Models often exploit dataset artifacts to produce correct answers, without connecting information across multiple supporting facts.
We formalize such undesirable behavior as disconnected reasoning across subsets of supporting facts.
Experiments suggest that there hasn't been much progress in multi-hop QA in the reading comprehension setting.
arXiv Detail & Related papers (2020-05-02T11:01:07Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.