MCR-Net: A Multi-Step Co-Interactive Relation Network for Unanswerable
Questions on Machine Reading Comprehension
- URL: http://arxiv.org/abs/2103.04567v1
- Date: Mon, 8 Mar 2021 06:38:14 GMT
- Title: MCR-Net: A Multi-Step Co-Interactive Relation Network for Unanswerable
Questions on Machine Reading Comprehension
- Authors: Wei Peng, Yue Hu, Jing Yu, Luxi Xing, Yuqiang Xie, Zihao Zhu, Yajing
Sun
- Abstract summary: We propose a Multi-Step Co-Interactive Relation Network (MCR-Net) to explicitly model the mutual interaction between the question and passage.
We show that our model achieves a remarkable improvement, outperforming the BERT-style baselines in literature.
- Score: 14.926981547759182
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Question answering systems usually use keyword searches to retrieve potential
passages related to a question, and then extract the answer from passages with
the machine reading comprehension methods. However, many questions tend to be
unanswerable in the real world. In this case, it is significant and challenging
how the model determines when no answer is supported by the passage and
abstains from answering. Most of the existing systems design a simple
classifier to determine answerability implicitly without explicitly modeling
mutual interaction and relation between the question and passage, leading to
the poor performance for determining the unanswerable questions. To tackle this
problem, we propose a Multi-Step Co-Interactive Relation Network (MCR-Net) to
explicitly model the mutual interaction and locate key clues from coarse to
fine by introducing a co-interactive relation module. The co-interactive
relation module contains a stack of interaction and fusion blocks to
continuously integrate and fuse history-guided and current-query-guided clues
in an explicit way. Experiments on the SQuAD 2.0 and DuReader datasets show
that our model achieves a remarkable improvement, outperforming the BERT-style
baselines in literature. Visualization analysis also verifies the importance of
the mutual interaction between the question and passage.
Related papers
- Leveraging Inter-Chunk Interactions for Enhanced Retrieval in Large Language Model-Based Question Answering [12.60063463163226]
IIER captures the internal connections between document chunks by considering three types of interactions: structural, keyword, and semantic.
It identifies multiple seed nodes based on the target question and iteratively searches for relevant chunks to gather supporting evidence.
It refines the context and reasoning chain, aiding the large language model in reasoning and answer generation.
arXiv Detail & Related papers (2024-08-06T02:39:55Z) - PAQA: Toward ProActive Open-Retrieval Question Answering [34.883834970415734]
This work aims to tackle the challenge of generating relevant clarifying questions by taking into account the inherent ambiguities present in both user queries and documents.
We propose PAQA, an extension to the existing AmbiNQ dataset, incorporating clarifying questions.
We then evaluate various models and assess how passage retrieval impacts ambiguity detection and the generation of clarifying questions.
arXiv Detail & Related papers (2024-02-26T14:40:34Z) - Chain-of-Discussion: A Multi-Model Framework for Complex Evidence-Based Question Answering [55.295699268654545]
We propose a novel Chain-of-Discussion framework to leverage the synergy among open-source Large Language Models.
Our experiments show that discussions among multiple LLMs play a vital role in enhancing the quality of answers.
arXiv Detail & Related papers (2024-02-26T05:31:34Z) - Successive Prompting for Decomposing Complex Questions [50.00659445976735]
Recent works leverage the capabilities of large language models (LMs) to perform complex question answering in a few-shot setting.
We introduce Successive Prompting'', where we iteratively break down a complex task into a simple task, solve it, and then repeat the process until we get the final solution.
Our best model (with successive prompting) achieves an improvement of 5% absolute F1 on a few-shot version of the DROP dataset.
arXiv Detail & Related papers (2022-12-08T06:03:38Z) - Exploiting Hybrid Semantics of Relation Paths for Multi-hop Question
Answering Over Knowledge Graphs [31.088325888508137]
This paper proposes improving multi-hop KGQA by exploiting relation paths' hybrid semantics.
We integrate explicit textual information and implicit KG structural features of relation paths based on a novel rotate-and-scale entity link prediction framework.
arXiv Detail & Related papers (2022-09-02T08:07:37Z) - Question rewriting? Assessing its importance for conversational question
answering [0.6449761153631166]
This work presents a conversational question answering system designed specifically for the Search-Oriented Conversational AI (SCAI) shared task.
In particular, we considered different variations of the question rewriting module to evaluate the influence on the subsequent components.
Our system achieved the best performance in the shared task and our analysis emphasizes the importance of the conversation context representation for the overall system performance.
arXiv Detail & Related papers (2022-01-22T23:31:25Z) - Context-Aware Interaction Network for Question Matching [51.76812857301819]
We propose a context-aware interaction network (COIN) to align two sequences and infer their semantic relationship.
Specifically, each interaction block includes (1) a context-aware cross-attention mechanism to effectively integrate contextual information, and (2) a gate fusion layer to flexibly interpolate aligned representations.
arXiv Detail & Related papers (2021-04-17T05:03:56Z) - Joint Passage Ranking for Diverse Multi-Answer Retrieval [56.43443577137929]
We study multi-answer retrieval, an under-explored problem that requires retrieving passages to cover multiple distinct answers for a question.
This task requires joint modeling of retrieved passages, as models should not repeatedly retrieve passages containing the same answer at the cost of missing a different valid answer.
In this paper, we introduce JPR, a joint passage retrieval model focusing on reranking. To model the joint probability of the retrieved passages, JPR makes use of an autoregressive reranker that selects a sequence of passages, equipped with novel training and decoding algorithms.
arXiv Detail & Related papers (2021-04-17T04:48:36Z) - Cascaded Human-Object Interaction Recognition [175.60439054047043]
We introduce a cascade architecture for a multi-stage, coarse-to-fine HOI understanding.
At each stage, an instance localization network progressively refines HOI proposals and feeds them into an interaction recognition network.
With our carefully-designed human-centric relation features, these two modules work collaboratively towards effective interaction understanding.
arXiv Detail & Related papers (2020-03-09T17:05:04Z) - Retrospective Reader for Machine Reading Comprehension [90.6069071495214]
Machine reading comprehension (MRC) is an AI challenge that requires machine to determine the correct answers to questions based on a given passage.
When unanswerable questions are involved in the MRC task, an essential verification module called verifier is especially required in addition to the encoder.
This paper devotes itself to exploring better verifier design for the MRC task with unanswerable questions.
arXiv Detail & Related papers (2020-01-27T11:14:34Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.