Knowledge Graph Simple Question Answering for Unseen Domains
- URL: http://arxiv.org/abs/2005.12040v1
- Date: Mon, 25 May 2020 11:34:54 GMT
- Title: Knowledge Graph Simple Question Answering for Unseen Domains
- Authors: Georgios Sidiropoulos, Nikos Voskarides and Evangelos Kanoulas
- Abstract summary: We propose a data-centric domain adaptation framework that is applicable to new domains.
We use distant supervision to extract a set of keywords that express each relation of the unseen domain.
Our framework significantly improves over zero-shot baselines and is robust across domains.
- Score: 9.263766921991452
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Knowledge graph simple question answering (KGSQA), in its standard form, does
not take into account that human-curated question answering training data only
cover a small subset of the relations that exist in a Knowledge Graph (KG), or
even worse, that new domains covering unseen and rather different to existing
domains relations are added to the KG. In this work, we study KGSQA in a
previously unstudied setting where new, unseen domains are added during test
time. In this setting, question-answer pairs of the new domain do not appear
during training, thus making the task more challenging. We propose a
data-centric domain adaptation framework that consists of a KGSQA system that
is applicable to new domains, and a sequence to sequence question generation
method that automatically generates question-answer pairs for the new domain.
Since the effectiveness of question generation for KGSQA can be restricted by
the limited lexical variety of the generated questions, we use distant
supervision to extract a set of keywords that express each relation of the
unseen domain and incorporate those in the question generation method.
Experimental results demonstrate that our framework significantly improves over
zero-shot baselines and is robust across domains.
Related papers
- QA Domain Adaptation using Hidden Space Augmentation and Self-Supervised
Contrastive Adaptation [24.39026345750824]
Question answering (QA) has recently shown impressive results for answering questions from customized domains.
Yet, a common challenge is to adapt QA models to an unseen target domain.
We propose a novel self-supervised framework called QADA for QA domain adaptation.
arXiv Detail & Related papers (2022-10-19T19:52:57Z) - Domain Adaptation for Question Answering via Question Classification [8.828396559882954]
We propose a novel framework: Question Classification for Question Answering (QC4QA)
For optimization, inter-domain discrepancy between the source and target domain is reduced via maximum mean discrepancy (MMD) distance.
We demonstrate the effectiveness of the proposed QC4QA with consistent improvements against the state-of-the-art baselines on multiple datasets.
arXiv Detail & Related papers (2022-09-12T03:12:02Z) - Multifaceted Improvements for Conversational Open-Domain Question
Answering [54.913313912927045]
We propose a framework with Multifaceted Improvements for Conversational open-domain Question Answering (MICQA)
Firstly, the proposed KL-divergence based regularization is able to lead to a better question understanding for retrieval and answer reading.
Second, the added post-ranker module can push more relevant passages to the top placements and be selected for reader with a two-aspect constrains.
Third, the well designed curriculum learning strategy effectively narrows the gap between the golden passage settings of training and inference, and encourages the reader to find true answer without the golden passage assistance.
arXiv Detail & Related papers (2022-04-01T07:54:27Z) - Relation-Guided Pre-Training for Open-Domain Question Answering [67.86958978322188]
We propose a Relation-Guided Pre-Training (RGPT-QA) framework to solve complex open-domain questions.
We show that RGPT-QA achieves 2.2%, 2.4%, and 6.3% absolute improvement in Exact Match accuracy on Natural Questions, TriviaQA, and WebQuestions.
arXiv Detail & Related papers (2021-09-21T17:59:31Z) - Contrastive Domain Adaptation for Question Answering using Limited Text
Corpora [20.116147632481983]
We propose a novel framework for domain adaptation called contrastive domain adaptation for QA.
Specifically, CAQA combines techniques from question generation and domain-invariant learning to answer out-of-domain questions in settings with limited text corpora.
arXiv Detail & Related papers (2021-08-31T14:05:55Z) - Cross-Domain Generalization Through Memorization: A Study of Nearest
Neighbors in Neural Duplicate Question Detection [72.01292864036087]
Duplicate question detection (DQD) is important to increase efficiency of community and automatic question answering systems.
We leverage neural representations and study nearest neighbors for cross-domain generalization in DQD.
We observe robust performance of this method in different cross-domain scenarios of StackExchange, Spring and Quora datasets.
arXiv Detail & Related papers (2020-11-22T19:19:33Z) - A Survey on Complex Question Answering over Knowledge Base: Recent
Advances and Challenges [71.4531144086568]
Question Answering (QA) over Knowledge Base (KB) aims to automatically answer natural language questions.
Researchers have shifted their attention from simple questions to complex questions, which require more KB triples and constraint inference.
arXiv Detail & Related papers (2020-07-26T07:13:32Z) - ClarQ: A large-scale and diverse dataset for Clarification Question
Generation [67.1162903046619]
We devise a novel bootstrapping framework that assists in the creation of a diverse, large-scale dataset of clarification questions based on postcomments extracted from stackexchange.
We quantitatively demonstrate the utility of the newly created dataset by applying it to the downstream task of question-answering.
We release this dataset in order to foster research into the field of clarification question generation with the larger goal of enhancing dialog and question answering systems.
arXiv Detail & Related papers (2020-06-10T17:56:50Z) - AmbigQA: Answering Ambiguous Open-domain Questions [99.59747941602684]
We introduce AmbigQA, a new open-domain question answering task which involves finding every plausible answer.
To study this task, we construct AmbigNQ, a dataset covering 14,042 questions from NQ-open.
We find that over half of the questions in NQ-open are ambiguous, with diverse sources of ambiguity such as event and entity references.
arXiv Detail & Related papers (2020-04-22T15:42:13Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.