An Answer Verbalization Dataset for Conversational Question Answerings
over Knowledge Graphs
- URL: http://arxiv.org/abs/2208.06734v1
- Date: Sat, 13 Aug 2022 21:21:28 GMT
- Title: An Answer Verbalization Dataset for Conversational Question Answerings
over Knowledge Graphs
- Authors: Endri Kacupaj, Kuldeep Singh, Maria Maleshkova, Jens Lehmann
- Abstract summary: This paper contributes to the state-of-the-art by extending an existing ConvQA dataset with verbalized answers.
We perform experiments with five sequence-to-sequence models on generating answer responses while maintaining grammatical correctness.
- Score: 9.979689965471428
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: We introduce a new dataset for conversational question answering over
Knowledge Graphs (KGs) with verbalized answers. Question answering over KGs is
currently focused on answer generation for single-turn questions (KGQA) or
multiple-tun conversational question answering (ConvQA). However, in a
real-world scenario (e.g., voice assistants such as Siri, Alexa, and Google
Assistant), users prefer verbalized answers. This paper contributes to the
state-of-the-art by extending an existing ConvQA dataset with multiple
paraphrased verbalized answers. We perform experiments with five
sequence-to-sequence models on generating answer responses while maintaining
grammatical correctness. We additionally perform an error analysis that details
the rates of models' mispredictions in specified categories. Our proposed
dataset extended with answer verbalization is publicly available with detailed
documentation on its usage for wider utility.
Related papers
- PCoQA: Persian Conversational Question Answering Dataset [12.07607688189035]
The PCoQA dataset is a resource comprising information-seeking dialogs encompassing a total of 9,026 contextually-driven questions.
PCoQA is designed to present novel challenges compared to previous question answering datasets.
This paper not only presents the comprehensive PCoQA dataset but also reports the performance of various benchmark models.
arXiv Detail & Related papers (2023-12-07T15:29:34Z) - Semantic Parsing for Conversational Question Answering over Knowledge
Graphs [63.939700311269156]
We develop a dataset where user questions are annotated with Sparql parses and system answers correspond to execution results thereof.
We present two different semantic parsing approaches and highlight the challenges of the task.
Our dataset and models are released at https://github.com/Edinburgh/SPICE.
arXiv Detail & Related papers (2023-01-28T14:45:11Z) - Conversational QA Dataset Generation with Answer Revision [2.5838973036257458]
We introduce a novel framework that extracts question-worthy phrases from a passage and then generates corresponding questions considering previous conversations.
Our framework revises the extracted answers after generating questions so that answers exactly match paired questions.
arXiv Detail & Related papers (2022-09-23T04:05:38Z) - ConditionalQA: A Complex Reading Comprehension Dataset with Conditional
Answers [93.55268936974971]
We describe a Question Answering dataset that contains complex questions with conditional answers.
We call this dataset ConditionalQA.
We show that ConditionalQA is challenging for many of the existing QA models, especially in selecting answer conditions.
arXiv Detail & Related papers (2021-10-13T17:16:46Z) - VOGUE: Answer Verbalization through Multi-Task Learning [4.882444194224553]
We propose a multi-task-based answer verbalization framework: VOGUE (VerbalizationOuGh mUlti-task lEarning)
Our framework can generate results based on using questions and queries as inputs concurrently.
We evaluate our framework on existing datasets for answer verbalization, and it outperforms all current baselines on both BLEU and METEOR scores.
arXiv Detail & Related papers (2021-06-24T21:05:32Z) - QAConv: Question Answering on Informative Conversations [85.2923607672282]
We focus on informative conversations including business emails, panel discussions, and work channels.
In total, we collect 34,204 QA pairs, including span-based, free-form, and unanswerable questions.
arXiv Detail & Related papers (2021-05-14T15:53:05Z) - GooAQ: Open Question Answering with Diverse Answer Types [63.06454855313667]
We present GooAQ, a large-scale dataset with a variety of answer types.
This dataset contains over 5 million questions and 3 million answers collected from Google.
arXiv Detail & Related papers (2021-04-18T05:40:39Z) - ParaQA: A Question Answering Dataset with Paraphrase Responses for
Single-Turn Conversation [5.087932295628364]
ParaQA is a dataset with multiple paraphrased responses for single-turn conversation over knowledge graphs (KG)
The dataset was created using a semi-automated framework for generating diverse paraphrasing of the answers using techniques such as back-translation.
arXiv Detail & Related papers (2021-03-13T18:53:07Z) - Towards Data Distillation for End-to-end Spoken Conversational Question
Answering [65.124088336738]
We propose a new Spoken Conversational Question Answering task (SCQA)
SCQA aims at enabling QA systems to model complex dialogues flow given the speech utterances and text corpora.
Our main objective is to build a QA system to deal with conversational questions both in spoken and text forms.
arXiv Detail & Related papers (2020-10-18T05:53:39Z) - Inquisitive Question Generation for High Level Text Comprehension [60.21497846332531]
We introduce INQUISITIVE, a dataset of 19K questions that are elicited while a person is reading through a document.
We show that readers engage in a series of pragmatic strategies to seek information.
We evaluate question generation models based on GPT-2 and show that our model is able to generate reasonable questions.
arXiv Detail & Related papers (2020-10-04T19:03:39Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.