DialogQAE: N-to-N Question Answer Pair Extraction from Customer Service
Chatlog
- URL: http://arxiv.org/abs/2212.07112v1
- Date: Wed, 14 Dec 2022 09:05:14 GMT
- Title: DialogQAE: N-to-N Question Answer Pair Extraction from Customer Service
Chatlog
- Authors: Xin Zheng, Tianyu Liu, Haoran Meng, Xu Wang, Yufan Jiang, Mengliang
Rao, Binghuai Lin, Zhifang Sui, Yunbo Cao
- Abstract summary: We propose N-to-N QA extraction task in which the derived questions and corresponding answers might be separated across different utterances.
We introduce a suite of generative/discriminative tagging based methods with end-to-end and two-stage variants that perform well on 5 customer service datasets.
- Score: 34.69426306212259
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Harvesting question-answer (QA) pairs from customer service chatlog in the
wild is an efficient way to enrich the knowledge base for customer service
chatbots in the cold start or continuous integration scenarios. Prior work
attempts to obtain 1-to-1 QA pairs from growing customer service chatlog, which
fails to integrate the incomplete utterances from the dialog context for
composite QA retrieval. In this paper, we propose N-to-N QA extraction task in
which the derived questions and corresponding answers might be separated across
different utterances. We introduce a suite of generative/discriminative tagging
based methods with end-to-end and two-stage variants that perform well on 5
customer service datasets and for the first time setup a benchmark for N-to-N
DialogQAE with utterance and session level evaluation metrics. With a deep dive
into extracted QA pairs, we find that the relations between and inside the QA
pairs can be indicators to analyze the dialogue structure, e.g. information
seeking, clarification, barge-in and elaboration. We also show that the
proposed models can adapt to different domains and languages, and reduce the
labor cost of knowledge accumulation in the real-world product dialogue
platform.
Related papers
- Q-TOD: A Query-driven Task-oriented Dialogue System [33.18698942938547]
We introduce a novel query-driven task-oriented dialogue system, namely Q-TOD.
The essential information from the dialogue context is extracted into a query, which is further employed to retrieve relevant knowledge records for response generation.
To evaluate the effectiveness of the proposed Q-TOD, we collect query annotations for three publicly available task-oriented dialogue datasets.
arXiv Detail & Related papers (2022-10-14T06:38:19Z) - KETOD: Knowledge-Enriched Task-Oriented Dialogue [77.59814785157877]
Existing studies in dialogue system research mostly treat task-oriented dialogue and chit-chat as separate domains.
We investigate how task-oriented dialogue and knowledge-grounded chit-chat can be effectively integrated into a single model.
arXiv Detail & Related papers (2022-05-11T16:01:03Z) - End-to-end Spoken Conversational Question Answering: Task, Dataset and
Model [92.18621726802726]
In spoken question answering, the systems are designed to answer questions from contiguous text spans within the related speech transcripts.
We propose a new Spoken Conversational Question Answering task (SCQA), aiming at enabling the systems to model complex dialogue flows.
Our main objective is to build the system to deal with conversational questions based on the audio recordings, and to explore the plausibility of providing more cues from different modalities with systems in information gathering.
arXiv Detail & Related papers (2022-04-29T17:56:59Z) - QAConv: Question Answering on Informative Conversations [85.2923607672282]
We focus on informative conversations including business emails, panel discussions, and work channels.
In total, we collect 34,204 QA pairs, including span-based, free-form, and unanswerable questions.
arXiv Detail & Related papers (2021-05-14T15:53:05Z) - Towards Data Distillation for End-to-end Spoken Conversational Question
Answering [65.124088336738]
We propose a new Spoken Conversational Question Answering task (SCQA)
SCQA aims at enabling QA systems to model complex dialogues flow given the speech utterances and text corpora.
Our main objective is to build a QA system to deal with conversational questions both in spoken and text forms.
arXiv Detail & Related papers (2020-10-18T05:53:39Z) - Matching Questions and Answers in Dialogues from Online Forums [12.64602629459043]
Matching question-answer relations between two turns in conversations is not only the first step in analyzing dialogue structures, but also valuable for training dialogue systems.
This paper presents a QA matching model considering both distance information and dialogue history by two simultaneous attention mechanisms called mutual attention.
arXiv Detail & Related papers (2020-05-19T08:18:52Z) - Multi-Stage Conversational Passage Retrieval: An Approach to Fusing Term
Importance Estimation and Neural Query Rewriting [56.268862325167575]
We tackle conversational passage retrieval (ConvPR) with query reformulation integrated into a multi-stage ad-hoc IR system.
We propose two conversational query reformulation (CQR) methods: (1) term importance estimation and (2) neural query rewriting.
For the former, we expand conversational queries using important terms extracted from the conversational context with frequency-based signals.
For the latter, we reformulate conversational queries into natural, standalone, human-understandable queries with a pretrained sequence-tosequence model.
arXiv Detail & Related papers (2020-05-05T14:30:20Z) - Conversations with Search Engines: SERP-based Conversational Response
Generation [77.1381159789032]
We create a suitable dataset, the Search as a Conversation (SaaC) dataset, for the development of pipelines for conversations with search engines.
We also develop a state-of-the-art pipeline for conversations with search engines, the Conversations with Search Engines (CaSE) using this dataset.
CaSE enhances the state-of-the-art by introducing a supporting token identification module and aprior-aware pointer generator.
arXiv Detail & Related papers (2020-04-29T13:07:53Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.