Automated Utterance Generation
- URL: http://arxiv.org/abs/2004.03484v2
- Date: Wed, 8 Apr 2020 01:27:09 GMT
- Title: Automated Utterance Generation
- Authors: Soham Parikh, Quaizar Vohra, Mitul Tiwari
- Abstract summary: Using relevant utterances as features in question-answering has shown to improve both the precision and recall for retrieving the right answer by a conversational assistant.
We propose an utterance generation system which 1) uses extractive summarization to extract important sentences from the description, 2) uses multiple paraphrasing techniques to generate a diverse set of paraphrases of the title and summary sentences, and 3) selects good candidate paraphrases with the help of a novel candidate selection algorithm.
- Score: 5.220940151628735
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Conversational AI assistants are becoming popular and question-answering is
an important part of any conversational assistant. Using relevant utterances as
features in question-answering has shown to improve both the precision and
recall for retrieving the right answer by a conversational assistant. Hence,
utterance generation has become an important problem with the goal of
generating relevant utterances (sentences or phrases) from a knowledge base
article that consists of a title and a description. However, generating good
utterances usually requires a lot of manual effort, creating the need for an
automated utterance generation. In this paper, we propose an utterance
generation system which 1) uses extractive summarization to extract important
sentences from the description, 2) uses multiple paraphrasing techniques to
generate a diverse set of paraphrases of the title and summary sentences, and
3) selects good candidate paraphrases with the help of a novel candidate
selection algorithm.
Related papers
- Automated Generation of Multiple-Choice Cloze Questions for Assessing
English Vocabulary Using GPT-turbo 3.5 [5.525336037820985]
We evaluate a new method for automatically generating multiple-choice questions using large language models (LLM)
The VocaTT engine is written in Python and comprises three basic steps: pre-processing target word lists, generating sentences and candidate word options, and finally selecting suitable word options.
Results showed a 75% rate of well-formedness for sentences and 66.85% rate for suitable word options.
arXiv Detail & Related papers (2024-03-04T14:24:47Z) - Improve Retrieval-based Dialogue System via Syntax-Informed Attention [46.79601705850277]
We propose SIA, Syntax-Informed Attention, considering both intra- and inter-sentence syntax information.
We evaluate our method on three widely used benchmarks and experimental results demonstrate the general superiority of our method on dialogue response selection.
arXiv Detail & Related papers (2023-03-12T08:14:16Z) - Question-Interlocutor Scope Realized Graph Modeling over Key Utterances
for Dialogue Reading Comprehension [61.55950233402972]
We propose a new key utterances extracting method for dialogue reading comprehension.
It performs prediction on the unit formed by several contiguous utterances, which can realize more answer-contained utterances.
As a graph constructed on the text of utterances, we then propose Question-Interlocutor Scope Realized Graph (QuISG) modeling.
arXiv Detail & Related papers (2022-10-26T04:00:42Z) - 'John ate 5 apples' != 'John ate some apples': Self-Supervised
Paraphrase Quality Detection for Algebraic Word Problems [5.682665111938764]
This paper introduces the novel task of scoring paraphrases for Algebraic Word Problems (AWP)
We propose ParaQD, a self-supervised paraphrase quality detection method using novel data augmentations.
Our method outperforms existing state-of-the-art self-supervised methods by up to 32% while also demonstrating impressive zero-shot performance.
arXiv Detail & Related papers (2022-06-16T16:01:59Z) - Few-Shot Bot: Prompt-Based Learning for Dialogue Systems [58.27337673451943]
Learning to converse using only a few examples is a great challenge in conversational AI.
The current best conversational models are either good chit-chatters (e.g., BlenderBot) or goal-oriented systems (e.g., MinTL)
We propose prompt-based few-shot learning which does not require gradient-based fine-tuning but instead uses a few examples as the only source of learning.
arXiv Detail & Related papers (2021-10-15T14:36:45Z) - Towards Document-Level Paraphrase Generation with Sentence Rewriting and
Reordering [88.08581016329398]
We propose CoRPG (Coherence Relationship guided Paraphrase Generation) for document-level paraphrase generation.
We use graph GRU to encode the coherence relationship graph and get the coherence-aware representation for each sentence.
Our model can generate document paraphrase with more diversity and semantic preservation.
arXiv Detail & Related papers (2021-09-15T05:53:40Z) - An Automated Multiple-Choice Question Generation Using Natural Language
Processing Techniques [0.913755431537592]
We present an NLP-based system for automatic multiple-choice question generation (MCQG) for Computer-Based Testing Examination (CBTE)
We used NLP technique to extract keywords that are important words in a given lesson material.
To validate that the system is not perverse, five lesson materials were used to check the effectiveness and efficiency of the system.
arXiv Detail & Related papers (2021-03-26T22:39:59Z) - Unsupervised Summarization for Chat Logs with Topic-Oriented Ranking and
Context-Aware Auto-Encoders [59.038157066874255]
We propose a novel framework called RankAE to perform chat summarization without employing manually labeled data.
RankAE consists of a topic-oriented ranking strategy that selects topic utterances according to centrality and diversity simultaneously.
A denoising auto-encoder is designed to generate succinct but context-informative summaries based on the selected utterances.
arXiv Detail & Related papers (2020-12-14T07:31:17Z) - Multi-Stage Conversational Passage Retrieval: An Approach to Fusing Term
Importance Estimation and Neural Query Rewriting [56.268862325167575]
We tackle conversational passage retrieval (ConvPR) with query reformulation integrated into a multi-stage ad-hoc IR system.
We propose two conversational query reformulation (CQR) methods: (1) term importance estimation and (2) neural query rewriting.
For the former, we expand conversational queries using important terms extracted from the conversational context with frequency-based signals.
For the latter, we reformulate conversational queries into natural, standalone, human-understandable queries with a pretrained sequence-tosequence model.
arXiv Detail & Related papers (2020-05-05T14:30:20Z) - Knowledgeable Dialogue Reading Comprehension on Key Turns [84.1784903043884]
Multi-choice machine reading comprehension (MRC) requires models to choose the correct answer from candidate options given a passage and a question.
Our research focuses dialogue-based MRC, where the passages are multi-turn dialogues.
It suffers from two challenges, the answer selection decision is made without support of latently helpful commonsense, and the multi-turn context may hide considerable irrelevant information.
arXiv Detail & Related papers (2020-04-29T07:04:43Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.