Controlling Style in Generated Dialogue
- URL: http://arxiv.org/abs/2009.10855v1
- Date: Tue, 22 Sep 2020 23:21:04 GMT
- Title: Controlling Style in Generated Dialogue
- Authors: Eric Michael Smith, Diana Gonzalez-Rico, Emily Dinan, Y-Lan Boureau
- Abstract summary: We adapt three previously proposed controllable generation architectures to open-domain dialogue generation.
We control the style of the generation to match one among about 200 possible styles.
We show how they can be used to provide insights into existing conversational datasets.
- Score: 13.445455480452484
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Open-domain conversation models have become good at generating
natural-sounding dialogue, using very large architectures with billions of
trainable parameters. The vast training data required to train these
architectures aggregates many different styles, tones, and qualities. Using
that data to train a single model makes it difficult to use the model as a
consistent conversational agent, e.g. with a stable set of persona traits and a
typical style of expression. Several architectures affording control mechanisms
over generation architectures have been proposed, each with different
trade-offs. However, it remains unclear whether their use in dialogue is
viable, and what the trade-offs look like with the most recent state-of-the-art
conversational architectures. In this work, we adapt three previously proposed
controllable generation architectures to open-domain dialogue generation,
controlling the style of the generation to match one among about 200 possible
styles. We compare their respective performance and tradeoffs, and show how
they can be used to provide insights into existing conversational datasets, and
generate a varied set of styled conversation replies.
Related papers
- Manual-Guided Dialogue for Flexible Conversational Agents [84.46598430403886]
How to build and use dialogue data efficiently, and how to deploy models in different domains at scale can be critical issues in building a task-oriented dialogue system.
We propose a novel manual-guided dialogue scheme, where the agent learns the tasks from both dialogue and manuals.
Our proposed scheme reduces the dependence of dialogue models on fine-grained domain ontology, and makes them more flexible to adapt to various domains.
arXiv Detail & Related papers (2022-08-16T08:21:12Z) - Building a Role Specified Open-Domain Dialogue System Leveraging
Large-Scale Language Models [15.062014096238803]
We study the challenge of imposing roles on open-domain dialogue systems.
We propose an efficient data collection framework for building role-satisfying dialogue dataset from scratch.
Our models return few out-of-bounds utterances, keeping competitive performance on general metrics.
arXiv Detail & Related papers (2022-04-30T06:23:06Z) - Stylistic Retrieval-based Dialogue System with Unparallel Training Data [19.777894827625275]
We propose a flexible framework that adapts a generic retrieval-based dialogue system to mimic the language style of a specified persona without any parallel data.
Our approach is based on automatic generation of stylized data by learning the usage of jargon, and then rewriting the generic conversations to a stylized one by incorporating the jargon.
arXiv Detail & Related papers (2021-09-12T09:56:24Z) - The Adapter-Bot: All-In-One Controllable Conversational Model [66.48164003532484]
We propose a dialogue model that uses a fixed backbone model such as DialGPT and triggers on-demand dialogue skills via different adapters.
Depending on the skills, the model is able to process multiple knowledge types, such as text, tables, and emphatic responses.
We evaluate our model using automatic evaluation by comparing it with existing state-of-the-art conversational models.
arXiv Detail & Related papers (2020-08-28T10:59:31Z) - Ranking Enhanced Dialogue Generation [77.8321855074999]
How to effectively utilize the dialogue history is a crucial problem in multi-turn dialogue generation.
Previous works usually employ various neural network architectures to model the history.
This paper proposes a Ranking Enhanced Dialogue generation framework.
arXiv Detail & Related papers (2020-08-13T01:49:56Z) - Paraphrase Augmented Task-Oriented Dialog Generation [68.1790912977053]
We propose a paraphrase augmented response generation (PARG) framework that jointly trains a paraphrase model and a response generation model.
We also design a method to automatically construct paraphrase training data set based on dialog state and dialog act labels.
arXiv Detail & Related papers (2020-04-16T05:12:36Z) - EmpTransfo: A Multi-head Transformer Architecture for Creating
Empathetic Dialog Systems [4.41738804598711]
This paper presents EmpTransfo, a multi-head Transformer architecture for creating an empathetic dialog system.
We show that utilizing the history of emotions and other metadata can improve the quality of generated conversations.
arXiv Detail & Related papers (2020-03-05T23:09:24Z) - Variational Hierarchical Dialog Autoencoder for Dialog State Tracking
Data Augmentation [59.174903564894954]
In this work, we extend this approach to the task of dialog state tracking for goal-oriented dialogs.
We propose the Variational Hierarchical Dialog Autoencoder (VHDA) for modeling the complete aspects of goal-oriented dialogs.
Experiments on various dialog datasets show that our model improves the downstream dialog trackers' robustness via generative data augmentation.
arXiv Detail & Related papers (2020-01-23T15:34:56Z) - Modality-Balanced Models for Visual Dialogue [102.35406085738325]
The Visual Dialog task requires a model to exploit both image and conversational context information to generate the next response to the dialogue.
We show that previous joint-modality (history and image) models over-rely on and are more prone to memorizing the dialogue history.
We present methods for this integration of the two models, via ensemble and consensus dropout fusion with shared parameters.
arXiv Detail & Related papers (2020-01-17T14:57:12Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.