DialGuide: Aligning Dialogue Model Behavior with Developer Guidelines
- URL: http://arxiv.org/abs/2212.10557v2
- Date: Sun, 21 May 2023 19:38:01 GMT
- Title: DialGuide: Aligning Dialogue Model Behavior with Developer Guidelines
- Authors: Prakhar Gupta, Yang Liu, Di Jin, Behnam Hedayatnia, Spandana Gella,
Sijia Liu, Patrick Lange, Julia Hirschberg, Dilek Hakkani-Tur
- Abstract summary: We introduce DialGuide, a framework for controlling dialogue model behavior using natural language rules.
Our dataset contains 10,737 positive and 15,467 negative dialogue context-response-guideline triplets across two domains - chit-chat and safety.
- Score: 48.780256371992515
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Dialogue models are able to generate coherent and fluent responses, but they
can still be challenging to control and may produce non-engaging, unsafe
results. This unpredictability diminishes user trust and can hinder the use of
the models in the real world. To address this, we introduce DialGuide, a novel
framework for controlling dialogue model behavior using natural language rules,
or guidelines. These guidelines provide information about the context they are
applicable to and what should be included in the response, allowing the models
to generate responses that are more closely aligned with the developer's
expectations and intent. We evaluate DialGuide on three tasks in open-domain
dialogue response generation: guideline selection, response generation, and
response entailment verification. Our dataset contains 10,737 positive and
15,467 negative dialogue context-response-guideline triplets across two domains
- chit-chat and safety. We provide baseline models for the tasks and benchmark
their performance. We also demonstrate that DialGuide is effective in the
dialogue safety domain, producing safe and engaging responses that follow
developer guidelines.
Related papers
- DiactTOD: Learning Generalizable Latent Dialogue Acts for Controllable
Task-Oriented Dialogue Systems [15.087619144902776]
We present a novel end-to-end latent dialogue act model (DiactTOD) that represents dialogue acts in a latent space.
When pre-trained on a large corpus, DiactTOD is able to predict and control dialogue acts to generate controllable responses.
arXiv Detail & Related papers (2023-08-01T23:29:16Z) - Controllable Mixed-Initiative Dialogue Generation through Prompting [50.03458333265885]
Mixed-initiative dialogue tasks involve repeated exchanges of information and conversational control.
Agents gain control by generating responses that follow particular dialogue intents or strategies, prescribed by a policy planner.
Standard approach has been fine-tuning pre-trained language models to perform generation conditioned on these intents.
We instead prompt large language models as a drop-in replacement to fine-tuning on conditional generation.
arXiv Detail & Related papers (2023-05-06T23:11:25Z) - Manual-Guided Dialogue for Flexible Conversational Agents [84.46598430403886]
How to build and use dialogue data efficiently, and how to deploy models in different domains at scale can be critical issues in building a task-oriented dialogue system.
We propose a novel manual-guided dialogue scheme, where the agent learns the tasks from both dialogue and manuals.
Our proposed scheme reduces the dependence of dialogue models on fine-grained domain ontology, and makes them more flexible to adapt to various domains.
arXiv Detail & Related papers (2022-08-16T08:21:12Z) - WeaSuL: Weakly Supervised Dialogue Policy Learning: Reward Estimation
for Multi-turn Dialogue [17.663449579168297]
We simulate a dialogue between an agent and a user (modelled similar to an agent with supervised learning objective) to interact with each other.
The agent uses dynamic blocking to generate ranked diverse responses and exploration-exploitation to select among the Top-K responses.
Empirical studies with two benchmarks indicate that our model can significantly out-perform the response quality and lead to a successful conversation.
arXiv Detail & Related papers (2021-08-01T08:00:45Z) - The Adapter-Bot: All-In-One Controllable Conversational Model [66.48164003532484]
We propose a dialogue model that uses a fixed backbone model such as DialGPT and triggers on-demand dialogue skills via different adapters.
Depending on the skills, the model is able to process multiple knowledge types, such as text, tables, and emphatic responses.
We evaluate our model using automatic evaluation by comparing it with existing state-of-the-art conversational models.
arXiv Detail & Related papers (2020-08-28T10:59:31Z) - Controlling Dialogue Generation with Semantic Exemplars [55.460082747572734]
We present an Exemplar-based Dialogue Generation model, EDGE, that uses the semantic frames present in exemplar responses to guide generation.
We show that controlling dialogue generation based on the semantic frames of exemplars, rather than words in the exemplar itself, improves the coherence of generated responses.
arXiv Detail & Related papers (2020-08-20T17:02:37Z) - Policy-Driven Neural Response Generation for Knowledge-Grounded Dialogue
Systems [18.375851346138155]
Seq2seq neural response generation approaches do not have explicit mechanisms to control the content or style of the generated response.
We propose using a dialogue policy to plan the content and style of target responses in the form of an action plan.
We demonstrate that a basic dialogue policy that operates at the sentence level generates better responses in comparison to turn level generation.
arXiv Detail & Related papers (2020-05-26T06:09:57Z) - A Controllable Model of Grounded Response Generation [122.7121624884747]
Current end-to-end neural conversation models inherently lack the flexibility to impose semantic control in the response generation process.
We propose a framework that we call controllable grounded response generation (CGRG)
We show that using this framework, a transformer based model with a novel inductive attention mechanism, trained on a conversation-like Reddit dataset, outperforms strong generation baselines.
arXiv Detail & Related papers (2020-05-01T21:22:08Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.