Meta Dialogue Policy Learning
- URL: http://arxiv.org/abs/2006.02588v1
- Date: Wed, 3 Jun 2020 23:53:06 GMT
- Title: Meta Dialogue Policy Learning
- Authors: Yumo Xu, Chenguang Zhu, Baolin Peng and Michael Zeng
- Abstract summary: We propose Deep Transferable Q-Network (DTQN) to utilize shareable low-level signals between domains.
We decompose the state and action representation space into feature subspaces corresponding to these low-level components.
In experiments, our model outperforms baseline models in terms of both success rate and dialogue efficiency.
- Score: 58.045067703675095
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Dialog policy determines the next-step actions for agents and hence is
central to a dialogue system. However, when migrated to novel domains with
little data, a policy model can fail to adapt due to insufficient interactions
with the new environment. We propose Deep Transferable Q-Network (DTQN) to
utilize shareable low-level signals between domains, such as dialogue acts and
slots. We decompose the state and action representation space into feature
subspaces corresponding to these low-level components to facilitate
cross-domain knowledge transfer. Furthermore, we embed DTQN in a meta-learning
framework and introduce Meta-DTQN with a dual-replay mechanism to enable
effective off-policy training and adaptation. In experiments, our model
outperforms baseline models in terms of both success rate and dialogue
efficiency on the multi-domain dialogue dataset MultiWOZ 2.0.
Related papers
- Plug-and-Play Policy Planner for Large Language Model Powered Dialogue
Agents [121.46051697742608]
We introduce a new dialogue policy planning paradigm to strategize dialogue problems with a tunable language model plug-in named PPDPP.
Specifically, we develop a novel training framework to facilitate supervised fine-tuning over available human-annotated data.
PPDPP consistently and substantially outperforms existing approaches on three different proactive dialogue applications.
arXiv Detail & Related papers (2023-11-01T03:20:16Z) - JoTR: A Joint Transformer and Reinforcement Learning Framework for
Dialog Policy Learning [53.83063435640911]
Dialogue policy learning (DPL) is a crucial component of dialogue modelling.
We introduce a novel framework, JoTR, to generate flexible dialogue actions.
Unlike traditional methods, JoTR formulates a word-level policy that allows for a more dynamic and adaptable dialogue action generation.
arXiv Detail & Related papers (2023-09-01T03:19:53Z) - Graph Neural Network Policies and Imitation Learning for Multi-Domain
Task-Oriented Dialogues [0.716879432974126]
Task-oriented dialogue systems are designed to achieve specific goals while conversing with humans.
In practice, they may have to handle simultaneously several domains and tasks.
We show that structured policies based on graph neural networks combined with different degrees of imitation learning can effectively handle multi-domain dialogues.
arXiv Detail & Related papers (2022-10-11T08:29:10Z) - Manual-Guided Dialogue for Flexible Conversational Agents [84.46598430403886]
How to build and use dialogue data efficiently, and how to deploy models in different domains at scale can be critical issues in building a task-oriented dialogue system.
We propose a novel manual-guided dialogue scheme, where the agent learns the tasks from both dialogue and manuals.
Our proposed scheme reduces the dependence of dialogue models on fine-grained domain ontology, and makes them more flexible to adapt to various domains.
arXiv Detail & Related papers (2022-08-16T08:21:12Z) - Improving Multi-Party Dialogue Discourse Parsing via Domain Integration [25.805553277418813]
Multi-party conversations are implicitly organized by semantic level correlations across the interactive turns.
dialogue discourse analysis can be applied to predict the dependency structure and relations between the elementary discourse units.
Existing corpora with dialogue discourse annotation are collected from specific domains with limited sample sizes.
arXiv Detail & Related papers (2021-10-09T09:36:22Z) - Transferable Dialogue Systems and User Simulators [17.106518400787156]
One of the difficulties in training dialogue systems is the lack of training data.
We explore the possibility of creating dialogue data through the interaction between a dialogue system and a user simulator.
We develop a modelling framework that can incorporate new dialogue scenarios through self-play between the two agents.
arXiv Detail & Related papers (2021-07-25T22:59:09Z) - A Simple But Effective Approach to n-shot Task-Oriented Dialogue
Augmentation [32.43362825854633]
We introduce a framework that creates synthetic task-oriented dialogues in a fully automatic manner.
Our framework uses the simple idea that each turn-pair in a task-oriented dialogue has a certain function.
We observe significant improvements in the fine-tuning scenarios in several domains.
arXiv Detail & Related papers (2021-02-27T18:55:12Z) - Modeling Long Context for Task-Oriented Dialogue State Generation [51.044300192906995]
We propose a multi-task learning model with a simple yet effective utterance tagging technique and a bidirectional language model.
Our approaches attempt to solve the problem that the performance of the baseline significantly drops when the input dialogue context sequence is long.
In our experiments, our proposed model achieves a 7.03% relative improvement over the baseline, establishing a new state-of-the-art joint goal accuracy of 52.04% on the MultiWOZ 2.0 dataset.
arXiv Detail & Related papers (2020-04-29T11:02:25Z) - Non-Autoregressive Dialog State Tracking [122.2328875457225]
We propose a novel framework of Non-Autoregressive Dialog State Tracking (NADST)
NADST can factor in potential dependencies among domains and slots to optimize the models towards better prediction of dialogue states as a complete set rather than separate slots.
Our results show that our model achieves the state-of-the-art joint accuracy across all domains on the MultiWOZ 2.1 corpus.
arXiv Detail & Related papers (2020-02-19T06:39:26Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.