Show Us the Way: Learning to Manage Dialog from Demonstrations
- URL: http://arxiv.org/abs/2004.08114v1
- Date: Fri, 17 Apr 2020 08:41:54 GMT
- Title: Show Us the Way: Learning to Manage Dialog from Demonstrations
- Authors: Gabriel Gordon-Hall, Philip John Gorinski, Gerasimos Lampouras,
Ignacio Iacobacci
- Abstract summary: We present our submission to the End-to-End Multi-Domain Dialog Challenge Track of the Eighth Dialog System Technology Challenge.
Our proposed dialog system adopts a pipeline architecture, with distinct components for Natural Language Understanding, Dialog State Tracking, Dialog Management and Natural Language Generation.
At the core of our system is a reinforcement learning algorithm which uses Deep Q-learning from Demonstrations to learn a dialog policy with the help of expert examples.
- Score: 20.770386771370347
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: We present our submission to the End-to-End Multi-Domain Dialog Challenge
Track of the Eighth Dialog System Technology Challenge. Our proposed dialog
system adopts a pipeline architecture, with distinct components for Natural
Language Understanding, Dialog State Tracking, Dialog Management and Natural
Language Generation. At the core of our system is a reinforcement learning
algorithm which uses Deep Q-learning from Demonstrations to learn a dialog
policy with the help of expert examples. We find that demonstrations are
essential to training an accurate dialog policy where both state and action
spaces are large. Evaluation of our Dialog Management component shows that our
approach is effective - beating supervised and reinforcement learning
baselines.
Related papers
- OmniDialog: An Omnipotent Pre-training Model for Task-Oriented Dialogue
System [43.92593448255296]
We propose an Omnipotent Dialogue pre-training model ( OmniDialog)
It unifies three dialogue tasks into a monolithic framework by multi-task learning, fostering inter-task communication.
We evaluate its performance across four tasks: dialogue summarization, end-to-end dialogue modeling, dialogue state tracking, and intent classification.
arXiv Detail & Related papers (2023-12-28T07:20:49Z) - Dialog-to-Actions: Building Task-Oriented Dialogue System via
Action-Level Generation [7.110201160927713]
We propose a task-oriented dialogue system via action-level generation.
Specifically, we first construct dialogue actions from large-scale dialogues and represent each natural language (NL) response as a sequence of dialogue actions.
We train a Sequence-to-Sequence model which takes the dialogue history as input and outputs sequence of dialogue actions.
arXiv Detail & Related papers (2023-04-03T11:09:20Z) - PK-Chat: Pointer Network Guided Knowledge Driven Generative Dialogue
Model [79.64376762489164]
PK-Chat is a Pointer network guided generative dialogue model, incorporating a unified pretrained language model and a pointer network over knowledge graphs.
The words generated by PK-Chat in the dialogue are derived from the prediction of word lists and the direct prediction of the external knowledge graph knowledge.
Based on the PK-Chat, a dialogue system is built for academic scenarios in the case of geosciences.
arXiv Detail & Related papers (2023-04-02T18:23:13Z) - Structure Extraction in Task-Oriented Dialogues with Slot Clustering [94.27806592467537]
In task-oriented dialogues, dialogue structure has often been considered as transition graphs among dialogue states.
We propose a simple yet effective approach for structure extraction in task-oriented dialogues.
arXiv Detail & Related papers (2022-02-28T20:18:12Z) - DSBERT:Unsupervised Dialogue Structure learning with BERT [4.171523157658394]
We propose a Bert-based unsupervised dialogue structure learning algorithm DSBERT (Dialogue Structure BERT)
Different from the previous SOTA models VRNN and SVRNN, we combine BERT and AutoEncoder, which can effectively combine context information.
Experimental results show that DSBERT can generate a dialogue structure closer to the real structure, can distinguish sentences with different semantics and map them to different hidden states.
arXiv Detail & Related papers (2021-11-09T03:31:18Z) - UniDS: A Unified Dialogue System for Chit-Chat and Task-oriented
Dialogues [59.499965460525694]
We propose a unified dialogue system (UniDS) with the two aforementioned skills.
We design a unified dialogue data schema, compatible for both chit-chat and task-oriented dialogues.
We train UniDS with mixed dialogue data from a pretrained chit-chat dialogue model.
arXiv Detail & Related papers (2021-10-15T11:56:47Z) - Integrating Pre-trained Model into Rule-based Dialogue Management [32.90885176553305]
Rule-based dialogue management is still the most popular solution for industrial task-oriented dialogue systems.
Data-driven dialogue systems, usually with end-to-end structures, are popular in academic research.
We propose a method to leverage the strength of both rule-based and data-driven dialogue managers.
arXiv Detail & Related papers (2021-02-17T03:44:22Z) - Distributed Structured Actor-Critic Reinforcement Learning for Universal
Dialogue Management [29.57382819573169]
We focus on devising a policy that chooses which dialogue action to respond to the user.
The sequential system decision-making process can be abstracted into a partially observable Markov decision process.
In the past few years, there are many deep reinforcement learning (DRL) algorithms, which use neural networks (NN) as function approximators.
arXiv Detail & Related papers (2020-09-22T05:39:31Z) - Rethinking Supervised Learning and Reinforcement Learning in
Task-Oriented Dialogue Systems [58.724629408229205]
We demonstrate how traditional supervised learning and a simulator-free adversarial learning method can be used to achieve performance comparable to state-of-the-art RL-based methods.
Our main goal is not to beat reinforcement learning with supervised learning, but to demonstrate the value of rethinking the role of reinforcement learning and supervised learning in optimizing task-oriented dialogue systems.
arXiv Detail & Related papers (2020-09-21T12:04:18Z) - Conversation Learner -- A Machine Teaching Tool for Building Dialog
Managers for Task-Oriented Dialog Systems [57.082447660944965]
Conversation Learner is a machine teaching tool for building dialog managers.
It enables dialog authors to create a dialog flow using familiar tools, converting the dialog flow into a parametric model.
It allows dialog authors to improve the dialog manager over time by leveraging user-system dialog logs as training data.
arXiv Detail & Related papers (2020-04-09T00:10:54Z) - Recent Advances and Challenges in Task-oriented Dialog System [63.82055978899631]
Task-oriented dialog systems are attracting more and more attention in academic and industrial communities.
We discuss three critical topics for task-oriented dialog systems: (1) improving data efficiency to facilitate dialog modeling in low-resource settings, (2) modeling multi-turn dynamics for dialog policy learning, and (3) integrating domain knowledge into the dialog model.
arXiv Detail & Related papers (2020-03-17T01:34:56Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.