From Machine Reading Comprehension to Dialogue State Tracking: Bridging
the Gap
- URL: http://arxiv.org/abs/2004.05827v1
- Date: Mon, 13 Apr 2020 09:00:03 GMT
- Title: From Machine Reading Comprehension to Dialogue State Tracking: Bridging
the Gap
- Authors: Shuyang Gao, Sanchit Agarwal, Tagyoung Chung, Di Jin, Dilek
Hakkani-Tur
- Abstract summary: We propose using machine reading comprehension (RC) in state tracking from two perspectives: model architectures and datasets.
Our method achieves near the current state-of-the-art in joint goal accuracy on MultiWOZ 2.1 given full training data.
- Score: 41.577548543163196
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Dialogue state tracking (DST) is at the heart of task-oriented dialogue
systems. However, the scarcity of labeled data is an obstacle to building
accurate and robust state tracking systems that work across a variety of
domains. Existing approaches generally require some dialogue data with state
information and their ability to generalize to unknown domains is limited. In
this paper, we propose using machine reading comprehension (RC) in state
tracking from two perspectives: model architectures and datasets. We divide the
slot types in dialogue state into categorical or extractive to borrow the
advantages from both multiple-choice and span-based reading comprehension
models. Our method achieves near the current state-of-the-art in joint goal
accuracy on MultiWOZ 2.1 given full training data. More importantly, by
leveraging machine reading comprehension datasets, our method outperforms the
existing approaches by many a large margin in few-shot scenarios when the
availability of in-domain data is limited. Lastly, even without any state
tracking data, i.e., zero-shot scenario, our proposed approach achieves greater
than 90% average slot accuracy in 12 out of 30 slots in MultiWOZ 2.1.
Related papers
- Diverse Retrieval-Augmented In-Context Learning for Dialogue State
Tracking [3.8073142980733]
We propose RefPyDST, which advances the state of the art with three advancements to in-context learning for dialogue state tracking.
First, we formulate DST as a Python programming task, explicitly modeling language coreference as variable reference in Python.
Second, since in-context learning depends highly on the context examples, we propose a method to retrieve a diverse set of relevant examples to improve performance.
arXiv Detail & Related papers (2023-07-04T03:15:52Z) - Which One Are You Referring To? Multimodal Object Identification in
Situated Dialogue [50.279206765971125]
We explore three methods to tackle the problem of interpreting multimodal inputs from conversational and situational contexts.
Our best method, scene-dialogue alignment, improves the performance by 20% F1-score compared to the SIMMC 2.1 baselines.
arXiv Detail & Related papers (2023-02-28T15:45:20Z) - Weakly Supervised Data Augmentation Through Prompting for Dialogue
Understanding [103.94325597273316]
We present a novel approach that iterates on augmentation quality by applying weakly-supervised filters.
We evaluate our methods on the emotion and act classification tasks in DailyDialog and the intent classification task in Facebook Multilingual Task-Oriented Dialogue.
For DailyDialog specifically, using 10% of the ground truth data we outperform the current state-of-the-art model which uses 100% of the data.
arXiv Detail & Related papers (2022-10-25T17:01:30Z) - Cross-Lingual Dialogue Dataset Creation via Outline-Based Generation [70.81596088969378]
Cross-lingual Outline-based Dialogue dataset (termed COD) enables natural language understanding.
COD enables dialogue state tracking, and end-to-end dialogue modelling and evaluation in 4 diverse languages.
arXiv Detail & Related papers (2022-01-31T18:11:21Z) - Slot Self-Attentive Dialogue State Tracking [22.187581131353948]
We propose a slot self-attention mechanism that can learn the slot correlations automatically.
We conduct comprehensive experiments on two multi-domain task-oriented dialogue datasets.
arXiv Detail & Related papers (2021-01-22T22:48:51Z) - CREDIT: Coarse-to-Fine Sequence Generation for Dialogue State Tracking [44.38388988238695]
A dialogue state tracker aims to accurately find a compact representation of the current dialogue status.
We employ a structured state representation and cast dialogue state tracking as a sequence generation problem.
Experiments demonstrate our tracker achieves encouraging joint goal accuracy for the five domains in MultiWOZ 2.0 and MultiWOZ 2.1 datasets.
arXiv Detail & Related papers (2020-09-22T10:27:18Z) - A Contextual Hierarchical Attention Network with Adaptive Objective for
Dialogue State Tracking [63.94927237189888]
We propose to enhance the dialogue state tracking (DST) through employing a contextual hierarchical attention network.
We also propose an adaptive objective to alleviate the slot imbalance problem by dynamically adjusting weights of different slots during training.
Experimental results show that our approach reaches 52.68% and 58.55% joint accuracy on MultiWOZ 2.0 and MultiWOZ 2.1 datasets.
arXiv Detail & Related papers (2020-06-02T12:25:44Z) - TripPy: A Triple Copy Strategy for Value Independent Neural Dialog State
Tracking [2.78632567955797]
Task-oriented dialog systems rely on dialog state tracking (DST) to monitor the user's goal during an interaction.
We present a new approach to DST which makes use of various copy mechanisms to fill slots with values.
arXiv Detail & Related papers (2020-05-06T14:52:48Z) - Non-Autoregressive Dialog State Tracking [122.2328875457225]
We propose a novel framework of Non-Autoregressive Dialog State Tracking (NADST)
NADST can factor in potential dependencies among domains and slots to optimize the models towards better prediction of dialogue states as a complete set rather than separate slots.
Our results show that our model achieves the state-of-the-art joint accuracy across all domains on the MultiWOZ 2.1 corpus.
arXiv Detail & Related papers (2020-02-19T06:39:26Z) - MA-DST: Multi-Attention Based Scalable Dialog State Tracking [13.358314140896937]
Dialog State Tracking dialog agents provide a natural language interface for users to complete their goal.
To enable accurate multi-domain DST, the model needs to encode dependencies between past utterances and slot semantics.
We introduce a novel architecture for this task to encode the conversation history and slot semantics.
arXiv Detail & Related papers (2020-02-07T05:34:58Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.