XQA-DST: Multi-Domain and Multi-Lingual Dialogue State Tracking
- URL: http://arxiv.org/abs/2204.05895v1
- Date: Tue, 12 Apr 2022 15:45:32 GMT
- Title: XQA-DST: Multi-Domain and Multi-Lingual Dialogue State Tracking
- Authors: Han Zhou, Ignacio Iacobacci, Pasquale Minervini
- Abstract summary: We propose a multi-domain and multi-lingual dialogue state tracker in a neural reading comprehension approach.
Our approach fills the slot values using span prediction, where the values are extracted from the dialogue itself.
We show its competitive transferability by zero-shot domain-adaptation experiments on MultiWOZ 2.1 with an average JGA of 31.6% for five domains.
- Score: 23.945407948894967
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: In a task-oriented dialogue system, Dialogue State Tracking (DST) keeps track
of all important information by filling slots with values given through the
conversation. Existing methods generally rely on a predefined set of values and
struggle to generalise to previously unseen slots in new domains. In this
paper, we propose a multi-domain and multi-lingual dialogue state tracker in a
neural reading comprehension approach. Our approach fills the slot values using
span prediction, where the values are extracted from the dialogue itself. With
a novel training strategy and an independent domain classifier, empirical
results demonstrate that our model is a domain-scalable and open-vocabulary
model that achieves 53.2% Joint Goal Accuracy (JGA) on MultiWOZ 2.1. We show
its competitive transferability by zero-shot domain-adaptation experiments on
MultiWOZ 2.1 with an average JGA of 31.6% for five domains. In addition, it
achieves cross-lingual transfer with state-of-the-art zero-shot results, 64.9%
JGA from English to German and 68.6% JGA from English to Italian on WOZ 2.0.
Related papers
- PoE: a Panel of Experts for Generalized Automatic Dialogue Assessment [58.46761798403072]
A model-based automatic dialogue evaluation metric (ADEM) is expected to perform well across multiple domains.
Despite significant progress, an ADEM that works well in one domain does not necessarily generalize to another.
We propose a Panel of Experts (PoE) network that consists of a shared transformer encoder and a collection of lightweight adapters.
arXiv Detail & Related papers (2022-12-18T02:26:50Z) - Multi2WOZ: A Robust Multilingual Dataset and Conversational Pretraining
for Task-Oriented Dialog [67.20796950016735]
Multi2WOZ dataset spans four typologically diverse languages: Chinese, German, Arabic, and Russian.
We introduce a new framework for multilingual conversational specialization of pretrained language models (PrLMs) that aims to facilitate cross-lingual transfer for arbitrary downstream TOD tasks.
Our experiments show that, in most setups, the best performance entails the combination of (I) conversational specialization in the target language and (ii) few-shot transfer for the concrete TOD task.
arXiv Detail & Related papers (2022-05-20T18:35:38Z) - Zero-shot Generalization in Dialog State Tracking through Generative
Question Answering [10.81203437307028]
We introduce a novel framework that supports natural language queries for unseen constraints and slots in task-oriented dialogs.
Our approach is based on generative question-answering using a conditional domain model pre-trained on substantive English sentences.
arXiv Detail & Related papers (2021-01-20T21:47:20Z) - CREDIT: Coarse-to-Fine Sequence Generation for Dialogue State Tracking [44.38388988238695]
A dialogue state tracker aims to accurately find a compact representation of the current dialogue status.
We employ a structured state representation and cast dialogue state tracking as a sequence generation problem.
Experiments demonstrate our tracker achieves encouraging joint goal accuracy for the five domains in MultiWOZ 2.0 and MultiWOZ 2.1 datasets.
arXiv Detail & Related papers (2020-09-22T10:27:18Z) - A Contextual Hierarchical Attention Network with Adaptive Objective for
Dialogue State Tracking [63.94927237189888]
We propose to enhance the dialogue state tracking (DST) through employing a contextual hierarchical attention network.
We also propose an adaptive objective to alleviate the slot imbalance problem by dynamically adjusting weights of different slots during training.
Experimental results show that our approach reaches 52.68% and 58.55% joint accuracy on MultiWOZ 2.0 and MultiWOZ 2.1 datasets.
arXiv Detail & Related papers (2020-06-02T12:25:44Z) - From Machine Reading Comprehension to Dialogue State Tracking: Bridging
the Gap [41.577548543163196]
We propose using machine reading comprehension (RC) in state tracking from two perspectives: model architectures and datasets.
Our method achieves near the current state-of-the-art in joint goal accuracy on MultiWOZ 2.1 given full training data.
arXiv Detail & Related papers (2020-04-13T09:00:03Z) - Few-shot Natural Language Generation for Task-Oriented Dialog [113.07438787659859]
We present FewShotWoz, the first NLG benchmark to simulate the few-shot learning setting in task-oriented dialog systems.
We develop the SC-GPT model, which is pre-trained on a large set of annotated NLG corpus to acquire the controllable generation ability.
Experiments on FewShotWoz and the large Multi-Domain-WOZ datasets show that the proposed SC-GPT significantly outperforms existing methods.
arXiv Detail & Related papers (2020-02-27T18:48:33Z) - CrossWOZ: A Large-Scale Chinese Cross-Domain Task-Oriented Dialogue
Dataset [58.910961297314415]
CrossWOZ is the first large-scale Chinese Cross-Domain Wizard-of-Oz task-oriented dataset.
It contains 6K dialogue sessions and 102K utterances for 5 domains, including hotel, restaurant, attraction, metro, and taxi.
arXiv Detail & Related papers (2020-02-27T03:06:35Z) - Non-Autoregressive Dialog State Tracking [122.2328875457225]
We propose a novel framework of Non-Autoregressive Dialog State Tracking (NADST)
NADST can factor in potential dependencies among domains and slots to optimize the models towards better prediction of dialogue states as a complete set rather than separate slots.
Our results show that our model achieves the state-of-the-art joint accuracy across all domains on the MultiWOZ 2.1 corpus.
arXiv Detail & Related papers (2020-02-19T06:39:26Z) - MA-DST: Multi-Attention Based Scalable Dialog State Tracking [13.358314140896937]
Dialog State Tracking dialog agents provide a natural language interface for users to complete their goal.
To enable accurate multi-domain DST, the model needs to encode dependencies between past utterances and slot semantics.
We introduce a novel architecture for this task to encode the conversation history and slot semantics.
arXiv Detail & Related papers (2020-02-07T05:34:58Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.