A Student-Teacher Architecture for Dialog Domain Adaptation under the
Meta-Learning Setting
- URL: http://arxiv.org/abs/2104.02689v1
- Date: Tue, 6 Apr 2021 17:31:28 GMT
- Title: A Student-Teacher Architecture for Dialog Domain Adaptation under the
Meta-Learning Setting
- Authors: Kun Qian, Wei Wei, Zhou Yu
- Abstract summary: It is essential to develop algorithms that can adapt to different domains efficiently when building data-driven dialog models.
We propose an efficient domain adaptive task-oriented dialog system model, which incorporates a meta-teacher model.
We evaluate our model on two multi-domain datasets, MultiWOZ and Google-Guided Dialogue, and achieve state-of-the-art performance.
- Score: 42.80034363734555
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Numerous new dialog domains are being created every day while collecting data
for these domains is extremely costly since it involves human interactions.
Therefore, it is essential to develop algorithms that can adapt to different
domains efficiently when building data-driven dialog models. The most recent
researches on domain adaption focus on giving the model a better
initialization, rather than optimizing the adaptation process. We propose an
efficient domain adaptive task-oriented dialog system model, which incorporates
a meta-teacher model to emphasize the different impacts between generated
tokens with respect to the context. We first train our base dialog model and
meta-teacher model adversarially in a meta-learning setting on rich-resource
domains. The meta-teacher learns to quantify the importance of tokens under
different contexts across different domains. During adaptation, the
meta-teacher guides the dialog model to focus on important tokens in order to
achieve better adaptation efficiency. We evaluate our model on two multi-domain
datasets, MultiWOZ and Google Schema-Guided Dialogue, and achieve
state-of-the-art performance.
Related papers
- A Unified Data Augmentation Framework for Low-Resource Multi-Domain Dialogue Generation [52.0964459842176]
Current state-of-the-art dialogue systems heavily rely on extensive training datasets.
We propose a novel data textbfAugmentation framework for textbfMulti-textbfDomain textbfDialogue textbfGeneration, referred to as textbfAMD$2$G.
The AMD$2$G framework consists of a data augmentation process and a two-stage training approach: domain-agnostic training and domain adaptation training.
arXiv Detail & Related papers (2024-06-14T09:52:27Z) - Schema Graph-Guided Prompt for Multi-Domain Dialogue State Tracking [16.955887768832046]
We propose a graph-based framework that learns domain-specific prompts by incorporating the dialogue schema.
Specifically, we embed domain-specific schema encoded by a graph neural network into the pre-trained language model.
Our experiments demonstrate that the proposed graph-based method outperforms other multi-domain DST approaches.
arXiv Detail & Related papers (2023-11-10T19:00:02Z) - Manual-Guided Dialogue for Flexible Conversational Agents [84.46598430403886]
How to build and use dialogue data efficiently, and how to deploy models in different domains at scale can be critical issues in building a task-oriented dialogue system.
We propose a novel manual-guided dialogue scheme, where the agent learns the tasks from both dialogue and manuals.
Our proposed scheme reduces the dependence of dialogue models on fine-grained domain ontology, and makes them more flexible to adapt to various domains.
arXiv Detail & Related papers (2022-08-16T08:21:12Z) - Domain Adaptation in Dialogue Systems using Transfer and Meta-Learning [12.64591916699374]
Current generative-based dialogue systems fail to adapt to new unseen domains when only a small amount of target data is available.
We propose a method that adapts to unseen domains by combining both transfer and meta-learning.
arXiv Detail & Related papers (2021-02-22T16:16:57Z) - Meta-Context Transformers for Domain-Specific Response Generation [4.377737808397113]
We present DSRNet, a transformer-based model for dialogue response generation by reinforcing domain-specific attributes.
We study the use of DSRNet in a multi-turn multi-interlocutor environment for domain-specific response generation.
Our model shows significant improvement over the state-of-the-art for multi-turn dialogue systems supported by better BLEU and semantic similarity (BertScore) scores.
arXiv Detail & Related papers (2020-10-12T09:49:27Z) - Meta Dialogue Policy Learning [58.045067703675095]
We propose Deep Transferable Q-Network (DTQN) to utilize shareable low-level signals between domains.
We decompose the state and action representation space into feature subspaces corresponding to these low-level components.
In experiments, our model outperforms baseline models in terms of both success rate and dialogue efficiency.
arXiv Detail & Related papers (2020-06-03T23:53:06Z) - Dynamic Fusion Network for Multi-Domain End-to-end Task-Oriented Dialog [70.79442700890843]
We propose a novel Dynamic Fusion Network (DF-Net) which automatically exploit the relevance between the target domain and each domain.
With little training data, we show its transferability by outperforming prior best model by 13.9% on average.
arXiv Detail & Related papers (2020-04-23T08:17:22Z) - Variational Hierarchical Dialog Autoencoder for Dialog State Tracking
Data Augmentation [59.174903564894954]
In this work, we extend this approach to the task of dialog state tracking for goal-oriented dialogs.
We propose the Variational Hierarchical Dialog Autoencoder (VHDA) for modeling the complete aspects of goal-oriented dialogs.
Experiments on various dialog datasets show that our model improves the downstream dialog trackers' robustness via generative data augmentation.
arXiv Detail & Related papers (2020-01-23T15:34:56Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.