Slot Self-Attentive Dialogue State Tracking
- URL: http://arxiv.org/abs/2101.09374v1
- Date: Fri, 22 Jan 2021 22:48:51 GMT
- Title: Slot Self-Attentive Dialogue State Tracking
- Authors: Fanghua Ye, Jarana Manotumruksa, Qiang Zhang, Shenghui Li, Emine
Yilmaz
- Abstract summary: We propose a slot self-attention mechanism that can learn the slot correlations automatically.
We conduct comprehensive experiments on two multi-domain task-oriented dialogue datasets.
- Score: 22.187581131353948
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: An indispensable component in task-oriented dialogue systems is the dialogue
state tracker, which keeps track of users' intentions in the course of
conversation. The typical approach towards this goal is to fill in multiple
pre-defined slots that are essential to complete the task. Although various
dialogue state tracking methods have been proposed in recent years, most of
them predict the value of each slot separately and fail to consider the
correlations among slots. In this paper, we propose a slot self-attention
mechanism that can learn the slot correlations automatically. Specifically, a
slot-token attention is first utilized to obtain slot-specific features from
the dialogue context. Then a stacked slot self-attention is applied on these
features to learn the correlations among slots. We conduct comprehensive
experiments on two multi-domain task-oriented dialogue datasets, including
MultiWOZ 2.0 and MultiWOZ 2.1. The experimental results demonstrate that our
approach achieves state-of-the-art performance on both datasets, verifying the
necessity and effectiveness of taking slot correlations into consideration.
Related papers
- Beyond the Granularity: Multi-Perspective Dialogue Collaborative
Selection for Dialogue State Tracking [18.172993687706708]
In dialogue state tracking, dialogue history is a crucial material, and its utilization varies between different models.
We propose DiCoS-DST to dynamically select the relevant dialogue contents corresponding to each slot for state updating.
Our approach achieves new state-of-the-art performance on MultiWOZ 2.1 and MultiWOZ 2.2, and achieves superior performance on multiple mainstream benchmark datasets.
arXiv Detail & Related papers (2022-05-20T10:08:45Z) - LUNA: Learning Slot-Turn Alignment for Dialogue State Tracking [21.80577241399013]
Dialogue state tracking (DST) aims to predict the current dialogue state given the dialogue history.
Existing methods generally exploit the utterances of all dialogue turns to assign value for each slot.
We propose LUNA, a sLot-tUrN Alignment enhanced approach.
It first explicitly aligns each slot with its most relevant utterance, then further predicts the corresponding value based on this aligned utterance instead of all dialogue utterances.
arXiv Detail & Related papers (2022-05-05T10:18:23Z) - A Universality-Individuality Integration Model for Dialog Act
Classification [0.0]
Dialog Act (DA) reveals the general intent of the speaker utterance in a conversation.
This paper suggests that word cues, part-of-speech cues and statistical cues can complement each other to improve the basis for recognition.
We propose a novel model based on universality and individuality strategies, called Universality-Individuality Integration Model (UIIM)
arXiv Detail & Related papers (2022-04-13T06:05:34Z) - Ontology-Enhanced Slot Filling [3.024095356755364]
In multi-domain task-oriented dialog system, user utterances and system responses may mention multiple named entities and attributes values.
A system needs to select those that are confirmed by the user and fill them into destined slots.
One difficulty is that since a dialogue session contains multiple system-user turns, feeding in all the tokens into a deep model such as BERT can be challenging due to limited capacity of input word tokens and GPU memory.
arXiv Detail & Related papers (2021-08-25T14:54:47Z) - Interactive Fusion of Multi-level Features for Compositional Activity
Recognition [100.75045558068874]
We present a novel framework that accomplishes this goal by interactive fusion.
We implement the framework in three steps, namely, positional-to-appearance feature extraction, semantic feature interaction, and semantic-to-positional prediction.
We evaluate our approach on two action recognition datasets, Something-Something and Charades.
arXiv Detail & Related papers (2020-12-10T14:17:18Z) - DCR-Net: A Deep Co-Interactive Relation Network for Joint Dialog Act
Recognition and Sentiment Classification [77.59549450705384]
In dialog system, dialog act recognition and sentiment classification are two correlative tasks.
Most of the existing systems either treat them as separate tasks or just jointly model the two tasks.
We propose a Deep Co-Interactive Relation Network (DCR-Net) to explicitly consider the cross-impact and model the interaction between the two tasks.
arXiv Detail & Related papers (2020-08-16T14:13:32Z) - Point or Generate Dialogue State Tracker [0.0]
We propose the Point-Or-Generate Dialogue State Tracker (POGD)
POGD points out explicitly expressed slot values from the user's utterance, and generates implicitly expressed ones based on slot-specific contexts.
Experiments show that POGD not only obtains state-of-the-art results on both WoZ 2.0 and MultiWoZ 2.0 datasets but also has good generalization on unseen values and new slots.
arXiv Detail & Related papers (2020-08-08T02:15:25Z) - A Contextual Hierarchical Attention Network with Adaptive Objective for
Dialogue State Tracking [63.94927237189888]
We propose to enhance the dialogue state tracking (DST) through employing a contextual hierarchical attention network.
We also propose an adaptive objective to alleviate the slot imbalance problem by dynamically adjusting weights of different slots during training.
Experimental results show that our approach reaches 52.68% and 58.55% joint accuracy on MultiWOZ 2.0 and MultiWOZ 2.1 datasets.
arXiv Detail & Related papers (2020-06-02T12:25:44Z) - Rethinking Dialogue State Tracking with Reasoning [76.0991910623001]
This paper proposes to track dialogue states gradually with reasoning over dialogue turns with the help of the back-end data.
Empirical results demonstrate that our method significantly outperforms the state-of-the-art methods by 38.6% in terms of joint belief accuracy for MultiWOZ 2.1.
arXiv Detail & Related papers (2020-05-27T02:05:33Z) - AGIF: An Adaptive Graph-Interactive Framework for Joint Multiple Intent
Detection and Slot Filling [69.59096090788125]
In this paper, we propose an Adaptive Graph-Interactive Framework (AGIF) for joint multiple intent detection and slot filling.
We introduce an intent-slot graph interaction layer to model the strong correlation between the slot and intents.
Such an interaction layer is applied to each token adaptively, which has the advantage to automatically extract the relevant intents information.
arXiv Detail & Related papers (2020-04-21T15:07:34Z) - Non-Autoregressive Dialog State Tracking [122.2328875457225]
We propose a novel framework of Non-Autoregressive Dialog State Tracking (NADST)
NADST can factor in potential dependencies among domains and slots to optimize the models towards better prediction of dialogue states as a complete set rather than separate slots.
Our results show that our model achieves the state-of-the-art joint accuracy across all domains on the MultiWOZ 2.1 corpus.
arXiv Detail & Related papers (2020-02-19T06:39:26Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.