Hierarchical Modeling for Out-of-Scope Domain and Intent Classification
- URL: http://arxiv.org/abs/2104.14781v1
- Date: Fri, 30 Apr 2021 06:38:23 GMT
- Title: Hierarchical Modeling for Out-of-Scope Domain and Intent Classification
- Authors: Pengfei Liu, Kun Li and Helen Meng
- Abstract summary: This paper focuses on out-of-scope intent classification in dialog systems.
We propose a hierarchical multi-task learning approach based on a joint model to classify domain and intent simultaneously.
Experiments show that the model outperforms existing methods in terms of accuracy, out-of-scope recall and F1.
- Score: 55.23920796595698
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: User queries for a real-world dialog system may sometimes fall outside the
scope of the system's capabilities, but appropriate system responses will
enable smooth processing throughout the human-computer interaction. This paper
is concerned with the user's intent, and focuses on out-of-scope intent
classification in dialog systems. Although user intents are highly correlated
with the application domain, few studies have exploited such correlations for
intent classification. Rather than developing a two-stage approach that first
classifies the domain and then the intent, we propose a hierarchical multi-task
learning approach based on a joint model to classify domain and intent
simultaneously. Novelties in the proposed approach include: (1) sharing
supervised out-of-scope signals in joint modeling of domain and intent
classification to replace a two-stage pipeline; and (2) introducing a
hierarchical model that learns the intent and domain representations in the
higher and lower layers respectively. Experiments show that the model
outperforms existing methods in terms of accuracy, out-of-scope recall and F1.
Additionally, threshold-based post-processing further improves performance by
balancing precision and recall in intent classification.
Related papers
- The Overlooked Classifier in Human-Object Interaction Recognition [82.20671129356037]
We encode the semantic correlation among classes into the classification head by initializing the weights with language embeddings of HOIs.
We propose a new loss named LSE-Sign to enhance multi-label learning on a long-tailed dataset.
Our simple yet effective method enables detection-free HOI classification, outperforming the state-of-the-arts that require object detection and human pose by a clear margin.
arXiv Detail & Related papers (2022-03-10T23:35:00Z) - GAN for Vision, KG for Relation: a Two-stage Deep Network for Zero-shot
Action Recognition [33.23662792742078]
We propose a two-stage deep neural network for zero-shot action recognition.
In the sampling stage, we utilize a generative adversarial networks (GAN) trained by action features and word vectors of seen classes.
In the classification stage, we construct a knowledge graph based on the relationship between word vectors of action classes and related objects.
arXiv Detail & Related papers (2021-05-25T09:34:42Z) - Automatic Intent-Slot Induction for Dialogue Systems [5.6195418981579435]
We propose a new task of em automatic intent-slot induction and propose a novel domain-independent tool.
That is, we design a coarse-to-fine three-step procedure including role-labeling, Concept-mining, And Pattern-mining.
We show that our RCAP can generate satisfactory SLU schema and outperforms the state-of-the-art supervised learning method.
arXiv Detail & Related papers (2021-03-16T07:21:31Z) - Example-Driven Intent Prediction with Observers [15.615065041164629]
We focus on the intent classification problem which aims to identify user intents given utterances addressed to the dialog system.
We propose two approaches for improving the generalizability of utterance classification models: (1) observers and (2) example-driven training.
arXiv Detail & Related papers (2020-10-17T01:03:06Z) - DCR-Net: A Deep Co-Interactive Relation Network for Joint Dialog Act
Recognition and Sentiment Classification [77.59549450705384]
In dialog system, dialog act recognition and sentiment classification are two correlative tasks.
Most of the existing systems either treat them as separate tasks or just jointly model the two tasks.
We propose a Deep Co-Interactive Relation Network (DCR-Net) to explicitly consider the cross-impact and model the interaction between the two tasks.
arXiv Detail & Related papers (2020-08-16T14:13:32Z) - BriNet: Towards Bridging the Intra-class and Inter-class Gaps in
One-Shot Segmentation [84.2925550033094]
Few-shot segmentation focuses on the generalization of models to segment unseen object instances with limited training samples.
We propose a framework, BriNet, to bridge the gaps between the extracted features of the query and support images.
The effectiveness of our framework is demonstrated by experimental results, which outperforms other competitive methods.
arXiv Detail & Related papers (2020-08-14T07:45:50Z) - Learning a Domain Classifier Bank for Unsupervised Adaptive Object
Detection [48.19258721979389]
In this paper, we propose a fine-grained domain alignment approach for object detectors based on deep networks.
We develop a bare object detector with the proposed fine-grained domain alignment mechanism as the adaptive detector.
Experiments on three popular transferring benchmarks demonstrate the effectiveness of our method.
arXiv Detail & Related papers (2020-07-06T09:12:46Z) - Cross-domain Detection via Graph-induced Prototype Alignment [114.8952035552862]
We propose a Graph-induced Prototype Alignment (GPA) framework to seek for category-level domain alignment.
In addition, in order to alleviate the negative effect of class-imbalance on domain adaptation, we design a Class-reweighted Contrastive Loss.
Our approach outperforms existing methods with a remarkable margin.
arXiv Detail & Related papers (2020-03-28T17:46:55Z) - Cascaded Human-Object Interaction Recognition [175.60439054047043]
We introduce a cascade architecture for a multi-stage, coarse-to-fine HOI understanding.
At each stage, an instance localization network progressively refines HOI proposals and feeds them into an interaction recognition network.
With our carefully-designed human-centric relation features, these two modules work collaboratively towards effective interaction understanding.
arXiv Detail & Related papers (2020-03-09T17:05:04Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.