Generalized Intent Discovery: Learning from Open World Dialogue System
- URL: http://arxiv.org/abs/2209.06030v1
- Date: Tue, 13 Sep 2022 14:31:53 GMT
- Title: Generalized Intent Discovery: Learning from Open World Dialogue System
- Authors: Yutao Mou, Keqing He, Yanan Wu, Pei Wang, Jingang Wang, Wei Wu, Yi
Huang, Junlan Feng, Weiran Xu
- Abstract summary: Generalized Intent Discovery (GID) aims to extend an IND intent classifier to an open-world intent set including IND and OOD intents.
We construct three public datasets for different application scenarios and propose two kinds of frameworks.
- Score: 34.39483579171543
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Traditional intent classification models are based on a pre-defined intent
set and only recognize limited in-domain (IND) intent classes. But users may
input out-of-domain (OOD) queries in a practical dialogue system. Such OOD
queries can provide directions for future improvement. In this paper, we define
a new task, Generalized Intent Discovery (GID), which aims to extend an IND
intent classifier to an open-world intent set including IND and OOD intents. We
hope to simultaneously classify a set of labeled IND intent classes while
discovering and recognizing new unlabeled OOD types incrementally. We construct
three public datasets for different application scenarios and propose two kinds
of frameworks, pipeline-based and end-to-end for future work. Further, we
conduct exhaustive experiments and qualitative analysis to comprehend key
challenges and provide new guidance for future GID research.
Related papers
- Domain Adaptation in Intent Classification Systems: A Review [24.783606027583474]
Dialogue agents, which perform specific tasks, are part of the long-term goal of NLP researchers to build intelligent agents that communicate with humans in natural language.
We conduct a thorough technical review of the datasets, domains, tasks, and methods needed to train the intent classification part of dialogue systems.
arXiv Detail & Related papers (2024-03-26T15:59:05Z) - Continual Generalized Intent Discovery: Marching Towards Dynamic and
Open-world Intent Recognition [25.811639218862958]
Generalized Intent Discovery (GID) only considers one stage of OOD learning, and needs to utilize the data in all previous stages for joint training.
Continual Generalized Intent Discovery (CGID) aims to continuously and automatically discover OOD intents from dynamic OOD data streams.
PLRD bootstraps new intent discovery through class prototypes and balances new and old intents through data replay and feature distillation.
arXiv Detail & Related papers (2023-10-16T08:48:07Z) - Prompt Learning With Knowledge Memorizing Prototypes For Generalized
Few-Shot Intent Detection [22.653220906899612]
Generalized Few-Shot Intent Detection (GFSID) is challenging and realistic because it needs to categorize both seen and novel intents simultaneously.
Previous GFSID methods rely on the episodic learning paradigm.
We propose to convert the GFSID task into the class incremental learning paradigm.
arXiv Detail & Related papers (2023-09-10T09:16:38Z) - Out-of-Domain Intent Detection Considering Multi-Turn Dialogue Contexts [91.43701971416213]
We introduce a context-aware OOD intent detection (Caro) framework to model multi-turn contexts in OOD intent detection tasks.
Caro establishes state-of-the-art performances on multi-turn OOD detection tasks by improving the F1-OOD score of over $29%$ compared to the previous best method.
arXiv Detail & Related papers (2023-05-05T01:39:21Z) - A Hybrid Architecture for Out of Domain Intent Detection and Intent
Discovery [0.0]
Out of Scope (OOS) and Out of Domain (OOD) inputs may run task-oriented systems into a problem.
A labeled dataset is needed to train a model for Intent Detection in task-oriented dialogue systems.
The creation of a labeled dataset is time-consuming and needs human resources.
Our results show that the proposed model for both OOD/OOS Intent Detection and Intent Discovery achieves great results.
arXiv Detail & Related papers (2023-03-07T18:49:13Z) - Discovering New Intents Using Latent Variables [51.50374666602328]
We propose a probabilistic framework for discovering intents where intent assignments are treated as latent variables.
In E-step, we conduct discovering intents and explore the intrinsic structure of unlabeled data by the posterior of intent assignments.
In M-step, we alleviate the forgetting of prior knowledge transferred from known intents by optimizing the discrimination of labeled data.
arXiv Detail & Related papers (2022-10-21T08:29:45Z) - Learning Open-World Object Proposals without Learning to Classify [110.30191531975804]
We propose a classification-free Object Localization Network (OLN) which estimates the objectness of each region purely by how well the location and shape of a region overlaps with any ground-truth object.
This simple strategy learns generalizable objectness and outperforms existing proposals on cross-category generalization.
arXiv Detail & Related papers (2021-08-15T14:36:02Z) - Hierarchical Modeling for Out-of-Scope Domain and Intent Classification [55.23920796595698]
This paper focuses on out-of-scope intent classification in dialog systems.
We propose a hierarchical multi-task learning approach based on a joint model to classify domain and intent simultaneously.
Experiments show that the model outperforms existing methods in terms of accuracy, out-of-scope recall and F1.
arXiv Detail & Related papers (2021-04-30T06:38:23Z) - Automatic Discovery of Novel Intents & Domains from Text Utterances [18.39942131996558]
We propose a novel framework, ADVIN, to automatically discover novel domains and intents from large volumes of unlabeled data.
ADVIN significantly outperforms baselines on three benchmark datasets, and real user utterances from a commercial voice-powered agent.
arXiv Detail & Related papers (2020-05-22T00:47:10Z) - AGIF: An Adaptive Graph-Interactive Framework for Joint Multiple Intent
Detection and Slot Filling [69.59096090788125]
In this paper, we propose an Adaptive Graph-Interactive Framework (AGIF) for joint multiple intent detection and slot filling.
We introduce an intent-slot graph interaction layer to model the strong correlation between the slot and intents.
Such an interaction layer is applied to each token adaptively, which has the advantage to automatically extract the relevant intents information.
arXiv Detail & Related papers (2020-04-21T15:07:34Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.