Related papers: Continual Generalized Intent Discovery: Marching Towards Dynamic and Open-world Intent Recognition

Continual Generalized Intent Discovery: Marching Towards Dynamic and Open-world Intent Recognition

URL: http://arxiv.org/abs/2310.10184v1
Date: Mon, 16 Oct 2023 08:48:07 GMT
Title: Continual Generalized Intent Discovery: Marching Towards Dynamic and Open-world Intent Recognition
Authors: Xiaoshuai Song, Yutao Mou, Keqing He, Yueyan Qiu, Pei Wang, Weiran Xu
Abstract summary: Generalized Intent Discovery (GID) only considers one stage of OOD learning, and needs to utilize the data in all previous stages for joint training. Continual Generalized Intent Discovery (CGID) aims to continuously and automatically discover OOD intents from dynamic OOD data streams. PLRD bootstraps new intent discovery through class prototypes and balances new and old intents through data replay and feature distillation.
Score: 25.811639218862958
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: In a practical dialogue system, users may input out-of-domain (OOD) queries. The Generalized Intent Discovery (GID) task aims to discover OOD intents from OOD queries and extend them to the in-domain (IND) classifier. However, GID only considers one stage of OOD learning, and needs to utilize the data in all previous stages for joint training, which limits its wide application in reality. In this paper, we introduce a new task, Continual Generalized Intent Discovery (CGID), which aims to continuously and automatically discover OOD intents from dynamic OOD data streams and then incrementally add them to the classifier with almost no previous data, thus moving towards dynamic intent recognition in an open world. Next, we propose a method called Prototype-guided Learning with Replay and Distillation (PLRD) for CGID, which bootstraps new intent discovery through class prototypes and balances new and old intents through data replay and feature distillation. Finally, we conduct detailed experiments and analysis to verify the effectiveness of PLRD and understand the key challenges of CGID for future research.

Related papers

Integration of Old and New Knowledge for Generalized Intent Discovery: A Consistency-driven Prototype-Prompting Framework [49.60947755616314]
Generalized Intent Discovery (GID) addresses this by leveraging unlabeled OOD data to discover new intents without additional annotation.<n>We propose a consistency-driven prototype-prompting framework for GID from the perspective of integrating old and new knowledge.<n>Our method significantly outperforms all baseline methods, achieving state-of-the-art results.
arXiv Detail & Related papers (2025-06-10T06:30:17Z)
IntentGPT: Few-shot Intent Discovery with Large Language Models [9.245106106117317]
We develop a model capable of identifying new intents as they emerge. IntentGPT is a training-free method that effectively prompts Large Language Models (LLMs) to discover new intents with minimal labeled data. Our experiments show that IntentGPT outperforms previous methods that require extensive domain-specific data and fine-tuning.
arXiv Detail & Related papers (2024-11-16T02:16:59Z)
Pseudo-Label Enhanced Prototypical Contrastive Learning for Uniformed Intent Discovery [27.18799732585361]
We propose a Pseudo-Label enhanced Prototypical Contrastive Learning (PLPCL) model for uniformed intent discovery. We iteratively utilize pseudo-labels to explore potential positive/negative samples for contrastive learning and bridge the gap between representation and clustering. Our method has been proven effective in two different settings of discovering new intents.
arXiv Detail & Related papers (2024-10-26T16:22:45Z)
COOLer: Class-Incremental Learning for Appearance-Based Multiple Object Tracking [32.47215340215641]
This paper extends the scope of continual learning research to class-incremental learning for multiple object tracking (MOT) Previous solutions for continual learning of object detectors do not address the data association stage of appearance-based trackers. We introduce COOLer, a COntrastive- and cOntinual-Learning-based tracker, which incrementally learns to track new categories while preserving past knowledge.
arXiv Detail & Related papers (2023-10-04T17:49:48Z)
Out-of-Domain Intent Detection Considering Multi-Turn Dialogue Contexts [91.43701971416213]
We introduce a context-aware OOD intent detection (Caro) framework to model multi-turn contexts in OOD intent detection tasks. Caro establishes state-of-the-art performances on multi-turn OOD detection tasks by improving the F1-OOD score of over $29%$ compared to the previous best method.
arXiv Detail & Related papers (2023-05-05T01:39:21Z)
A Hybrid Architecture for Out of Domain Intent Detection and Intent Discovery [0.0]
Out of Scope (OOS) and Out of Domain (OOD) inputs may run task-oriented systems into a problem. A labeled dataset is needed to train a model for Intent Detection in task-oriented dialogue systems. The creation of a labeled dataset is time-consuming and needs human resources. Our results show that the proposed model for both OOD/OOS Intent Detection and Intent Discovery achieves great results.
arXiv Detail & Related papers (2023-03-07T18:49:13Z)
Discovering New Intents Using Latent Variables [51.50374666602328]
We propose a probabilistic framework for discovering intents where intent assignments are treated as latent variables. In E-step, we conduct discovering intents and explore the intrinsic structure of unlabeled data by the posterior of intent assignments. In M-step, we alleviate the forgetting of prior knowledge transferred from known intents by optimizing the discrimination of labeled data.
arXiv Detail & Related papers (2022-10-21T08:29:45Z)
Generalized Intent Discovery: Learning from Open World Dialogue System [34.39483579171543]
Generalized Intent Discovery (GID) aims to extend an IND intent classifier to an open-world intent set including IND and OOD intents. We construct three public datasets for different application scenarios and propose two kinds of frameworks.
arXiv Detail & Related papers (2022-09-13T14:31:53Z)
Triggering Failures: Out-Of-Distribution detection by learning from local adversarial attacks in Semantic Segmentation [76.2621758731288]
We tackle the detection of out-of-distribution (OOD) objects in semantic segmentation. Our main contribution is a new OOD detection architecture called ObsNet associated with a dedicated training scheme based on Local Adversarial Attacks (LAA) We show it obtains top performances both in speed and accuracy when compared to ten recent methods of the literature on three different datasets.
arXiv Detail & Related papers (2021-08-03T17:09:56Z)
Enhancing the Generalization for Intent Classification and Out-of-Domain Detection in SLU [70.44344060176952]
Intent classification is a major task in spoken language understanding (SLU) Recent works have shown that using extra data and labels can improve the OOD detection performance. This paper proposes to train a model with only IND data while supporting both IND intent classification and OOD detection.
arXiv Detail & Related papers (2021-06-28T08:27:38Z)
Privileged Knowledge Distillation for Online Action Detection [114.5213840651675]
Online Action Detection (OAD) in videos is proposed as a per-frame labeling task to address the real-time prediction tasks. This paper presents a novel learning-with-privileged based framework for online action detection where the future frames only observable at the training stages are considered as a form of privileged information.
arXiv Detail & Related papers (2020-11-18T08:52:15Z)
Continual Learning for Natural Language Generation in Task-oriented Dialog Systems [72.92029584113676]
Natural language generation (NLG) is an essential component of task-oriented dialog systems. We study NLG in a "continual learning" setting to expand its knowledge to new domains or functionalities incrementally. The major challenge towards this goal is catastrophic forgetting, meaning that a continually trained model tends to forget the knowledge it has learned before.
arXiv Detail & Related papers (2020-10-02T10:32:29Z)

This list is automatically generated from the titles and abstracts of the papers in this site.