Related papers: Exploring the Limits of Natural Language Inference Based Setup for Few-Shot Intent Detection

Exploring the Limits of Natural Language Inference Based Setup for Few-Shot Intent Detection

URL: http://arxiv.org/abs/2112.07434v2
Date: Tue, 26 Dec 2023 06:59:28 GMT
Title: Exploring the Limits of Natural Language Inference Based Setup for Few-Shot Intent Detection
Authors: Ayush Kumar, Vijit Malik, Jithendra Vepa
Abstract summary: Generalized Few-shot intent detection is more realistic but challenging setup. We employ a simple and effective method based on Natural Language Inference. Our method achieves state-of-the-art results on 1-shot and 5-shot intent detection task.
Score: 13.971616443394474
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Intent Detection is one of the core tasks of dialog systems. Few-shot Intent Detection is challenging due to limited number of annotated utterances for novel classes. Generalized Few-shot intent detection is more realistic but challenging setup which aims to discriminate the joint label space of both novel intents which have few examples each and existing intents consisting of enough labeled data. Large label spaces and fewer number of shots increase the complexity of the task. In this work, we employ a simple and effective method based on Natural Language Inference that leverages the semantics in the class-label names to learn and predict the novel classes. Our method achieves state-of-the-art results on 1-shot and 5-shot intent detection task with gains ranging from 2-8\% points in F1 score on four benchmark datasets. Our method also outperforms existing approaches on a more practical setting of generalized few-shot intent detection with gains up to 20% F1 score. We show that the suggested approach performs well across single and multi domain datasets with the number of class labels from as few as 7 to as high as 150.

Related papers

Dynamic Label Name Refinement for Few-Shot Dialogue Intent Classification [10.850826520563967]
We propose a novel approach to few-shot dialogue intent classification through in-context learning. Our method retrieves relevant examples for a test input from the training set. We leverage a large language model to dynamically refine intent labels based on semantic understanding.
arXiv Detail & Related papers (2024-12-20T06:53:57Z)
All Labels Together: Low-shot Intent Detection with an Efficient Label Semantic Encoding Paradigm [48.02790193676742]
In intent detection tasks, leveraging meaningful semantic information from intent labels can be particularly beneficial for few-shot scenarios. We present an end-to-end One-to-All system that enables the comparison of an input utterance with all label candidates. Experiments on three few-shot intent detection tasks demonstrate that One-to-All is especially effective when the training resource is extremely scarce.
arXiv Detail & Related papers (2023-09-07T08:50:45Z)
New Intent Discovery with Pre-training and Contrastive Learning [21.25371293641141]
New intent discovery aims to uncover novel intent categories from user utterances to expand the set of supported intent classes. Existing approaches typically rely on a large amount of labeled utterances. We propose a new contrastive loss to exploit self-supervisory signals in unlabeled data for clustering.
arXiv Detail & Related papers (2022-05-25T17:07:25Z)
Few-Shot Intent Detection via Contrastive Pre-Training and Fine-Tuning [27.154414939086426]
We present a simple yet effective few-shot intent detection schema via contrastive pre-training and fine-tuning. We first conduct self-supervised contrastive pre-training on collected intent datasets, which implicitly learns to discriminate semantically similar utterances. We then perform few-shot intent detection together with supervised contrastive learning, which explicitly pulls utterances from the same intent closer.
arXiv Detail & Related papers (2021-09-13T22:28:58Z)
Discriminative Nearest Neighbor Few-Shot Intent Detection by Transferring Natural Language Inference [150.07326223077405]
Few-shot learning is attracting much attention to mitigate data scarcity. We present a discriminative nearest neighbor classification with deep self-attention. We propose to boost the discriminative ability by transferring a natural language inference (NLI) model.
arXiv Detail & Related papers (2020-10-25T00:39:32Z)
Few-shot Learning for Multi-label Intent Detection [59.66787898744991]
State-of-the-art work estimates label-instance relevance scores and uses a threshold to select multiple associated intent labels. Experiments on two datasets show that the proposed model significantly outperforms strong baselines in both one-shot and five-shot settings.
arXiv Detail & Related papers (2020-10-11T14:42:18Z)
Dynamic Semantic Matching and Aggregation Network for Few-shot Intent Detection [69.2370349274216]
Few-shot Intent Detection is challenging due to the scarcity of available annotated utterances. Semantic components are distilled from utterances via multi-head self-attention. Our method provides a comprehensive matching measure to enhance representations of both labeled and unlabeled instances.
arXiv Detail & Related papers (2020-10-06T05:16:38Z)
Any-Shot Object Detection [81.88153407655334]
'Any-shot detection' is where totally unseen and few-shot categories can simultaneously co-occur during inference. We propose a unified any-shot detection model, that can concurrently learn to detect both zero-shot and few-shot object classes. Our framework can also be used solely for Zero-shot detection and Few-shot detection tasks.
arXiv Detail & Related papers (2020-03-16T03:43:15Z)
Efficient Intent Detection with Dual Sentence Encoders [53.16532285820849]
We introduce intent detection methods backed by pretrained dual sentence encoders such as USE and ConveRT. We demonstrate the usefulness and wide applicability of the proposed intent detectors, showing that they outperform intent detectors based on fine-tuning the full BERT-Large model. We release our code, as well as a new challenging single-domain intent detection dataset.
arXiv Detail & Related papers (2020-03-10T15:33:54Z)

This list is automatically generated from the titles and abstracts of the papers in this site.