X-Shot: A Unified System to Handle Frequent, Few-shot and Zero-shot
Learning Simultaneously in Classification
- URL: http://arxiv.org/abs/2403.03863v1
- Date: Wed, 6 Mar 2024 17:13:24 GMT
- Title: X-Shot: A Unified System to Handle Frequent, Few-shot and Zero-shot
Learning Simultaneously in Classification
- Authors: Hanzi Xu, Muhao Chen, Lifu Huang, Slobodan Vucetic, Wenpeng Yin
- Abstract summary: We introduce a novel classification challenge: X-shot, reflecting a real-world context where freq-shot, few-shot, and zero-shot labels co-occur without predefined limits.
X-shot centers on open-domain generalization and devising a system versatile enough to manage various label scenarios.
To our knowledge, this is the first work addressing X-shot learning, where X remains variable.
- Score: 51.07629536521054
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: In recent years, few-shot and zero-shot learning, which learn to predict
labels with limited annotated instances, have garnered significant attention.
Traditional approaches often treat frequent-shot (freq-shot; labels with
abundant instances), few-shot, and zero-shot learning as distinct challenges,
optimizing systems for just one of these scenarios. Yet, in real-world
settings, label occurrences vary greatly. Some of them might appear thousands
of times, while others might only appear sporadically or not at all. For
practical deployment, it is crucial that a system can adapt to any label
occurrence. We introduce a novel classification challenge: X-shot, reflecting a
real-world context where freq-shot, few-shot, and zero-shot labels co-occur
without predefined limits. Here, X can span from 0 to positive infinity. The
crux of X-shot centers on open-domain generalization and devising a system
versatile enough to manage various label scenarios. To solve X-shot, we propose
BinBin (Binary INference Based on INstruction following) that leverages the
Indirect Supervision from a large collection of NLP tasks via instruction
following, bolstered by Weak Supervision provided by large language models.
BinBin surpasses previous state-of-the-art techniques on three benchmark
datasets across multiple domains. To our knowledge, this is the first work
addressing X-shot learning, where X remains variable.
Related papers
- ICXML: An In-Context Learning Framework for Zero-Shot Extreme Multi-Label Classification [22.825115483590285]
This paper focuses on the task of Extreme Multi-Label Classification (XMC) whose goal is to predict multiple labels for each instance from an extremely large label space.
We introduce In-Context Extreme Multilabel Learning (ICXML), a two-stage framework that cuts down the search space by generating a set of candidate labels through incontext learning and then reranks them.
arXiv Detail & Related papers (2023-11-16T08:01:17Z) - Segment, Select, Correct: A Framework for Weakly-Supervised Referring Segmentation [63.13635858586001]
Referring Image (RIS) is the problem of identifying objects in images through natural language sentences.
We propose a novel weakly-supervised framework that tackles RIS by decomposing it into three steps.
Using only the first two steps (zero-shot segment and select) outperforms other zero-shot baselines by as much as 16.5%.
arXiv Detail & Related papers (2023-10-20T13:20:17Z) - Disambiguation of One-Shot Visual Classification Tasks: A Simplex-Based
Approach [8.436437583394998]
We present a strategy which aims at detecting the presence of multiple objects in a given shot.
This strategy is based on identifying the corners of a simplex in a high dimensional space.
We show the ability of the proposed method to slightly, yet statistically significantly, improve accuracy in extreme settings.
arXiv Detail & Related papers (2023-01-16T11:37:05Z) - Vocabulary-informed Zero-shot and Open-set Learning [128.83517181045815]
We propose vocabulary-informed learning to address problems of supervised, zero-shot, generalized zero-shot and open set recognition.
Specifically, we propose a weighted maximum margin framework for semantic manifold-based recognition that incorporates distance constraints from (both supervised and unsupervised) vocabulary atoms.
We illustrate that resulting model shows improvements in supervised, zero-shot, generalized zero-shot, and large open set recognition, with up to 310K class vocabulary on Animal with Attributes and ImageNet datasets.
arXiv Detail & Related papers (2023-01-03T08:19:22Z) - SuS-X: Training-Free Name-Only Transfer of Vision-Language Models [28.06403983530132]
Contrastive Language-Image Pre-training (CLIP) has emerged as a simple yet effective way to train large-scale vision-language models.
Fine-tuning the entire CLIP model can be resource-intensive and unstable.
We propose a novel method, SuS-X, that requires neither intensive fine-tuning nor costly labelled data.
arXiv Detail & Related papers (2022-11-28T16:48:41Z) - Few-shot Weakly-Supervised Object Detection via Directional Statistics [55.97230224399744]
We propose a probabilistic multiple instance learning approach for few-shot Common Object Localization (COL) and few-shot Weakly Supervised Object Detection (WSOD)
Our model simultaneously learns the distribution of the novel objects and localizes them via expectation-maximization steps.
Our experiments show that the proposed method, despite being simple, outperforms strong baselines in few-shot COL and WSOD, as well as large-scale WSOD tasks.
arXiv Detail & Related papers (2021-03-25T22:34:16Z) - Generative Multi-Label Zero-Shot Learning [136.17594611722285]
Multi-label zero-shot learning strives to classify images into multiple unseen categories for which no data is available during training.
Our work is the first to tackle the problem of multi-label feature in the (generalized) zero-shot setting.
Our cross-level fusion-based generative approach outperforms the state-of-the-art on all three datasets.
arXiv Detail & Related papers (2021-01-27T18:56:46Z) - Any-Shot Object Detection [81.88153407655334]
'Any-shot detection' is where totally unseen and few-shot categories can simultaneously co-occur during inference.
We propose a unified any-shot detection model, that can concurrently learn to detect both zero-shot and few-shot object classes.
Our framework can also be used solely for Zero-shot detection and Few-shot detection tasks.
arXiv Detail & Related papers (2020-03-16T03:43:15Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.