Active Generalized Category Discovery
- URL: http://arxiv.org/abs/2403.04272v1
- Date: Thu, 7 Mar 2024 07:12:24 GMT
- Title: Active Generalized Category Discovery
- Authors: Shijie Ma, Fei Zhu, Zhun Zhong, Xu-Yao Zhang, Cheng-Lin Liu
- Abstract summary: Generalized Category Discovery (GCD) endeavors to cluster unlabeled samples from both novel and old classes.
We take the spirit of active learning and propose a new setting called Active Generalized Category Discovery (AGCD)
Our method achieves state-of-the-art performance on both generic and fine-grained datasets.
- Score: 60.69060965936214
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Generalized Category Discovery (GCD) is a pragmatic and challenging
open-world task, which endeavors to cluster unlabeled samples from both novel
and old classes, leveraging some labeled data of old classes. Given that
knowledge learned from old classes is not fully transferable to new classes,
and that novel categories are fully unlabeled, GCD inherently faces intractable
problems, including imbalanced classification performance and inconsistent
confidence between old and new classes, especially in the low-labeling regime.
Hence, some annotations of new classes are deemed necessary. However, labeling
new classes is extremely costly. To address this issue, we take the spirit of
active learning and propose a new setting called Active Generalized Category
Discovery (AGCD). The goal is to improve the performance of GCD by actively
selecting a limited amount of valuable samples for labeling from the oracle. To
solve this problem, we devise an adaptive sampling strategy, which jointly
considers novelty, informativeness and diversity to adaptively select novel
samples with proper uncertainty. However, owing to the varied orderings of
label indices caused by the clustering of novel classes, the queried labels are
not directly applicable to subsequent training. To overcome this issue, we
further propose a stable label mapping algorithm that transforms ground truth
labels to the label space of the classifier, thereby ensuring consistent
training across different active selection stages. Our method achieves
state-of-the-art performance on both generic and fine-grained datasets. Our
code is available at https://github.com/mashijie1028/ActiveGCD
Related papers
- Happy: A Debiased Learning Framework for Continual Generalized Category Discovery [54.54153155039062]
This paper explores the underexplored task of Continual Generalized Category Discovery (C-GCD)
C-GCD aims to incrementally discover new classes from unlabeled data while maintaining the ability to recognize previously learned classes.
We introduce a debiased learning framework, namely Happy, characterized by Hardness-aware prototype sampling and soft entropy regularization.
arXiv Detail & Related papers (2024-10-09T04:18:51Z) - Generalized Category Discovery with Clustering Assignment Consistency [56.92546133591019]
Generalized category discovery (GCD) is a recently proposed open-world task.
We propose a co-training-based framework that encourages clustering consistency.
Our method achieves state-of-the-art performance on three generic benchmarks and three fine-grained visual recognition datasets.
arXiv Detail & Related papers (2023-10-30T00:32:47Z) - Novel Class Discovery for Long-tailed Recognition [17.464885951359047]
We consider a more realistic setting for novel class discovery where the distributions of novel and known classes are long-tailed.
To tackle this problem, we propose an adaptive self-labeling strategy based on an equiangular prototype representation of classes.
Our method infers high-quality pseudo-labels for the novel classes by solving a relaxed optimal transport problem.
arXiv Detail & Related papers (2023-08-06T02:15:19Z) - Dynamic Conceptional Contrastive Learning for Generalized Category
Discovery [76.82327473338734]
Generalized category discovery (GCD) aims to automatically cluster partially labeled data.
Unlabeled data contain instances that are not only from known categories of the labeled data but also from novel categories.
One effective way for GCD is applying self-supervised learning to learn discriminate representation for unlabeled data.
We propose a Dynamic Conceptional Contrastive Learning framework, which can effectively improve clustering accuracy.
arXiv Detail & Related papers (2023-03-30T14:04:39Z) - TagRec++: Hierarchical Label Aware Attention Network for Question
Categorization [0.3683202928838613]
Online learning systems organize the content according to a well defined taxonomy of hierarchical nature.
The task of categorizing inputs to the hierarchical labels is usually cast as a flat multi-class classification problem.
We formulate the task as a dense retrieval problem to retrieve the appropriate hierarchical labels for each content.
arXiv Detail & Related papers (2022-08-10T05:08:37Z) - A Unified Objective for Novel Class Discovery [48.1003877511578]
We introduce a UNified Objective function (UNO) for discovering novel classes.
UNO favors synergy between supervised and unsupervised learning.
Despite its simplicity, UNO outperforms the state of the art on several benchmarks.
arXiv Detail & Related papers (2021-08-19T07:22:29Z) - A Closer Look at Self-training for Zero-Label Semantic Segmentation [53.4488444382874]
Being able to segment unseen classes not observed during training is an important technical challenge in deep learning.
Prior zero-label semantic segmentation works approach this task by learning visual-semantic embeddings or generative models.
We propose a consistency regularizer to filter out noisy pseudo-labels by taking the intersections of the pseudo-labels generated from different augmentations of the same image.
arXiv Detail & Related papers (2021-04-21T14:34:33Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.