Generalized Category Discovery with Decoupled Prototypical Network
- URL: http://arxiv.org/abs/2211.15115v1
- Date: Mon, 28 Nov 2022 08:05:45 GMT
- Title: Generalized Category Discovery with Decoupled Prototypical Network
- Authors: Wenbin An, Feng Tian, Qinghua Zheng, Wei Ding, QianYing Wang, Ping
Chen
- Abstract summary: We present a novel model called Decoupled Prototypical Network (DPN)
By formulating a bipartite matching problem for category prototypes, DPN can achieve different training targets effectively.
DPN can learn more discriminative features for both known and novel categories through our proposed Semantic-aware Prototypical Learning (SPL)
- Score: 27.1635162759448
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Generalized Category Discovery (GCD) aims to recognize both known and novel
categories from a set of unlabeled data, based on another dataset labeled with
only known categories. Without considering differences between known and novel
categories, current methods learn about them in a coupled manner, which can
hurt model's generalization and discriminative ability. Furthermore, the
coupled training approach prevents these models transferring category-specific
knowledge explicitly from labeled data to unlabeled data, which can lose
high-level semantic information and impair model performance. To mitigate above
limitations, we present a novel model called Decoupled Prototypical Network
(DPN). By formulating a bipartite matching problem for category prototypes, DPN
can not only decouple known and novel categories to achieve different training
targets effectively, but also align known categories in labeled and unlabeled
data to transfer category-specific knowledge explicitly and capture high-level
semantics. Furthermore, DPN can learn more discriminative features for both
known and novel categories through our proposed Semantic-aware Prototypical
Learning (SPL). Besides capturing meaningful semantic information, SPL can also
alleviate the noise of hard pseudo labels through semantic-weighted soft
assignment. Extensive experiments show that DPN outperforms state-of-the-art
models by a large margin on all evaluation metrics across multiple benchmark
datasets. Code and data are available at https://github.com/Lackel/DPN.
Related papers
- Prototypical Hash Encoding for On-the-Fly Fine-Grained Category Discovery [65.16724941038052]
Category-aware Prototype Generation (CPG) and Discrimi Category 5.3% (DCE) are proposed.
CPG enables the model to fully capture the intra-category diversity by representing each category with multiple prototypes.
DCE boosts the discrimination ability of hash code with the guidance of the generated category prototypes.
arXiv Detail & Related papers (2024-10-24T23:51:40Z) - Transfer and Alignment Network for Generalized Category Discovery [11.37283148391564]
Generalized Category Discovery is a crucial real-world task.
Despite the improved performance on known categories, current methods perform poorly on novel categories.
We attribute the poor performance to two reasons: biased knowledge transfer between labeled and unlabeled data and noisy representation learning on the unlabeled data.
We propose a Transfer and Alignment Network (TAN), which incorporates two knowledge transfer mechanisms to calibrate the biased knowledge and two feature alignment mechanisms to learn discriminative features.
arXiv Detail & Related papers (2023-12-27T08:35:47Z) - Transductive Learning for Textual Few-Shot Classification in API-based
Embedding Models [46.79078308022975]
Few-shot classification involves training a model to perform a new classification task with a handful of labeled data.
We introduce a scenario where the embedding of a pre-trained model is served through a gated API with compute-cost and data-privacy constraints.
We propose a transductive inference, a learning paradigm that has been overlooked by the NLP community.
arXiv Detail & Related papers (2023-10-21T12:47:10Z) - Dynamic Conceptional Contrastive Learning for Generalized Category
Discovery [76.82327473338734]
Generalized category discovery (GCD) aims to automatically cluster partially labeled data.
Unlabeled data contain instances that are not only from known categories of the labeled data but also from novel categories.
One effective way for GCD is applying self-supervised learning to learn discriminate representation for unlabeled data.
We propose a Dynamic Conceptional Contrastive Learning framework, which can effectively improve clustering accuracy.
arXiv Detail & Related papers (2023-03-30T14:04:39Z) - Fine-grained Category Discovery under Coarse-grained supervision with
Hierarchical Weighted Self-contrastive Learning [37.6512548064269]
We investigate a new practical scenario called Fine-grained Category Discovery under Coarse-grained supervision (FCDC)
FCDC aims at discovering fine-grained categories with only coarse-grained labeled data, which can adapt models to categories of different granularity from known ones and reduce significant labeling cost.
We propose a hierarchical weighted self-contrastive network by building a novel weighted self-contrastive module and combining it with supervised learning in a hierarchical manner.
arXiv Detail & Related papers (2022-10-14T12:06:23Z) - Novel Class Discovery without Forgetting [72.52222295216062]
We identify and formulate a new, pragmatic problem setting of NCDwF: Novel Class Discovery without Forgetting.
We propose a machine learning model to incrementally discover novel categories of instances from unlabeled data.
We introduce experimental protocols based on CIFAR-10, CIFAR-100 and ImageNet-1000 to measure the trade-off between knowledge retention and novel class discovery.
arXiv Detail & Related papers (2022-07-21T17:54:36Z) - Class-incremental Novel Class Discovery [76.35226130521758]
We study the new task of class-incremental Novel Class Discovery (class-iNCD)
We propose a novel approach for class-iNCD which prevents forgetting of past information about the base classes.
Our experiments, conducted on three common benchmarks, demonstrate that our method significantly outperforms state-of-the-art approaches.
arXiv Detail & Related papers (2022-07-18T13:49:27Z) - Label-enhanced Prototypical Network with Contrastive Learning for
Multi-label Few-shot Aspect Category Detection [17.228616743739412]
Multi-label aspect category detection allows a given review sentence to contain multiple aspect categories.
We propose a novel label-enhanced network (LPN) for multi-label few-shot aspect category detection.
arXiv Detail & Related papers (2022-06-14T02:37:44Z) - Dual Progressive Prototype Network for Generalized Zero-Shot Learning [77.0715029826957]
Generalized Zero-Shot Learning (GZSL) aims to recognize new categories with auxiliary semantic information,e.g., category attributes.
Our approach, named Dual Progressive Prototype Network (DPPN), constructs two types of prototypes that record visual patterns for attributes and categories.
Experiments on four benchmarks prove that DPPN effectively alleviates the domain shift problem in GZSL.
arXiv Detail & Related papers (2021-11-03T08:43:29Z) - Learning Adaptive Embedding Considering Incremental Class [55.21855842960139]
Class-Incremental Learning (CIL) aims to train a reliable model with the streaming data, which emerges unknown classes sequentially.
Different from traditional closed set learning, CIL has two main challenges: 1) Novel class detection.
After the novel classes are detected, the model needs to be updated without re-training using entire previous data.
arXiv Detail & Related papers (2020-08-31T04:11:24Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.