Related papers: Comparison Knowledge Translation for Generalizable Image Classification

Comparison Knowledge Translation for Generalizable Image Classification

URL: http://arxiv.org/abs/2205.03633v1
Date: Sat, 7 May 2022 11:05:18 GMT
Title: Comparison Knowledge Translation for Generalizable Image Classification
Authors: Zunlei Feng, Tian Qiu, Sai Wu, Xiaotuan Jin, Zengliang He, Mingli Song, Huiqiong Wang
Abstract summary: We build a generalizable framework that emulates the humans' recognition mechanism in the image classification task. We put forward a Comparison Classification Translation Network (CCT-Net), which comprises a comparison classifier and a matching discriminator. CCT-Net achieves surprising generalization ability on unseen categories and SOTA performance on target categories.
Score: 31.530232003512957
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Deep learning has recently achieved remarkable performance in image classification tasks, which depends heavily on massive annotation. However, the classification mechanism of existing deep learning models seems to contrast to humans' recognition mechanism. With only a glance at an image of the object even unknown type, humans can quickly and precisely find other same category objects from massive images, which benefits from daily recognition of various objects. In this paper, we attempt to build a generalizable framework that emulates the humans' recognition mechanism in the image classification task, hoping to improve the classification performance on unseen categories with the support of annotations of other categories. Specifically, we investigate a new task termed Comparison Knowledge Translation (CKT). Given a set of fully labeled categories, CKT aims to translate the comparison knowledge learned from the labeled categories to a set of novel categories. To this end, we put forward a Comparison Classification Translation Network (CCT-Net), which comprises a comparison classifier and a matching discriminator. The comparison classifier is devised to classify whether two images belong to the same category or not, while the matching discriminator works together in an adversarial manner to ensure whether classified results match the truth. Exhaustive experiments show that CCT-Net achieves surprising generalization ability on unseen categories and SOTA performance on target categories.

Related papers

Shape-Based Single Object Classification Using Ensemble Method Classifiers [0.14999444543328289]
A hierarchical classification framework has been proposed for bridging the semantic gap effectively. The method was applied to classify single object images from Amazon and Google datasets. The estimated classification accuracies ranged from 20% to 99%.
arXiv Detail & Related papers (2025-01-16T05:58:32Z)
Enhancing Visual Classification using Comparative Descriptors [13.094102298155736]
We introduce a novel concept of comparative descriptors. These descriptors emphasize the unique features of a target class against its most similar classes, enhancing differentiation. An additional filtering process ensures that these descriptors are closer to the image embeddings in the CLIP space.
arXiv Detail & Related papers (2024-11-08T06:28:02Z)
ChatGPT-Powered Hierarchical Comparisons for Image Classification [12.126353699873281]
We present a novel framework for image classification based on large language models (LLMs) We group classes into hierarchies and classifying images by comparing image-text embeddings at each hierarchy level, resulting in an intuitive, effective, and explainable approach.
arXiv Detail & Related papers (2023-11-01T00:26:40Z)
Category Query Learning for Human-Object Interaction Classification [25.979131884959923]
Unlike most previous HOI methods, we propose a novel and complementary approach called category query learning. This idea is motivated by an earlier multi-label image classification method, but is for the first time applied for the challenging human-object interaction classification task. Our method is simple, general and effective. It is validated on three representative HOI baselines and achieves new state-of-the-art results on two benchmarks.
arXiv Detail & Related papers (2023-03-24T13:59:58Z)
Stable Attribute Group Editing for Reliable Few-shot Image Generation [88.59350889410794]
We present an editing-based'' framework Attribute Group Editing (AGE) for reliable few-shot image generation. We find that class inconsistency is a common problem in GAN-generated images for downstream classification. We propose to boost the downstream classification performance of SAGE by enhancing the pixel and frequency components.
arXiv Detail & Related papers (2023-02-01T01:51:47Z)
Semantic Representation and Dependency Learning for Multi-Label Image Recognition [76.52120002993728]
We propose a novel and effective semantic representation and dependency learning (SRDL) framework to learn category-specific semantic representation for each category. Specifically, we design a category-specific attentional regions (CAR) module to generate channel/spatial-wise attention matrices to guide model. We also design an object erasing (OE) module to implicitly learn semantic dependency among categories by erasing semantic-aware regions.
arXiv Detail & Related papers (2022-04-08T00:55:15Z)
Multi-Label Image Classification with Contrastive Learning [57.47567461616912]
We show that a direct application of contrastive learning can hardly improve in multi-label cases. We propose a novel framework for multi-label classification with contrastive learning in a fully supervised setting.
arXiv Detail & Related papers (2021-07-24T15:00:47Z)
Boosting few-shot classification with view-learnable contrastive learning [19.801016732390064]
We introduce contrastive loss into few-shot classification for learning latent fine-grained structure in the embedding space. We develop a learning-to-learn algorithm to automatically generate different views of the same image.
arXiv Detail & Related papers (2021-07-20T03:13:33Z)
Category Contrast for Unsupervised Domain Adaptation in Visual Tasks [92.9990560760593]
We propose a novel Category Contrast technique (CaCo) that introduces semantic priors on top of instance discrimination for visual UDA tasks. CaCo is complementary to existing UDA methods and generalizable to other learning setups such as semi-supervised learning, unsupervised model adaptation, etc.
arXiv Detail & Related papers (2021-06-05T12:51:35Z)
Learning and Evaluating Representations for Deep One-class Classification [59.095144932794646]
We present a two-stage framework for deep one-class classification. We first learn self-supervised representations from one-class data, and then build one-class classifiers on learned representations. In experiments, we demonstrate state-of-the-art performance on visual domain one-class classification benchmarks.
arXiv Detail & Related papers (2020-11-04T23:33:41Z)
Zero-Shot Recognition through Image-Guided Semantic Classification [9.291055558504588]
We present a new embedding-based framework for zero-shot learning (ZSL) Motivated by the binary relevance method for multi-label classification, we propose to inversely learn the mapping between an image and a semantic classifier. IGSC is conceptually simple and can be realized by a slight enhancement of an existing deep architecture for classification.
arXiv Detail & Related papers (2020-07-23T06:22:40Z)
I Am Going MAD: Maximum Discrepancy Competition for Comparing Classifiers Adaptively [135.7695909882746]
We name the MAximum Discrepancy (MAD) competition. We adaptively sample a small test set from an arbitrarily large corpus of unlabeled images. Human labeling on the resulting model-dependent image sets reveals the relative performance of the competing classifiers.
arXiv Detail & Related papers (2020-02-25T03:32:29Z)

This list is automatically generated from the titles and abstracts of the papers in this site.