Class-Specific Variational Auto-Encoder for Content-Based Image
Retrieval
- URL: http://arxiv.org/abs/2304.11734v1
- Date: Sun, 23 Apr 2023 19:51:25 GMT
- Title: Class-Specific Variational Auto-Encoder for Content-Based Image
Retrieval
- Authors: Mehdi Rafiei and Alexandros Iosifidis
- Abstract summary: We propose a regularized loss for Variational Auto-Encoders (VAEs) forcing the model to focus on a given class of interest.
As a result, the model learns to discriminate the data belonging to the class of interest from any other possibility.
Experimental results show that the proposed method outperforms its competition in both in-domain and out-of-domain retrieval problems.
- Score: 95.42181254494287
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Using a discriminative representation obtained by supervised deep learning
methods showed promising results on diverse Content-Based Image Retrieval
(CBIR) problems. However, existing methods exploiting labels during training
try to discriminate all available classes, which is not ideal in cases where
the retrieval problem focuses on a class of interest. In this paper, we propose
a regularized loss for Variational Auto-Encoders (VAEs) forcing the model to
focus on a given class of interest. As a result, the model learns to
discriminate the data belonging to the class of interest from any other
possibility, making the learnt latent space of the VAE suitable for
class-specific retrieval tasks. The proposed Class-Specific Variational
Auto-Encoder (CS-VAE) is evaluated on three public and one custom datasets, and
its performance is compared with that of three related VAE-based methods.
Experimental results show that the proposed method outperforms its competition
in both in-domain and out-of-domain retrieval problems.
Related papers
- Accurate Explanation Model for Image Classifiers using Class Association Embedding [5.378105759529487]
We propose a generative explanation model that combines the advantages of global and local knowledge.
Class association embedding (CAE) encodes each sample into a pair of separated class-associated and individual codes.
Building-block coherency feature extraction algorithm is proposed that efficiently separates class-associated features from individual ones.
arXiv Detail & Related papers (2024-06-12T07:41:00Z) - Evaluating the Fairness of Discriminative Foundation Models in Computer
Vision [51.176061115977774]
We propose a novel taxonomy for bias evaluation of discriminative foundation models, such as Contrastive Language-Pretraining (CLIP)
We then systematically evaluate existing methods for mitigating bias in these models with respect to our taxonomy.
Specifically, we evaluate OpenAI's CLIP and OpenCLIP models for key applications, such as zero-shot classification, image retrieval and image captioning.
arXiv Detail & Related papers (2023-10-18T10:32:39Z) - Convolutional autoencoder-based multimodal one-class classification [80.52334952912808]
One-class classification refers to approaches of learning using data from a single class only.
We propose a deep learning one-class classification method suitable for multimodal data.
arXiv Detail & Related papers (2023-09-25T12:31:18Z) - Fine-grained Recognition with Learnable Semantic Data Augmentation [68.48892326854494]
Fine-grained image recognition is a longstanding computer vision challenge.
We propose diversifying the training data at the feature-level to alleviate the discriminative region loss problem.
Our method significantly improves the generalization performance on several popular classification networks.
arXiv Detail & Related papers (2023-09-01T11:15:50Z) - Open World Classification with Adaptive Negative Samples [89.2422451410507]
Open world classification is a task in natural language processing with key practical relevance and impact.
We propose an approach based on underlineadaptive underlinesamples (ANS) designed to generate effective synthetic open category samples in the training stage.
ANS achieves significant improvements over state-of-the-art methods.
arXiv Detail & Related papers (2023-03-09T21:12:46Z) - R2-Trans:Fine-Grained Visual Categorization with Redundancy Reduction [21.11038841356125]
Fine-grained visual categorization (FGVC) aims to discriminate similar subcategories, whose main challenge is the large intraclass diversities and subtle inter-class differences.
We present a novel approach for FGVC, which can simultaneously make use of partial yet sufficient discriminative information in environmental cues and also compress the redundant information in class-token with respect to the target.
arXiv Detail & Related papers (2022-04-21T13:35:38Z) - Universal Domain Adaptation in Ordinal Regression [11.703377306384695]
We address the problem of universal domain adaptation (UDA) in ordinal regression (OR)
We show that the UDA techniques developed for classification and based on the clustering assumption, under-perform in OR settings.
We propose a method that complements the OR classifier with an auxiliary task of order learning, which plays the double role of discriminating between common and private instances, and expanding class labels to the private target images via ranking.
arXiv Detail & Related papers (2021-06-22T07:23:39Z) - Zero-sample surface defect detection and classification based on
semantic feedback neural network [13.796631421521765]
We propose an Ensemble Co-training algorithm, which adaptively reduces the prediction error in image tag embedding from multiple angles.
Various experiments conducted on the zero-shot dataset and the cylinder liner dataset in the industrial field provide competitive results.
arXiv Detail & Related papers (2021-06-15T08:26:36Z) - Channel DropBlock: An Improved Regularization Method for Fine-Grained
Visual Classification [58.07257910065007]
Existing approaches mainly tackle this problem by introducing attention mechanisms to locate the discriminative parts or feature encoding approaches to extract the highly parameterized features in a weakly-supervised fashion.
In this work, we propose a lightweight yet effective regularization method named Channel DropBlock (CDB) in combination with two alternative correlation metrics, to address this problem.
arXiv Detail & Related papers (2021-06-07T09:03:02Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.