Enhancing Fine-Grained Classification for Low Resolution Images
- URL: http://arxiv.org/abs/2105.00241v1
- Date: Sat, 1 May 2021 13:19:02 GMT
- Title: Enhancing Fine-Grained Classification for Low Resolution Images
- Authors: Maneet Singh, Shruti Nagpal, Mayank Vatsa, Richa Singh
- Abstract summary: Low resolution images suffer from the inherent challenge of limited information content and the absence of fine details useful for sub-category classification.
This research proposes a novel attribute-assisted loss, which utilizes ancillary information to learn discriminative features for classification.
The proposed loss function enables a model to learn class-specific discriminative features, while incorporating attribute-level separability.
- Score: 97.82441158440527
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Low resolution fine-grained classification has widespread applicability for
applications where data is captured at a distance such as surveillance and
mobile photography. While fine-grained classification with high resolution
images has received significant attention, limited attention has been given to
low resolution images. These images suffer from the inherent challenge of
limited information content and the absence of fine details useful for
sub-category classification. This results in low inter-class variations across
samples of visually similar classes. In order to address these challenges, this
research proposes a novel attribute-assisted loss, which utilizes ancillary
information to learn discriminative features for classification. The proposed
loss function enables a model to learn class-specific discriminative features,
while incorporating attribute-level separability. Evaluation is performed on
multiple datasets with different models, for four resolutions varying from
32x32 to 224x224. Different experiments demonstrate the efficacy of the
proposed attributeassisted loss for low resolution fine-grained classification.
Related papers
- High-Discriminative Attribute Feature Learning for Generalized Zero-Shot Learning [54.86882315023791]
We propose an innovative approach called High-Discriminative Attribute Feature Learning for Generalized Zero-Shot Learning (HDAFL)
HDAFL utilizes multiple convolutional kernels to automatically learn discriminative regions highly correlated with attributes in images.
We also introduce a Transformer-based attribute discrimination encoder to enhance the discriminative capability among attributes.
arXiv Detail & Related papers (2024-04-07T13:17:47Z) - Fine-grained Recognition with Learnable Semantic Data Augmentation [68.48892326854494]
Fine-grained image recognition is a longstanding computer vision challenge.
We propose diversifying the training data at the feature-level to alleviate the discriminative region loss problem.
Our method significantly improves the generalization performance on several popular classification networks.
arXiv Detail & Related papers (2023-09-01T11:15:50Z) - A Compositional Feature Embedding and Similarity Metric for
Ultra-Fine-Grained Visual Categorization [16.843126268445726]
Fine-grained visual categorization (FGVC) aims at classifying objects with small inter-class variances.
This paper proposes a novel compositional feature embedding and similarity metric ( CECS) for ultra-fine-grained visual categorization.
Experimental results on two ultra-FGVC datasets and one FGVC dataset with recent benchmark methods consistently demonstrate that the proposed CECS method achieves the state-the-art performance.
arXiv Detail & Related papers (2021-09-25T15:05:25Z) - Region-level Active Learning for Cluttered Scenes [60.93811392293329]
We introduce a new strategy that subsumes previous Image-level and Object-level approaches into a generalized, Region-level approach.
We show that this approach significantly decreases labeling effort and improves rare object search on realistic data with inherent class-imbalance and cluttered scenes.
arXiv Detail & Related papers (2021-08-20T14:02:38Z) - Mutual Information-based Disentangled Neural Networks for Classifying
Unseen Categories in Different Domains: Application to Fetal Ultrasound
Imaging [10.504733425082335]
Deep neural networks exhibit limited generalizability across images with different entangled domain features and categorical features.
We propose Mutual Information-based Disentangled Neural Networks (MIDNet), which extract generalizable categorical features to transfer knowledge to unseen categories in a target domain.
We extensively evaluate the proposed method on fetal ultrasound datasets for two different image classification tasks.
arXiv Detail & Related papers (2020-10-30T17:32:18Z) - Attention Model Enhanced Network for Classification of Breast Cancer
Image [54.83246945407568]
AMEN is formulated in a multi-branch fashion with pixel-wised attention model and classification submodular.
To focus more on subtle detail information, the sample image is enhanced by the pixel-wised attention map generated from former branch.
Experiments conducted on three benchmark datasets demonstrate the superiority of the proposed method under various scenarios.
arXiv Detail & Related papers (2020-10-07T08:44:21Z) - Discriminative Residual Analysis for Image Set Classification with
Posture and Age Variations [27.751472312581228]
Discriminant Residual Analysis (DRA) is proposed to improve the classification performance.
DRA attempts to obtain a powerful projection which casts the residual representations into a discriminant subspace.
Two regularization approaches are used to deal with the probable small sample size problem.
arXiv Detail & Related papers (2020-08-23T08:53:06Z) - Contraction Mapping of Feature Norms for Classifier Learning on the Data
with Different Quality [5.47982638565422]
We propose a contraction mapping function to compress the range of feature norms of training images according to their quality.
Experiments on various classification applications, including handwritten digit recognition, lung nodule classification, face verification and face recognition, demonstrate that the proposed approach is promising.
arXiv Detail & Related papers (2020-07-27T09:53:55Z) - Capturing scattered discriminative information using a deep architecture
in acoustic scene classification [49.86640645460706]
In this study, we investigate various methods to capture discriminative information and simultaneously mitigate the overfitting problem.
We adopt a max feature map method to replace conventional non-linear activations in a deep neural network.
Two data augment methods and two deep architecture modules are further explored to reduce overfitting and sustain the system's discriminative power.
arXiv Detail & Related papers (2020-07-09T08:32:06Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.