An Integrated Attribute Guided Dense Attention Model for Fine-Grained
Generalized Zero-Shot Learning
- URL: http://arxiv.org/abs/2101.02141v2
- Date: Fri, 5 Feb 2021 03:24:02 GMT
- Title: An Integrated Attribute Guided Dense Attention Model for Fine-Grained
Generalized Zero-Shot Learning
- Authors: Tasfia Shermin, Shyh Wei Teng, Ferdous Sohel, Manzur Murshed, Guojun
Lu
- Abstract summary: Embedding learning (EL) and feature synthesizing (FS) are two of the popular categories of fine-grained GZSL methods.
We propose to explore global and direct attribute-supervised local visual features for both EL and FS categories.
We demonstrate that our proposed method outperforms contemporary methods on benchmark datasets.
- Score: 7.22073260315824
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Embedding learning (EL) and feature synthesizing (FS) are two of the popular
categories of fine-grained GZSL methods. The global feature exploring EL or FS
methods do not explore fine distinction as they ignore local details. And, the
local detail exploring EL or FS methods either neglect direct attribute
guidance or global information. Consequently, neither method performs well. In
this paper, we propose to explore global and direct attribute-supervised local
visual features for both EL and FS categories in an integrated manner for
fine-grained GZSL. The proposed integrated network has an EL sub-network and a
FS sub-network. Consequently, the proposed integrated network can be tested in
two ways. We propose a novel two-step dense attention mechanism to discover
attribute-guided local visual features. We introduce new mutual learning
between the sub-networks to exploit mutually beneficial information for
optimization. Moreover, to reduce bias towards the source domain during
testing, we propose to compute source-target class similarity based on mutual
information and transfer-learn the target classes. We demonstrate that our
proposed method outperforms contemporary methods on benchmark datasets.
Related papers
- GLC++: Source-Free Universal Domain Adaptation through Global-Local Clustering and Contrastive Affinity Learning [84.54244771470012]
Source-Free Universal Domain Adaptation (SF-UniDA) aims to accurately classify "known" data belonging to common categories.
We propose a novel Global and Local Clustering (GLC) technique, which comprises an adaptive one-vs-all global clustering algorithm.
We evolve GLC to GLC++, integrating a contrastive affinity learning strategy.
arXiv Detail & Related papers (2024-03-21T13:57:45Z) - Learning to Discover and Detect Objects [43.52208526783969]
We tackle the problem of novel class discovery, detection, and localization (NCDL)
In this setting, we assume a source dataset with labels for objects of commonly observed classes.
By training our detection network with this objective in an end-to-end manner, it learns to classify all region proposals for a large variety of classes.
arXiv Detail & Related papers (2022-10-19T17:59:55Z) - L2G: A Simple Local-to-Global Knowledge Transfer Framework for Weakly
Supervised Semantic Segmentation [67.26984058377435]
We present L2G, a simple online local-to-global knowledge transfer framework for high-quality object attention mining.
Our framework conducts the global network to learn the captured rich object detail knowledge from a global view.
Experiments show that our method attains 72.1% and 44.2% mIoU scores on the validation set of PASCAL VOC 2012 and MS COCO 2014.
arXiv Detail & Related papers (2022-04-07T04:31:32Z) - Semi-supervised Domain Adaptive Structure Learning [72.01544419893628]
Semi-supervised domain adaptation (SSDA) is a challenging problem requiring methods to overcome both 1) overfitting towards poorly annotated data and 2) distribution shift across domains.
We introduce an adaptive structure learning method to regularize the cooperation of SSL and DA.
arXiv Detail & Related papers (2021-12-12T06:11:16Z) - An Entropy-guided Reinforced Partial Convolutional Network for Zero-Shot
Learning [77.72330187258498]
We propose a novel Entropy-guided Reinforced Partial Convolutional Network (ERPCNet)
ERPCNet extracts and aggregates localities based on semantic relevance and visual correlations without human-annotated regions.
It not only discovers global-cooperative localities dynamically but also converges faster for policy gradient optimization.
arXiv Detail & Related papers (2021-11-03T11:13:13Z) - Semi-supervised Domain Adaptation for Semantic Segmentation [3.946367634483361]
We propose a novel two-step semi-supervised dual-domain adaptation (SSDDA) approach to address both cross- and intra-domain gaps in semantic segmentation.
We demonstrate that the proposed approach outperforms state-of-the-art methods on two common synthetic-to-real semantic segmentation benchmarks.
arXiv Detail & Related papers (2021-10-20T16:13:00Z) - Learning to Affiliate: Mutual Centralized Learning for Few-shot
Classification [33.19451499073551]
Few-shot learning aims to learn a classifier that can be easily adapted to accommodate new tasks not seen during training.
Recent methods tend to collectively use a set of local features to densely represent an image instead of using a mixed global feature.
arXiv Detail & Related papers (2021-06-10T06:16:00Z) - Unveiling the Potential of Structure-Preserving for Weakly Supervised
Object Localization [71.79436685992128]
We propose a two-stage approach, termed structure-preserving activation (SPA), towards fully leveraging the structure information incorporated in convolutional features for WSOL.
In the first stage, a restricted activation module (RAM) is designed to alleviate the structure-missing issue caused by the classification network.
In the second stage, we propose a post-process approach, termed self-correlation map generating (SCG) module to obtain structure-preserving localization maps.
arXiv Detail & Related papers (2021-03-08T03:04:14Z) - Bidirectional Mapping Coupled GAN for Generalized Zero-Shot Learning [7.22073260315824]
Bidirectional mapping-based generalized zero-shot learning (GZSL) methods rely on the quality of synthesized features to recognize seen and unseen data.
We learn a joint distribution of seen-unseen domains and preserving domain distinction is crucial for these methods.
In this work, we utilize the available unseen class semantics alongside seen class semantics and learn joint distribution through a strong visual-semantic coupling.
arXiv Detail & Related papers (2020-12-30T06:11:29Z) - Global Context-Aware Progressive Aggregation Network for Salient Object
Detection [117.943116761278]
We propose a novel network named GCPANet to integrate low-level appearance features, high-level semantic features, and global context features.
We show that the proposed approach outperforms the state-of-the-art methods both quantitatively and qualitatively.
arXiv Detail & Related papers (2020-03-02T04:26:10Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.