Bidirectional Mapping Coupled GAN for Generalized Zero-Shot Learning
- URL: http://arxiv.org/abs/2012.15054v2
- Date: Fri, 19 Feb 2021 08:25:09 GMT
- Title: Bidirectional Mapping Coupled GAN for Generalized Zero-Shot Learning
- Authors: Tasfia Shermin, Shyh Wei Teng, Ferdous Sohel, Manzur Murshed, Guojun
Lu
- Abstract summary: Bidirectional mapping-based generalized zero-shot learning (GZSL) methods rely on the quality of synthesized features to recognize seen and unseen data.
We learn a joint distribution of seen-unseen domains and preserving domain distinction is crucial for these methods.
In this work, we utilize the available unseen class semantics alongside seen class semantics and learn joint distribution through a strong visual-semantic coupling.
- Score: 7.22073260315824
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Bidirectional mapping-based generalized zero-shot learning (GZSL) methods
rely on the quality of synthesized features to recognize seen and unseen data.
Therefore, learning a joint distribution of seen-unseen domains and preserving
domain distinction is crucial for these methods. However, existing methods only
learn the underlying distribution of seen data, although unseen class semantics
are available in the GZSL problem setting. Most methods neglect retaining
domain distinction and use the learned distribution to recognize seen and
unseen data. Consequently, they do not perform well. In this work, we utilize
the available unseen class semantics alongside seen class semantics and learn
joint distribution through a strong visual-semantic coupling. We propose a
bidirectional mapping coupled generative adversarial network (BMCoGAN) by
extending the coupled generative adversarial network into a dual-domain
learning bidirectional mapping model. We further integrate a Wasserstein
generative adversarial optimization to supervise the joint distribution
learning. We design a loss optimization for retaining domain distinctive
information in the synthesized features and reducing bias towards seen classes,
which pushes synthesized seen features towards real seen features and pulls
synthesized unseen features away from real seen features. We evaluate BMCoGAN
on benchmark datasets and demonstrate its superior performance against
contemporary methods.
Related papers
- Dual Relation Mining Network for Zero-Shot Learning [48.89161627050706]
We propose a Dual Relation Mining Network (DRMN) to enable effective visual-semantic interactions and learn semantic relationship among attributes for knowledge transfer.
Specifically, we introduce a Dual Attention Block (DAB) for visual-semantic relationship mining, which enriches visual information by multi-level feature fusion.
For semantic relationship modeling, we utilize a Semantic Interaction Transformer (SIT) to enhance the generalization of attribute representations among images.
arXiv Detail & Related papers (2024-05-06T16:31:19Z) - GSMFlow: Generation Shifts Mitigating Flow for Generalized Zero-Shot
Learning [55.79997930181418]
Generalized Zero-Shot Learning aims to recognize images from both the seen and unseen classes by transferring semantic knowledge from seen to unseen classes.
It is a promising solution to take the advantage of generative models to hallucinate realistic unseen samples based on the knowledge learned from the seen classes.
We propose a novel flow-based generative framework that consists of multiple conditional affine coupling layers for learning unseen data generation.
arXiv Detail & Related papers (2022-07-05T04:04:37Z) - Interpolation-based Correlation Reduction Network for Semi-Supervised
Graph Learning [49.94816548023729]
We propose a novel graph contrastive learning method, termed Interpolation-based Correlation Reduction Network (ICRN)
In our method, we improve the discriminative capability of the latent feature by enlarging the margin of decision boundaries.
By combining the two settings, we extract rich supervision information from both the abundant unlabeled nodes and the rare yet valuable labeled nodes for discnative representation learning.
arXiv Detail & Related papers (2022-06-06T14:26:34Z) - Voxel-wise Adversarial Semi-supervised Learning for Medical Image
Segmentation [4.489713477369384]
We introduce a novel adversarial learning-based semi-supervised segmentation method for medical image segmentation.
Our method embeds both local and global features from multiple hidden layers and learns context relations between multiple classes.
Our method outperforms current best-performing state-of-the-art semi-supervised learning approaches on the image segmentation of the left atrium (single class) and multiorgan datasets (multiclass)
arXiv Detail & Related papers (2022-05-14T06:57:19Z) - Semi-supervised Domain Adaptive Structure Learning [72.01544419893628]
Semi-supervised domain adaptation (SSDA) is a challenging problem requiring methods to overcome both 1) overfitting towards poorly annotated data and 2) distribution shift across domains.
We introduce an adaptive structure learning method to regularize the cooperation of SSL and DA.
arXiv Detail & Related papers (2021-12-12T06:11:16Z) - HSVA: Hierarchical Semantic-Visual Adaptation for Zero-Shot Learning [74.76431541169342]
Zero-shot learning (ZSL) tackles the unseen class recognition problem, transferring semantic knowledge from seen classes to unseen ones.
We propose a novel hierarchical semantic-visual adaptation (HSVA) framework to align semantic and visual domains.
Experiments on four benchmark datasets demonstrate HSVA achieves superior performance on both conventional and generalized ZSL.
arXiv Detail & Related papers (2021-09-30T14:27:50Z) - An Integrated Attribute Guided Dense Attention Model for Fine-Grained
Generalized Zero-Shot Learning [7.22073260315824]
Embedding learning (EL) and feature synthesizing (FS) are two of the popular categories of fine-grained GZSL methods.
We propose to explore global and direct attribute-supervised local visual features for both EL and FS categories.
We demonstrate that our proposed method outperforms contemporary methods on benchmark datasets.
arXiv Detail & Related papers (2020-12-31T21:38:46Z) - Adversarial Bipartite Graph Learning for Video Domain Adaptation [50.68420708387015]
Domain adaptation techniques, which focus on adapting models between distributionally different domains, are rarely explored in the video recognition area.
Recent works on visual domain adaptation which leverage adversarial learning to unify the source and target video representations are not highly effective on the videos.
This paper proposes an Adversarial Bipartite Graph (ABG) learning framework which directly models the source-target interactions.
arXiv Detail & Related papers (2020-07-31T03:48:41Z) - Two-Level Adversarial Visual-Semantic Coupling for Generalized Zero-shot
Learning [21.89909688056478]
We propose a new two-level joint idea to augment the generative network with an inference network during training.
This provides strong cross-modal interaction for effective transfer of knowledge between visual and semantic domains.
We evaluate our approach on four benchmark datasets against several state-of-the-art methods, and show its performance.
arXiv Detail & Related papers (2020-07-15T15:34:09Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.