Unsupervised Domain Attention Adaptation Network for Caricature
Attribute Recognition
- URL: http://arxiv.org/abs/2007.09344v1
- Date: Sat, 18 Jul 2020 06:38:45 GMT
- Title: Unsupervised Domain Attention Adaptation Network for Caricature
Attribute Recognition
- Authors: Wen Ji, Kelei He, Jing Huo, Zheng Gu, Yang Gao
- Abstract summary: Caricature attributes provide distinctive facial features to help research in Psychology and Neuroscience.
Unlike the facial photo attribute datasets that have a quantity of annotated images, the annotations of caricature attributes are rare.
We propose a caricature attribute dataset, namely WebCariA, to facility the research in attribute learning of caricatures.
- Score: 23.95731281719786
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Caricature attributes provide distinctive facial features to help research in
Psychology and Neuroscience. However, unlike the facial photo attribute
datasets that have a quantity of annotated images, the annotations of
caricature attributes are rare. To facility the research in attribute learning
of caricatures, we propose a caricature attribute dataset, namely WebCariA.
Moreover, to utilize models that trained by face attributes, we propose a novel
unsupervised domain adaptation framework for cross-modality (i.e., photos to
caricatures) attribute recognition, with an integrated inter- and intra-domain
consistency learning scheme. Specifically, the inter-domain consistency
learning scheme consisting an image-to-image translator to first fill the
domain gap between photos and caricatures by generating intermediate image
samples, and a label consistency learning module to align their semantic
information. The intra-domain consistency learning scheme integrates the common
feature consistency learning module with a novel attribute-aware
attention-consistency learning module for a more efficient alignment. We did an
extensive ablation study to show the effectiveness of the proposed method. And
the proposed method also outperforms the state-of-the-art methods by a margin.
The implementation of the proposed method is available at
https://github.com/KeleiHe/DAAN.
Related papers
- AKGNet: Attribute Knowledge-Guided Unsupervised Lung-Infected Area Segmentation [25.874281336821685]
Lung-infected area segmentation is crucial for assessing the severity of lung diseases.
We propose a novel attribute knowledge-guided framework for unsupervised lung-infected area segmentation.
AKGNet facilitates text attribute knowledge learning, attribute-image cross-attention fusion, and high-confidence-based pseudo-label exploration.
arXiv Detail & Related papers (2024-04-17T02:36:02Z) - Improving Generalization of Image Captioning with Unsupervised Prompt
Learning [63.26197177542422]
Generalization of Image Captioning (GeneIC) learns a domain-specific prompt vector for the target domain without requiring annotated data.
GeneIC aligns visual and language modalities with a pre-trained Contrastive Language-Image Pre-Training (CLIP) model.
arXiv Detail & Related papers (2023-08-05T12:27:01Z) - Improving Human-Object Interaction Detection via Virtual Image Learning [68.56682347374422]
Human-Object Interaction (HOI) detection aims to understand the interactions between humans and objects.
In this paper, we propose to alleviate the impact of such an unbalanced distribution via Virtual Image Leaning (VIL)
A novel label-to-image approach, Multiple Steps Image Creation (MUSIC), is proposed to create a high-quality dataset that has a consistent distribution with real images.
arXiv Detail & Related papers (2023-08-04T10:28:48Z) - EAML: Ensemble Self-Attention-based Mutual Learning Network for Document
Image Classification [1.1470070927586016]
We design a self-attention-based fusion module that serves as a block in our ensemble trainable network.
It allows to simultaneously learn the discriminant features of image and text modalities throughout the training stage.
This is the first time to leverage a mutual learning approach along with a self-attention-based fusion module to perform document image classification.
arXiv Detail & Related papers (2023-05-11T16:05:03Z) - Adapt and Align to Improve Zero-Shot Sketch-Based Image Retrieval [85.39613457282107]
Cross-domain nature of sketch-based image retrieval is challenging.
We present an effective Adapt and Align'' approach to address the key challenges.
Inspired by recent advances in image-text foundation models (e.g., CLIP) on zero-shot scenarios, we explicitly align the learned image embedding with a more semantic text embedding to achieve the desired knowledge transfer from seen to unseen classes.
arXiv Detail & Related papers (2023-05-09T03:10:15Z) - PiPa: Pixel- and Patch-wise Self-supervised Learning for Domain
Adaptative Semantic Segmentation [100.6343963798169]
Unsupervised Domain Adaptation (UDA) aims to enhance the generalization of the learned model to other domains.
We propose a unified pixel- and patch-wise self-supervised learning framework, called PiPa, for domain adaptive semantic segmentation.
arXiv Detail & Related papers (2022-11-14T18:31:24Z) - Attribute Prototype Network for Any-Shot Learning [113.50220968583353]
We argue that an image representation with integrated attribute localization ability would be beneficial for any-shot, i.e. zero-shot and few-shot, image classification tasks.
We propose a novel representation learning framework that jointly learns global and local features using only class-level attributes.
arXiv Detail & Related papers (2022-04-04T02:25:40Z) - Attribute Prototype Network for Zero-Shot Learning [113.50220968583353]
We propose a novel zero-shot representation learning framework that jointly learns discriminative global and local features.
Our model points to the visual evidence of the attributes in an image, confirming the improved attribute localization ability of our image representation.
arXiv Detail & Related papers (2020-08-19T06:46:35Z) - Semi-supervised Learning with a Teacher-student Network for Generalized
Attribute Prediction [7.462336024223667]
This paper presents a study on semi-supervised learning to solve the visual attribute prediction problem.
Our method achieves competitive performance on various benchmarks for fashion attribute prediction.
arXiv Detail & Related papers (2020-07-14T02:06:24Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.