Semantic-Aware Representation Blending for Multi-Label Image Recognition
with Partial Labels
- URL: http://arxiv.org/abs/2203.02172v1
- Date: Fri, 4 Mar 2022 07:56:16 GMT
- Title: Semantic-Aware Representation Blending for Multi-Label Image Recognition
with Partial Labels
- Authors: Tao Pu, Tianshui Chen, Hefeng Wu, Liang Lin
- Abstract summary: We propose to blend category-specific representation across different images to transfer information of known labels to complement unknown labels.
Experiments on the MS-COCO, Visual Genome, Pascal VOC 2007 datasets show that the proposed SARB framework obtains superior performance over current leading competitors.
- Score: 86.17081952197788
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Training the multi-label image recognition models with partial labels, in
which merely some labels are known while others are unknown for each image, is
a considerably challenging and practical task. To address this task, current
algorithms mainly depend on pre-training classification or similarity models to
generate pseudo labels for the unknown labels. However, these algorithms depend
on sufficient multi-label annotations to train the models, leading to poor
performance especially with low known label proportion. In this work, we
propose to blend category-specific representation across different images to
transfer information of known labels to complement unknown labels, which can
get rid of pre-training models and thus does not depend on sufficient
annotations. To this end, we design a unified semantic-aware representation
blending (SARB) framework that exploits instance-level and prototype-level
semantic representation to complement unknown labels by two complementary
modules: 1) an instance-level representation blending (ILRB) module blends the
representations of the known labels in an image to the representations of the
unknown labels in another image to complement these unknown labels. 2) a
prototype-level representation blending (PLRB) module learns more stable
representation prototypes for each category and blends the representation of
unknown labels with the prototypes of corresponding labels to complement these
labels. Extensive experiments on the MS-COCO, Visual Genome, Pascal VOC 2007
datasets show that the proposed SARB framework obtains superior performance
over current leading competitors on all known label proportion settings, i.e.,
with the mAP improvement of 4.6%, 4.%, 2.2% on these three datasets when the
known label proportion is 10%. Codes are available at
https://github.com/HCPLab-SYSU/HCP-MLR-PL.
Related papers
- Semantic-Aware Graph Matching Mechanism for Multi-Label Image
Recognition [21.36538164675385]
Multi-label image recognition aims to predict a set of labels that present in an image.
In this paper, we treat each image as a bag of instances, and formulate the task of multi-label image recognition as an instance-label matching selection problem.
We propose an innovative Semantic-aware Graph Matching framework for Multi-Label image recognition (ML-SGM)
arXiv Detail & Related papers (2023-04-21T23:48:01Z) - Dual-Perspective Semantic-Aware Representation Blending for Multi-Label
Image Recognition with Partial Labels [70.36722026729859]
We propose a dual-perspective semantic-aware representation blending (DSRB) that blends multi-granularity category-specific semantic representation across different images.
The proposed DS consistently outperforms current state-of-the-art algorithms on all proportion label settings.
arXiv Detail & Related papers (2022-05-26T00:33:44Z) - Heterogeneous Semantic Transfer for Multi-label Recognition with Partial Labels [70.45813147115126]
Multi-label image recognition with partial labels (MLR-PL) may greatly reduce the cost of annotation and thus facilitate large-scale MLR.
We find that strong semantic correlations exist within each image and across different images.
These correlations can help transfer the knowledge possessed by the known labels to retrieve the unknown labels.
arXiv Detail & Related papers (2022-05-23T08:37:38Z) - Structured Semantic Transfer for Multi-Label Recognition with Partial
Labels [85.6967666661044]
We propose a structured semantic transfer (SST) framework that enables training multi-label recognition models with partial labels.
The framework consists of two complementary transfer modules that explore within-image and cross-image semantic correlations.
Experiments on the Microsoft COCO, Visual Genome and Pascal VOC datasets show that the proposed SST framework obtains superior performance over current state-of-the-art algorithms.
arXiv Detail & Related papers (2021-12-21T02:15:01Z) - Inferring Prototypes for Multi-Label Few-Shot Image Classification with
Word Vector Guided Attention [45.6809084493491]
Multi-label few-shot image classification (ML-FSIC) is the task of assigning descriptive labels to previously unseen images.
In this paper we propose to use word embeddings as a form of prior knowledge about the meaning of the labels.
Our model can infer prototypes for unseen labels without the need for fine-tuning any model parameters.
arXiv Detail & Related papers (2021-12-02T07:59:11Z) - Knowledge-Guided Multi-Label Few-Shot Learning for General Image
Recognition [75.44233392355711]
KGGR framework exploits prior knowledge of statistical label correlations with deep neural networks.
It first builds a structured knowledge graph to correlate different labels based on statistical label co-occurrence.
Then, it introduces the label semantics to guide learning semantic-specific features.
It exploits a graph propagation network to explore graph node interactions.
arXiv Detail & Related papers (2020-09-20T15:05:29Z) - Instance-Aware Graph Convolutional Network for Multi-Label
Classification [55.131166957803345]
Graph convolutional neural network (GCN) has effectively boosted the multi-label image recognition task.
We propose an instance-aware graph convolutional neural network (IA-GCN) framework for multi-label classification.
arXiv Detail & Related papers (2020-08-19T12:49:28Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.