Semantic-Aware Graph Matching Mechanism for Multi-Label Image
Recognition
- URL: http://arxiv.org/abs/2304.11275v1
- Date: Fri, 21 Apr 2023 23:48:01 GMT
- Title: Semantic-Aware Graph Matching Mechanism for Multi-Label Image
Recognition
- Authors: Yanan Wu, Songhe Feng and Yang Wang
- Abstract summary: Multi-label image recognition aims to predict a set of labels that present in an image.
In this paper, we treat each image as a bag of instances, and formulate the task of multi-label image recognition as an instance-label matching selection problem.
We propose an innovative Semantic-aware Graph Matching framework for Multi-Label image recognition (ML-SGM)
- Score: 21.36538164675385
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Multi-label image recognition aims to predict a set of labels that present in
an image. The key to deal with such problem is to mine the associations between
image contents and labels, and further obtain the correct assignments between
images and their labels. In this paper, we treat each image as a bag of
instances, and formulate the task of multi-label image recognition as an
instance-label matching selection problem. To model such problem, we propose an
innovative Semantic-aware Graph Matching framework for Multi-Label image
recognition (ML-SGM), in which Graph Matching mechanism is introduced owing to
its good performance of excavating the instance and label relationship. The
framework explicitly establishes category correlations and instance-label
correspondences by modeling the relation among content-aware (instance) and
semantic-aware (label) category representations, to facilitate multi-label
image understanding and reduce the dependency of large amounts of training
samples for each category. Specifically, we first construct an instance spatial
graph and a label semantic graph respectively and then incorporate them into a
constructed assignment graph by connecting each instance to all labels.
Subsequently, the graph network block is adopted to aggregate and update all
nodes and edges state on the assignment graph to form structured
representations for each instance and label. Our network finally derives a
prediction score for each instance-label correspondence and optimizes such
correspondence with a weighted cross-entropy loss. Empirical results conducted
on generic multi-label image recognition demonstrate the superiority of our
proposed method. Moreover, the proposed method also shows advantages in
multi-label recognition with partial labels and multi-label few-shot learning,
as well as outperforms current state-of-the-art methods with a clear margin.
Related papers
- Towards Effective Multi-Label Recognition Attacks via Knowledge Graph
Consistency [33.250544869840155]
We show that the naive extensions of multi-class attacks to the multi-label setting lead to violating label relationships.
We propose a graph-consistent multi-label attack framework, which searches for small image perturbations that lead to misclassifying a desired target set.
arXiv Detail & Related papers (2022-07-11T19:08:32Z) - Dual-Perspective Semantic-Aware Representation Blending for Multi-Label
Image Recognition with Partial Labels [70.36722026729859]
We propose a dual-perspective semantic-aware representation blending (DSRB) that blends multi-granularity category-specific semantic representation across different images.
The proposed DS consistently outperforms current state-of-the-art algorithms on all proportion label settings.
arXiv Detail & Related papers (2022-05-26T00:33:44Z) - Graph Attention Transformer Network for Multi-Label Image Classification [50.0297353509294]
We propose a general framework for multi-label image classification that can effectively mine complex inter-label relationships.
Our proposed methods can achieve state-of-the-art performance on three datasets.
arXiv Detail & Related papers (2022-03-08T12:39:05Z) - Semantic-Aware Representation Blending for Multi-Label Image Recognition
with Partial Labels [86.17081952197788]
We propose to blend category-specific representation across different images to transfer information of known labels to complement unknown labels.
Experiments on the MS-COCO, Visual Genome, Pascal VOC 2007 datasets show that the proposed SARB framework obtains superior performance over current leading competitors.
arXiv Detail & Related papers (2022-03-04T07:56:16Z) - Structured Semantic Transfer for Multi-Label Recognition with Partial
Labels [85.6967666661044]
We propose a structured semantic transfer (SST) framework that enables training multi-label recognition models with partial labels.
The framework consists of two complementary transfer modules that explore within-image and cross-image semantic correlations.
Experiments on the Microsoft COCO, Visual Genome and Pascal VOC datasets show that the proposed SST framework obtains superior performance over current state-of-the-art algorithms.
arXiv Detail & Related papers (2021-12-21T02:15:01Z) - GM-MLIC: Graph Matching based Multi-Label Image Classification [20.118173194957052]
Multi-Label Image Classification (MLIC) aims to predict a set of labels that present in an image.
In this paper, we treat each image as a bag of instances, and reformulate the task of MLIC as an instance-label matching selection problem.
We propose a novel deep learning framework named Graph Matching based Multi-Label Image Classification (GM-MLIC)
arXiv Detail & Related papers (2021-04-30T05:36:25Z) - Knowledge-Guided Multi-Label Few-Shot Learning for General Image
Recognition [75.44233392355711]
KGGR framework exploits prior knowledge of statistical label correlations with deep neural networks.
It first builds a structured knowledge graph to correlate different labels based on statistical label co-occurrence.
Then, it introduces the label semantics to guide learning semantic-specific features.
It exploits a graph propagation network to explore graph node interactions.
arXiv Detail & Related papers (2020-09-20T15:05:29Z) - SSKD: Self-Supervised Knowledge Distillation for Cross Domain Adaptive
Person Re-Identification [25.96221714337815]
Domain adaptive person re-identification (re-ID) is a challenging task due to the large discrepancy between the source domain and the target domain.
Existing methods mainly attempt to generate pseudo labels for unlabeled target images by clustering algorithms.
We propose a Self-Supervised Knowledge Distillation (SSKD) technique containing two modules, the identity learning and the soft label learning.
arXiv Detail & Related papers (2020-09-13T10:12:02Z) - Instance-Aware Graph Convolutional Network for Multi-Label
Classification [55.131166957803345]
Graph convolutional neural network (GCN) has effectively boosted the multi-label image recognition task.
We propose an instance-aware graph convolutional neural network (IA-GCN) framework for multi-label classification.
arXiv Detail & Related papers (2020-08-19T12:49:28Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.