Instance-Aware Graph Convolutional Network for Multi-Label
Classification
- URL: http://arxiv.org/abs/2008.08407v1
- Date: Wed, 19 Aug 2020 12:49:28 GMT
- Title: Instance-Aware Graph Convolutional Network for Multi-Label
Classification
- Authors: Yun Wang, Tong Zhang, Zhen Cui, Chunyan Xu, Jian Yang
- Abstract summary: Graph convolutional neural network (GCN) has effectively boosted the multi-label image recognition task.
We propose an instance-aware graph convolutional neural network (IA-GCN) framework for multi-label classification.
- Score: 55.131166957803345
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Graph convolutional neural network (GCN) has effectively boosted the
multi-label image recognition task by introducing label dependencies based on
statistical label co-occurrence of data. However, in previous methods, label
correlation is computed based on statistical information of data and therefore
the same for all samples, and this makes graph inference on labels insufficient
to handle huge variations among numerous image instances. In this paper, we
propose an instance-aware graph convolutional neural network (IA-GCN) framework
for multi-label classification. As a whole, two fused branches of sub-networks
are involved in the framework: a global branch modeling the whole image and a
region-based branch exploring dependencies among regions of interests (ROIs).
For label diffusion of instance-awareness in graph convolution, rather than
using the statistical label correlation alone, an image-dependent label
correlation matrix (LCM), fusing both the statistical LCM and an individual one
of each image instance, is constructed for graph inference on labels to inject
adaptive information of label-awareness into the learned features of the model.
Specifically, the individual LCM of each image is obtained by mining the label
dependencies based on the scores of labels about detected ROIs. In this
process, considering the contribution differences of ROIs to multi-label
classification, variational inference is introduced to learn adaptive scaling
factors for those ROIs by considering their complex distribution. Finally,
extensive experiments on MS-COCO and VOC datasets show that our proposed
approach outperforms existing state-of-the-art methods.
Related papers
- Semantic-Aware Graph Matching Mechanism for Multi-Label Image
Recognition [21.36538164675385]
Multi-label image recognition aims to predict a set of labels that present in an image.
In this paper, we treat each image as a bag of instances, and formulate the task of multi-label image recognition as an instance-label matching selection problem.
We propose an innovative Semantic-aware Graph Matching framework for Multi-Label image recognition (ML-SGM)
arXiv Detail & Related papers (2023-04-21T23:48:01Z) - Leveraging Instance Features for Label Aggregation in Programmatic Weak
Supervision [75.1860418333995]
Programmatic Weak Supervision (PWS) has emerged as a widespread paradigm to synthesize training labels efficiently.
The core component of PWS is the label model, which infers true labels by aggregating the outputs of multiple noisy supervision sources as labeling functions.
Existing statistical label models typically rely only on the outputs of LF, ignoring the instance features when modeling the underlying generative process.
arXiv Detail & Related papers (2022-10-06T07:28:53Z) - Dual-Perspective Semantic-Aware Representation Blending for Multi-Label
Image Recognition with Partial Labels [70.36722026729859]
We propose a dual-perspective semantic-aware representation blending (DSRB) that blends multi-granularity category-specific semantic representation across different images.
The proposed DS consistently outperforms current state-of-the-art algorithms on all proportion label settings.
arXiv Detail & Related papers (2022-05-26T00:33:44Z) - Graph Attention Transformer Network for Multi-Label Image Classification [50.0297353509294]
We propose a general framework for multi-label image classification that can effectively mine complex inter-label relationships.
Our proposed methods can achieve state-of-the-art performance on three datasets.
arXiv Detail & Related papers (2022-03-08T12:39:05Z) - Semantic-Aware Representation Blending for Multi-Label Image Recognition
with Partial Labels [86.17081952197788]
We propose to blend category-specific representation across different images to transfer information of known labels to complement unknown labels.
Experiments on the MS-COCO, Visual Genome, Pascal VOC 2007 datasets show that the proposed SARB framework obtains superior performance over current leading competitors.
arXiv Detail & Related papers (2022-03-04T07:56:16Z) - Multi-layered Semantic Representation Network for Multi-label Image
Classification [8.17894017454724]
Multi-label image classification (MLIC) is a fundamental and practical task, which aims to assign multiple possible labels to an image.
In recent years, many deep convolutional neural network (CNN) based approaches have been proposed which model label correlations.
This paper advances this research direction by improving the modeling of label correlations and the learning of semantic representations.
arXiv Detail & Related papers (2021-06-22T08:04:22Z) - Knowledge-Guided Multi-Label Few-Shot Learning for General Image
Recognition [75.44233392355711]
KGGR framework exploits prior knowledge of statistical label correlations with deep neural networks.
It first builds a structured knowledge graph to correlate different labels based on statistical label co-occurrence.
Then, it introduces the label semantics to guide learning semantic-specific features.
It exploits a graph propagation network to explore graph node interactions.
arXiv Detail & Related papers (2020-09-20T15:05:29Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.