Attention-Driven Dynamic Graph Convolutional Network for Multi-Label
Image Recognition
- URL: http://arxiv.org/abs/2012.02994v1
- Date: Sat, 5 Dec 2020 10:10:12 GMT
- Title: Attention-Driven Dynamic Graph Convolutional Network for Multi-Label
Image Recognition
- Authors: Jin Ye, Junjun He, Xiaojiang Peng, Wenhao Wu, and Yu Qiao
- Abstract summary: We propose an Attention-Driven Dynamic Graph Convolutional Network (ADD-GCN) to dynamically generate a specific graph for each image.
Experiments on public multi-label benchmarks demonstrate the effectiveness of our method.
- Score: 53.17837649440601
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Recent studies often exploit Graph Convolutional Network (GCN) to model label
dependencies to improve recognition accuracy for multi-label image recognition.
However, constructing a graph by counting the label co-occurrence possibilities
of the training data may degrade model generalizability, especially when there
exist occasional co-occurrence objects in test images. Our goal is to eliminate
such bias and enhance the robustness of the learnt features. To this end, we
propose an Attention-Driven Dynamic Graph Convolutional Network (ADD-GCN) to
dynamically generate a specific graph for each image. ADD-GCN adopts a Dynamic
Graph Convolutional Network (D-GCN) to model the relation of content-aware
category representations that are generated by a Semantic Attention Module
(SAM). Extensive experiments on public multi-label benchmarks demonstrate the
effectiveness of our method, which achieves mAPs of 85.2%, 96.0%, and 95.5% on
MS-COCO, VOC2007, and VOC2012, respectively, and outperforms current
state-of-the-art methods with a clear margin. All codes can be found at
https://github.com/Yejin0111/ADD-GCN.
Related papers
- Self-Attention Empowered Graph Convolutional Network for Structure
Learning and Node Embedding [5.164875580197953]
In representation learning on graph-structured data, many popular graph neural networks (GNNs) fail to capture long-range dependencies.
This paper proposes a novel graph learning framework called the graph convolutional network with self-attention (GCN-SA)
The proposed scheme exhibits an exceptional generalization capability in node-level representation learning.
arXiv Detail & Related papers (2024-03-06T05:00:31Z) - GKGNet: Group K-Nearest Neighbor based Graph Convolutional Network for Multi-Label Image Recognition [37.02054260449195]
Multi-Label Image Recognition (MLIR) is a challenging task that aims to predict multiple object labels in a single image.
We present the first fully graph convolutional model, Group K-nearest neighbor based Graph convolutional Network (GKGNet)
Our experiments demonstrate that GKGNet achieves state-of-the-art performance with significantly lower computational costs.
arXiv Detail & Related papers (2023-08-28T07:50:04Z) - Learning Hierarchical Graph Representation for Image Manipulation
Detection [50.04902159383709]
The objective of image manipulation detection is to identify and locate the manipulated regions in the images.
Recent approaches mostly adopt the sophisticated Convolutional Neural Networks (CNNs) to capture the tampering artifacts left in the images.
We propose a hierarchical Graph Convolutional Network (HGCN-Net), which consists of two parallel branches.
arXiv Detail & Related papers (2022-01-15T01:54:25Z) - GCN-SE: Attention as Explainability for Node Classification in Dynamic
Graphs [20.330666300034338]
Graph Convolutional Networks (GCNs) are a popular method from graph representation learning.
We propose a new method, GCN-SE, that attaches a set of learnable attention weights to graph snapshots at different times.
We show that GCN-SE outperforms previously proposed node classification methods on a variety of graph datasets.
arXiv Detail & Related papers (2021-10-11T20:30:35Z) - Group-Wise Semantic Mining for Weakly Supervised Semantic Segmentation [49.90178055521207]
This work addresses weakly supervised semantic segmentation (WSSS), with the goal of bridging the gap between image-level annotations and pixel-level segmentation.
We formulate WSSS as a novel group-wise learning task that explicitly models semantic dependencies in a group of images to estimate more reliable pseudo ground-truths.
In particular, we devise a graph neural network (GNN) for group-wise semantic mining, wherein input images are represented as graph nodes.
arXiv Detail & Related papers (2020-12-09T12:40:13Z) - Knowledge-Guided Multi-Label Few-Shot Learning for General Image
Recognition [75.44233392355711]
KGGR framework exploits prior knowledge of statistical label correlations with deep neural networks.
It first builds a structured knowledge graph to correlate different labels based on statistical label co-occurrence.
Then, it introduces the label semantics to guide learning semantic-specific features.
It exploits a graph propagation network to explore graph node interactions.
arXiv Detail & Related papers (2020-09-20T15:05:29Z) - Multi-Level Graph Convolutional Network with Automatic Graph Learning
for Hyperspectral Image Classification [63.56018768401328]
We propose a Multi-level Graph Convolutional Network (GCN) with Automatic Graph Learning method (MGCN-AGL) for HSI classification.
By employing attention mechanism to characterize the importance among spatially neighboring regions, the most relevant information can be adaptively incorporated to make decisions.
Our MGCN-AGL encodes the long range dependencies among image regions based on the expressive representations that have been produced at local level.
arXiv Detail & Related papers (2020-09-19T09:26:20Z) - Sequential Graph Convolutional Network for Active Learning [53.99104862192055]
We propose a novel pool-based Active Learning framework constructed on a sequential Graph Convolution Network (GCN)
With a small number of randomly sampled images as seed labelled examples, we learn the parameters of the graph to distinguish labelled vs unlabelled nodes.
We exploit these characteristics of GCN to select the unlabelled examples which are sufficiently different from labelled ones.
arXiv Detail & Related papers (2020-06-18T00:55:10Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.