Adaptive Graph Convolution Module for Salient Object Detection
- URL: http://arxiv.org/abs/2303.09801v1
- Date: Fri, 17 Mar 2023 07:07:17 GMT
- Title: Adaptive Graph Convolution Module for Salient Object Detection
- Authors: Yongwoo Lee, Minhyeok Lee, Suhwan Cho, Sangyoun Lee
- Abstract summary: We propose an adaptive graph convolution module (AGCM) to deal with complex scenes.
Prototype features are extracted from the input image using a learnable region generation layer.
The proposed AGCM dramatically improves the SOD performance both quantitatively and quantitatively.
- Score: 7.278033100480174
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Salient object detection (SOD) is a task that involves identifying and
segmenting the most visually prominent object in an image. Existing solutions
can accomplish this use a multi-scale feature fusion mechanism to detect the
global context of an image. However, as there is no consideration of the
structures in the image nor the relations between distant pixels, conventional
methods cannot deal with complex scenes effectively. In this paper, we propose
an adaptive graph convolution module (AGCM) to overcome these limitations.
Prototype features are initially extracted from the input image using a
learnable region generation layer that spatially groups features in the image.
The prototype features are then refined by propagating information between them
based on a graph architecture, where each feature is regarded as a node.
Experimental results show that the proposed AGCM dramatically improves the SOD
performance both quantitatively and quantitatively.
Related papers
- HRGR: Enhancing Image Manipulation Detection via Hierarchical Region-aware Graph Reasoning [41.99592210429157]
We describe a new method named Hierarchical Region-aware Graph Reasoning (HRGR) to enhance image manipulation detection.
Unlike existing grid-based methods, we model image correlations based on content-coherence feature regions with irregular shapes, generated by a novel Differentiable Feature Partition strategy.
Our method is fully differentiable and can seamlessly integrate into mainstream networks in an end-to-end manner, without requiring additional supervision.
arXiv Detail & Related papers (2024-10-29T08:51:30Z) - Rotated Multi-Scale Interaction Network for Referring Remote Sensing Image Segmentation [63.15257949821558]
Referring Remote Sensing Image (RRSIS) is a new challenge that combines computer vision and natural language processing.
Traditional Referring Image (RIS) approaches have been impeded by the complex spatial scales and orientations found in aerial imagery.
We introduce the Rotated Multi-Scale Interaction Network (RMSIN), an innovative approach designed for the unique demands of RRSIS.
arXiv Detail & Related papers (2023-12-19T08:14:14Z) - Hi-ResNet: Edge Detail Enhancement for High-Resolution Remote Sensing Segmentation [10.919956120261539]
High-resolution remote sensing (HRS) semantic segmentation extracts key objects from high-resolution coverage areas.
objects of the same category within HRS images show significant differences in scale and shape across diverse geographical environments.
We propose a High-resolution remote sensing network (Hi-ResNet) with efficient network structure designs.
arXiv Detail & Related papers (2023-05-22T03:58:25Z) - Adaptive Rotated Convolution for Rotated Object Detection [96.94590550217718]
We present Adaptive Rotated Convolution (ARC) module to handle rotated object detection problem.
In our ARC module, the convolution kernels rotate adaptively to extract object features with varying orientations in different images.
The proposed approach achieves state-of-the-art performance on the DOTA dataset with 81.77% mAP.
arXiv Detail & Related papers (2023-03-14T11:53:12Z) - Occlusion-Aware Instance Segmentation via BiLayer Network Architectures [73.45922226843435]
We propose Bilayer Convolutional Network (BCNet), where the top layer detects occluding objects (occluders) and the bottom layer infers partially occluded instances (occludees)
We investigate the efficacy of bilayer structure using two popular convolutional network designs, namely, Fully Convolutional Network (FCN) and Graph Convolutional Network (GCN)
arXiv Detail & Related papers (2022-08-08T21:39:26Z) - Semantic Labeling of High Resolution Images Using EfficientUNets and
Transformers [5.177947445379688]
We propose a new segmentation model that combines convolutional neural networks with deep transformers.
Our results demonstrate that the proposed methodology improves segmentation accuracy compared to state-of-the-art techniques.
arXiv Detail & Related papers (2022-06-20T12:03:54Z) - Learning Hierarchical Graph Representation for Image Manipulation
Detection [50.04902159383709]
The objective of image manipulation detection is to identify and locate the manipulated regions in the images.
Recent approaches mostly adopt the sophisticated Convolutional Neural Networks (CNNs) to capture the tampering artifacts left in the images.
We propose a hierarchical Graph Convolutional Network (HGCN-Net), which consists of two parallel branches.
arXiv Detail & Related papers (2022-01-15T01:54:25Z) - PnP-DETR: Towards Efficient Visual Analysis with Transformers [146.55679348493587]
Recently, DETR pioneered the solution vision tasks with transformers, it directly translates the image feature map into the object result.
Recent transformer-based image recognition model andTT show consistent efficiency gain.
arXiv Detail & Related papers (2021-09-15T01:10:30Z) - Prior-Induced Information Alignment for Image Matting [28.90998570043986]
We propose a novel network named Prior-Induced Information Alignment Matting Network (PIIAMatting)
It can efficiently model the distinction of pixel-wise response maps and the correlation of layer-wise feature maps.
PIIAMatting performs favourably against state-of-the-art image matting methods on the Alphamatting.com, Composition-1K and Distinctions-646 dataset.
arXiv Detail & Related papers (2021-06-28T07:46:59Z) - High-Order Information Matters: Learning Relation and Topology for
Occluded Person Re-Identification [84.43394420267794]
We propose a novel framework by learning high-order relation and topology information for discriminative features and robust alignment.
Our framework significantly outperforms state-of-the-art by6.5%mAP scores on Occluded-Duke dataset.
arXiv Detail & Related papers (2020-03-18T12:18:35Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.