Cross-layer Feature Pyramid Network for Salient Object Detection
- URL: http://arxiv.org/abs/2002.10864v1
- Date: Tue, 25 Feb 2020 14:06:27 GMT
- Title: Cross-layer Feature Pyramid Network for Salient Object Detection
- Authors: Zun Li, Congyan Lang, Junhao Liew, Qibin Hou, Yidong Li, Jiashi Feng
- Abstract summary: We propose a novel Cross-layer Feature Pyramid Network to improve the progressive fusion in salient object detection.
The distributed features per layer own both semantics and salient details from all other layers simultaneously, and suffer reduced loss of important information.
- Score: 102.20031050972429
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Feature pyramid network (FPN) based models, which fuse the semantics and
salient details in a progressive manner, have been proven highly effective in
salient object detection. However, it is observed that these models often
generate saliency maps with incomplete object structures or unclear object
boundaries, due to the \emph{indirect} information propagation among distant
layers that makes such fusion structure less effective. In this work, we
propose a novel Cross-layer Feature Pyramid Network (CFPN), in which direct
cross-layer communication is enabled to improve the progressive fusion in
salient object detection. Specifically, the proposed network first aggregates
multi-scale features from different layers into feature maps that have access
to both the high- and low-level information. Then, it distributes the
aggregated features to all the involved layers to gain access to richer
context. In this way, the distributed features per layer own both semantics and
salient details from all other layers simultaneously, and suffer reduced loss
of important information. Extensive experimental results over six widely used
salient object detection benchmarks and with three popular backbones clearly
demonstrate that CFPN can accurately locate fairly complete salient regions and
effectively segment the object boundaries.
Related papers
- Hierarchical Graph Interaction Transformer with Dynamic Token Clustering for Camouflaged Object Detection [57.883265488038134]
We propose a hierarchical graph interaction network termed HGINet for camouflaged object detection.
The network is capable of discovering imperceptible objects via effective graph interaction among the hierarchical tokenized features.
Our experiments demonstrate the superior performance of HGINet compared to existing state-of-the-art methods.
arXiv Detail & Related papers (2024-08-27T12:53:25Z) - PVAFN: Point-Voxel Attention Fusion Network with Multi-Pooling Enhancing for 3D Object Detection [59.355022416218624]
integration of point and voxel representations is becoming more common in LiDAR-based 3D object detection.
We propose a novel two-stage 3D object detector, called Point-Voxel Attention Fusion Network (PVAFN)
PVAFN uses a multi-pooling strategy to integrate both multi-scale and region-specific information effectively.
arXiv Detail & Related papers (2024-08-26T19:43:01Z) - Cross-Layer Feature Pyramid Transformer for Small Object Detection in Aerial Images [5.652171904017473]
Object detection in aerial images has always been a challenging task due to the generally small size of the objects.
Most current detectors prioritize novel detection frameworks, often overlooking research on fundamental components such as feature pyramid networks.
We introduce the Cross-Layer Feature Pyramid Transformer (CFPT), a novel upsampler-free feature pyramid network designed specifically for small object detection in aerial images.
arXiv Detail & Related papers (2024-07-29T04:40:18Z) - FIPGNet:Pyramid grafting network with feature interaction strategies [0.0]
We propose a new salience object detection framework(FIPGNet), which is a pyramid graft network with feature interaction strategies.
Specifically, we propose an attention-mechanism based feature interaction strategy (FIA) that innovatively introduces spatial agent Cross Attention.
The proposed method outperforms the current 12 salient object detection methods on four indicators.
arXiv Detail & Related papers (2024-07-04T17:53:37Z) - M$^3$Net: Multilevel, Mixed and Multistage Attention Network for Salient
Object Detection [22.60675416709486]
M$3$Net is an attention network for Salient Object Detection.
Cross-attention approach to achieve the interaction between multilevel features.
Mixed Attention Block aims at modeling context at both global and local levels.
Multilevel supervision strategy to optimize the aggregated feature stage-by-stage.
arXiv Detail & Related papers (2023-09-15T12:46:14Z) - De-coupling and De-positioning Dense Self-supervised Learning [65.56679416475943]
Dense Self-Supervised Learning (SSL) methods address the limitations of using image-level feature representations when handling images with multiple objects.
We show that they suffer from coupling and positional bias, which arise from the receptive field increasing with layer depth and zero-padding.
We demonstrate the benefits of our method on COCO and on a new challenging benchmark, OpenImage-MINI, for object classification, semantic segmentation, and object detection.
arXiv Detail & Related papers (2023-03-29T18:07:25Z) - Feature Aggregation and Propagation Network for Camouflaged Object
Detection [42.33180748293329]
Camouflaged object detection (COD) aims to detect/segment camouflaged objects embedded in the environment.
Several COD methods have been developed, but they still suffer from unsatisfactory performance due to intrinsic similarities between foreground objects and background surroundings.
We propose a novel Feature Aggregation and propagation Network (FAP-Net) for camouflaged object detection.
arXiv Detail & Related papers (2022-12-02T05:54:28Z) - Saliency Detection via Global Context Enhanced Feature Fusion and Edge
Weighted Loss [6.112591965159383]
We propose a context fusion decoder network (CFDN) and near edge weighted loss (NEWLoss) function.
The CFDN creates an accurate saliency map by integrating global context information and thus suppressing the influence of the unnecessary spatial information.
NewLoss accelerates learning of obscure boundaries without additional modules by generating weight maps on object boundaries.
arXiv Detail & Related papers (2021-10-13T08:04:55Z) - Global Context-Aware Progressive Aggregation Network for Salient Object
Detection [117.943116761278]
We propose a novel network named GCPANet to integrate low-level appearance features, high-level semantic features, and global context features.
We show that the proposed approach outperforms the state-of-the-art methods both quantitatively and qualitatively.
arXiv Detail & Related papers (2020-03-02T04:26:10Z) - Depthwise Non-local Module for Fast Salient Object Detection Using a
Single Thread [136.2224792151324]
We propose a new deep learning algorithm for fast salient object detection.
The proposed algorithm achieves competitive accuracy and high inference efficiency simultaneously with a single CPU thread.
arXiv Detail & Related papers (2020-01-22T15:23:48Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.