Salient Object Detection via Dynamic Scale Routing
- URL: http://arxiv.org/abs/2210.13821v1
- Date: Tue, 25 Oct 2022 08:01:27 GMT
- Title: Salient Object Detection via Dynamic Scale Routing
- Authors: Zhenyu Wu, Shuai Li, Chenglizhao Chen, Hong Qin, Aimin Hao
- Abstract summary: This paper introduces the "dynamic" scale routing (as a brand-new idea) in this paper.
It will result in a generic plug-in that could directly fit the existing feature backbone.
We provide a self-adaptive bidirectional decoder design to accommodate the DPConv-based encoder best.
- Score: 62.26677215668959
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Recent research advances in salient object detection (SOD) could largely be
attributed to ever-stronger multi-scale feature representation empowered by the
deep learning technologies. The existing SOD deep models extract multi-scale
features via the off-the-shelf encoders and combine them smartly via various
delicate decoders. However, the kernel sizes in this commonly-used thread are
usually "fixed". In our new experiments, we have observed that kernels of small
size are preferable in scenarios containing tiny salient objects. In contrast,
large kernel sizes could perform better for images with large salient objects.
Inspired by this observation, we advocate the "dynamic" scale routing (as a
brand-new idea) in this paper. It will result in a generic plug-in that could
directly fit the existing feature backbone. This paper's key technical
innovations are two-fold. First, instead of using the vanilla convolution with
fixed kernel sizes for the encoder design, we propose the dynamic pyramid
convolution (DPConv), which dynamically selects the best-suited kernel sizes
w.r.t. the given input. Second, we provide a self-adaptive bidirectional
decoder design to accommodate the DPConv-based encoder best. The most
significant highlight is its capability of routing between feature scales and
their dynamic collection, making the inference process scale-aware. As a
result, this paper continues to enhance the current SOTA performance. Both the
code and dataset are publicly available at
https://github.com/wuzhenyubuaa/DPNet.
Related papers
- D-Net: Dynamic Large Kernel with Dynamic Feature Fusion for Volumetric Medical Image Segmentation [7.894630378784007]
We propose Dynamic Large Kernel (DLK) and Dynamic Feature Fusion (DFF) modules.
D-Net is able to effectively utilize a multi-scale large receptive field and adaptively harness global contextual information.
arXiv Detail & Related papers (2024-03-15T20:49:43Z) - RecursiveDet: End-to-End Region-based Recursive Object Detection [19.799892459080485]
Region-based object detectors like Sparse R-CNN usually have multiple cascade bounding box decoding stages.
In this paper, we find the general setting of decoding stages is actually redundant.
The RecusiveDet is able to achieve obvious performance boosts with even fewer model parameters.
arXiv Detail & Related papers (2023-07-25T16:22:58Z) - Dynamic Perceiver for Efficient Visual Recognition [87.08210214417309]
We propose Dynamic Perceiver (Dyn-Perceiver) to decouple the feature extraction procedure and the early classification task.
A feature branch serves to extract image features, while a classification branch processes a latent code assigned for classification tasks.
Early exits are placed exclusively within the classification branch, thus eliminating the need for linear separability in low-level features.
arXiv Detail & Related papers (2023-06-20T03:00:22Z) - Large Selective Kernel Network for Remote Sensing Object Detection [96.30162456627784]
We propose the Large Selective Kernel Network (LSKNet)
LSKNet can adjust its large spatial receptive field to better model the ranging context of various objects in remote sensing scenarios.
We rank 2nd place in 2022 the Greater Bay Area International Algorithm Competition.
arXiv Detail & Related papers (2023-03-16T02:00:37Z) - Lightweight Salient Object Detection in Optical Remote-Sensing Images
via Semantic Matching and Edge Alignment [61.45639694373033]
We propose a novel lightweight network for optical remote sensing images (ORSI-SOD) based on semantic matching and edge alignment, termed SeaNet.
Specifically, SeaNet includes a lightweight MobileNet-V2 for feature extraction, a dynamic semantic matching module (DSMM) for high-level features, and a portable decoder for inference.
arXiv Detail & Related papers (2023-01-07T04:33:51Z) - ParCNetV2: Oversized Kernel with Enhanced Attention [60.141606180434195]
We introduce a convolutional neural network architecture named ParCNetV2.
It extends position-aware circular convolution (ParCNet) with oversized convolutions and strengthens attention through bifurcate gate units.
Our method outperforms other pure convolutional neural networks as well as neural networks hybridizing CNNs and transformers.
arXiv Detail & Related papers (2022-11-14T07:22:55Z) - Dilated convolution with learnable spacings [6.6389732792316005]
CNNs need receptive fields (RF) to compete with visual transformers.
RFs can simply be enlarged by increasing the convolution kernel sizes.
The number of trainable parameters, which scales quadratically with the kernel's size in the 2D case, rapidly becomes prohibitive.
This paper presents a new method to increase the RF size without increasing the number of parameters.
arXiv Detail & Related papers (2021-12-07T14:54:24Z) - Efficient Person Search: An Anchor-Free Approach [86.45858994806471]
Person search aims to simultaneously localize and identify a query person from realistic, uncropped images.
To achieve this goal, state-of-the-art models typically add a re-id branch upon two-stage detectors like Faster R-CNN.
In this work, we present an anchor-free approach to efficiently tackling this challenging task, by introducing the following dedicated designs.
arXiv Detail & Related papers (2021-09-01T07:01:33Z) - HR-Depth: High Resolution Self-Supervised Monocular Depth Estimation [14.81943833870932]
We present an improvedDepthNet, HR-Depth, with two effective strategies.
Using Resnet-18 as the encoder, HR-Depth surpasses all pre-vious state-of-the-art(SoTA) methods with the least param-eters at both high and low resolution.
arXiv Detail & Related papers (2020-12-14T09:15:15Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.