Rethinking Lightweight Salient Object Detection via Network Depth-Width
Tradeoff
- URL: http://arxiv.org/abs/2301.06679v1
- Date: Tue, 17 Jan 2023 03:43:25 GMT
- Title: Rethinking Lightweight Salient Object Detection via Network Depth-Width
Tradeoff
- Authors: Jia Li, Shengye Qiao, Zhirui Zhao, Chenxi Xie, Xiaowu Chen and
Changqun Xia
- Abstract summary: Existing salient object detection methods often adopt deeper and wider networks for better performance.
We propose a novel trilateral decoder framework by decoupling the U-shape structure into three complementary branches.
We show that our method achieves better efficiency-accuracy balance across five benchmarks.
- Score: 26.566339984225756
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Existing salient object detection methods often adopt deeper and wider
networks for better performance, resulting in heavy computational burden and
slow inference speed. This inspires us to rethink saliency detection to achieve
a favorable balance between efficiency and accuracy. To this end, we design a
lightweight framework while maintaining satisfying competitive accuracy.
Specifically, we propose a novel trilateral decoder framework by decoupling the
U-shape structure into three complementary branches, which are devised to
confront the dilution of semantic context, loss of spatial structure and
absence of boundary detail, respectively. Along with the fusion of three
branches, the coarse segmentation results are gradually refined in structure
details and boundary quality. Without adding additional learnable parameters,
we further propose Scale-Adaptive Pooling Module to obtain multi-scale
receptive filed. In particular, on the premise of inheriting this framework, we
rethink the relationship among accuracy, parameters and speed via network
depth-width tradeoff. With these insightful considerations, we comprehensively
design shallower and narrower models to explore the maximum potential of
lightweight SOD. Our models are purposed for different application
environments: 1) a tiny version CTD-S (1.7M, 125FPS) for resource constrained
devices, 2) a fast version CTD-M (12.6M, 158FPS) for speed-demanding scenarios,
3) a standard version CTD-L (26.5M, 84FPS) for high-performance platforms.
Extensive experiments validate the superiority of our method, which achieves
better efficiency-accuracy balance across five benchmarks.
Related papers
- Accelerating Deep Neural Networks via Semi-Structured Activation
Sparsity [0.0]
Exploiting sparsity in the network's feature maps is one of the ways to reduce its inference latency.
We propose a solution to induce semi-structured activation sparsity exploitable through minor runtime modifications.
Our approach yields a speed improvement of $1.25 times$ with a minimal accuracy drop of $1.1%$ for the ResNet18 model on the ImageNet dataset.
arXiv Detail & Related papers (2023-09-12T22:28:53Z) - Surrogate Lagrangian Relaxation: A Path To Retrain-free Deep Neural
Network Pruning [9.33753001494221]
Network pruning is a widely used technique to reduce computation cost and model size for deep neural networks.
In this paper, we develop a systematic weight-pruning optimization approach based on Surrogate Lagrangian relaxation.
arXiv Detail & Related papers (2023-04-08T22:48:30Z) - Iterative Soft Shrinkage Learning for Efficient Image Super-Resolution [91.3781512926942]
Image super-resolution (SR) has witnessed extensive neural network designs from CNN to transformer architectures.
This work investigates the potential of network pruning for super-resolution iteration to take advantage of off-the-shelf network designs and reduce the underlying computational overhead.
We propose a novel Iterative Soft Shrinkage-Percentage (ISS-P) method by optimizing the sparse structure of a randomly network at each and tweaking unimportant weights with a small amount proportional to the magnitude scale on-the-fly.
arXiv Detail & Related papers (2023-03-16T21:06:13Z) - PIDNet: A Real-time Semantic Segmentation Network Inspired by PID
Controllers [6.0653144230649865]
Two-branch network architecture has shown its efficiency and effectiveness in real-time semantic segmentation tasks.
We propose a novel three-branch network architecture: PIDNet, which contains three branches to parse detailed, context and boundary information.
Our family of PIDNets achieve the best trade-off between inference speed and accuracy and their accuracy surpasses all the existing models with similar inference speed on the Cityscapes and CamVid datasets.
arXiv Detail & Related papers (2022-06-04T23:16:52Z) - An Adaptive Device-Edge Co-Inference Framework Based on Soft
Actor-Critic [72.35307086274912]
High-dimension parameter model and large-scale mathematical calculation restrict execution efficiency, especially for Internet of Things (IoT) devices.
We propose a new Deep Reinforcement Learning (DRL)-Soft Actor Critic for discrete (SAC-d), which generates the emphexit point, emphexit point, and emphcompressing bits by soft policy iterations.
Based on the latency and accuracy aware reward design, such an computation can well adapt to the complex environment like dynamic wireless channel and arbitrary processing, and is capable of supporting the 5G URL
arXiv Detail & Related papers (2022-01-09T09:31:50Z) - Joint Channel and Weight Pruning for Model Acceleration on Moblie
Devices [37.51092726022731]
pruning is a widely adopted practice to balance the computational resource consumption and the accuracy.
We present a unified framework with Joint Channel pruning and Weight pruning (JCW)
We develop a tailored multi-objective evolutionary algorithm in the JCW framework, which enables one single search to obtain the optimal candidate architectures.
arXiv Detail & Related papers (2021-10-15T11:18:42Z) - FastFlowNet: A Lightweight Network for Fast Optical Flow Estimation [81.76975488010213]
Dense optical flow estimation plays a key role in many robotic vision tasks.
Current networks often occupy large number of parameters and require heavy computation costs.
Our proposed FastFlowNet works in the well-known coarse-to-fine manner with following innovations.
arXiv Detail & Related papers (2021-03-08T03:09:37Z) - FDFlowNet: Fast Optical Flow Estimation using a Deep Lightweight Network [12.249680550252327]
We present a lightweight yet effective model for real-time optical flow estimation, termed FDFlowNet (fast deep flownet)
We achieve better or similar accuracy on the challenging KITTI and Sintel benchmarks while being about 2 times faster than PWC-Net.
arXiv Detail & Related papers (2020-06-22T14:01:01Z) - A Privacy-Preserving-Oriented DNN Pruning and Mobile Acceleration
Framework [56.57225686288006]
Weight pruning of deep neural networks (DNNs) has been proposed to satisfy the limited storage and computing capability of mobile edge devices.
Previous pruning methods mainly focus on reducing the model size and/or improving performance without considering the privacy of user data.
We propose a privacy-preserving-oriented pruning and mobile acceleration framework that does not require the private training dataset.
arXiv Detail & Related papers (2020-03-13T23:52:03Z) - Toward fast and accurate human pose estimation via soft-gated skip
connections [97.06882200076096]
This paper is on highly accurate and highly efficient human pose estimation.
We re-analyze this design choice in the context of improving both the accuracy and the efficiency over the state-of-the-art.
Our model achieves state-of-the-art results on the MPII and LSP datasets.
arXiv Detail & Related papers (2020-02-25T18:51:51Z) - Triple Wins: Boosting Accuracy, Robustness and Efficiency Together by
Enabling Input-Adaptive Inference [119.19779637025444]
Deep networks were recently suggested to face the odds between accuracy (on clean natural images) and robustness (on adversarially perturbed images)
This paper studies multi-exit networks associated with input-adaptive inference, showing their strong promise in achieving a "sweet point" in cooptimizing model accuracy, robustness and efficiency.
arXiv Detail & Related papers (2020-02-24T00:40:22Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.