Lightweight Salient Object Detection in Optical Remote-Sensing Images
via Semantic Matching and Edge Alignment
- URL: http://arxiv.org/abs/2301.02778v2
- Date: Mon, 3 Apr 2023 05:02:47 GMT
- Title: Lightweight Salient Object Detection in Optical Remote-Sensing Images
via Semantic Matching and Edge Alignment
- Authors: Gongyang Li, Zhi Liu, Xinpeng Zhang, Weisi Lin
- Abstract summary: We propose a novel lightweight network for optical remote sensing images (ORSI-SOD) based on semantic matching and edge alignment, termed SeaNet.
Specifically, SeaNet includes a lightweight MobileNet-V2 for feature extraction, a dynamic semantic matching module (DSMM) for high-level features, and a portable decoder for inference.
- Score: 61.45639694373033
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: Recently, relying on convolutional neural networks (CNNs), many methods for
salient object detection in optical remote sensing images (ORSI-SOD) are
proposed. However, most methods ignore the huge parameters and computational
cost brought by CNNs, and only a few pay attention to the portability and
mobility. To facilitate practical applications, in this paper, we propose a
novel lightweight network for ORSI-SOD based on semantic matching and edge
alignment, termed SeaNet. Specifically, SeaNet includes a lightweight
MobileNet-V2 for feature extraction, a dynamic semantic matching module (DSMM)
for high-level features, an edge self-alignment module (ESAM) for low-level
features, and a portable decoder for inference. First, the high-level features
are compressed into semantic kernels. Then, semantic kernels are used to
activate salient object locations in two groups of high-level features through
dynamic convolution operations in DSMM. Meanwhile, in ESAM, cross-scale edge
information extracted from two groups of low-level features is self-aligned
through L2 loss and used for detail enhancement. Finally, starting from the
highest-level features, the decoder infers salient objects based on the
accurate locations and fine details contained in the outputs of the two
modules. Extensive experiments on two public datasets demonstrate that our
lightweight SeaNet not only outperforms most state-of-the-art lightweight
methods but also yields comparable accuracy with state-of-the-art conventional
methods, while having only 2.76M parameters and running with 1.7G FLOPs for
288x288 inputs. Our code and results are available at
https://github.com/MathLee/SeaNet.
Related papers
- BAFNet: Bilateral Attention Fusion Network for Lightweight Semantic Segmentation of Urban Remote Sensing Images [6.153725909241752]
We propose a lightweight semantic segmentation network called bilateral attention fusion network (BAFNet) to efficiently segment high-resolution urban remote sensing images.
BAFNet outperforms advanced lightweight models in accuracy but also demonstrates comparable performance to non-lightweight state-of-the-art methods on two datasets.
arXiv Detail & Related papers (2024-09-16T13:25:42Z) - LSKNet: A Foundation Lightweight Backbone for Remote Sensing [78.29112381082243]
This paper proposes a lightweight Large Selective Kernel Network (LSKNet) backbone.
LSKNet can adjust its large spatial receptive field to better model the ranging context of various objects in remote sensing scenarios.
Without bells and whistles, our lightweight LSKNet sets new state-of-the-art scores on standard remote sensing classification, object detection and semantic segmentation benchmarks.
arXiv Detail & Related papers (2024-03-18T12:43:38Z) - Salient Object Detection in Optical Remote Sensing Images Driven by
Transformer [69.22039680783124]
We propose a novel Global Extraction Local Exploration Network (GeleNet) for Optical Remote Sensing Images (ORSI-SOD)
Specifically, GeleNet first adopts a transformer backbone to generate four-level feature embeddings with global long-range dependencies.
Extensive experiments on three public datasets demonstrate that the proposed GeleNet outperforms relevant state-of-the-art methods.
arXiv Detail & Related papers (2023-09-15T07:14:43Z) - DPNet: Dual-Path Network for Real-time Object Detection with Lightweight
Attention [15.360769793764526]
This paper presents a dual-path network, named DPNet, with a lightweight attention scheme for real-time object detection.
DPNet achieves state-of-the-art trade-off between detection accuracy and implementation efficiency.
arXiv Detail & Related papers (2022-09-28T09:11:01Z) - Adjacent Context Coordination Network for Salient Object Detection in
Optical Remote Sensing Images [102.75699068451166]
We propose a novel Adjacent Context Coordination Network (ACCoNet) to explore the coordination of adjacent features in an encoder-decoder architecture for optical RSI-SOD.
The proposed ACCoNet outperforms 22 state-of-the-art methods under nine evaluation metrics, and runs up to 81 fps on a single NVIDIA Titan X GPU.
arXiv Detail & Related papers (2022-03-25T14:14:55Z) - Lightweight Salient Object Detection in Optical Remote Sensing Images
via Feature Correlation [93.80710126516405]
We propose a novel lightweight ORSI-SOD solution, named CorrNet, to address these issues.
By reducing the parameters and computations of each component, CorrNet ends up having only 4.09M parameters and running with 21.09G FLOPs.
Experimental results on two public datasets demonstrate that our lightweight CorrNet achieves competitive or even better performance compared with 26 state-of-the-art methods.
arXiv Detail & Related papers (2022-01-20T08:28:01Z) - An Attention-Fused Network for Semantic Segmentation of
Very-High-Resolution Remote Sensing Imagery [26.362854938949923]
We propose a novel convolutional neural network architecture, named attention-fused network (AFNet)
We achieve state-of-the-art performance with an overall accuracy of 91.7% and a mean F1 score of 90.96% on the ISPRS Vaihingen 2D dataset and the ISPRS Potsdam 2D dataset.
arXiv Detail & Related papers (2021-05-10T06:23:27Z) - Adaptive Linear Span Network for Object Skeleton Detection [56.78705071830965]
We propose adaptive linear span network (AdaLSN) to automatically configure and integrate scale-aware features for object skeleton detection.
AdaLSN substantiates its versatility by achieving significantly higher accuracy and latency trade-off.
It also demonstrates general applicability to image-to-mask tasks such as edge detection and road extraction.
arXiv Detail & Related papers (2020-11-08T12:51:14Z) - MACU-Net for Semantic Segmentation of Fine-Resolution Remotely Sensed
Images [11.047174552053626]
MACU-Net is a multi-scale skip connected and asymmetric-convolution-based U-Net for fine-resolution remotely sensed images.
Our design has the following advantages: (1) The multi-scale skip connections combine and realign semantic features contained in both low-level and high-level feature maps; (2) the asymmetric convolution block strengthens the feature representation and feature extraction capability of a standard convolution layer.
Experiments conducted on two remotely sensed datasets demonstrate that the proposed MACU-Net transcends the U-Net, U-NetPPL, U-Net 3+, amongst other benchmark approaches.
arXiv Detail & Related papers (2020-07-26T08:56:47Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.