Large Selective Kernel Network for Remote Sensing Object Detection
- URL: http://arxiv.org/abs/2303.09030v2
- Date: Mon, 20 Mar 2023 02:11:05 GMT
- Title: Large Selective Kernel Network for Remote Sensing Object Detection
- Authors: Yuxuan Li, Qibin Hou, Zhaohui Zheng, Ming-Ming Cheng, Jian Yang and
Xiang Li
- Abstract summary: We propose the Large Selective Kernel Network (LSKNet)
LSKNet can adjust its large spatial receptive field to better model the ranging context of various objects in remote sensing scenarios.
We rank 2nd place in 2022 the Greater Bay Area International Algorithm Competition.
- Score: 96.30162456627784
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: Recent research on remote sensing object detection has largely focused on
improving the representation of oriented bounding boxes but has overlooked the
unique prior knowledge presented in remote sensing scenarios. Such prior
knowledge can be useful because tiny remote sensing objects may be mistakenly
detected without referencing a sufficiently long-range context, and the
long-range context required by different types of objects can vary. In this
paper, we take these priors into account and propose the Large Selective Kernel
Network (LSKNet). LSKNet can dynamically adjust its large spatial receptive
field to better model the ranging context of various objects in remote sensing
scenarios. To the best of our knowledge, this is the first time that large and
selective kernel mechanisms have been explored in the field of remote sensing
object detection. Without bells and whistles, LSKNet sets new state-of-the-art
scores on standard benchmarks, i.e., HRSC2016 (98.46\% mAP), DOTA-v1.0 (81.85\%
mAP) and FAIR1M-v1.0 (47.87\% mAP). Based on a similar technique, we rank 2nd
place in 2022 the Greater Bay Area International Algorithm Competition. Code is
available at https://github.com/zcablii/Large-Selective-Kernel-Network.
Related papers
- LSKNet: A Foundation Lightweight Backbone for Remote Sensing [78.29112381082243]
This paper proposes a lightweight Large Selective Kernel Network (LSKNet) backbone.
LSKNet can adjust its large spatial receptive field to better model the ranging context of various objects in remote sensing scenarios.
Without bells and whistles, our lightweight LSKNet sets new state-of-the-art scores on standard remote sensing classification, object detection and semantic segmentation benchmarks.
arXiv Detail & Related papers (2024-03-18T12:43:38Z) - Tucker Bilinear Attention Network for Multi-scale Remote Sensing Object
Detection [10.060030309684953]
Large-scale variation of remote-sensing targets is one of main challenges in VHR remote-sensing object detection.
This paper proposes two novel modules: Guided Attention and Tucker Bilinear Attention.
Based on two modules, we build a new multi-scale remote sensing object detection framework.
arXiv Detail & Related papers (2023-03-09T15:20:03Z) - Open-Set Object Detection Using Classification-free Object Proposal and
Instance-level Contrastive Learning [25.935629339091697]
Open-set object detection (OSOD) is a promising direction to handle the problem consisting of two subtasks: objects and background separation, and open-set object classification.
We present Openset RCNN to address the challenging OSOD.
We show that our Openset RCNN can endow the robot with an open-set perception ability to support robotic rearrangement tasks in cluttered environments.
arXiv Detail & Related papers (2022-11-21T15:00:04Z) - Salient Object Detection via Dynamic Scale Routing [62.26677215668959]
This paper introduces the "dynamic" scale routing (as a brand-new idea) in this paper.
It will result in a generic plug-in that could directly fit the existing feature backbone.
We provide a self-adaptive bidirectional decoder design to accommodate the DPConv-based encoder best.
arXiv Detail & Related papers (2022-10-25T08:01:27Z) - RFLA: Gaussian Receptive Field based Label Assignment for Tiny Object
Detection [45.10513110142015]
Current anchor-based or anchor-free label assignment paradigms incur many outlier tiny-sized ground truth samples.
We propose a Gaussian Receptive Field based Label Assignment (RFLA) strategy for tiny object detection.
Our approach outperforms the state-of-the-art competitors with 4.0 AP points on the AI-TOD dataset.
arXiv Detail & Related papers (2022-08-18T09:35:56Z) - Discovery-and-Selection: Towards Optimal Multiple Instance Learning for
Weakly Supervised Object Detection [86.86602297364826]
We propose a discoveryand-selection approach fused with multiple instance learning (DS-MIL)
Our proposed DS-MIL approach can consistently improve the baselines, reporting state-of-the-art performance.
arXiv Detail & Related papers (2021-10-18T07:06:57Z) - Oriented R-CNN for Object Detection [61.78746189807462]
This work proposes an effective and simple oriented object detection framework, termed Oriented R-CNN.
In the first stage, we propose an oriented Region Proposal Network (oriented RPN) that directly generates high-quality oriented proposals in a nearly cost-free manner.
The second stage is oriented R-CNN head for refining oriented Regions of Interest (oriented RoIs) and recognizing them.
arXiv Detail & Related papers (2021-08-12T12:47:43Z) - VarifocalNet: An IoU-aware Dense Object Detector [11.580759212782812]
We learn an Iou-aware Classification Score (IACS) as a joint representation of object presence confidence and localization accuracy.
We show that dense object detectors can achieve a more accurate ranking of candidate detections based on the IACS.
We build an IoU-aware dense object detector based on the FCOS+ATSS architecture, that we call VarifocalNet or VFNet for short.
arXiv Detail & Related papers (2020-08-31T05:12:21Z) - Dynamic Refinement Network for Oriented and Densely Packed Object
Detection [75.29088991850958]
We present a dynamic refinement network that consists of two novel components, i.e., a feature selection module (FSM) and a dynamic refinement head (DRH)
Our FSM enables neurons to adjust receptive fields in accordance with the shapes and orientations of target objects, whereas the DRH empowers our model to refine the prediction dynamically in an object-aware manner.
We perform quantitative evaluations on several publicly available benchmarks including DOTA, HRSC2016, SKU110K, and our own SKU110K-R dataset.
arXiv Detail & Related papers (2020-05-20T11:35:50Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.