Locality-Aware Rotated Ship Detection in High-Resolution Remote Sensing
Imagery Based on Multi-Scale Convolutional Network
- URL: http://arxiv.org/abs/2007.12326v1
- Date: Fri, 24 Jul 2020 03:01:42 GMT
- Title: Locality-Aware Rotated Ship Detection in High-Resolution Remote Sensing
Imagery Based on Multi-Scale Convolutional Network
- Authors: Lingyi Liu, Yunpeng Bai, and Ying Li
- Abstract summary: We propose a locality-aware rotated ship detection (LARSD) framework based on a multi-scale convolutional neural network (CNN)
The proposed framework applies a UNet-like multi-scale CNN to generate multi-scale feature maps with high-level information in high resolution.
To enlarge the detection dataset, we build a new high-resolution ship detection (HRSD) dataset, where 2499 images and 9269 instances were collected from Google Earth with different resolutions.
- Score: 7.984128966509492
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Ship detection has been an active and vital topic in the field of remote
sensing for a decade, but it is still a challenging problem due to the large
scale variations, the high aspect ratios, the intensive arrangement, and the
background clutter disturbance. In this letter, we propose a locality-aware
rotated ship detection (LARSD) framework based on a multi-scale convolutional
neural network (CNN) to tackle these issues. The proposed framework applies a
UNet-like multi-scale CNN to generate multi-scale feature maps with high-level
semantic information in high resolution. Then, a rotated anchor-based
regression is applied for directly predicting the probability, the edge
distances, and the angle of ships. Finally, a locality-aware score alignment is
proposed to fix the mismatch between classification results and location
results caused by the independence of each subnet. Furthermore, to enlarge the
datasets of ship detection, we build a new high-resolution ship detection
(HRSD) dataset, where 2499 images and 9269 instances were collected from Google
Earth with different resolutions. Experiments based on public dataset HRSC2016
and our HRSD dataset demonstrate that our detection method achieves
state-of-the-art performance.
Related papers
- Renormalized Connection for Scale-preferred Object Detection in Satellite Imagery [51.83786195178233]
We design a Knowledge Discovery Network (KDN) to implement the renormalization group theory in terms of efficient feature extraction.
Renormalized connection (RC) on the KDN enables synergistic focusing'' of multi-scale features.
RCs extend the multi-level feature's divide-and-conquer'' mechanism of the FPN-based detectors to a wide range of scale-preferred tasks.
arXiv Detail & Related papers (2024-09-09T13:56:22Z) - PGNeXt: High-Resolution Salient Object Detection via Pyramid Grafting Network [24.54269823691119]
We present an advanced study on more challenging high-resolution salient object detection (HRSOD) from both dataset and network framework perspectives.
To compensate for the lack of HRSOD dataset, we thoughtfully collect a large-scale high resolution salient object detection dataset, called UHRSD.
All the images are finely annotated in pixel-level, far exceeding previous low-resolution SOD datasets.
arXiv Detail & Related papers (2024-08-02T09:31:21Z) - DiAD: A Diffusion-based Framework for Multi-class Anomaly Detection [55.48770333927732]
We propose a Difusion-based Anomaly Detection (DiAD) framework for multi-class anomaly detection.
It consists of a pixel-space autoencoder, a latent-space Semantic-Guided (SG) network with a connection to the stable diffusion's denoising network, and a feature-space pre-trained feature extractor.
Experiments on MVTec-AD and VisA datasets demonstrate the effectiveness of our approach.
arXiv Detail & Related papers (2023-12-11T18:38:28Z) - Learning to Holistically Detect Bridges from Large-Size VHR Remote
Sensing Imagery [40.001753733290464]
It is essential to perform holistic bridge detection in large-size very-high-resolution (VHR) RSIs.
The lack of datasets with large-size VHR RSIs limits the deep learning algorithms' performance on bridge detection.
This paper proposes a large-scale dataset named GLH-Bridge comprising 6,000 VHR RSIs sampled from diverse geographic locations.
arXiv Detail & Related papers (2023-12-05T04:15:22Z) - Pyramid Grafting Network for One-Stage High Resolution Saliency
Detection [29.013012579688347]
We propose a one-stage framework called Pyramid Grafting Network (PGNet) to extract features from different resolution images independently.
An attention-based Cross-Model Grafting Module (CMGM) is proposed to enable CNN branch to combine broken detailed information more holistically.
We contribute a new Ultra-High-Resolution Saliency Detection dataset UHRSD, containing 5,920 images at 4K-8K resolutions.
arXiv Detail & Related papers (2022-04-11T12:22:21Z) - LocalTrans: A Multiscale Local Transformer Network for Cross-Resolution
Homography Estimation [52.63874513999119]
Cross-resolution image alignment is a key problem in multiscale giga photography.
Existing deep homography methods neglecting the explicit formulation of correspondences between them, which leads to degraded accuracy in cross-resolution challenges.
We propose a local transformer network embedded within a multiscale structure to explicitly learn correspondences between the multimodal inputs.
arXiv Detail & Related papers (2021-06-08T02:51:45Z) - MRDet: A Multi-Head Network for Accurate Oriented Object Detection in
Aerial Images [51.227489316673484]
We propose an arbitrary-oriented region proposal network (AO-RPN) to generate oriented proposals transformed from horizontal anchors.
To obtain accurate bounding boxes, we decouple the detection task into multiple subtasks and propose a multi-head network.
Each head is specially designed to learn the features optimal for the corresponding task, which allows our network to detect objects accurately.
arXiv Detail & Related papers (2020-12-24T06:36:48Z) - Dense Label Encoding for Boundary Discontinuity Free Rotation Detection [69.75559390700887]
This paper explores a relatively less-studied methodology based on classification.
We propose new techniques to push its frontier in two aspects.
Experiments and visual analysis on large-scale public datasets for aerial images show the effectiveness of our approach.
arXiv Detail & Related papers (2020-11-19T05:42:02Z) - LR-CNN: Local-aware Region CNN for Vehicle Detection in Aerial Imagery [43.91170581467171]
State-of-the-art object detection approaches have difficulties detecting dense, small targets with arbitrary orientation in large aerial images.
We present the Local-aware Region Convolutional Neural Network (LR-CNN), a novel two-stage approach for vehicle detection in aerial imagery.
arXiv Detail & Related papers (2020-05-28T19:57:34Z) - SCRDet++: Detecting Small, Cluttered and Rotated Objects via
Instance-Level Feature Denoising and Rotation Loss Smoothing [131.04304632759033]
Small and cluttered objects are common in real-world which are challenging for detection.
In this paper, we first innovatively introduce the idea of denoising to object detection.
Instance-level denoising on the feature map is performed to enhance the detection to small and cluttered objects.
arXiv Detail & Related papers (2020-04-28T06:03:54Z) - A Novel CNN-based Method for Accurate Ship Detection in HR Optical
Remote Sensing Images via Rotated Bounding Box [10.689750889854269]
A novel CNN-based ship detection method is proposed, by overcoming some common deficiencies of current CNN-based methods in ship detection.
We are able to predict the orientation and other variables independently, and yet more effectively, with a novel dual-branch regression network.
Experimental results demonstrate the great superiority of the proposed method in ship detection.
arXiv Detail & Related papers (2020-04-15T14:48:46Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.