SISP: A Benchmark Dataset for Fine-grained Ship Instance Segmentation in
Panchromatic Satellite Images
- URL: http://arxiv.org/abs/2402.03708v1
- Date: Tue, 6 Feb 2024 05:02:33 GMT
- Title: SISP: A Benchmark Dataset for Fine-grained Ship Instance Segmentation in
Panchromatic Satellite Images
- Authors: Pengming Feng, Mingjie Xie, Hongning Liu, Xuanjia Zhao, Guangjun He,
Xueliang Zhang, Jian Guan
- Abstract summary: We propose a benchmark dataset for fine-grained Ship Instance in Panchromatic satellite images, namely SISP.
SISP contains 56,693 well-annotated ship instances with four fine-grained categories across 10,000 sliced images.
Targets in the proposed SISP dataset have characteristics that are consistent with real satellite scenes.
- Score: 25.259591254585388
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Fine-grained ship instance segmentation in satellite images holds
considerable significance for monitoring maritime activities at sea. However,
existing datasets often suffer from the scarcity of fine-grained information or
pixel-wise localization annotations, as well as the insufficient image
diversity and variations, thus limiting the research of this task. To this end,
we propose a benchmark dataset for fine-grained Ship Instance Segmentation in
Panchromatic satellite images, namely SISP, which contains 56,693
well-annotated ship instances with four fine-grained categories across 10,000
sliced images, and all the images are collected from SuperView-1 satellite with
the resolution of 0.5m. Targets in the proposed SISP dataset have
characteristics that are consistent with real satellite scenes, such as high
class imbalance, various scenes, large variations in target densities and
scales, and high inter-class similarity and intra-class diversity, all of which
make the SISP dataset more suitable for real-world applications. In addition,
we introduce a Dynamic Feature Refinement-assist Instance segmentation network,
namely DFRInst, as the benchmark method for ship instance segmentation in
satellite images, which can fortify the explicit representation of crucial
features, thus improving the performance of ship instance segmentation.
Experiments and analysis are performed on the proposed SISP dataset to evaluate
the benchmark method and several state-of-the-art methods to establish
baselines for facilitating future research. The proposed dataset and source
codes will be available at: https://github.com/Justlovesmile/SISP.
Related papers
- SMART-Ship: A Comprehensive Synchronized Multi-modal Aligned Remote Sensing Targets Dataset and Benchmark for Berthed Ships Analysis [12.87083600993665]
This dataset consists of 1092 multi-modal image sets, covering 38,838 ships.<n>Each image set is acquired within one week and registered to ensuretemporal consistency.<n>We define benchmarks on five fundamental tasks and compare methods across the dataset.
arXiv Detail & Related papers (2025-08-04T13:09:58Z) - USIS16K: High-Quality Dataset for Underwater Salient Instance Segmentation [11.590111778515775]
We introduce USIS16K, a large-scale dataset comprising 16,151 high-resolution underwater images.<n>Each image is annotated with high-quality instance-level salient object masks.<n>We provide benchmark evaluations on underwater object detection and USIS tasks using USIS16K.
arXiv Detail & Related papers (2025-06-24T09:58:01Z) - Hyperspectral Remote Sensing Images Salient Object Detection: The First Benchmark Dataset and Baseline [14.081609886645555]
We introduce the first HRSI-SOD dataset, termed HRSSD, which includes 704 hyperspectral images and 5327 pixel-level annotated salient objects.
The HRSSD dataset poses substantial challenges for salient object detection algorithms due to large scale variation, diverse foreground-background relations, and multi-salient objects.
We propose an innovative and efficient baseline model for HRSI-SOD, termed the Deep Spectral Saliency Network (DSSN)
arXiv Detail & Related papers (2025-04-03T09:12:42Z) - A$^{2}$-MAE: A spatial-temporal-spectral unified remote sensing pre-training method based on anchor-aware masked autoencoder [26.81539884309151]
Remote sensing (RS) data provide Earth observations across multiple dimensions, encompassing critical spatial, temporal, and spectral information.
Despite various pre-training methods tailored to the characteristics of RS data, a key limitation persists: the inability to effectively integrate spatial, temporal, and spectral information within a single unified model.
We propose an Anchor-Aware Masked AutoEncoder method (A$2$-MAE), leveraging intrinsic complementary information from the different kinds of images and geo-information to reconstruct the masked patches during the pre-training phase.
arXiv Detail & Related papers (2024-06-12T11:02:15Z) - Evaluating the Efficacy of Cut-and-Paste Data Augmentation in Semantic Segmentation for Satellite Imagery [4.499833362998487]
This study explores the effectiveness of a Cut-and-Paste augmentation technique for semantic segmentation in satellite images.
We adapt this augmentation, which usually requires labeled instances, to the case of semantic segmentation.
Using the DynamicEarthNet dataset and a U-Net model for evaluation, we found that this augmentation significantly enhances the mIoU score on the test set from 37.9 to 44.1.
arXiv Detail & Related papers (2024-04-08T17:18:30Z) - Rotated Multi-Scale Interaction Network for Referring Remote Sensing Image Segmentation [63.15257949821558]
Referring Remote Sensing Image (RRSIS) is a new challenge that combines computer vision and natural language processing.
Traditional Referring Image (RIS) approaches have been impeded by the complex spatial scales and orientations found in aerial imagery.
We introduce the Rotated Multi-Scale Interaction Network (RMSIN), an innovative approach designed for the unique demands of RRSIS.
arXiv Detail & Related papers (2023-12-19T08:14:14Z) - SAMRS: Scaling-up Remote Sensing Segmentation Dataset with Segment
Anything Model [85.85899655118087]
We develop an efficient pipeline for generating a large-scale RS segmentation dataset, dubbed SAMRS.
SAMRS totally possesses 105,090 images and 1,668,241 instances, surpassing existing high-resolution RS segmentation datasets in size by several orders of magnitude.
arXiv Detail & Related papers (2023-05-03T10:58:07Z) - DCN-T: Dual Context Network with Transformer for Hyperspectral Image
Classification [109.09061514799413]
Hyperspectral image (HSI) classification is challenging due to spatial variability caused by complex imaging conditions.
We propose a tri-spectral image generation pipeline that transforms HSI into high-quality tri-spectral images.
Our proposed method outperforms state-of-the-art methods for HSI classification.
arXiv Detail & Related papers (2023-04-19T18:32:52Z) - CEU-Net: Ensemble Semantic Segmentation of Hyperspectral Images Using
Clustering [2.741266294612776]
Clustering Ensemble U-Net (CEU-Net) is a novel semantic segmentation model for Hyperspectral images (HSIs)
CEU-Net combines spectral information extracted from convolutional neural network (CNN) training on a cluster of landscape pixels.
Our model outperforms existing state-of-the-art HSI semantic segmentation methods and gets competitive performance with and without patching.
arXiv Detail & Related papers (2022-03-09T16:51:15Z) - Multi-Content Complementation Network for Salient Object Detection in
Optical Remote Sensing Images [108.79667788962425]
salient object detection in optical remote sensing images (RSI-SOD) remains to be a challenging emerging topic.
We propose a novel Multi-Content Complementation Network (MCCNet) to explore the complementarity of multiple content for RSI-SOD.
In MCCM, we consider multiple types of features that are critical to RSI-SOD, including foreground features, edge features, background features, and global image-level features.
arXiv Detail & Related papers (2021-12-02T04:46:40Z) - Learning to Aggregate Multi-Scale Context for Instance Segmentation in
Remote Sensing Images [28.560068780733342]
A novel context aggregation network (CATNet) is proposed to improve the feature extraction process.
The proposed model exploits three lightweight plug-and-play modules, namely dense feature pyramid network (DenseFPN), spatial context pyramid ( SCP), and hierarchical region of interest extractor (HRoIE)
arXiv Detail & Related papers (2021-11-22T08:55:25Z) - Semantic Attention and Scale Complementary Network for Instance
Segmentation in Remote Sensing Images [54.08240004593062]
We propose an end-to-end multi-category instance segmentation model, which consists of a Semantic Attention (SEA) module and a Scale Complementary Mask Branch (SCMB)
SEA module contains a simple fully convolutional semantic segmentation branch with extra supervision to strengthen the activation of interest instances on the feature map.
SCMB extends the original single mask branch to trident mask branches and introduces complementary mask supervision at different scales.
arXiv Detail & Related papers (2021-07-25T08:53:59Z) - Salient Objects in Clutter [130.63976772770368]
This paper identifies and addresses a serious design bias of existing salient object detection (SOD) datasets.
This design bias has led to a saturation in performance for state-of-the-art SOD models when evaluated on existing datasets.
We propose a new high-quality dataset and update the previous saliency benchmark.
arXiv Detail & Related papers (2021-05-07T03:49:26Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.