MS-Net: A Multi-modal Self-supervised Network for Fine-Grained
Classification of Aircraft in SAR Images
- URL: http://arxiv.org/abs/2308.14613v1
- Date: Mon, 28 Aug 2023 14:28:50 GMT
- Title: MS-Net: A Multi-modal Self-supervised Network for Fine-Grained
Classification of Aircraft in SAR Images
- Authors: Bingying Yue, Jianhao Li, Hao Shi, Yupei Wang, Honghu Zhong
- Abstract summary: This article proposes a novel multi-modal self-supervised network (MS-Net) for fine-grained classification of aircraft.
In the case of no label, the proposed algorithm achieves an accuracy of 88.46% for 17 types of air-craft classification task.
- Score: 8.54188605939881
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Synthetic aperture radar (SAR) imaging technology is commonly used to provide
24-hour all-weather earth observation. However, it still has some drawbacks in
SAR target classification, especially in fine-grained classification of
aircraft: aircrafts in SAR images have large intra-class diversity and
inter-class similarity; the number of effective samples is insufficient and
it's hard to annotate. To address these issues, this article proposes a novel
multi-modal self-supervised network (MS-Net) for fine-grained classification of
aircraft. Firstly, in order to entirely exploit the potential of multi-modal
information, a two-sided path feature extraction network (TSFE-N) is
constructed to enhance the image feature of the target and obtain the domain
knowledge feature of text mode. Secondly, a contrastive self-supervised
learning (CSSL) framework is employed to effectively learn useful
label-independent feature from unbalanced data, a similarity per-ception loss
(SPloss) is proposed to avoid network overfitting. Finally, TSFE-N is used as
the encoder of CSSL to obtain the classification results. Through a large
number of experiments, our MS-Net can effectively reduce the difficulty of
classifying similar types of aircrafts. In the case of no label, the proposed
algorithm achieves an accuracy of 88.46% for 17 types of air-craft
classification task, which has pioneering significance in the field of
fine-grained classification of aircraft in SAR images.
Related papers
- Renormalized Connection for Scale-preferred Object Detection in Satellite Imagery [51.83786195178233]
We design a Knowledge Discovery Network (KDN) to implement the renormalization group theory in terms of efficient feature extraction.
Renormalized connection (RC) on the KDN enables synergistic focusing'' of multi-scale features.
RCs extend the multi-level feature's divide-and-conquer'' mechanism of the FPN-based detectors to a wide range of scale-preferred tasks.
arXiv Detail & Related papers (2024-09-09T13:56:22Z) - SAFE: a SAR Feature Extractor based on self-supervised learning and masked Siamese ViTs [5.961207817077044]
We propose a novel self-supervised learning framework based on masked Siamese Vision Transformers to create a General SAR Feature Extractor coined SAFE.
Our method leverages contrastive learning principles to train a model on unlabeled SAR data, extracting robust and generalizable features.
We introduce tailored data augmentation techniques specific to SAR imagery, such as sub-aperture decomposition and despeckling.
Our network competes with or surpasses other state-of-the-art methods in few-shot classification and segmentation tasks, even without being trained on the sensors used for the evaluation.
arXiv Detail & Related papers (2024-06-30T23:11:20Z) - Efficient Prompt Tuning of Large Vision-Language Model for Fine-Grained
Ship Classification [62.425462136772666]
Fine-grained ship classification in remote sensing (RS-FGSC) poses a significant challenge due to the high similarity between classes and the limited availability of labeled data.
Recent advancements in large pre-trained Vision-Language Models (VLMs) have demonstrated impressive capabilities in few-shot or zero-shot learning.
This study delves into harnessing the potential of VLMs to enhance classification accuracy for unseen ship categories.
arXiv Detail & Related papers (2024-03-13T05:48:58Z) - Fine-grained Recognition with Learnable Semantic Data Augmentation [68.48892326854494]
Fine-grained image recognition is a longstanding computer vision challenge.
We propose diversifying the training data at the feature-level to alleviate the discriminative region loss problem.
Our method significantly improves the generalization performance on several popular classification networks.
arXiv Detail & Related papers (2023-09-01T11:15:50Z) - Multi-Modal Domain Fusion for Multi-modal Aerial View Object
Classification [4.438928487047433]
A novel Multi-Modal Domain Fusion(MDF) network is proposed to learn the domain invariant features from multi-modal data.
The network achieves top-10 performance in the Track-1 with an accuracy of 25.3 % and top-5 performance in Track-2 with an accuracy of 34.26 %.
arXiv Detail & Related papers (2022-12-14T05:14:02Z) - SAR-ShipNet: SAR-Ship Detection Neural Network via Bidirectional
Coordinate Attention and Multi-resolution Feature Fusion [7.323279438948967]
This paper studies a practically meaningful ship detection problem from synthetic aperture radar (SAR) images by the neural network.
We propose a SAR-ship detection neural network (call SAR-ShipNet for short), by newly developing Bidirectional Coordinate Attention (BCA) and Multi-resolution Feature Fusion (MRF) based on CenterNet.
Experimental results on the public SAR-Ship dataset show that our SAR-ShipNet achieves competitive advantages in both speed and accuracy.
arXiv Detail & Related papers (2022-03-29T12:27:04Z) - AF$_2$: Adaptive Focus Framework for Aerial Imagery Segmentation [86.44683367028914]
Aerial imagery segmentation has some unique challenges, the most critical one among which lies in foreground-background imbalance.
We propose Adaptive Focus Framework (AF$), which adopts a hierarchical segmentation procedure and focuses on adaptively utilizing multi-scale representations.
AF$ has significantly improved the accuracy on three widely used aerial benchmarks, as fast as the mainstream method.
arXiv Detail & Related papers (2022-02-18T10:14:45Z) - Attentional Feature Refinement and Alignment Network for Aircraft
Detection in SAR Imagery [24.004052923372548]
Aircraft detection in Synthetic Aperture Radar (SAR) imagery is a challenging task due to aircraft's discrete appearance, obvious intraclass variation, small size and serious background's interference.
In this paper, a single-shot detector namely Attentional Feature Refinement and Alignment Network (AFRAN) is proposed for detecting aircraft in SAR images with competitive accuracy and speed.
arXiv Detail & Related papers (2022-01-18T16:54:49Z) - Robust Semi-supervised Federated Learning for Images Automatic
Recognition in Internet of Drones [57.468730437381076]
We present a Semi-supervised Federated Learning (SSFL) framework for privacy-preserving UAV image recognition.
There are significant differences in the number, features, and distribution of local data collected by UAVs using different camera modules.
We propose an aggregation rule based on the frequency of the client's participation in training, namely the FedFreq aggregation rule.
arXiv Detail & Related papers (2022-01-03T16:49:33Z) - Multi-Objective CNN Based Algorithm for SAR Despeckling [1.933681537640272]
This paper proposes a convolutional neural network (CNN) with a multi-objective cost function taking care of SAR image properties.
Experiments on simulated and real SAR images show the accuracy of the proposed method compared to the State-of-Art despeckling algorithms.
arXiv Detail & Related papers (2020-06-16T10:15:42Z) - DeepEMD: Differentiable Earth Mover's Distance for Few-Shot Learning [122.51237307910878]
We develop methods for few-shot image classification from a new perspective of optimal matching between image regions.
We employ the Earth Mover's Distance (EMD) as a metric to compute a structural distance between dense image representations.
To generate the important weights of elements in the formulation, we design a cross-reference mechanism.
arXiv Detail & Related papers (2020-03-15T08:13:16Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.