Related papers: Attentional Feature Refinement and Alignment Network for Aircraft Detection in SAR Imagery

Attentional Feature Refinement and Alignment Network for Aircraft Detection in SAR Imagery

URL: http://arxiv.org/abs/2201.07124v2
Date: Wed, 19 Jan 2022 04:37:26 GMT
Title: Attentional Feature Refinement and Alignment Network for Aircraft Detection in SAR Imagery
Authors: Yan Zhao, Lingjun Zhao, Zhong Liu, Dewen Hu, Gangyao Kuang, Li Liu
Abstract summary: Aircraft detection in Synthetic Aperture Radar (SAR) imagery is a challenging task due to aircraft's discrete appearance, obvious intraclass variation, small size and serious background's interference. In this paper, a single-shot detector namely Attentional Feature Refinement and Alignment Network (AFRAN) is proposed for detecting aircraft in SAR images with competitive accuracy and speed.
Score: 24.004052923372548
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Aircraft detection in Synthetic Aperture Radar (SAR) imagery is a challenging task in SAR Automatic Target Recognition (SAR ATR) areas due to aircraft's extremely discrete appearance, obvious intraclass variation, small size and serious background's interference. In this paper, a single-shot detector namely Attentional Feature Refinement and Alignment Network (AFRAN) is proposed for detecting aircraft in SAR images with competitive accuracy and speed. Specifically, three significant components including Attention Feature Fusion Module (AFFM), Deformable Lateral Connection Module (DLCM) and Anchor-guided Detection Module (ADM), are carefully designed in our method for refining and aligning informative characteristics of aircraft. To represent characteristics of aircraft with less interference, low-level textural and high-level semantic features of aircraft are fused and refined in AFFM throughly. The alignment between aircraft's discrete back-scatting points and convolutional sampling spots is promoted in DLCM. Eventually, the locations of aircraft are predicted precisely in ADM based on aligned features revised by refined anchors. To evaluate the performance of our method, a self-built SAR aircraft sliced dataset and a large scene SAR image are collected. Extensive quantitative and qualitative experiments with detailed analysis illustrate the effectiveness of the three proposed components. Furthermore, the topmost detection accuracy and competitive speed are achieved by our method compared with other domain-specific,e.g., DAPN, PADN, and general CNN-based methods,e.g., FPN, Cascade R-CNN, SSD, RefineDet and RPDet.

Related papers

High-Frequency Semantics and Geometric Priors for End-to-End Detection Transformers in Challenging UAV Imagery [4.833513511627847]
Unmanned Aerial Vehicle-based Object Detection (UAV-OD) faces substantial challenges, including small target sizes, high-density distributions, and cluttered backgrounds in UAV imagery.<n>We propose HEGS-DETR, a comprehensively enhanced, real-time Detection Transformer framework tailored for UAVs.<n> Experiments on the VisDrone dataset demonstrate that HEGS-DETR achieves a 5.1% AP50 and 3.8% AP increase over the baseline, while maintaining real-time speed and reducing parameter count by 4M.
arXiv Detail & Related papers (2025-07-01T14:56:56Z)
J-DDL: Surface Damage Detection and Localization System for Fighter Aircraft [18.53607676786071]
We propose a smart surface damage detection and localization system for fighter aircraft, termed J-DDL.<n>J-DDL integrates 2D images and 3D point clouds of the entire aircraft surface, captured using a combined system of laser scanners and cameras.<n>Key innovations include lightweight Fasternet blocks for efficient feature extraction, an optimized neck architecture, and the introduction of a novel loss function, Inner-CIOU.
arXiv Detail & Related papers (2025-06-12T09:05:35Z)
MTSGL: Multi-Task Structure Guided Learning for Robust and Interpretable SAR Aircraft Recognition [16.88286091071643]
We propose a multi-task structure guided learning (MTSGL) network for robust and interpretable SAR aircraft recognition. The proposed MTSGL includes a structural semantic awareness (SSA) module and a structural consistency regularization (SCR) module. In conclusion, the MTSGL is presented with the expert-level aircraft prior knowledge and structure guided learning paradigm, aiming to comprehend the aircraft concept in a way analogous to the human cognitive process.
arXiv Detail & Related papers (2025-04-23T07:27:08Z)
Unpaired Object-Level SAR-to-Optical Image Translation for Aircraft with Keypoints-Guided Diffusion Models [4.6570959687411975]
Translating SAR images into optical images is a promising solution to enhance interpretation and support downstream tasks. This study proposes a keypoint-guided diffusion model (KeypointDiff) for SAR-to-optical image translation of unpaired aircraft targets.
arXiv Detail & Related papers (2025-03-25T16:05:49Z)
Physics-Guided Detector for SAR Airplanes [48.11882103050703]
We propose a novel physics-guided detector (PGD) learning paradigm for SAR airplanes. It comprehensively investigate their discreteness and variability to improve the detection performance. The experiments demonstrate the flexibility and effectiveness of the proposed PGD.
arXiv Detail & Related papers (2024-11-19T07:41:09Z)
MS-Net: A Multi-modal Self-supervised Network for Fine-Grained Classification of Aircraft in SAR Images [8.54188605939881]
This article proposes a novel multi-modal self-supervised network (MS-Net) for fine-grained classification of aircraft. In the case of no label, the proposed algorithm achieves an accuracy of 88.46% for 17 types of air-craft classification task.
arXiv Detail & Related papers (2023-08-28T14:28:50Z)
Efficient Real-time Smoke Filtration with 3D LiDAR for Search and Rescue with Autonomous Heterogeneous Robotic Systems [56.838297900091426]
Smoke and dust affect the performance of any mobile robotic platform due to their reliance on onboard perception systems. This paper proposes a novel modular computation filtration pipeline based on intensity and spatial information.
arXiv Detail & Related papers (2023-08-14T16:48:57Z)
Multi-Modal Domain Fusion for Multi-modal Aerial View Object Classification [4.438928487047433]
A novel Multi-Modal Domain Fusion(MDF) network is proposed to learn the domain invariant features from multi-modal data. The network achieves top-10 performance in the Track-1 with an accuracy of 25.3 % and top-5 performance in Track-2 with an accuracy of 34.26 %.
arXiv Detail & Related papers (2022-12-14T05:14:02Z)
Spatio-Temporal-Frequency Graph Attention Convolutional Network for Aircraft Recognition Based on Heterogeneous Radar Network [24.666924145375397]
This paper proposes a knowledge-and-data-driven graph neural network-based collaboration learning model for reliable aircraft recognition in a heterogeneous radar network. A graph attention convolutional network (STFGACN) is developed to distill semantic features from the radar cross-section signals received by the network.
arXiv Detail & Related papers (2022-04-15T07:39:32Z)
Context-Preserving Instance-Level Augmentation and Deformable Convolution Networks for SAR Ship Detection [50.53262868498824]
Shape deformation of targets in SAR image due to random orientation and partial information loss is an essential challenge in SAR ship detection. We propose a data augmentation method to train a deep network that is robust to partial information loss within the targets.
arXiv Detail & Related papers (2022-02-14T07:01:01Z)
Rethinking Drone-Based Search and Rescue with Aerial Person Detection [79.76669658740902]
The visual inspection of aerial drone footage is an integral part of land search and rescue (SAR) operations today. We propose a novel deep learning algorithm to automate this aerial person detection (APD) task. We present the novel Aerial Inspection RetinaNet (AIR) algorithm as the combination of these contributions.
arXiv Detail & Related papers (2021-11-17T21:48:31Z)
RRNet: Relational Reasoning Network with Parallel Multi-scale Attention for Salient Object Detection in Optical Remote Sensing Images [82.1679766706423]
Salient object detection (SOD) for optical remote sensing images (RSIs) aims at locating and extracting visually distinctive objects/regions from the optical RSIs. We propose a relational reasoning network with parallel multi-scale attention for SOD in optical RSIs. Our proposed RRNet outperforms the existing state-of-the-art SOD competitors both qualitatively and quantitatively.
arXiv Detail & Related papers (2021-10-27T07:18:32Z)
MRDet: A Multi-Head Network for Accurate Oriented Object Detection in Aerial Images [51.227489316673484]
We propose an arbitrary-oriented region proposal network (AO-RPN) to generate oriented proposals transformed from horizontal anchors. To obtain accurate bounding boxes, we decouple the detection task into multiple subtasks and propose a multi-head network. Each head is specially designed to learn the features optimal for the corresponding task, which allows our network to detect objects accurately.
arXiv Detail & Related papers (2020-12-24T06:36:48Z)
PENet: Object Detection using Points Estimation in Aerial Images [9.33900415971554]
A novel network structure, Points Estimated Network (PENet), is proposed in this work to answer these challenges. PENet uses a Mask Resampling Module (MRM) to augment the imbalanced datasets, a coarse anchor-free detector (CPEN) to effectively predict the center points of the small object clusters, and a fine anchor-free detector FPEN to locate the precise positions of the small objects. Our experiments on aerial datasets visDrone and UAVDT showed that PENet achieved higher precision results than existing state-of-the-art approaches.
arXiv Detail & Related papers (2020-01-22T19:43:17Z)

This list is automatically generated from the titles and abstracts of the papers in this site.