Fast Fourier Convolution Based Remote Sensor Image Object Detection for
Earth Observation
- URL: http://arxiv.org/abs/2209.00551v1
- Date: Thu, 1 Sep 2022 15:50:58 GMT
- Title: Fast Fourier Convolution Based Remote Sensor Image Object Detection for
Earth Observation
- Authors: Gu Lingyun, Eugene Popov, Dong Ge
- Abstract summary: We propose a Frequency-aware Feature Pyramid Framework (FFPF) for remote sensing object detection.
F-ResNet is proposed to perceive the spectral context information by plugging the frequency domain convolution into each stage of the backbone.
The BSFPN is designed to use a bilateral sampling strategy and skipping connection to better model the association of object features at different scales.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Remote sensor image object detection is an important technology for Earth
observation, and is used in various tasks such as forest fire monitoring and
ocean monitoring. Image object detection technology, despite the significant
developments, is struggling to handle remote sensor images and small-scale
objects, due to the limited pixels of small objects. Numerous existing studies
have demonstrated that an effective way to promote small object detection is to
introduce the spatial context. Meanwhile, recent researches for image
classification have shown that spectral convolution operations can perceive
long-term spatial dependence more efficiently in the frequency domain than
spatial domain. Inspired by this observation, we propose a Frequency-aware
Feature Pyramid Framework (FFPF) for remote sensing object detection, which
consists of a novel Frequency-aware ResNet (F-ResNet) and a Bilateral
Spectral-aware Feature Pyramid Network (BS-FPN). Specifically, the F-ResNet is
proposed to perceive the spectral context information by plugging the frequency
domain convolution into each stage of the backbone, extracting richer features
of small objects. To the best of our knowledge, this is the first work to
introduce frequency-domain convolution into remote sensing object detection
task. In addition, the BSFPN is designed to use a bilateral sampling strategy
and skipping connection to better model the association of object features at
different scales, towards unleashing the potential of the spectral context
information from F-ResNet. Extensive experiments are conducted for object
detection in the optical remote sensing image dataset (DIOR and DOTA). The
experimental results demonstrate the excellent performance of our method. It
achieves an average accuracy (mAP) without any tricks.
Related papers
- Multimodal Transformer Using Cross-Channel attention for Object Detection in Remote Sensing Images [1.662438436885552]
Multi-modal fusion has been determined to enhance the accuracy by fusing data from multiple modalities.
We propose a novel multi-modal fusion strategy for mapping relationships between different channels at the early stage.
By addressing fusion in the early stage, as opposed to mid or late-stage methods, our method achieves competitive and even superior performance compared to existing techniques.
arXiv Detail & Related papers (2023-10-21T00:56:11Z) - AGO-Net: Association-Guided 3D Point Cloud Object Detection Network [86.10213302724085]
We propose a novel 3D detection framework that associates intact features for objects via domain adaptation.
We achieve new state-of-the-art performance on the KITTI 3D detection benchmark in both accuracy and speed.
arXiv Detail & Related papers (2022-08-24T16:54:38Z) - Spatial-Temporal Frequency Forgery Clue for Video Forgery Detection in
VIS and NIR Scenario [87.72258480670627]
Existing face forgery detection methods based on frequency domain find that the GAN forged images have obvious grid-like visual artifacts in the frequency spectrum compared to the real images.
This paper proposes a Cosine Transform-based Forgery Clue Augmentation Network (FCAN-DCT) to achieve a more comprehensive spatial-temporal feature representation.
arXiv Detail & Related papers (2022-07-05T09:27:53Z) - Enhanced Single-shot Detector for Small Object Detection in Remote
Sensing Images [33.84369068593722]
We propose image pyramid single-shot detector (IPSSD) for small-scale object detection.
In IPSSD, single-shot detector is adopted combined with an image pyramid network to extract semantically strong features for generating candidate regions.
The proposed network can enhance the small-scale features from a feature pyramid network.
arXiv Detail & Related papers (2022-05-12T07:35:07Z) - RRNet: Relational Reasoning Network with Parallel Multi-scale Attention
for Salient Object Detection in Optical Remote Sensing Images [82.1679766706423]
Salient object detection (SOD) for optical remote sensing images (RSIs) aims at locating and extracting visually distinctive objects/regions from the optical RSIs.
We propose a relational reasoning network with parallel multi-scale attention for SOD in optical RSIs.
Our proposed RRNet outperforms the existing state-of-the-art SOD competitors both qualitatively and quantitatively.
arXiv Detail & Related papers (2021-10-27T07:18:32Z) - Infrared Small-Dim Target Detection with Transformer under Complex
Backgrounds [155.388487263872]
We propose a new infrared small-dim target detection method with the transformer.
We adopt the self-attention mechanism of the transformer to learn the interaction information of image features in a larger range.
We also design a feature enhancement module to learn more features of small-dim targets.
arXiv Detail & Related papers (2021-09-29T12:23:41Z) - CFC-Net: A Critical Feature Capturing Network for Arbitrary-Oriented
Object Detection in Remote Sensing Images [0.9462808515258465]
In this paper, we discuss the role of discriminative features in object detection.
We then propose a Critical Feature Capturing Network (CFC-Net) to improve detection accuracy.
We show that our method achieves superior detection performance compared with many state-of-the-art approaches.
arXiv Detail & Related papers (2021-01-18T02:31:09Z) - Dense Attention Fluid Network for Salient Object Detection in Optical
Remote Sensing Images [193.77450545067967]
We propose an end-to-end Dense Attention Fluid Network (DAFNet) for salient object detection in optical remote sensing images (RSIs)
A Global Context-aware Attention (GCA) module is proposed to adaptively capture long-range semantic context relationships.
We construct a new and challenging optical RSI dataset for SOD that contains 2,000 images with pixel-wise saliency annotations.
arXiv Detail & Related papers (2020-11-26T06:14:10Z) - SCRDet++: Detecting Small, Cluttered and Rotated Objects via
Instance-Level Feature Denoising and Rotation Loss Smoothing [131.04304632759033]
Small and cluttered objects are common in real-world which are challenging for detection.
In this paper, we first innovatively introduce the idea of denoising to object detection.
Instance-level denoising on the feature map is performed to enhance the detection to small and cluttered objects.
arXiv Detail & Related papers (2020-04-28T06:03:54Z) - Small-Object Detection in Remote Sensing Images with End-to-End
Edge-Enhanced GAN and Object Detector Network [9.135036713000513]
A generative adversarial network (GAN)-based model called enhanced super-resolution GAN (ESRGAN) shows remarkable image enhancement performance.
We propose a new edge-enhanced super-resolution GAN (EESRGAN) to improve the image quality of remote sensing images.
arXiv Detail & Related papers (2020-03-20T03:07:30Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.