Fast Fourier Convolution Based Remote Sensor Image Object Detection for
Earth Observation
- URL: http://arxiv.org/abs/2209.00551v1
- Date: Thu, 1 Sep 2022 15:50:58 GMT
- Title: Fast Fourier Convolution Based Remote Sensor Image Object Detection for
Earth Observation
- Authors: Gu Lingyun, Eugene Popov, Dong Ge
- Abstract summary: We propose a Frequency-aware Feature Pyramid Framework (FFPF) for remote sensing object detection.
F-ResNet is proposed to perceive the spectral context information by plugging the frequency domain convolution into each stage of the backbone.
The BSFPN is designed to use a bilateral sampling strategy and skipping connection to better model the association of object features at different scales.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Remote sensor image object detection is an important technology for Earth
observation, and is used in various tasks such as forest fire monitoring and
ocean monitoring. Image object detection technology, despite the significant
developments, is struggling to handle remote sensor images and small-scale
objects, due to the limited pixels of small objects. Numerous existing studies
have demonstrated that an effective way to promote small object detection is to
introduce the spatial context. Meanwhile, recent researches for image
classification have shown that spectral convolution operations can perceive
long-term spatial dependence more efficiently in the frequency domain than
spatial domain. Inspired by this observation, we propose a Frequency-aware
Feature Pyramid Framework (FFPF) for remote sensing object detection, which
consists of a novel Frequency-aware ResNet (F-ResNet) and a Bilateral
Spectral-aware Feature Pyramid Network (BS-FPN). Specifically, the F-ResNet is
proposed to perceive the spectral context information by plugging the frequency
domain convolution into each stage of the backbone, extracting richer features
of small objects. To the best of our knowledge, this is the first work to
introduce frequency-domain convolution into remote sensing object detection
task. In addition, the BSFPN is designed to use a bilateral sampling strategy
and skipping connection to better model the association of object features at
different scales, towards unleashing the potential of the spectral context
information from F-ResNet. Extensive experiments are conducted for object
detection in the optical remote sensing image dataset (DIOR and DOTA). The
experimental results demonstrate the excellent performance of our method. It
achieves an average accuracy (mAP) without any tricks.
Related papers
- United Domain Cognition Network for Salient Object Detection in Optical Remote Sensing Images [21.76732661032257]
We propose a novel United Domain Cognition Network (UDCNet) to jointly explore the global-local information in the frequency and spatial domains.
Experimental results demonstrate the superiority of the proposed UDCNet over 24 state-of-the-art models.
arXiv Detail & Related papers (2024-11-11T04:12:27Z) - Frequency-Spatial Entanglement Learning for Camouflaged Object Detection [34.426297468968485]
Existing methods attempt to reduce the impact of pixel similarity by maximizing the distinguishing ability of spatial features with complicated design.
We propose a new approach to address this issue by jointly exploring the representation in the frequency and spatial domains, introducing the Frequency-Spatial Entanglement Learning (FSEL) method.
Our experiments demonstrate the superiority of our FSEL over 21 state-of-the-art methods, through comprehensive quantitative and qualitative comparisons in three widely-used datasets.
arXiv Detail & Related papers (2024-09-03T07:58:47Z) - AGO-Net: Association-Guided 3D Point Cloud Object Detection Network [86.10213302724085]
We propose a novel 3D detection framework that associates intact features for objects via domain adaptation.
We achieve new state-of-the-art performance on the KITTI 3D detection benchmark in both accuracy and speed.
arXiv Detail & Related papers (2022-08-24T16:54:38Z) - Spatial-Temporal Frequency Forgery Clue for Video Forgery Detection in
VIS and NIR Scenario [87.72258480670627]
Existing face forgery detection methods based on frequency domain find that the GAN forged images have obvious grid-like visual artifacts in the frequency spectrum compared to the real images.
This paper proposes a Cosine Transform-based Forgery Clue Augmentation Network (FCAN-DCT) to achieve a more comprehensive spatial-temporal feature representation.
arXiv Detail & Related papers (2022-07-05T09:27:53Z) - RRNet: Relational Reasoning Network with Parallel Multi-scale Attention
for Salient Object Detection in Optical Remote Sensing Images [82.1679766706423]
Salient object detection (SOD) for optical remote sensing images (RSIs) aims at locating and extracting visually distinctive objects/regions from the optical RSIs.
We propose a relational reasoning network with parallel multi-scale attention for SOD in optical RSIs.
Our proposed RRNet outperforms the existing state-of-the-art SOD competitors both qualitatively and quantitatively.
arXiv Detail & Related papers (2021-10-27T07:18:32Z) - Infrared Small-Dim Target Detection with Transformer under Complex
Backgrounds [155.388487263872]
We propose a new infrared small-dim target detection method with the transformer.
We adopt the self-attention mechanism of the transformer to learn the interaction information of image features in a larger range.
We also design a feature enhancement module to learn more features of small-dim targets.
arXiv Detail & Related papers (2021-09-29T12:23:41Z) - CFC-Net: A Critical Feature Capturing Network for Arbitrary-Oriented
Object Detection in Remote Sensing Images [0.9462808515258465]
In this paper, we discuss the role of discriminative features in object detection.
We then propose a Critical Feature Capturing Network (CFC-Net) to improve detection accuracy.
We show that our method achieves superior detection performance compared with many state-of-the-art approaches.
arXiv Detail & Related papers (2021-01-18T02:31:09Z) - Dense Attention Fluid Network for Salient Object Detection in Optical
Remote Sensing Images [193.77450545067967]
We propose an end-to-end Dense Attention Fluid Network (DAFNet) for salient object detection in optical remote sensing images (RSIs)
A Global Context-aware Attention (GCA) module is proposed to adaptively capture long-range semantic context relationships.
We construct a new and challenging optical RSI dataset for SOD that contains 2,000 images with pixel-wise saliency annotations.
arXiv Detail & Related papers (2020-11-26T06:14:10Z) - SCRDet++: Detecting Small, Cluttered and Rotated Objects via
Instance-Level Feature Denoising and Rotation Loss Smoothing [131.04304632759033]
Small and cluttered objects are common in real-world which are challenging for detection.
In this paper, we first innovatively introduce the idea of denoising to object detection.
Instance-level denoising on the feature map is performed to enhance the detection to small and cluttered objects.
arXiv Detail & Related papers (2020-04-28T06:03:54Z) - Small-Object Detection in Remote Sensing Images with End-to-End
Edge-Enhanced GAN and Object Detector Network [9.135036713000513]
A generative adversarial network (GAN)-based model called enhanced super-resolution GAN (ESRGAN) shows remarkable image enhancement performance.
We propose a new edge-enhanced super-resolution GAN (EESRGAN) to improve the image quality of remote sensing images.
arXiv Detail & Related papers (2020-03-20T03:07:30Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.