Frequency Perception Network for Camouflaged Object Detection
- URL: http://arxiv.org/abs/2308.08924v1
- Date: Thu, 17 Aug 2023 11:30:46 GMT
- Title: Frequency Perception Network for Camouflaged Object Detection
- Authors: Runmin Cong, Mengyao Sun, Sanyi Zhang, Xiaofei Zhou, Wei Zhang, and
Yao Zhao
- Abstract summary: We propose a novel learnable and separable frequency perception mechanism driven by the semantic hierarchy in the frequency domain.
Our entire network adopts a two-stage model, including a frequency-guided coarse localization stage and a detail-preserving fine localization stage.
Compared with the currently existing models, our proposed method achieves competitive performance in three popular benchmark datasets.
- Score: 51.26386921922031
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: Camouflaged object detection (COD) aims to accurately detect objects hidden
in the surrounding environment. However, the existing COD methods mainly locate
camouflaged objects in the RGB domain, their performance has not been fully
exploited in many challenging scenarios. Considering that the features of the
camouflaged object and the background are more discriminative in the frequency
domain, we propose a novel learnable and separable frequency perception
mechanism driven by the semantic hierarchy in the frequency domain. Our entire
network adopts a two-stage model, including a frequency-guided coarse
localization stage and a detail-preserving fine localization stage. With the
multi-level features extracted by the backbone, we design a flexible frequency
perception module based on octave convolution for coarse positioning. Then, we
design the correction fusion module to step-by-step integrate the high-level
features through the prior-guided correction and cross-layer feature channel
association, and finally combine them with the shallow features to achieve the
detailed correction of the camouflaged objects. Compared with the currently
existing models, our proposed method achieves competitive performance in three
popular benchmark datasets both qualitatively and quantitatively.
Related papers
- DenoDet: Attention as Deformable Multi-Subspace Feature Denoising for Target Detection in SAR Images [20.11145540094807]
We propose a network aided by explicit frequency domain transform to calibrate convolutional biases and pay more attention to high-frequencies.
We design TransDeno, a dynamic frequency domain attention module that performs as a transform domain soft thresholding operation.
Our plug-and-play TransDeno sets state-of-the-art scores on multiple SAR target detection datasets.
arXiv Detail & Related papers (2024-06-05T01:05:26Z) - Unveiling Camouflage: A Learnable Fourier-based Augmentation for
Camouflaged Object Detection and Instance Segmentation [27.41886911999097]
We propose a learnable augmentation method for camouflaged object detection (COD) and camouflaged instance segmentation (CIS)
Our proposed augmentation method boosts the performance of camouflaged object detectors and camouflaged instance segmenters by large margins.
arXiv Detail & Related papers (2023-08-29T22:43:46Z) - Feature Shrinkage Pyramid for Camouflaged Object Detection with
Transformers [34.42710399235461]
Vision transformers have recently shown strong global context modeling capabilities in camouflaged object detection.
They suffer from two major limitations: less effective locality modeling and insufficient feature aggregation in decoders.
We propose a novel transformer-based Feature Shrinkage Pyramid Network (FSPNet), which aims to hierarchically decode locality-enhanced neighboring transformer features.
arXiv Detail & Related papers (2023-03-26T20:50:58Z) - Feature Aggregation and Propagation Network for Camouflaged Object
Detection [42.33180748293329]
Camouflaged object detection (COD) aims to detect/segment camouflaged objects embedded in the environment.
Several COD methods have been developed, but they still suffer from unsatisfactory performance due to intrinsic similarities between foreground objects and background surroundings.
We propose a novel Feature Aggregation and propagation Network (FAP-Net) for camouflaged object detection.
arXiv Detail & Related papers (2022-12-02T05:54:28Z) - AGO-Net: Association-Guided 3D Point Cloud Object Detection Network [86.10213302724085]
We propose a novel 3D detection framework that associates intact features for objects via domain adaptation.
We achieve new state-of-the-art performance on the KITTI 3D detection benchmark in both accuracy and speed.
arXiv Detail & Related papers (2022-08-24T16:54:38Z) - Adaptive Frequency Learning in Two-branch Face Forgery Detection [66.91715092251258]
We propose Adaptively learn Frequency information in the two-branch Detection framework, dubbed AFD.
We liberate our network from the fixed frequency transforms, and achieve better performance with our data- and task-dependent transform layers.
arXiv Detail & Related papers (2022-03-27T14:25:52Z) - High-resolution Iterative Feedback Network for Camouflaged Object
Detection [128.893782016078]
Spotting camouflaged objects that are visually assimilated into the background is tricky for object detection algorithms.
We aim to extract the high-resolution texture details to avoid the detail degradation that causes blurred vision in edges and boundaries.
We introduce a novel HitNet to refine the low-resolution representations by high-resolution features in an iterative feedback manner.
arXiv Detail & Related papers (2022-03-22T11:20:21Z) - Generalizing Face Forgery Detection with High-frequency Features [63.33397573649408]
Current CNN-based detectors tend to overfit to method-specific color textures and thus fail to generalize.
We propose to utilize the high-frequency noises for face forgery detection.
The first is the multi-scale high-frequency feature extraction module that extracts high-frequency noises at multiple scales.
The second is the residual-guided spatial attention module that guides the low-level RGB feature extractor to concentrate more on forgery traces from a new perspective.
arXiv Detail & Related papers (2021-03-23T08:19:21Z) - Depthwise Non-local Module for Fast Salient Object Detection Using a
Single Thread [136.2224792151324]
We propose a new deep learning algorithm for fast salient object detection.
The proposed algorithm achieves competitive accuracy and high inference efficiency simultaneously with a single CPU thread.
arXiv Detail & Related papers (2020-01-22T15:23:48Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.