FMNet: Frequency-Assisted Mamba-Like Linear Attention Network for Camouflaged Object Detection
- URL: http://arxiv.org/abs/2503.11030v1
- Date: Fri, 14 Mar 2025 02:55:19 GMT
- Title: FMNet: Frequency-Assisted Mamba-Like Linear Attention Network for Camouflaged Object Detection
- Authors: Ming Deng, Sijin Sun, Zihao Li, Xiaochuan Hu, Xing Wu,
- Abstract summary: Camouflaged Object Detection (COD) is challenging due to the strong similarity between camouflaged objects and their surroundings.<n>Existing methods mainly rely on spatial local features, failing to capture global information.<n>To address this, the Frequency-Assisted Mamba-Like Linear Attention Network (FMNet) is proposed.
- Score: 7.246630480680039
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Camouflaged Object Detection (COD) is challenging due to the strong similarity between camouflaged objects and their surroundings, which complicates identification. Existing methods mainly rely on spatial local features, failing to capture global information, while Transformers increase computational costs.To address this, the Frequency-Assisted Mamba-Like Linear Attention Network (FMNet) is proposed, which leverages frequency-domain learning to efficiently capture global features and mitigate ambiguity between objects and the background. FMNet introduces the Multi-Scale Frequency-Assisted Mamba-Like Linear Attention (MFM) module, integrating frequency and spatial features through a multi-scale structure to handle scale variations while reducing computational complexity. Additionally, the Pyramidal Frequency Attention Extraction (PFAE) module and the Frequency Reverse Decoder (FRD) enhance semantics and reconstruct features. Experimental results demonstrate that FMNet outperforms existing methods on multiple COD datasets, showcasing its advantages in both performance and efficiency. Code available at https://anonymous.4open.science/r/FMNet-3CE5.
Related papers
- Frequency-Integrated Transformer for Arbitrary-Scale Super-Resolution [8.303267303436613]
Methods based on implicit neural representation have demonstrated remarkable capabilities in arbitrary-scale super-resolution (ASSR) tasks.
We propose a novel network called Frequency-Integrated Transformer (FIT) to incorporate frequency information to enhance ASSR performance.
arXiv Detail & Related papers (2025-04-26T06:12:49Z) - Adaptive Frequency Enhancement Network for Remote Sensing Image Semantic Segmentation [33.49405456617909]
We propose the Adaptive Frequency Enhancement Network (AFENet), which integrates two key components: the Adaptive Frequency and Spatial feature Interaction Module (AFSIM) and the Selective feature Fusion Module (SFM)
AFSIM dynamically separates and modulates high- and low-frequency features according to the content of the input image.
SFM selectively fuses global context and local detailed features to enhance the network's representation capability.
arXiv Detail & Related papers (2025-04-03T14:42:49Z) - Exploring State Space Model in Wavelet Domain: An Infrared and Visible Image Fusion Network via Wavelet Transform and State Space Model [8.392891463947661]
We propose Wavelet-Mamba, which integrates wavelet transform with the state-space model (SSM)
Wavelet-SSM module incorporates wavelet-based frequency domain feature extraction and global information extraction through SSM.
Our method achieves both visually compelling results and superior performance compared to current state-of-the-art methods.
arXiv Detail & Related papers (2025-03-24T06:25:44Z) - Spatial and Frequency Domain Adaptive Fusion Network for Image Deblurring [0.0]
Image deblurring aims to reconstruct a latent sharp image from its corresponding blurred one.<n>We propose a spatial-frequency domain adaptive fusion network (SFAFNet) to address this limitation.<n>Our SFAFNet performs favorably compared to state-of-the-art approaches on commonly used benchmarks.
arXiv Detail & Related papers (2025-02-20T02:43:55Z) - FE-UNet: Frequency Domain Enhanced U-Net with Segment Anything Capability for Versatile Image Segmentation [50.9040167152168]
We experimentally quantify the contrast sensitivity function of CNNs and compare it with that of the human visual system.
We propose the Wavelet-Guided Spectral Pooling Module (WSPM) to enhance and balance image features across the frequency domain.
To further emulate the human visual system, we introduce the Frequency Domain Enhanced Receptive Field Block (FE-RFB)
We develop FE-UNet, a model that utilizes SAM2 as its backbone and incorporates Hiera-Large as a pre-trained block.
arXiv Detail & Related papers (2025-02-06T07:24:34Z) - Accelerated Multi-Contrast MRI Reconstruction via Frequency and Spatial Mutual Learning [50.74383395813782]
We propose a novel Frequency and Spatial Mutual Learning Network (FSMNet) to explore global dependencies across different modalities.
The proposed FSMNet achieves state-of-the-art performance for the Multi-Contrast MR Reconstruction task with different acceleration factors.
arXiv Detail & Related papers (2024-09-21T12:02:47Z) - Frequency-Spatial Entanglement Learning for Camouflaged Object Detection [34.426297468968485]
Existing methods attempt to reduce the impact of pixel similarity by maximizing the distinguishing ability of spatial features with complicated design.
We propose a new approach to address this issue by jointly exploring the representation in the frequency and spatial domains, introducing the Frequency-Spatial Entanglement Learning (FSEL) method.
Our experiments demonstrate the superiority of our FSEL over 21 state-of-the-art methods, through comprehensive quantitative and qualitative comparisons in three widely-used datasets.
arXiv Detail & Related papers (2024-09-03T07:58:47Z) - Frequency Perception Network for Camouflaged Object Detection [51.26386921922031]
We propose a novel learnable and separable frequency perception mechanism driven by the semantic hierarchy in the frequency domain.<n>Our entire network adopts a two-stage model, including a frequency-guided coarse localization stage and a detail-preserving fine localization stage.<n>Compared with the currently existing models, our proposed method achieves competitive performance in three popular benchmark datasets.
arXiv Detail & Related papers (2023-08-17T11:30:46Z) - Adaptive Frequency Filters As Efficient Global Token Mixers [100.27957692579892]
We show that adaptive frequency filters can serve as efficient global token mixers.
We take AFF token mixers as primary neural operators to build a lightweight neural network, dubbed AFFNet.
arXiv Detail & Related papers (2023-07-26T07:42:28Z) - MSFA-Frequency-Aware Transformer for Hyperspectral Images Demosaicing [15.847332787718852]
This paper proposes a novel de-mosaicing framework, the MSFA-frequency-aware Transformer network (FDM-Net)
The advantage of Maformer is that it can leverage the MSFA information and non-local dependencies present in the data.
Our experimental results demonstrate that FDM-Net outperforms state-of-the-art methods with 6dB PSNR, and reconstructs high-fidelity details successfully.
arXiv Detail & Related papers (2023-03-23T16:27:30Z) - Adaptive Frequency Learning in Two-branch Face Forgery Detection [66.91715092251258]
We propose Adaptively learn Frequency information in the two-branch Detection framework, dubbed AFD.
We liberate our network from the fixed frequency transforms, and achieve better performance with our data- and task-dependent transform layers.
arXiv Detail & Related papers (2022-03-27T14:25:52Z) - Deep Frequency Filtering for Domain Generalization [55.66498461438285]
Deep Neural Networks (DNNs) have preferences for some frequency components in the learning process.
We propose Deep Frequency Filtering (DFF) for learning domain-generalizable features.
We show that applying our proposed DFF on a plain baseline outperforms the state-of-the-art methods on different domain generalization tasks.
arXiv Detail & Related papers (2022-03-23T05:19:06Z) - FMNet: Latent Feature-wise Mapping Network for Cleaning up Noisy
Micro-Doppler Spectrogram [2.9849405664643585]
noisy surroundings cause uninterpretable motion patterns on the micro-Doppler spectrogram.
radar returns often suffer from multipath, clutter and interference.
We propose a latent feature-wise mapping strategy, called Feature Mapping Network (FMNet), to transform measured spectrograms.
arXiv Detail & Related papers (2021-07-09T19:20:41Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.