UIU-Net: U-Net in U-Net for Infrared Small Object Detection
- URL: http://arxiv.org/abs/2212.00968v1
- Date: Fri, 2 Dec 2022 04:52:26 GMT
- Title: UIU-Net: U-Net in U-Net for Infrared Small Object Detection
- Authors: Xin Wu and Danfeng Hong and Jocelyn Chanussot
- Abstract summary: We propose a simple and effective U-Net in U-Net'' framework, UIU-Net for short, and detect small objects in infrared images.
As the name suggests, UIU-Net embeds a tiny U-Net into a larger U-Net backbone, enabling the multi-level and multi-scale representation learning of objects.
The proposed UIU-Net also produces powerful generalization performance for video sequence infrared small object datasets.
- Score: 36.72184013409837
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Learning-based infrared small object detection methods currently rely heavily
on the classification backbone network. This tends to result in tiny object
loss and feature distinguishability limitations as the network depth increases.
Furthermore, small objects in infrared images are frequently emerged bright and
dark, posing severe demands for obtaining precise object contrast information.
For this reason, we in this paper propose a simple and effective ``U-Net in
U-Net'' framework, UIU-Net for short, and detect small objects in infrared
images. As the name suggests, UIU-Net embeds a tiny U-Net into a larger U-Net
backbone, enabling the multi-level and multi-scale representation learning of
objects. Moreover, UIU-Net can be trained from scratch, and the learned
features can enhance global and local contrast information effectively. More
specifically, the UIU-Net model is divided into two modules: the
resolution-maintenance deep supervision (RM-DS) module and the
interactive-cross attention (IC-A) module. RM-DS integrates Residual U-blocks
into a deep supervision network to generate deep multi-scale
resolution-maintenance features while learning global context information.
Further, IC-A encodes the local context information between the low-level
details and high-level semantic features. Extensive experiments conducted on
two infrared single-frame image datasets, i.e., SIRST and Synthetic datasets,
show the effectiveness and superiority of the proposed UIU-Net in comparison
with several state-of-the-art infrared small object detection methods. The
proposed UIU-Net also produces powerful generalization performance for video
sequence infrared small object datasets, e.g., ATR ground/air video sequence
dataset. The codes of this work are available openly at
\url{https://github.com/danfenghong/IEEE_TIP_UIU-Net}.
Related papers
- HCF-Net: Hierarchical Context Fusion Network for Infrared Small Object Detection [16.92362922379821]
We propose a deep learning method to improve infrared small object detection performance.
The method includes the parallelized patch-aware attention (PPA) module, dimension-aware selective integration (DASI) module, and multi-dilated channel refiner (MDCR) module.
arXiv Detail & Related papers (2024-03-16T02:45:42Z) - ILNet: Low-level Matters for Salient Infrared Small Target Detection [5.248337726304453]
Infrared small target detection is a technique for finding small targets from infrared clutter background.
Due to the dearth of high-level semantic information, small infrared target features are weakened in the deep layers of the CNN.
We propose an infrared low-level network (ILNet) that considers infrared small targets as salient areas with little semantic information.
arXiv Detail & Related papers (2023-09-24T14:09:37Z) - Point-aware Interaction and CNN-induced Refinement Network for RGB-D
Salient Object Detection [95.84616822805664]
We introduce CNNs-assisted Transformer architecture and propose a novel RGB-D SOD network with Point-aware Interaction and CNN-induced Refinement.
In order to alleviate the block effect and detail destruction problems brought by the Transformer naturally, we design a CNN-induced refinement (CNNR) unit for content refinement and supplementation.
arXiv Detail & Related papers (2023-08-17T11:57:49Z) - Local Contrast and Global Contextual Information Make Infrared Small
Object Salient Again [5.324958606516871]
Infrared small object detection (ISOS) aims to segment small objects only covered with several pixels from clutter background in infrared images.
It's of great challenge due to: 1) small objects lack of sufficient intensity, shape and texture information; 2) small objects are easily lost in the process where detection models, say deep neural networks, obtain high-level semantic features and image-level receptive fields through successive downsampling.
This paper proposes a reliable detection model for ISOS, dubbed UCFNet, which can handle well the two issues.
Experiments on several public datasets demonstrate that our method significantly outperforms the state-of-the
arXiv Detail & Related papers (2023-01-28T05:18:13Z) - iSmallNet: Densely Nested Network with Label Decoupling for Infrared
Small Target Detection [35.4167548966077]
iSmallNet is a densely nested network with label decoupling for infrared small object detection.
We develop two key modules to boost the overall performance.
Experiments on NUAA-SIRST and NUDT-SIRST clearly show the superiority of iSmallNet over 11 state-of-the-art detectors.
arXiv Detail & Related papers (2022-10-29T10:27:54Z) - Infrared Small-Dim Target Detection with Transformer under Complex
Backgrounds [155.388487263872]
We propose a new infrared small-dim target detection method with the transformer.
We adopt the self-attention mechanism of the transformer to learn the interaction information of image features in a larger range.
We also design a feature enhancement module to learn more features of small-dim targets.
arXiv Detail & Related papers (2021-09-29T12:23:41Z) - RGB-D Saliency Detection via Cascaded Mutual Information Minimization [122.8879596830581]
Existing RGB-D saliency detection models do not explicitly encourage RGB and depth to achieve effective multi-modal learning.
We introduce a novel multi-stage cascaded learning framework via mutual information minimization to "explicitly" model the multi-modal information between RGB image and depth data.
arXiv Detail & Related papers (2021-09-15T12:31:27Z) - Dense Nested Attention Network for Infrared Small Target Detection [36.654692765557726]
Single-frame infrared small target (SIRST) detection aims at separating small targets from clutter backgrounds.
Existing CNN-based methods cannot be directly applied for infrared small targets.
We propose a dense nested attention network (DNANet) in this paper.
arXiv Detail & Related papers (2021-06-01T13:45:35Z) - KiU-Net: Overcomplete Convolutional Architectures for Biomedical Image
and Volumetric Segmentation [71.79090083883403]
"Traditional" encoder-decoder based approaches perform poorly in detecting smaller structures and are unable to segment boundary regions precisely.
We propose KiU-Net which has two branches: (1) an overcomplete convolutional network Kite-Net which learns to capture fine details and accurate edges of the input, and (2) U-Net which learns high level features.
The proposed method achieves a better performance as compared to all the recent methods with an additional benefit of fewer parameters and faster convergence.
arXiv Detail & Related papers (2020-10-04T19:23:33Z) - Accurate RGB-D Salient Object Detection via Collaborative Learning [101.82654054191443]
RGB-D saliency detection shows impressive ability on some challenge scenarios.
We propose a novel collaborative learning framework where edge, depth and saliency are leveraged in a more efficient way.
arXiv Detail & Related papers (2020-07-23T04:33:36Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.