iSmallNet: Densely Nested Network with Label Decoupling for Infrared
Small Target Detection
- URL: http://arxiv.org/abs/2210.16561v2
- Date: Thu, 29 Jun 2023 10:51:43 GMT
- Title: iSmallNet: Densely Nested Network with Label Decoupling for Infrared
Small Target Detection
- Authors: Zhiheng Hu, Yongzhen Wang, Peng Li, Jie Qin, Haoran Xie, Mingqiang Wei
- Abstract summary: iSmallNet is a densely nested network with label decoupling for infrared small object detection.
We develop two key modules to boost the overall performance.
Experiments on NUAA-SIRST and NUDT-SIRST clearly show the superiority of iSmallNet over 11 state-of-the-art detectors.
- Score: 35.4167548966077
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Small targets are often submerged in cluttered backgrounds of infrared
images. Conventional detectors tend to generate false alarms, while CNN-based
detectors lose small targets in deep layers. To this end, we propose iSmallNet,
a multi-stream densely nested network with label decoupling for infrared small
object detection. On the one hand, to fully exploit the shape information of
small targets, we decouple the original labeled ground-truth (GT) map into an
interior map and a boundary one. The GT map, in collaboration with the two
additional maps, tackles the unbalanced distribution of small object
boundaries. On the other hand, two key modules are delicately designed and
incorporated into the proposed network to boost the overall performance. First,
to maintain small targets in deep layers, we develop a multi-scale nested
interaction module to explore a wide range of context information. Second, we
develop an interior-boundary fusion module to integrate multi-granularity
information. Experiments on NUAA-SIRST and NUDT-SIRST clearly show the
superiority of iSmallNet over 11 state-of-the-art detectors.
Related papers
- HCF-Net: Hierarchical Context Fusion Network for Infrared Small Object Detection [16.92362922379821]
We propose a deep learning method to improve infrared small object detection performance.
The method includes the parallelized patch-aware attention (PPA) module, dimension-aware selective integration (DASI) module, and multi-dilated channel refiner (MDCR) module.
arXiv Detail & Related papers (2024-03-16T02:45:42Z) - ILNet: Low-level Matters for Salient Infrared Small Target Detection [5.248337726304453]
Infrared small target detection is a technique for finding small targets from infrared clutter background.
Due to the dearth of high-level semantic information, small infrared target features are weakened in the deep layers of the CNN.
We propose an infrared low-level network (ILNet) that considers infrared small targets as salient areas with little semantic information.
arXiv Detail & Related papers (2023-09-24T14:09:37Z) - UIU-Net: U-Net in U-Net for Infrared Small Object Detection [36.72184013409837]
We propose a simple and effective U-Net in U-Net'' framework, UIU-Net for short, and detect small objects in infrared images.
As the name suggests, UIU-Net embeds a tiny U-Net into a larger U-Net backbone, enabling the multi-level and multi-scale representation learning of objects.
The proposed UIU-Net also produces powerful generalization performance for video sequence infrared small object datasets.
arXiv Detail & Related papers (2022-12-02T04:52:26Z) - Infrared Small-Dim Target Detection with Transformer under Complex
Backgrounds [155.388487263872]
We propose a new infrared small-dim target detection method with the transformer.
We adopt the self-attention mechanism of the transformer to learn the interaction information of image features in a larger range.
We also design a feature enhancement module to learn more features of small-dim targets.
arXiv Detail & Related papers (2021-09-29T12:23:41Z) - Dense Nested Attention Network for Infrared Small Target Detection [36.654692765557726]
Single-frame infrared small target (SIRST) detection aims at separating small targets from clutter backgrounds.
Existing CNN-based methods cannot be directly applied for infrared small targets.
We propose a dense nested attention network (DNANet) in this paper.
arXiv Detail & Related papers (2021-06-01T13:45:35Z) - Cross-Modality 3D Object Detection [63.29935886648709]
We present a novel two-stage multi-modal fusion network for 3D object detection.
The whole architecture facilitates two-stage fusion.
Our experiments on the KITTI dataset show that the proposed multi-stage fusion helps the network to learn better representations.
arXiv Detail & Related papers (2020-08-16T11:01:20Z) - Suppress and Balance: A Simple Gated Network for Salient Object
Detection [89.88222217065858]
We propose a simple gated network (GateNet) to solve both issues at once.
With the help of multilevel gate units, the valuable context information from the encoder can be optimally transmitted to the decoder.
In addition, we adopt the atrous spatial pyramid pooling based on the proposed "Fold" operation (Fold-ASPP) to accurately localize salient objects of various scales.
arXiv Detail & Related papers (2020-07-16T02:00:53Z) - Cross-layer Feature Pyramid Network for Salient Object Detection [102.20031050972429]
We propose a novel Cross-layer Feature Pyramid Network to improve the progressive fusion in salient object detection.
The distributed features per layer own both semantics and salient details from all other layers simultaneously, and suffer reduced loss of important information.
arXiv Detail & Related papers (2020-02-25T14:06:27Z) - NETNet: Neighbor Erasing and Transferring Network for Better Single Shot
Object Detection [170.30694322460045]
We propose a new Neighbor Erasing and Transferring (NET) mechanism to reconfigure the pyramid features and explore scale-aware features.
A single-shot network called NETNet is constructed for scale-aware object detection.
arXiv Detail & Related papers (2020-01-18T15:21:29Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.