Related papers: Weakly-supervised Contrastive Learning with Quantity Prompts for Moving Infrared Small Target Detection

Weakly-supervised Contrastive Learning with Quantity Prompts for Moving Infrared Small Target Detection

URL: http://arxiv.org/abs/2507.02454v1
Date: Thu, 03 Jul 2025 09:11:31 GMT
Title: Weakly-supervised Contrastive Learning with Quantity Prompts for Moving Infrared Small Target Detection
Authors: Weiwei Duan, Luping Ji, Shengjia Chen, Sicheng Zhu, Jianghong Huang, Mao Ye,
Abstract summary: Moving infrared small target detection faces huge challenges due to tiny target size and weak background contrast.<n>Currently, most existing methods are fully-supervised, heavily relying on a large number of manual target-wise annotations.<n>This paper proposes a new weakly-supervised contrastive learning (WeCoL) scheme, only requires simple target quantity prompts during model training.
Score: 11.930404803127358
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Different from general object detection, moving infrared small target detection faces huge challenges due to tiny target size and weak background contrast.Currently, most existing methods are fully-supervised, heavily relying on a large number of manual target-wise annotations. However, manually annotating video sequences is often expensive and time-consuming, especially for low-quality infrared frame images. Inspired by general object detection, non-fully supervised strategies ($e.g.$, weakly supervised) are believed to be potential in reducing annotation requirements. To break through traditional fully-supervised frameworks, as the first exploration work, this paper proposes a new weakly-supervised contrastive learning (WeCoL) scheme, only requires simple target quantity prompts during model training.Specifically, in our scheme, based on the pretrained segment anything model (SAM), a potential target mining strategy is designed to integrate target activation maps and multi-frame energy accumulation.Besides, contrastive learning is adopted to further improve the reliability of pseudo-labels, by calculating the similarity between positive and negative samples in feature subspace.Moreover, we propose a long-short term motion-aware learning scheme to simultaneously model the local motion patterns and global motion trajectory of small targets.The extensive experiments on two public datasets (DAUB and ITSDT-15K) verify that our weakly-supervised scheme could often outperform early fully-supervised methods. Even, its performance could reach over 90\% of state-of-the-art (SOTA) fully-supervised ones.

Related papers

From Easy to Hard: Progressive Active Learning Framework for Infrared Small Target Detection with Single Point Supervision [18.555485444818835]
We construct an innovative Progressive Active Learning (PAL) framework for single point supervision.<n>We propose a model pre-start concept, which focuses on automatically selecting a portion of easy samples.<n>We show that existing SIRST detection networks equipped with our PAL framework have achieved state-of-the-art (SOTA) results on multiple public datasets.
arXiv Detail & Related papers (2024-12-15T11:08:49Z)
Robust infrared small target detection using self-supervised and a contrario paradigms [1.2224547302812558]
We introduce a novel approach that combines a contrario paradigm with Self-Supervised Learning (SSL) to improve Infrared Small Target Detection (IRSTD) On the one hand, the integration of an a contrario criterion into a YOLO detection head enhances feature map responses for small and unexpected objects while effectively controlling false alarms. Our findings show that instance discrimination methods outperform masked image modeling strategies when applied to YOLO-based small object detection.
arXiv Detail & Related papers (2024-10-09T21:08:57Z)
Refined Infrared Small Target Detection Scheme with Single-Point Supervision [2.661766509317245]
We propose an innovative refined infrared small target detection scheme with single-point supervision. The proposed scheme achieves state-of-the-art (SOTA) performance. Notably, the proposed scheme won the third place in the "ICPR 2024 Resource-Limited Infrared Small Target Detection Challenge Track 1: Weakly Supervised Infrared Small Target Detection"
arXiv Detail & Related papers (2024-08-05T18:49:58Z)
ACTRESS: Active Retraining for Semi-supervised Visual Grounding [52.08834188447851]
A previous study, RefTeacher, makes the first attempt to tackle this task by adopting the teacher-student framework to provide pseudo confidence supervision and attention-based supervision. This approach is incompatible with current state-of-the-art visual grounding models, which follow the Transformer-based pipeline. Our paper proposes the ACTive REtraining approach for Semi-Supervised Visual Grounding, abbreviated as ACTRESS.
arXiv Detail & Related papers (2024-07-03T16:33:31Z)
Enhancing Infrared Small Target Detection Robustness with Bi-Level Adversarial Framework [61.34862133870934]
We propose a bi-level adversarial framework to promote the robustness of detection in the presence of distinct corruptions. Our scheme remarkably improves 21.96% IOU across a wide array of corruptions and notably promotes 4.97% IOU on the general benchmark.
arXiv Detail & Related papers (2023-09-03T06:35:07Z)
Weakly-supervised Contrastive Learning for Unsupervised Object Discovery [52.696041556640516]
Unsupervised object discovery is promising due to its ability to discover objects in a generic manner. We design a semantic-guided self-supervised learning model to extract high-level semantic features from images. We introduce Principal Component Analysis (PCA) to localize object regions.
arXiv Detail & Related papers (2023-07-07T04:03:48Z)
Semi-supervised learning made simple with self-supervised clustering [65.98152950607707]
Self-supervised learning models have been shown to learn rich visual representations without requiring human annotations. We propose a conceptually simple yet empirically powerful approach to turn clustering-based self-supervised methods into semi-supervised learners.
arXiv Detail & Related papers (2023-06-13T01:09:18Z)
Sample, Crop, Track: Self-Supervised Mobile 3D Object Detection for Urban Driving LiDAR [43.971680545189756]
We propose a new self-supervised mobile object detection approach called SCT. This uses both motion cues and expected object sizes to improve detection performance. We significantly outperform the state-of-the-art self-supervised mobile object detection method TCR on the KITTI tracking benchmark.
arXiv Detail & Related papers (2022-09-21T16:12:46Z)
Texture-guided Saliency Distilling for Unsupervised Salient Object Detection [67.10779270290305]
We propose a novel USOD method to mine rich and accurate saliency knowledge from both easy and hard samples. Our method achieves state-of-the-art USOD performance on RGB, RGB-D, RGB-T, and video SOD benchmarks.
arXiv Detail & Related papers (2022-07-13T02:01:07Z)
Activation to Saliency: Forming High-Quality Labels for Unsupervised Salient Object Detection [54.92703325989853]
We propose a two-stage Activation-to-Saliency (A2S) framework that effectively generates high-quality saliency cues. No human annotations are involved in our framework during the whole training process. Our framework reports significant performance compared with existing USOD methods.
arXiv Detail & Related papers (2021-12-07T11:54:06Z)
Cascade Attentive Dropout for Weakly Supervised Object Detection [7.697578661762592]
Weakly supervised object detection (WSOD) aims to classify and locate objects with only image-level supervision. Many WSOD approaches adopt multiple instance learning as the initial model, which is prone to converge to the most discriminative object regions. We propose a novel cascade attentive dropout strategy to alleviate the part domination problem, together with an improved global context module.
arXiv Detail & Related papers (2020-11-20T08:08:13Z)
Progressive Object Transfer Detection [84.48927705173494]
We propose a novel Progressive Object Transfer Detection (POTD) framework. First, POTD can leverage various object supervision of different domains effectively into a progressive detection procedure. Second, POTD consists of two delicate transfer stages, i.e., Low-Shot Transfer Detection (LSTD), and Weakly-Supervised Transfer Detection (WSTD)
arXiv Detail & Related papers (2020-02-12T00:16:24Z)

This list is automatically generated from the titles and abstracts of the papers in this site.