Fugu-MT 論文翻訳(概要): Revisiting the Scale Loss Function and Gaussian-Shape Convolution for Infrared Small Target Detection

論文の概要: Revisiting the Scale Loss Function and Gaussian-Shape Convolution for Infrared Small Target Detection

arxiv url: http://arxiv.org/abs/2604.09991v1
Date: Sat, 11 Apr 2026 02:31:14 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-14 20:13:15.782186
Title: Revisiting the Scale Loss Function and Gaussian-Shape Convolution for Infrared Small Target Detection
Title（参考訳）: 赤外小ターゲット検出のためのスケールロス関数とガウス形状変換の再検討
Authors: Hao Li, Man Fung Zhuo,
Abstract要約: 赤外線小目標検出は、非単調なスケール損失関数からの不安定性のトレーニングと、空間的注意の欠如という2つの永続的な課題に直面している。本稿では,予測マスクと地上真実との符号付き面積差に応じて予測を重み付けするエンフディフに基づくスケール損失を提案する。空間側では、赤外小目標の中心集中強度プロファイルと一致するように、学習可能なスケールパラメータを持つEmphGaussian型畳み込みを導入する。
参考スコア（独自算出の注目度）: 5.238802422720473
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Infrared small target detection still faces two persistent challenges: training instability from non-monotonic scale loss functions, and inadequate spatial attention due to generic convolution kernels that ignore the physical imaging characteristics of small targets. In this paper, we revisit both aspects. For the loss side, we propose a \emph{diff-based scale loss} that weights predictions according to the signed area difference between the predicted mask and the ground truth, yielding strictly monotonic gradients and stable convergence. We further analyze a family of four scale loss variants to understand how their geometric properties affect detection behavior. For the spatial side, we introduce \emph{Gaussian-shaped convolution} with a learnable scale parameter to match the center-concentrated intensity profile of infrared small targets, and augment it with a \emph{rotated pinwheel mask} that adaptively aligns the kernel with target orientation via a straight-through estimator. Extensive experiments on IRSTD-1k, NUDT-SIRST, and SIRST-UAVB demonstrate consistent improvements in $mIoU$, $P_d$, and $F_a$ over state-of-the-art methods. We release our anonymous code and pretrained models.
Abstract（参考訳）: 赤外線小ターゲット検出は、非単調スケール損失関数からの不安定性のトレーニングと、小さなターゲットの物理的イメージング特性を無視した一般的な畳み込みカーネルによる空間的注意の不足という、2つの永続的な課題に直面している。本稿では,両側面について考察する。損失側には,予測マスクと地上真実との符号付き領域差に基づいて予測を重み付けし,厳密な単調勾配と安定収束を与える「emph{diff-based scale loss」を提案する。さらに,その幾何学的特性が検出行動にどう影響するかを理解するために,4種類のスケール損失変異の族を解析する。空間側では、赤外小目標の中心集中強度プロファイルと一致させるために、学習可能なスケールパラメータを持つ \emph{Gaussian-shaped convolution} を導入し、ストレートスルー推定器を介してカーネルと目標方向を適応的にアライメントする \emph{rotated pinwheel mask} で拡張する。 IRSTD-1k、NUDT-SIRST、SIRST-UAVBの大規模な実験では、mIoU$、$P_d$、$F_a$の最先端メソッドに対する一貫した改善が示されている。匿名のコードと事前訓練されたモデルをリリースします。

論文の概要: Revisiting the Scale Loss Function and Gaussian-Shape Convolution for Infrared Small Target Detection

関連論文リスト