Fugu-MT 論文翻訳(概要): One-to-More: High-Fidelity Training-Free Anomaly Generation with Attention Control

論文の概要: One-to-More: High-Fidelity Training-Free Anomaly Generation with Attention Control

arxiv url: http://arxiv.org/abs/2603.18093v1
Date: Wed, 18 Mar 2026 09:32:41 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-20 17:19:05.760015
Title: One-to-More: High-Fidelity Training-Free Anomaly Generation with Attention Control
Title（参考訳）: ワン・ツー・モーア:注意制御による高忠実度トレーニングフリーな異常発生
Authors: Haoxiang Rao, Zhao Wang, Chenyang Si, Yan Lyu, Yuanyi Duan, Fang Zhao, Caifeng Shan,
Abstract要約: 工業的異常検出 (AD) は, 正常な画像が多数存在するが, 異常な画像が不足していることが特徴である。我々は,より現実的な異常を合成するために,訓練不要な小ショット異常生成手法,すなわちO2MAGを提案する。
参考スコア（独自算出の注目度）: 31.036186754161225
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Industrial anomaly detection (AD) is characterized by an abundance of normal images but a scarcity of anomalous ones. Although numerous few-shot anomaly synthesis methods have been proposed to augment anomalous data for downstream AD tasks, most existing approaches require time-consuming training and struggle to learn distributions that are faithful to real anomalies, thereby restricting the efficacy of AD models trained on such data. To address these limitations, we propose a training-free few-shot anomaly generation method, namely O2MAG, which leverages the self-attention in One reference anomalous image to synthesize More realistic anomalies, supporting effective downstream anomaly detection. Specifically, O2MAG manipulates three parallel diffusion processes via self-attention grafting and incorporates the anomaly mask to mitigate foreground-background query confusion, synthesizing text-guided anomalies that closely adhere to real anomalous distributions. To bridge the semantic gap between the encoded anomaly text prompts and the true anomaly semantics, Anomaly-Guided Optimization is further introduced to align the synthesis process with the target anomalous distribution, steering the generation toward realistic and text-consistent anomalies. Moreover, to mitigate faint anomaly synthesis inside anomaly masks, Dual-Attention Enhancement is adopted during generation to reinforce both self- and cross-attention on masked regions. Extensive experiments validate the effectiveness of O2MAG, demonstrating its superior performance over prior state-of-the-art methods on downstream AD tasks.
Abstract（参考訳）: 工業的異常検出 (AD) は, 正常な画像が多数存在するが, 異常な画像が不足していることが特徴である。下流ADタスクの異常データを増大させるためには、多数の数発の異常合成法が提案されているが、既存のアプローチの多くは、実際の異常に忠実な分布の学習に時間を要するため、そのようなデータに基づいて訓練されたADモデルの有効性を制限している。これらの制約に対処するために,1つの参照異常画像における自己アテンションを利用して,より現実的な異常を合成し,効果的な下流異常検出を支援する,トレーニング不要な小ショット異常生成手法であるO2MAGを提案する。具体的には、O2MAGは、自己アテンショングラフトにより3つの平行拡散過程を操作し、異常マスクを組み込んで、前景と背景のクエリの混乱を緩和し、実際の異常分布に密着したテキスト誘導異常を合成する。符号化された異常テキストプロンプトと真の異常セマンティクスとのセマンティクスギャップを埋めるため、Anomaly-Guided Optimizationが導入された。さらに、異常マスク内におけるかすかな異常合成を緩和するために、世代間に二重注意増強を採用し、マスク領域における自己および交差注意の強化を図った。大規模な実験によりO2MAGの有効性が検証され、下流ADタスクにおける先行技術よりも優れた性能を示した。

論文の概要: One-to-More: High-Fidelity Training-Free Anomaly Generation with Attention Control

関連論文リスト