Fugu-MT 論文翻訳(概要): AnoStyler: Text-Driven Localized Anomaly Generation via Lightweight Style Transfer

論文の概要: AnoStyler: Text-Driven Localized Anomaly Generation via Lightweight Style Transfer

arxiv url: http://arxiv.org/abs/2511.06687v1
Date: Mon, 10 Nov 2025 04:20:31 GMT
ステータス: 翻訳完了
システム内更新日: 2025-11-11 21:18:45.070189
Title: AnoStyler: Text-Driven Localized Anomaly Generation via Lightweight Style Transfer
Title（参考訳）: AnoStyler:軽量スタイル転送によるテキスト駆動の局所的異常生成
Authors: Yulim So, Seokho Kang,
Abstract要約: AnoStylerはテキスト誘導型転送としてゼロショット異常生成をフレーム化する。 CLIPベースの損失関数で訓練された軽量なU-Netモデルを用いて、通常の画像を視覚的にリアルな異常画像にスタイリングする。
参考スコア（独自算出の注目度）: 14.610038284393164
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Anomaly generation has been widely explored to address the scarcity of anomaly images in real-world data. However, existing methods typically suffer from at least one of the following limitations, hindering their practical deployment: (1) lack of visual realism in generated anomalies; (2) dependence on large amounts of real images; and (3) use of memory-intensive, heavyweight model architectures. To overcome these limitations, we propose AnoStyler, a lightweight yet effective method that frames zero-shot anomaly generation as text-guided style transfer. Given a single normal image along with its category label and expected defect type, an anomaly mask indicating the localized anomaly regions and two-class text prompts representing the normal and anomaly states are generated using generalizable category-agnostic procedures. A lightweight U-Net model trained with CLIP-based loss functions is used to stylize the normal image into a visually realistic anomaly image, where anomalies are localized by the anomaly mask and semantically aligned with the text prompts. Extensive experiments on the MVTec-AD and VisA datasets show that AnoStyler outperforms existing anomaly generation methods in generating high-quality and diverse anomaly images. Furthermore, using these generated anomalies helps enhance anomaly detection performance.
Abstract（参考訳）: 実世界のデータにおける異常画像の不足に対処するために、異常生成が広く研究されている。しかし、既存の手法では、(1)生成した異常における視覚的リアリズムの欠如、(2)大量の実画像への依存、(3)メモリ集約の重厚なモデルアーキテクチャの使用など、現実的な展開を妨げている。これらの制限を克服するために,テキスト誘導型転送としてゼロショット異常生成をフレーム化する軽量かつ効果的なAnoStylerを提案する。局所異常領域を示す異常マスクと、正常状態と異常状態を表す2種類のテキストプロンプトとを、そのカテゴリラベルと期待欠陥タイプとともに1つの正常画像を生成する。 CLIPベースの損失関数で訓練された軽量なU-Netモデルは、正常な画像を視覚的にリアルな異常画像にスタイリングするために使用され、異常は異常マスクによって局所化され、テキストプロンプトに意味的に整合する。 MVTec-ADとVisAデータセットの大規模な実験により、AnoStylerは、高品質で多様な異常画像を生成するために、既存の異常生成方法より優れていることが示された。さらに、これらの生成異常を用いることで、異常検出性能が向上する。

論文の概要: AnoStyler: Text-Driven Localized Anomaly Generation via Lightweight Style Transfer

関連論文リスト