Fugu-MT 論文翻訳(概要): MIRAGE: Model-agnostic Industrial Realistic Anomaly Generation and Evaluation for Visual Anomaly Detection

論文の概要: MIRAGE: Model-agnostic Industrial Realistic Anomaly Generation and Evaluation for Visual Anomaly Detection

arxiv url: http://arxiv.org/abs/2603.13507v1
Date: Fri, 13 Mar 2026 18:36:44 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-17 16:19:35.248225
Title: MIRAGE: Model-agnostic Industrial Realistic Anomaly Generation and Evaluation for Visual Anomaly Detection
Title（参考訳）: MIRAGE: モデルに依存しない産業的リアルな異常生成と視覚的異常検出のための評価
Authors: Jinwei Hu, Francesco Borsatti, Arianna Stropeni, Davide Dalle Pezze, Manuel Barusco, Gian Antonio Susto,
Abstract要約: MIRAGEは、リアルな異常画像生成とピクセルレベルのマスク生成のための完全に自動化されたパイプラインである。当社のパイプラインでは,生成モデルをAPIコールを通じてブラックボックスとしてアクセスしています。マスク生成を大規模に行うためには、軽量でトレーニング不要なデュアルブランチセマンティックチェンジ検出モジュールを導入する。
参考スコア（独自算出の注目度）: 11.313360174738465
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Industrial visual anomaly detection (VAD) methods are typically trained on normal samples only, yet performance improves substantially when even limited anomalous data is available. Existing anomaly generation approaches either require real anomalous examples, demand expensive hardware, or produce synthetic defects that lack realism. We present MIRAGE (Model-agnostic Industrial Realistic Anomaly Generation and Evaluation), a fully automated pipeline for realistic anomalous image generation and pixel-level mask creation that requires no training and no anomalous images. Our pipeline accesses any generative model as a black box via API calls, uses a VLM for automatic defect prompt generation, and includes a CLIP-based quality filter to retain only well-aligned generated images. For mask generation at scale, we introduce a lightweight, training-free dual-branch semantic change detection module combining text-conditioned Grounding DINO features with fine-grained YOLOv26-Seg structural features. We benchmark four generation methods using Gemini 2.5 Flash Image (Nano Banana) as the generative backbone, evaluating performance on MVTec AD and VisA across two distinct tasks: (i) downstream anomaly segmentation and (ii) visual quality of the generated images, assessed via standard metrics (IS, IC-LPIPS) and a human perceptual study involving 31 participants and 1,550 pairwise votes. The results demonstrate that MIRAGE offers a scalable, accessible foundation for anomaly-aware industrial inspection that requires no real defect data. As a final contribution, we publicly release a large-scale dataset comprising 500 image-mask pairs per category for every MVTec AD and VisA class, over 13,000 pairs in total, alongside all generation prompts and pipeline code.
Abstract（参考訳）: 産業用視覚異常検出法(VAD)は通常、通常のサンプルのみで訓練されるが、限られた異常データであっても性能は大幅に向上する。既存の異常生成アプローチは、実際の異常な例を必要とするか、高価なハードウェアを必要とするか、現実性に欠ける合成欠陥を生成するかのいずれかである。 MIRAGE(Model-Agnostic Industrial Realistic Anomaly Generation and Evaluation)は、リアルな異常画像生成と画素レベルのマスク生成のための完全自動化パイプラインであり、トレーニングを必要とせず、異常画像も不要である。パイプラインはAPIコールを通じて生成モデルをブラックボックスとしてアクセスし、自動欠陥プロンプト生成にVLMを使用し、CLIPベースの品質フィルタを組み込んで、整列した生成画像のみを保持する。マスク生成を大規模に行うために,テキストコンディショニングDINO機能とYOLOv26-Seg構造を併用した,軽量でトレーニング不要な2分岐意味変化検出モジュールを導入する。 Gemini 2.5 Flash Image (Nano Banana) を生成バックボーンとし、MVTec ADとVisAの性能を2つの異なるタスクで評価した。 (i)下流異常セグメンテーション (i)標準指標(IS,IC-LPIPS)を用いて評価した生成画像の視覚的品質と,31人の参加者と1,550人の対決による人間の知覚調査を行った。その結果,MIRAGEは異常データを必要としない産業検査のためのスケーラブルでアクセシブルな基盤を提供することを示した。最後のコントリビューションとして、MVTec ADとVisAクラス毎に500のイメージマスクペアからなる大規模データセットを公開し、すべての生成プロンプトとパイプラインコードとともに合計13,000以上のペアを出力した。

論文の概要: MIRAGE: Model-agnostic Industrial Realistic Anomaly Generation and Evaluation for Visual Anomaly Detection

関連論文リスト