Fugu-MT 論文翻訳(概要): Visual Prompting Meets Feature Reconstruction-Based Anomaly Detection with Dual-Teacher Supervision

論文の概要: Visual Prompting Meets Feature Reconstruction-Based Anomaly Detection with Dual-Teacher Supervision

arxiv url: http://arxiv.org/abs/2606.09670v1
Date: Mon, 08 Jun 2026 15:52:05 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-09 14:42:07.486091
Title: Visual Prompting Meets Feature Reconstruction-Based Anomaly Detection with Dual-Teacher Supervision
Title（参考訳）: Dual-Teacher Supervision による特徴再構成に基づく異常検出を実現する Visual Prompting
Authors: Mateo Diaz-Bone, Daniel Caraballo, Florian Scheidegger, Thomas Frick, Mattia Rigotti, Andrea Bartezzaghi, Roy Assaf, Niccolo Avogaro, Yagmur G. Cinar, Brown Ebouky, Filip M. Janicki, Piotr S. Kluska, Cezary Skura, Cristiano Malossi,
Abstract要約: 前景のマスキングを用いてオブジェクトを分離する視覚的プロンプトパイプラインを導入する。また,学習者モデルにおける教師の解凍機構を導入し,ドメイン適応性を向上させる。 AeBADデータセットの課題に対して,前回の最先端よりも3.5パーセントの改善を実現した。
参考スコア（独自算出の注目度）: 7.548589503544904
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Recent Anomaly Detection methods achieve perfect detection and segmentation scores on well-established datasets, such as MVTec. However, many of these methods face challenges when foundational assumptions - such as consistent object scale, viewpoint, background, illumination, and centered placement - are violated. Those variations that occur render anomaly detection methods unusable in many real-world scenarios. To address these limitations, we introduce three key contributions: (1) a visual prompting pipeline that isolates objects using foreground-background masking; (2) a mechanism for unfreezing the teacher in student-teacher models to improve domain adaptability; and (3) a data augmentation strategy leveraging diffusion-generated synthetic images to enhance anomaly detection performance. We achieve a 3.5 percentage point improvement over the previous state-of-the-art on the challenging AeBAD dataset by using the Masked Multiscale Reconstruction (MMR) model as our backbone.
Abstract（参考訳）: 最近の異常検出手法は、MVTecのような確立されたデータセット上で完全な検出とセグメンテーションスコアを達成する。しかし、これらの手法の多くは、一貫性のあるオブジェクトスケール、視点、背景、照明、中心的な配置など、基本的な仮定が違反する場合に直面する。多くの現実のシナリオでは使用不可能なレンダリング異常検出メソッドが発生する。これらの制約に対処するため,(1)前景マスキングを用いて物体を分離する視覚的プロンプトパイプライン,(2)教師がドメイン適応性を向上させるために教師を解放するメカニズム,(3)拡散合成画像を利用したデータ拡張戦略による異常検出性能の向上,の3つの主要な貢献を紹介する。我々は,Masked Multiscale Reconstruction(MMR)モデルをバックボーンとして使用することにより,従来のAeBADデータセットよりも3.5パーセントの改善を実現した。

論文の概要: Visual Prompting Meets Feature Reconstruction-Based Anomaly Detection with Dual-Teacher Supervision

関連論文リスト