Fugu-MT 論文翻訳(概要): PB-IAD: Utilizing multimodal foundation models for semantic industrial anomaly detection in dynamic manufacturing environments

論文の概要: PB-IAD: Utilizing multimodal foundation models for semantic industrial anomaly detection in dynamic manufacturing environments

arxiv url: http://arxiv.org/abs/2508.14504v1
Date: Wed, 20 Aug 2025 07:53:13 GMT
ステータス: 翻訳完了
システム内更新日: 2025-08-21 16:52:41.382869
Title: PB-IAD: Utilizing multimodal foundation models for semantic industrial anomaly detection in dynamic manufacturing environments
Title（参考訳）: PB-IAD:動的製造環境における意味的産業異常検出のためのマルチモーダル基礎モデルの利用
Authors: Bernd Hofmann, Albert Scheck, Joerg Franke, Patrick Bruendl,
Abstract要約: 本稿では, PB-IAD (Prompt-based Industrial Anomaly Detection) を提案する。データ分散性、アジャイル適応性、ドメインユーザ中心性という、動的運用環境の3つの重要な要件に対処する。 PatchCoreのような異常検出のための最先端のメソッドにベンチマークされる。
参考スコア（独自算出の注目度）: 0.0
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: The detection of anomalies in manufacturing processes is crucial to ensure product quality and identify process deviations. Statistical and data-driven approaches remain the standard in industrial anomaly detection, yet their adaptability and usability are constrained by the dependence on extensive annotated datasets and limited flexibility under dynamic production conditions. Recent advances in the perception capabilities of foundation models provide promising opportunities for their adaptation to this downstream task. This paper presents PB-IAD (Prompt-based Industrial Anomaly Detection), a novel framework that leverages the multimodal and reasoning capabilities of foundation models for industrial anomaly detection. Specifically, PB-IAD addresses three key requirements of dynamic production environments: data sparsity, agile adaptability, and domain user centricity. In addition to the anomaly detection, the framework includes a prompt template that is specifically designed for iteratively implementing domain-specific process knowledge, as well as a pre-processing module that translates domain user inputs into effective system prompts. This user-centric design allows domain experts to customise the system flexibly without requiring data science expertise. The proposed framework is evaluated by utilizing GPT-4.1 across three distinct manufacturing scenarios, two data modalities, and an ablation study to systematically assess the contribution of semantic instructions. Furthermore, PB-IAD is benchmarked to state-of-the-art methods for anomaly detection such as PatchCore. The results demonstrate superior performance, particularly in data-sparse scenarios and low-shot settings, achieved solely through semantic instructions.
Abstract（参考訳）: 製造プロセスにおける異常の検出は、製品品質の確保とプロセスの偏差の特定に不可欠である。統計的およびデータ駆動型アプローチは、産業的異常検出の標準であり続けているが、それらの適応性とユーザビリティは、広範な注釈付きデータセットへの依存と、動的生産条件下での柔軟性の制限によって制約される。基礎モデルの認識能力の最近の進歩は、この下流課題に適応するための有望な機会を提供する。本稿では, PB-IAD (Prompt-based Industrial Anomaly Detection) を提案する。具体的には、PB-IADは、データ分散性、アジャイル適応性、ドメインユーザ中心性という、動的生産環境の3つの重要な要件に対処する。このフレームワークは異常検出に加えて、ドメイン固有のプロセス知識を反復的に実装するために特別に設計されたプロンプトテンプレートと、ドメインユーザの入力を効果的なシステムプロンプトに変換する前処理モジュールを含んでいる。このユーザ中心の設計により、ドメインの専門家は、データサイエンスの専門知識を必要とせずに、柔軟にシステムをカスタマイズできる。提案フレームワークは,3つの異なる製造シナリオ,2つのデータモダリティ,および意味的指示の貢献を体系的に評価するアブレーションスタディを用いて,GPT-4.1の評価を行う。さらに、PB-IADはPatchCoreのような異常検出のための最先端の手法にベンチマークされる。その結果、特にデータスパースシナリオやローショット設定において、セマンティックインストラクションによってのみ達成される優れたパフォーマンスが示された。

論文の概要: PB-IAD: Utilizing multimodal foundation models for semantic industrial anomaly detection in dynamic manufacturing environments

関連論文リスト