Fugu-MT 論文翻訳(概要): Unified Unsupervised Anomaly Detection via Matching Cost Filtering

論文の概要: Unified Unsupervised Anomaly Detection via Matching Cost Filtering

arxiv url: http://arxiv.org/abs/2510.03363v2
Date: Wed, 08 Oct 2025 12:00:06 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-09 14:21:18.186925
Title: Unified Unsupervised Anomaly Detection via Matching Cost Filtering
Title（参考訳）: マッチングコストフィルタリングによる統一的教師なし異常検出
Authors: Zhe Zhang, Mingxiu Cai, Gaochang Wu, Jing Zhang, Lingqiao Liu, Dacheng Tao, Tianyou Chai, Xiatian Zhu,
Abstract要約: 教師なし異常検出(UAD)は、通常のトレーニングデータのみを用いて画像レベルの異常と画素レベルの異常を識別することを目的としている。 UADモデルの異常コスト量を補正するための汎用的なポストホック精錬フレームワークであるUnified Cost Filtering (UCF) を提案する。
参考スコア（独自算出の注目度）: 113.43366521994396
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Unsupervised anomaly detection (UAD) aims to identify image- and pixel-level anomalies using only normal training data, with wide applications such as industrial inspection and medical analysis, where anomalies are scarce due to privacy concerns and cold-start constraints. Existing methods, whether reconstruction-based (restoring normal counterparts) or embedding-based (pretrained representations), fundamentally conduct image- or feature-level matching to generate anomaly maps. Nonetheless, matching noise has been largely overlooked, limiting their detection ability. Beyond earlier focus on unimodal RGB-based UAD, recent advances expand to multimodal scenarios, e.g., RGB--3D and RGB--Text, enabled by point cloud sensing and vision--language models. Despite shared challenges, these lines remain largely isolated, hindering a comprehensive understanding and knowledge transfer. In this paper, we advocate unified UAD for both unimodal and multimodal settings in the matching perspective. Under this insight, we present Unified Cost Filtering (UCF), a generic post-hoc refinement framework for refining anomaly cost volume of any UAD model. The cost volume is constructed by matching a test sample against normal samples from the same or different modalities, followed by a learnable filtering module with multi-layer attention guidance from the test sample, mitigating matching noise and highlighting subtle anomalies. Comprehensive experiments on 22 diverse benchmarks demonstrate the efficacy of UCF in enhancing a variety of UAD methods, consistently achieving new state-of-the-art results in both unimodal (RGB) and multimodal (RGB--3D, RGB--Text) UAD scenarios. Code and models will be released at https://github.com/ZHE-SAPI/CostFilter-AD.
Abstract（参考訳）: 教師なし異常検出(UAD)は、通常のトレーニングデータのみを用いて画像レベルの異常と画素レベルの異常を識別することを目的としており、産業検査や医療分析などの幅広い応用において、プライバシー上の懸念やコールドスタート制約により異常が不足している。既存の手法では、再構成ベース(通常の表現を復元する)や埋め込みベース(事前制限された表現)のいずれでも、画像レベルのマッチングを根本的に実施して、異常マップを生成する。それでも、一致したノイズはほとんど見落とされ、検出能力は制限されている。従来は単調なRGBベースのUDDに重点を置いていたが、最近の進歩は、ポイントクラウドセンシングとビジョン言語モデルによって実現されたマルチモーダルシナリオ(例えば、RGB--3D、RGB-Text)にまで拡大している。共有された課題にもかかわらず、これらの線は主として孤立しており、包括的な理解と知識の伝達を妨げる。本稿では,一元的および多元的設定の統一的 UAD を,マッチングの観点から提唱する。この知見に基づき,UADモデルの異常コスト量を補修するための汎用的なポストホック改質フレームワークであるUnified Cost Filtering (UCF)を提案する。コスト容積は、同一又は異なるモダリティの通常のサンプルとテストサンプルをマッチングし、次いで、テストサンプルから多層注意誘導付き学習可能なフィルタリングモジュールを作成し、マッチングノイズを軽減し、微妙な異常を強調することによって構成される。 22の多種多様なベンチマークに関する総合的な実験は、UAD手法の強化におけるUCFの有効性を実証し、Unimodal (RGB) と Multimodal (RGB--3D, RGB-Text) UAD のシナリオにおいて、新しい最先端の結果を一貫して達成している。コードとモデルはhttps://github.com/ZHE-SAPI/CostFilter-ADでリリースされる。

論文の概要: Unified Unsupervised Anomaly Detection via Matching Cost Filtering

関連論文リスト