Fugu-MT 論文翻訳(概要): ShortcutBreaker: Low-Rank Noisy Bottleneck with Global Perturbation Attention for Multi-Class Unsupervised Anomaly Detection

論文の概要: ShortcutBreaker: Low-Rank Noisy Bottleneck with Global Perturbation Attention for Multi-Class Unsupervised Anomaly Detection

arxiv url: http://arxiv.org/abs/2510.18342v1
Date: Tue, 21 Oct 2025 06:51:30 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-25 03:08:13.035761
Title: ShortcutBreaker: Low-Rank Noisy Bottleneck with Global Perturbation Attention for Multi-Class Unsupervised Anomaly Detection
Title（参考訳）: ShortcutBreaker: Multi-class Unsupervised Anomaly Detection のための大域摂動注意付き低域雑音ボツネック
Authors: Peng Tang, Xiaoxiao Yan, Xiaobin Hu, Yuning Cui, Donghao Luo, Jiangning Zhang, Pengcheng Xu, Jinlong Peng, Qingdong He, Feiyue Huang, Song Xue, Tobias Lasser,
Abstract要約: ShortcutBreakerはMUADタスクのための新しい統合された機能再構成フレームワークである。ショートカットの問題に対処する2つの重要なイノベーションが特徴だ。提案手法は,4つのデータセットに対して,99.8%,98.9%,90.6%,87.8%の顕著な画像レベルのAUROCを実現する。
参考スコア（独自算出の注目度）: 59.89803740308262
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Multi-class unsupervised anomaly detection (MUAD) has garnered growing research interest, as it seeks to develop a unified model for anomaly detection across multiple classes, i.e., eliminating the need to train separate models for distinct objects and thereby saving substantial computational resources. Under the MUAD setting, while advanced Transformer-based architectures have brought significant performance improvements, identity shortcuts persist: they directly copy inputs to outputs, narrowing the gap in reconstruction errors between normal and abnormal cases, and thereby making the two harder to distinguish. Therefore, we propose ShortcutBreaker, a novel unified feature-reconstruction framework for MUAD tasks, featuring two key innovations to address the issue of shortcuts. First, drawing on matrix rank inequality, we design a low-rank noisy bottleneck (LRNB) to project highdimensional features into a low-rank latent space, and theoretically demonstrate its capacity to prevent trivial identity reproduction. Second, leveraging ViTs global modeling capability instead of merely focusing on local features, we incorporate a global perturbation attention to prevent information shortcuts in the decoders. Extensive experiments are performed on four widely used anomaly detection benchmarks, including three industrial datasets (MVTec-AD, ViSA, and Real-IAD) and one medical dataset (Universal Medical). The proposed method achieves a remarkable image-level AUROC of 99.8%, 98.9%, 90.6%, and 87.8% on these four datasets, respectively, consistently outperforming previous MUAD methods across different scenarios.
Abstract（参考訳）: MUAD(Multi-class Unsupervised Anomaly Detection)は、複数のクラスにまたがる異常検出のための統一モデルを開発することを目的として、研究の関心が高まっている。 MUAD設定の下では、高度なTransformerベースのアーキテクチャは大幅なパフォーマンス向上をもたらしたが、IDショートカットは出力に直接入力をコピーし、通常のケースと異常ケースの間の再構成エラーのギャップを狭め、区別を困難にしている。そこで本稿では,MUADタスクのための新しい統合機能再構築フレームワークであるShortcutBreakerを提案する。まず,行列階不等式に基づいて,高次元特徴を低ランク潜在空間に投影する低ランク雑音ボトルネック(LRNB)を設計し,その有効性を理論的に示す。第2に、局所的な特徴にのみ焦点をあてるのではなく、ViTのグローバルモデリング機能を活用して、デコーダにおける情報ショートカットを防ぐために、グローバルな摂動注意を組み込む。 3つの産業用データセット(MVTec-AD, ViSA, Real-IAD)と1つの医療用データセット(Universal Medical)を含む、広く使用されている4つの異常検出ベンチマークで、広範囲にわたる実験が行われた。提案手法は,これらの4つのデータセットにおいて,99.8%,98.9%,90.6%,87.8%の顕著な画像レベルのAUROCを実現する。

論文の概要: ShortcutBreaker: Low-Rank Noisy Bottleneck with Global Perturbation Attention for Multi-Class Unsupervised Anomaly Detection

関連論文リスト