Fugu-MT 論文翻訳(概要): AnomalyMoE: Towards a Language-free Generalist Model for Unified Visual Anomaly Detection

論文の概要: AnomalyMoE: Towards a Language-free Generalist Model for Unified Visual Anomaly Detection

arxiv url: http://arxiv.org/abs/2508.06203v1
Date: Fri, 08 Aug 2025 10:33:18 GMT
ステータス: 翻訳完了
システム内更新日: 2025-08-11 20:39:06.198197
Title: AnomalyMoE: Towards a Language-free Generalist Model for Unified Visual Anomaly Detection
Title（参考訳）: AnomalyMoE:Unified Visual Anomaly Detectionのための言語自由ジェネリストモデルを目指して
Authors: Zhaopeng Gu, Bingke Zhu, Guibo Zhu, Yingying Chen, Wei Ge, Ming Tang, Jinqiao Wang,
Abstract要約: AnomalyMoEは、Mixture-of-Expertsアーキテクチャに基づいた、新しく普遍的な異常検出フレームワークである。我々の重要な洞察は、複雑な異常検出問題を3つの異なる意味階層に分解することである。 AnomalyMoEはパッチ、コンポーネント、グローバルレベルで3つの専用の専門家ネットワークを使用している。
参考スコア（独自算出の注目度）: 29.06542941993374
License: http://creativecommons.org/licenses/by-nc-sa/4.0/
Abstract: Anomaly detection is a critical task across numerous domains and modalities, yet existing methods are often highly specialized, limiting their generalizability. These specialized models, tailored for specific anomaly types like textural defects or logical errors, typically exhibit limited performance when deployed outside their designated contexts. To overcome this limitation, we propose AnomalyMoE, a novel and universal anomaly detection framework based on a Mixture-of-Experts (MoE) architecture. Our key insight is to decompose the complex anomaly detection problem into three distinct semantic hierarchies: local structural anomalies, component-level semantic anomalies, and global logical anomalies. AnomalyMoE correspondingly employs three dedicated expert networks at the patch, component, and global levels, and is specialized in reconstructing features and identifying deviations at its designated semantic level. This hierarchical design allows a single model to concurrently understand and detect a wide spectrum of anomalies. Furthermore, we introduce an Expert Information Repulsion (EIR) module to promote expert diversity and an Expert Selection Balancing (ESB) module to ensure the comprehensive utilization of all experts. Experiments on 8 challenging datasets spanning industrial imaging, 3D point clouds, medical imaging, video surveillance, and logical anomaly detection demonstrate that AnomalyMoE establishes new state-of-the-art performance, significantly outperforming specialized methods in their respective domains.
Abstract（参考訳）: 異常検出は、多くの領域やモダリティにまたがる重要なタスクであるが、既存の手法はしばしば高度に特殊化され、一般化可能性を制限する。これらの特殊なモデルは、テクスチャ欠陥や論理的エラーのような特定の異常タイプに適したもので、通常、指定されたコンテキスト外に配置されたときに限られたパフォーマンスを示す。この制限を克服するために,Mixture-of-Experts (MoE)アーキテクチャに基づいた,新しい,普遍的な異常検出フレームワークであるAnomalyMoEを提案する。我々の重要な洞察は、複雑な異常検出問題を、局所的な構造的異常、コンポーネントレベルの意味的異常、大域的な論理的異常の3つの異なる階層に分解することである。 AnomalyMoEはパッチ、コンポーネント、グローバルレベルで3つの専門的ネットワークを使用し、特徴の再構築と指定されたセマンティックレベルでの偏差の特定に特化している。この階層的設計により、単一のモデルで広い範囲の異常を同時に理解し、検出することができる。さらに、専門家の多様性を促進するためのエキスパートインフォメーション・リパルス(EIR)モジュールと、専門家全員の総合的な利用を保証する専門家選択バランス(ESB)モジュールも導入する。産業画像、3Dポイントクラウド、医用画像、ビデオ監視、論理異常検出を含む8つの挑戦的なデータセットの実験は、AnomalyMoEが新しい最先端のパフォーマンスを確立し、それぞれのドメインにおける特殊手法を著しく上回っていることを示す。

論文の概要: AnomalyMoE: Towards a Language-free Generalist Model for Unified Visual Anomaly Detection

関連論文リスト