Fugu-MT 論文翻訳(概要): Generalist Multi-Class Anomaly Detection via Distillation to Two Heterogeneous Student Networks

論文の概要: Generalist Multi-Class Anomaly Detection via Distillation to Two Heterogeneous Student Networks

arxiv url: http://arxiv.org/abs/2509.24448v1
Date: Mon, 29 Sep 2025 08:31:31 GMT
ステータス: 翻訳完了
システム内更新日: 2025-09-30 22:32:19.864564
Title: Generalist Multi-Class Anomaly Detection via Distillation to Two Heterogeneous Student Networks
Title（参考訳）: 2つの異種学生ネットワークへの蒸留による総合的多クラス異常検出
Authors: Hangil Park, Yongmin Seo, Tae-Kyun Kim,
Abstract要約: 異常検出は、様々な現実世界のアプリケーションにおいて重要な役割を果たす。最近の手法では、一般的な異常検出に対処しようと試みているが、その性能はデータセット固有の設定や単一クラスタスクに敏感である。本稿では,このギャップを埋めるために,知識蒸留(KD)に基づく新しい二重モデルアンサンブル手法を提案する。
参考スコア（独自算出の注目度）: 11.543429175824905
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Anomaly detection (AD) plays an important role in various real-world applications. Recent advancements in AD, however, are often biased towards industrial inspection, struggle to generalize to broader tasks like semantic anomaly detection and vice versa. Although recent methods have attempted to address general anomaly detection, their performance remains sensitive to dataset-specific settings and single-class tasks. In this paper, we propose a novel dual-model ensemble approach based on knowledge distillation (KD) to bridge this gap. Our framework consists of a teacher and two student models: an Encoder-Decoder model, specialized in detecting patch-level minor defects for industrial AD and an Encoder-Encoder model, optimized for semantic AD. Both models leverage a shared pre-trained encoder (DINOv2) to extract high-quality feature representations. The dual models are jointly learned using the Noisy-OR objective, and the final anomaly score is obtained using the joint probability via local and semantic anomaly scores derived from the respective models. We evaluate our method on eight public benchmarks under both single-class and multi-class settings: MVTec-AD, MVTec-LOCO, VisA and Real-IAD for industrial inspection and CIFAR-10/100, FMNIST and View for semantic anomaly detection. The proposed method achieved state-of-the-art accuracies in both domains, in multi-class as well as single-class settings, demonstrating generalization across multiple domains of anomaly detection. Our model achieved an image-level AUROC of 99.7% on MVTec-AD and 97.8% on CIFAR-10, which is significantly better than the prior general AD models in multi-class settings and even higher than the best specialist models on individual benchmarks.
Abstract（参考訳）: 異常検出(AD)は、様々な現実世界の応用において重要な役割を果たす。しかし、ADの最近の進歩は、しばしば産業検査に偏り、意味的異常検出のようなより広範なタスクに一般化するのに苦労する。最近の手法は一般的な異常検出に対処しようとするが、その性能はデータセット固有の設定や単一クラスタスクに敏感である。本稿では,このギャップを埋めるため,知識蒸留(KD)に基づく新しい二重モデルアンサンブル手法を提案する。我々のフレームワークは教師と学生の2つのモデルで構成されている。Encoder-Decoderモデルは、産業用ADのパッチレベルのマイナー欠陥の検出に特化しており、Encoder-Encoderモデルは意味的ADに最適化されている。どちらのモデルも共有事前訓練エンコーダ(DINOv2)を利用して高品質な特徴表現を抽出する。両モデルはノイズオーダの目的を用いて共同学習し,各モデルから得られた局所的および意味的異常スコアを用いて最終異常スコアを求める。 MVTec-AD, MVTec-LOCO, VisA, Real-IADを産業検査用, CIFAR-10/100, FMNIST, Viewを意味異常検出用とした。提案手法は,複数の領域にまたがる異常検出の一般化を実証し,両領域の最先端の精度をマルチクラスおよびシングルクラス設定で達成した。画像レベルのAUROCはMVTec-ADで99.7%,CIFAR-10で97.8%,マルチクラス環境では従来の一般的なADモデルよりも大幅に向上し,個々のベンチマークで最高のスペシャリストモデルよりも高くなった。

関連論文リスト

Learning Multi-view Multi-class Anomaly Detection [10.199404082194947]
MVMCAD(Multi-View Multi-Class Anomaly Detection Model)を導入し、複数のビューからの情報を統合して異常を正確に識別する。具体的には、凍結エンコーダの前にプリエンコーダの事前拡張機構を追加する半凍結エンコーダを提案する。 AAM(Anomaly Amplification Module)は、グローバルトークンのインタラクションをモデル化し、通常のリージョンを抑圧する。
論文参考訳（メタデータ） (2025-04-30T03:59:58Z)
Learning Feature Inversion for Multi-class Anomaly Detection under General-purpose COCO-AD Benchmark [101.23684938489413]
異常検出(AD)は、しばしば産業品質検査や医学的病変検査のための異常の検出に焦点が当てられている。この研究はまず、COCOをADフィールドに拡張することにより、大規模で汎用的なCOCO-ADデータセットを構築する。セグメンテーション分野のメトリクスにインスパイアされた我々は、より実用的なしきい値に依存したAD固有のメトリクスをいくつか提案する。
論文参考訳（メタデータ） (2024-04-16T17:38:26Z)
Absolute-Unified Multi-Class Anomaly Detection via Class-Agnostic Distribution Alignment [27.375917265177847]
教師なし異常検出(UAD)メソッドは、各オブジェクトカテゴリごとに別々のモデルを構築する。近年の研究では、複数のクラス、すなわちモデル統一 UAD に対する統一モデルのトレーニングが提案されている。我々は,クラス情報,すなわちtextitabsolute-unified UADを使わずに,マルチクラス異常検出に対処する,シンプルかつ強力な手法を提案する。
論文参考訳（メタデータ） (2024-03-31T15:50:52Z)
Toward Multi-class Anomaly Detection: Exploring Class-aware Unified Model against Inter-class Interference [67.36605226797887]
統一型異常検出(MINT-AD)のためのマルチクラスインプリシトニューラル表現変換器を提案する。マルチクラス分布を学習することにより、モデルが変換器デコーダのクラス対応クエリ埋め込みを生成する。 MINT-ADは、カテゴリと位置情報を特徴埋め込み空間に投影することができ、さらに分類と事前確率損失関数によって監督される。
論文参考訳（メタデータ） (2024-03-21T08:08:31Z)
Open-Vocabulary Video Anomaly Detection [57.552523669351636]
監視の弱いビデオ異常検出(VAD)は、ビデオフレームが正常であるか異常であるかを識別するためにビデオレベルラベルを利用する際、顕著な性能を達成した。近年の研究は、より現実的な、オープンセットのVADに取り組み、異常や正常なビデオから見えない異常を検出することを目的としている。本稿ではさらに一歩前進し、未確認および未確認の異常を検知・分類するために訓練済みの大規模モデルを活用することを目的とした、オープン語彙ビデオ異常検出(OVVAD)について検討する。
論文参考訳（メタデータ） (2023-11-13T02:54:17Z)
Anomaly Detection via Multi-Scale Contrasted Memory [3.0170109896527086]
マルチスケールの標準プロトタイプをトレーニング中に記憶し,異常偏差値を計算する2段階の異常検出器を新たに導入する。 CIFAR-10の誤差相対改善率を最大35%とすることにより,多種多様なオブジェクト,スタイル,局所異常に対する最先端性能を高い精度で向上させる。
論文参考訳（メタデータ） (2022-11-16T16:58:04Z)
Unsupervised Anomaly Detection with Adversarial Mirrored AutoEncoders [51.691585766702744]
本稿では,識別器のミラー化ワッサースタイン損失を利用して,よりセマンティックレベルの再構築を行う逆自動エンコーダの変種を提案する。我々は,再建基準の代替として,異常スコアの代替尺度を提案した。提案手法は,OOD検出ベンチマークにおける異常検出の最先端手法よりも優れている。
論文参考訳（メタデータ） (2020-03-24T08:26:58Z)

関連論文リストは本サイト内にある論文のタイトル・アブストラクトから自動的に作成しています。

指定された論文の情報です。
本サイトの運営者は本サイト（すべての情報・翻訳含む）の品質を保証せず、本サイト（すべての情報・翻訳含む）を使用して発生したあらゆる結果について一切の責任を負いません。