Fugu-MT 論文翻訳(概要): InCTRLv2: Generalist Residual Models for Few-Shot Anomaly Detection and Segmentation

論文の概要: InCTRLv2: Generalist Residual Models for Few-Shot Anomaly Detection and Segmentation

arxiv url: http://arxiv.org/abs/2604.04632v1
Date: Mon, 06 Apr 2026 12:30:46 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-07 15:49:19.187189
Title: InCTRLv2: Generalist Residual Models for Few-Shot Anomaly Detection and Segmentation
Title（参考訳）: InCTRLv2:Few-Shot異常検出とセグメンテーションのための一般残留モデル
Authors: Jiawen Zhu, Mengjia Niu, Guansong Pang,
Abstract要約: InCTRLv2は、二重ブランチフレームワークの下で、異常知覚の2つの新しい視点を導入している。識別異常スコア学習(DASL)と1クラス異常スコア学習(OASL)の2つの新しいモジュールによって実現される。 10個のADデータセットの実験により、InCTRLv2は、さまざまな設定にわたる異常検出とセグメンテーションタスクの両方において、SotAのパフォーマンスを達成することが示された。
参考スコア（独自算出の注目度）: 40.431502791639936
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: While recent anomaly detection (AD) methods have made substantial progress in recognizing abnormal patterns within specific domains, most of them are specialist models that are trained on large training samples from a specific target dataset, struggling to generalize to unseen datasets. To address this limitation, the paradigm of Generalist Anomaly Detection (GAD) has emerged in recent years, aiming to learn a single generalist model to detect anomalies across diverse domains without retraining. To this end, this work introduces InCTRLv2, a novel few-shot Generalist Anomaly Detection and Segmentation (GADS) framework that significantly extends our previously proposed GAD model, InCTRL. Building on the idea of learning in-context residuals with few-shot normal examples to detect anomalies as in InCTRL, InCTRLv2 introduces two new, complementary perspectives of anomaly perception under a dual-branch framework. This is accomplished by two novel modules upon InCTRL: i) Discriminative Anomaly Score Learning (DASL) with both normal and abnormal data in the main branch, which learns a semantic-guided abnormality and normality space that supports the classification of query samples from both the abnormality and normality perspectives; and ii) One-class Anomaly Score Learning (OASL) using only the normal data, which learns generalized normality patterns in a semantic space via an auxiliary branch, focusing on detecting anomalies through the lens of normality solely. Both branches are guided by rich visual-text semantic priors encoded by large-scale vision-language models. Together, they offer a dual semantic perspective for AD: one emphasizes normal-abnormal discriminations, while the other emphasizes normality-deviated semantics. Extensive experiments on ten AD datasets demonstrate that InCTRLv2 achieves SotA performance in both anomaly detection and segmentation tasks across various settings.
Abstract（参考訳）: 最近の異常検出(AD)法は、特定のドメイン内の異常パターンを認識するためにかなりの進歩を遂げているが、そのほとんどは、特定のターゲットデータセットからの大規模なトレーニングサンプルに基づいて訓練され、目に見えないデータセットへの一般化に苦慮しているスペシャリストモデルである。この制限に対処するために、GAD(Generalist Anomaly Detection)のパラダイムが近年出現し、複数のドメインにまたがる異常をリトレーニングすることなく検出する1つのジェネラリストモデルを学ぶことを目指している。そこで本研究では,GADS(Generalist Anomaly Detection and Segmentation)フレームワークであるInCTRL2を紹介する。 InCTRLのように、異常を検出するために、ごく普通の例で文脈内残差を学習するというアイデアに基づいて、InCTRLv2は、二重ブランチフレームワークの下での異常知覚を補完する2つの新しい視点を導入する。これはInCTRL上の2つの新しいモジュールによって達成される。一メインブランチにおける正常データと異常データの両方を有する識別的異常スコア学習(DASL)であって、異常データと正常データの両方の観点からクエリサンプルの分類を支援する意味誘導異常と正常性空間を学習すること。二正規データのみを用いて一級異常スコア学習(OASL)を行い、補助枝を介して意味空間における一般化正規性パターンを学習し、正規性のレンズを通して異常を検出することのみに焦点を当てた。両方のブランチは、大規模な視覚言語モデルでエンコードされたリッチなビジュアルテキストセマンティクスによってガイドされる。両者はADの2つの意味論的な視点を提供し、一方は正常な非正規な差別を強調し、もう一方は正常さを逸脱した意味論を強調している。 10個のADデータセットに対する大規模な実験により、InCTRLv2は様々な設定で異常検出とセグメンテーションタスクの両方でSotA性能を達成することが示された。

論文の概要: InCTRLv2: Generalist Residual Models for Few-Shot Anomaly Detection and Segmentation

関連論文リスト