Fugu-MT 論文翻訳(概要): GRD-Net: Generative-Reconstructive-Discriminative Anomaly Detection with Region of Interest Attention Module

論文の概要: GRD-Net: Generative-Reconstructive-Discriminative Anomaly Detection with Region of Interest Attention Module

arxiv url: http://arxiv.org/abs/2603.07566v1
Date: Sun, 08 Mar 2026 10:02:17 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-10 15:13:14.813762
Title: GRD-Net: Generative-Reconstructive-Discriminative Anomaly Detection with Region of Interest Attention Module
Title（参考訳）: GRD-Net:関心注意モジュールの領域を用いた生成・再構成・識別異常検出
Authors: Niccolò Ferrari, Michele Fraccaroli, Evelina Lamma,
Abstract要約: 2ブロックからなる新しいアーキテクチャを提案する。最初のブロックはGAN(Generative Adversarial Network)であり、残余のオートエンコーダ(ResAE)に基づいて再構成および復調処理を行う。 2番目のブロックは画像のセグメンテーションを生成し、欠陥を発見できる。
参考スコア（独自算出の注目度）: 0.0509780930114934
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Anomaly detection is nowadays increasingly used in industrial applications and processes. One of the main fields of the appliance is the visual inspection for surface anomaly detection, which aims to spot regions that deviate from regularity and consequently identify abnormal products. Defect localization is a key task, that usually is achieved using a basic comparison between generated image and the original one, implementing some blob-analysis or image-editing algorithms, in the post-processing step, which is very biased towards the source dataset, and they are unable to generalize. Furthermore, in industrial applications, the totality of the image is not always interesting but could be one or some regions of interest (ROIs), where only in those areas there are relevant anomalies to be spotted. For these reasons, we propose a new architecture composed by two blocks. The first block is a Generative Adversarial Network (GAN), based on a residual autoencoder (ResAE), to perform reconstruction and denoising processes, while the second block produces image segmentation, spotting defects. This method learns from a dataset composed of good products and generated synthetic defects. The discriminative network is trained using a ROI for each image contained in the training dataset. The network will learn in which area anomalies are relevant. This approach guarantees the reduction of using pre-processing algorithms, formerly developed with blob-analysis and image-editing procedures. To test our model we used challenging MVTec anomaly detection datasets and an industrial large dataset of pharmaceutical BFS strips of vials. This set constitutes a more realistic use case of the aforementioned network.
Abstract（参考訳）: 異常検出は、近年、産業応用やプロセスでますます使われている。装置の主な分野の1つは、表面異常検出のための視覚検査であり、これは規則性から逸脱する領域を特定し、結果として異常品を特定することを目的としている。欠陥ローカライゼーションは重要なタスクであり、通常は生成された画像と元の画像の基本的な比較を使って達成され、いくつかのブロブ分析や画像編集アルゴリズムを実装します。さらに、産業アプリケーションでは、画像の総量は必ずしも面白くないが、1つまたは一部の関心領域(ROI)になり得る。これらの理由から,2ブロックからなる新しいアーキテクチャを提案する。第1ブロックは残留オートエンコーダ(ResAE)に基づくGAN(Generative Adversarial Network)であり、再構成および復調処理を行う。この方法は、優れた製品と生成された合成欠陥からなるデータセットから学習する。識別ネットワークは、トレーニングデータセットに含まれる各画像のROIを使用してトレーニングされる。ネットワークはどの領域の異常が関係しているかを学習する。このアプローチは、以前はブロブ分析と画像編集の手順で開発された前処理アルゴリズムの使用を減らすことを保証する。我々のモデルをテストするために、挑戦的なMVTec異常検出データセットと工業用BFSスニペットのBFSスニペットの大規模データセットを使用した。このセットは前述のネットワークのより現実的なユースケースを構成する。

論文の概要: GRD-Net: Generative-Reconstructive-Discriminative Anomaly Detection with Region of Interest Attention Module

関連論文リスト