DeSTSeg: Segmentation Guided Denoising Student-Teacher for Anomaly
Detection
- URL: http://arxiv.org/abs/2211.11317v2
- Date: Tue, 21 Mar 2023 09:18:20 GMT
- Title: DeSTSeg: Segmentation Guided Denoising Student-Teacher for Anomaly
Detection
- Authors: Xuan Zhang, Shiyu Li, Xi Li, Ping Huang, Jiulong Shan, Ting Chen
- Abstract summary: We propose an improved model called DeSTSeg, which integrates a pre-trained teacher network, a denoising student encoder-decoder, and a segmentation network into one framework.
Our method achieves state-of-the-art performance, 98.6% on image-level AUC, 75.8% on pixel-level average precision, and 76.4% on instance-level average precision.
- Score: 18.95747313320397
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Visual anomaly detection, an important problem in computer vision, is usually
formulated as a one-class classification and segmentation task. The
student-teacher (S-T) framework has proved to be effective in solving this
challenge. However, previous works based on S-T only empirically applied
constraints on normal data and fused multi-level information. In this study, we
propose an improved model called DeSTSeg, which integrates a pre-trained
teacher network, a denoising student encoder-decoder, and a segmentation
network into one framework. First, to strengthen the constraints on anomalous
data, we introduce a denoising procedure that allows the student network to
learn more robust representations. From synthetically corrupted normal images,
we train the student network to match the teacher network feature of the same
images without corruption. Second, to fuse the multi-level S-T features
adaptively, we train a segmentation network with rich supervision from
synthetic anomaly masks, achieving a substantial performance improvement.
Experiments on the industrial inspection benchmark dataset demonstrate that our
method achieves state-of-the-art performance, 98.6% on image-level AUC, 75.8%
on pixel-level average precision, and 76.4% on instance-level average
precision.
Related papers
- Teacher Encoder-Student Decoder Denoising Guided Segmentation Network for Anomaly Detection [15.545036112870841]
We propose a novel model named PFADSeg, which integrates a pre-trained teacher network, a denoising student network with multi-scale feature fusion, and a guided anomaly segmentation network into a unified framework.
evaluated on the MVTec AD dataset, PFADSeg achieves state-of-the-art results with an image-level AUC of 98.9%, a pixel-level mean precision of 76.4%, and an instance-level mean precision of 78.7%.
arXiv Detail & Related papers (2025-01-21T12:55:04Z) - Semi-supervised Semantic Segmentation for Remote Sensing Images via Multi-scale Uncertainty Consistency and Cross-Teacher-Student Attention [59.19580789952102]
This paper proposes a novel semi-supervised Multi-Scale Uncertainty and Cross-Teacher-Student Attention (MUCA) model for RS image semantic segmentation tasks.
MUCA constrains the consistency among feature maps at different layers of the network by introducing a multi-scale uncertainty consistency regularization.
MUCA utilizes a Cross-Teacher-Student attention mechanism to guide the student network, guiding the student network to construct more discriminative feature representations.
arXiv Detail & Related papers (2025-01-18T11:57:20Z) - CUCL: Codebook for Unsupervised Continual Learning [129.91731617718781]
The focus of this study is on Unsupervised Continual Learning (UCL), as it presents an alternative to Supervised Continual Learning.
We propose a method named Codebook for Unsupervised Continual Learning (CUCL) which promotes the model to learn discriminative features to complete the class boundary.
Our method significantly boosts the performances of supervised and unsupervised methods.
arXiv Detail & Related papers (2023-11-25T03:08:50Z) - Self-Supervised Versus Supervised Training for Segmentation of Organoid
Images [2.6242820867975127]
Large amounts of microscopic image data sets remain unlabeled, preventing their effective exploitation using deep-learning algorithms.
Self-supervised learning (SSL) is a promising solution based on learning intrinsic features under a pretext task that is similar to the main task without requiring labels.
A ResNet50 U-Net was first trained to restore images of liver progenitor organoids from augmented images using the Structural Similarity Index Metric (SSIM), alone, and using SSIM combined with L1 loss.
For comparison, we used the same U-Net architecture to train two supervised models, one utilizing the ResNet50 encoder
arXiv Detail & Related papers (2023-11-19T01:57:55Z) - Prompt Tuning for Parameter-efficient Medical Image Segmentation [79.09285179181225]
We propose and investigate several contributions to achieve a parameter-efficient but effective adaptation for semantic segmentation on two medical imaging datasets.
We pre-train this architecture with a dedicated dense self-supervision scheme based on assignments to online generated prototypes.
We demonstrate that the resulting neural network model is able to attenuate the gap between fully fine-tuned and parameter-efficiently adapted models.
arXiv Detail & Related papers (2022-11-16T21:55:05Z) - Deep Semantic Statistics Matching (D2SM) Denoising Network [70.01091467628068]
We introduce the Deep Semantic Statistics Matching (D2SM) Denoising Network.
It exploits semantic features of pretrained classification networks, then it implicitly matches the probabilistic distribution of clear images at the semantic feature space.
By learning to preserve the semantic distribution of denoised images, we empirically find our method significantly improves the denoising capabilities of networks.
arXiv Detail & Related papers (2022-07-19T14:35:42Z) - Graph Consistency based Mean-Teaching for Unsupervised Domain Adaptive
Person Re-Identification [54.58165777717885]
This paper proposes a Graph Consistency based Mean-Teaching (GCMT) method with constructing the Graph Consistency Constraint (GCC) between teacher and student networks.
Experiments on three datasets, i.e., Market-1501, DukeMTMCreID, and MSMT17, show that proposed GCMT outperforms state-of-the-art methods by clear margin.
arXiv Detail & Related papers (2021-05-11T04:09:49Z) - Robust Self-Ensembling Network for Hyperspectral Image Classification [38.84831094095329]
We propose a robust self-ensembling network (RSEN) to address this problem.
The proposed RSEN consists of twoworks including a base network and an ensemble network.
We show that the proposed algorithm can yield competitive performance compared with the state-of-the-art methods.
arXiv Detail & Related papers (2021-04-08T13:33:14Z) - Hyperspectral Denoising Using Unsupervised Disentangled Spatio-Spectral
Deep Priors [10.65207459525818]
In recent years, data-driven neural network priors have shown promising performance for RGB natural image denoising.
Data-driven priors are hard to acquire for hyperspectral images due to the lack of training data.
This work puts forth an unsupervised DIP framework that is based on the classic-spectral decomposition of HSIs.
arXiv Detail & Related papers (2021-02-24T14:38:51Z) - Pairwise Relation Learning for Semi-supervised Gland Segmentation [90.45303394358493]
We propose a pairwise relation-based semi-supervised (PRS2) model for gland segmentation on histology images.
This model consists of a segmentation network (S-Net) and a pairwise relation network (PR-Net)
We evaluate our model against five recent methods on the GlaS dataset and three recent methods on the CRAG dataset.
arXiv Detail & Related papers (2020-08-06T15:02:38Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.