Student-Teacher Feature Pyramid Matching for Unsupervised Anomaly
Detection
- URL: http://arxiv.org/abs/2103.04257v1
- Date: Sun, 7 Mar 2021 04:25:04 GMT
- Title: Student-Teacher Feature Pyramid Matching for Unsupervised Anomaly
Detection
- Authors: Guodong Wang, Shumin Han, Errui Ding, Di Huang
- Abstract summary: Anomaly detection is a challenging task and usually formulated as an unsupervised learning problem for the unexpectedness of anomalies.
This paper proposes a simple yet powerful approach to this issue, which is implemented in the student-teacher framework.
Very competitive results are delivered on three major benchmarks, significantly superior to the state of the art ones.
- Score: 38.523251117024984
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Anomaly detection is a challenging task and usually formulated as an
unsupervised learning problem for the unexpectedness of anomalies. This paper
proposes a simple yet powerful approach to this issue, which is implemented in
the student-teacher framework for its advantages but substantially extends it
in terms of both accuracy and efficiency. Given a strong model pre-trained on
image classification as the teacher, we distill the knowledge into a single
student network with the identical architecture to learn the distribution of
anomaly-free images and this one-step transfer preserves the crucial clues as
much as possible. Moreover, we integrate the multi-scale feature matching
strategy into the framework, and this hierarchical feature alignment enables
the student network to receive a mixture of multi-level knowledge from the
feature pyramid under better supervision, thus allowing to detect anomalies of
various sizes. The difference between feature pyramids generated by the two
networks serves as a scoring function indicating the probability of anomaly
occurring. Due to such operations, our approach achieves accurate and fast
pixel-level anomaly detection. Very competitive results are delivered on three
major benchmarks, significantly superior to the state of the art ones. In
addition, it makes inferences at a very high speed (with 100 FPS for images of
the size at 256x256), at least dozens of times faster than the latest
counterparts.
Related papers
- Semi-supervised Semantic Segmentation for Remote Sensing Images via Multi-scale Uncertainty Consistency and Cross-Teacher-Student Attention [59.19580789952102]
This paper proposes a novel semi-supervised Multi-Scale Uncertainty and Cross-Teacher-Student Attention (MUCA) model for RS image semantic segmentation tasks.
MUCA constrains the consistency among feature maps at different layers of the network by introducing a multi-scale uncertainty consistency regularization.
MUCA utilizes a Cross-Teacher-Student attention mechanism to guide the student network, guiding the student network to construct more discriminative feature representations.
arXiv Detail & Related papers (2025-01-18T11:57:20Z) - Attend, Distill, Detect: Attention-aware Entropy Distillation for Anomaly Detection [4.0679780034913335]
A knowledge-distillation based multi-class anomaly detection promises a low latency with a reasonably good performance but with a significant drop as compared to one-class version.
We propose a DCAM (Distributed Convolutional Attention Module) which improves the distillation process between teacher and student networks.
arXiv Detail & Related papers (2024-05-10T13:25:39Z) - Dual-Student Knowledge Distillation Networks for Unsupervised Anomaly
Detection [2.06682776181122]
Student-teacher networks (S-T) are favored in unsupervised anomaly detection.
However, vanilla S-T networks are not stable.
We propose a novel dual-student knowledge distillation architecture.
arXiv Detail & Related papers (2024-02-01T09:32:39Z) - Prior Knowledge Guided Network for Video Anomaly Detection [1.389970629097429]
Video Anomaly Detection (VAD) involves detecting anomalous events in videos.
We propose a Prior Knowledge Guided Network(PKG-Net) for the VAD task.
arXiv Detail & Related papers (2023-09-04T15:57:07Z) - MixedTeacher : Knowledge Distillation for fast inference textural
anomaly detection [4.243356707599485]
unsupervised learning for anomaly detection has been at the heart of image processing research.
We propose a new method based on the promising concept of knowledge distillation.
The proposed texture anomaly detector has an outstanding capability to detect defects in any texture and a fast inference time compared to the SOTA methods.
arXiv Detail & Related papers (2023-06-16T14:14:20Z) - EfficientAD: Accurate Visual Anomaly Detection at Millisecond-Level
Latencies [1.1602089225841632]
We propose a lightweight feature extractor that processes an image in less than a millisecond on a modern GPU.
We then use a student-teacher approach to detect anomalous features.
We evaluate our method, called EfficientAD, on 32 datasets from three industrial anomaly detection dataset collections.
arXiv Detail & Related papers (2023-03-25T18:48:33Z) - Video Anomaly Detection by Solving Decoupled Spatio-Temporal Jigsaw
Puzzles [67.39567701983357]
Video Anomaly Detection (VAD) is an important topic in computer vision.
Motivated by the recent advances in self-supervised learning, this paper addresses VAD by solving an intuitive yet challenging pretext task.
Our method outperforms state-of-the-art counterparts on three public benchmarks.
arXiv Detail & Related papers (2022-07-20T19:49:32Z) - Warp Consistency for Unsupervised Learning of Dense Correspondences [116.56251250853488]
Key challenge in learning dense correspondences is lack of ground-truth matches for real image pairs.
We propose Warp Consistency, an unsupervised learning objective for dense correspondence regression.
Our approach sets a new state-of-the-art on several challenging benchmarks, including MegaDepth, RobotCar and TSS.
arXiv Detail & Related papers (2021-04-07T17:58:22Z) - Multi-attentional Deepfake Detection [79.80308897734491]
Face forgery by deepfake is widely spread over the internet and has raised severe societal concerns.
We propose a new multi-attentional deepfake detection network. Specifically, it consists of three key components: 1) multiple spatial attention heads to make the network attend to different local parts; 2) textural feature enhancement block to zoom in the subtle artifacts in shallow features; 3) aggregate the low-level textural feature and high-level semantic features guided by the attention maps.
arXiv Detail & Related papers (2021-03-03T13:56:14Z) - Recurrent Multi-view Alignment Network for Unsupervised Surface
Registration [79.72086524370819]
Learning non-rigid registration in an end-to-end manner is challenging due to the inherent high degrees of freedom and the lack of labeled training data.
We propose to represent the non-rigid transformation with a point-wise combination of several rigid transformations.
We also introduce a differentiable loss function that measures the 3D shape similarity on the projected multi-view 2D depth images.
arXiv Detail & Related papers (2020-11-24T14:22:42Z) - Anomaly Detection in Video via Self-Supervised and Multi-Task Learning [113.81927544121625]
Anomaly detection in video is a challenging computer vision problem.
In this paper, we approach anomalous event detection in video through self-supervised and multi-task learning at the object level.
arXiv Detail & Related papers (2020-11-15T10:21:28Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.