Target before Shooting: Accurate Anomaly Detection and Localization
under One Millisecond via Cascade Patch Retrieval
- URL: http://arxiv.org/abs/2308.06748v1
- Date: Sun, 13 Aug 2023 11:49:05 GMT
- Title: Target before Shooting: Accurate Anomaly Detection and Localization
under One Millisecond via Cascade Patch Retrieval
- Authors: Hanxi Li, Jianfei Hu, Bo Li, Hao Chen, Yongbin Zheng, Chunhua Shen
- Abstract summary: We re-examine the "matching" nature of Anomaly Detection (AD)
We propose a new AD framework that simultaneously enjoys new records of AD accuracy and dramatically high running speed.
- Score: 49.45246833329707
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: In this work, by re-examining the "matching" nature of Anomaly Detection
(AD), we propose a new AD framework that simultaneously enjoys new records of
AD accuracy and dramatically high running speed. In this framework, the anomaly
detection problem is solved via a cascade patch retrieval procedure that
retrieves the nearest neighbors for each test image patch in a coarse-to-fine
fashion. Given a test sample, the top-K most similar training images are first
selected based on a robust histogram matching process. Secondly, the nearest
neighbor of each test patch is retrieved over the similar geometrical locations
on those "global nearest neighbors", by using a carefully trained local metric.
Finally, the anomaly score of each test image patch is calculated based on the
distance to its "local nearest neighbor" and the "non-background" probability.
The proposed method is termed "Cascade Patch Retrieval" (CPR) in this work.
Different from the conventional patch-matching-based AD algorithms, CPR selects
proper "targets" (reference images and locations) before "shooting"
(patch-matching). On the well-acknowledged MVTec AD, BTAD and MVTec-3D AD
datasets, the proposed algorithm consistently outperforms all the comparing
SOTA methods by remarkable margins, measured by various AD metrics.
Furthermore, CPR is extremely efficient. It runs at the speed of 113 FPS with
the standard setting while its simplified version only requires less than 1 ms
to process an image at the cost of a trivial accuracy drop. The code of CPR is
available at https://github.com/flyinghu123/CPR.
Related papers
- Graspness Discovery in Clutters for Fast and Accurate Grasp Detection [57.81325062171676]
"graspness" is a quality based on geometry cues that distinguishes graspable areas in cluttered scenes.
We develop a neural network named cascaded graspness model to approximate the searching process.
Experiments on a large-scale benchmark, GraspNet-1Billion, show that our method outperforms previous arts by a large margin.
arXiv Detail & Related papers (2024-06-17T02:06:47Z) - Patch-based Selection and Refinement for Early Object Detection [20.838511460733038]
We propose a novel set of algorithms that divide the image into patches, select patches with objects at various scales, elaborate the details of a small object, and detect it as early as possible.
Our approach is built upon a transformer-based network and integrates the diffusion model to improve the detection accuracy.
Our algorithms enhance the mAP for small objects from 1.03 to 8.93, and reduce the data volume in computation by more than 77%.
arXiv Detail & Related papers (2023-11-03T23:41:13Z) - A Hierarchical Descriptor Framework for On-the-Fly Anatomical Location
Matching between Longitudinal Studies [0.07499722271664144]
We propose a method to match anatomical locations between pairs of medical images in longitudinal comparisons.
The matching is made possible by computing a descriptor of the query point in a source image.
A hierarchical search operation finds the corresponding point with the most similar descriptor in the target image.
arXiv Detail & Related papers (2023-08-11T18:01:27Z) - 2D3D-MATR: 2D-3D Matching Transformer for Detection-free Registration
between Images and Point Clouds [38.425876064671435]
We propose 2D3D-MATR, a detection-free method for accurate and robust registration between images and point clouds.
Our method adopts a coarse-to-fine pipeline where it first computes coarse correspondences between downsampled patches of the input image and the point cloud.
To resolve the scale ambiguity in patch matching, we construct a multi-scale pyramid for each image patch and learn to find for each point patch the best matching image patch at a proper resolution level.
arXiv Detail & Related papers (2023-08-10T16:10:54Z) - Stagnation Detection in Highly Multimodal Fitness Landscapes [0.0]
Stagnation detection has been proposed as a mechanism for randomized searchs to escape from local optima.
In this paper, we investigate a new mechanism called radius memory which can be added to stagnation detection to control the search radius more carefully.
We implement this idea in an algorithm called SD-RLS$textm$ and show compared to previous variants of stagnation detection that it yields speed-ups.
arXiv Detail & Related papers (2021-04-09T14:33:52Z) - DetCo: Unsupervised Contrastive Learning for Object Detection [64.22416613061888]
Unsupervised contrastive learning achieves great success in learning image representations with CNN.
We present a novel contrastive learning approach, named DetCo, which fully explores the contrasts between global image and local image patches.
DetCo consistently outperforms supervised method by 1.6/1.2/1.0 AP on Mask RCNN-C4/FPN/RetinaNet with 1x schedule.
arXiv Detail & Related papers (2021-02-09T12:47:20Z) - Image Denoising by Gaussian Patch Mixture Model and Low Rank Patches [0.0]
A new method is proposed by solving two issues: how to improve similar patches matching accuracy and build an appropriate low rank matrix approximation model for Gaussian noise.
By solving the two issues, experimental results are reported to show that the proposed approach outperforms the state-of-the-art denoising methods.
arXiv Detail & Related papers (2020-11-20T09:28:04Z) - Dense Label Encoding for Boundary Discontinuity Free Rotation Detection [69.75559390700887]
This paper explores a relatively less-studied methodology based on classification.
We propose new techniques to push its frontier in two aspects.
Experiments and visual analysis on large-scale public datasets for aerial images show the effectiveness of our approach.
arXiv Detail & Related papers (2020-11-19T05:42:02Z) - Making Affine Correspondences Work in Camera Geometry Computation [62.7633180470428]
Local features provide region-to-region rather than point-to-point correspondences.
We propose guidelines for effective use of region-to-region matches in the course of a full model estimation pipeline.
Experiments show that affine solvers can achieve accuracy comparable to point-based solvers at faster run-times.
arXiv Detail & Related papers (2020-07-20T12:07:48Z) - Frustratingly Simple Few-Shot Object Detection [98.42824677627581]
We find that fine-tuning only the last layer of existing detectors on rare classes is crucial to the few-shot object detection task.
Such a simple approach outperforms the meta-learning methods by roughly 220 points on current benchmarks.
arXiv Detail & Related papers (2020-03-16T00:29:14Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.