Related papers: High-Fidelity Zero-Shot Texture Anomaly Localization Using Feature Correspondence Analysis

High-Fidelity Zero-Shot Texture Anomaly Localization Using Feature Correspondence Analysis

URL: http://arxiv.org/abs/2304.06433v2
Date: Mon, 4 Dec 2023 15:07:49 GMT
Title: High-Fidelity Zero-Shot Texture Anomaly Localization Using Feature Correspondence Analysis
Authors: Andrei-Timotei Ardelean and Tim Weyrich
Abstract summary: We propose a novel method for Zero-Shot Anomaly Localization on textures. The task refers to identifying abnormal regions in an otherwise homogeneous image. As opposed to using holistic distances between distributions, the proposed approach allows pinpointing the non-conformity of a pixel in a local context. We validate our solution on several datasets and obtain more than a 40% reduction in error over the previous state of the art on the MVTec AD dataset.
Score: 3.085407950646415
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: We propose a novel method for Zero-Shot Anomaly Localization on textures. The task refers to identifying abnormal regions in an otherwise homogeneous image. To obtain a high-fidelity localization, we leverage a bijective mapping derived from the 1-dimensional Wasserstein Distance. As opposed to using holistic distances between distributions, the proposed approach allows pinpointing the non-conformity of a pixel in a local context with increased precision. By aggregating the contribution of the pixel to the errors of all nearby patches we obtain a reliable anomaly score estimate. We validate our solution on several datasets and obtain more than a 40% reduction in error over the previous state of the art on the MVTec AD dataset in a zero-shot setting. Also see https://reality.tf.fau.de/pub/ardelean2024highfidelity.html.

Related papers

Quantized FCA: Efficient Zero-Shot Texture Anomaly Detection [6.344680297236473]
This work focuses on the problem of detecting and localizing anomalies in textures.<n>We propose a real-time method, named QFCA, which implements a quantized version of the feature correspondence analysis (FCA) algorithm.<n>By carefully adapting the patch statistics comparison to work on histograms of quantized values, we obtain a 10x speedup with little to no loss in accuracy.
arXiv Detail & Related papers (2025-10-17T12:48:59Z)
A Single Image Is All You Need: Zero-Shot Anomaly Localization Without Training Data [4.861045498353029]
Anomaly detection in images is typically addressed by learning from collections of training data or relying on reference samples.<n>We propose a single-image anomaly localization method that leverages the inductive bias of convolutional neural networks.<n>Our method is named Single Shot Decomposition Network (SSDnet)
arXiv Detail & Related papers (2025-09-22T19:29:20Z)
ZeroStereo: Zero-shot Stereo Matching from Single Images [17.560148513475387]
We propose ZeroStereo, a novel stereo image generation pipeline for zero-shot stereo matching. Our approach synthesizes high-quality right images by leveraging pseudo disparities generated by a monocular depth estimation model. Our pipeline achieves state-of-the-art zero-shot generalization across multiple datasets with only a dataset volume comparable to Scene Flow.
arXiv Detail & Related papers (2025-01-15T08:43:48Z)
Equipping Diffusion Models with Differentiable Spatial Entropy for Low-Light Image Enhancement [7.302792947244082]
In this work, we propose a novel method that shifts the focus from a deterministic pixel-by-pixel comparison to a statistical perspective. The core idea is to introduce spatial entropy into the loss function to measure the distribution difference between predictions and targets. Specifically, we equip the entropy with diffusion models and aim for superior accuracy and enhanced perceptual quality over l1 based noise matching loss.
arXiv Detail & Related papers (2024-04-15T12:35:10Z)
DVMNet++: Rethinking Relative Pose Estimation for Unseen Objects [59.51874686414509]
Existing approaches typically predict 3D translation utilizing the ground-truth object bounding box and approximate 3D rotation with a large number of discrete hypotheses. We present a Deep Voxel Matching Network (DVMNet++) that computes the relative object pose in a single pass. Our approach delivers more accurate relative pose estimates for novel objects at a lower computational cost compared to state-of-the-art methods.
arXiv Detail & Related papers (2024-03-20T15:41:32Z)
Thera: Aliasing-Free Arbitrary-Scale Super-Resolution with Neural Heat Fields [52.11475771410058]
Recent approaches to arbitrary-scale single image super-resolution (ASR) use neural fields to represent continuous signals that can be sampled at arbitrary resolutions. Existing methods attempt to mitigate this by approximating an integral version of the field at each scaling factor, compromising both fidelity and generalization. We introduce neural heat fields, a novel neural field formulation that inherently models a physically exact PSF. Our formulation enables analytically correct anti-aliasing at any desired output resolution, and -- unlike supersampling -- at no additional cost.
arXiv Detail & Related papers (2023-11-29T14:01:28Z)
Vanishing Point Estimation in Uncalibrated Images with Prior Gravity Direction [82.72686460985297]
We tackle the problem of estimating a Manhattan frame. We derive two new 2-line solvers, one of which does not suffer from singularities affecting existing solvers. We also design a new non-minimal method, running on an arbitrary number of lines, to boost the performance in local optimization.
arXiv Detail & Related papers (2023-08-21T13:03:25Z)
Convolutional Cross-View Pose Estimation [9.599356978682108]
We propose a novel end-to-end method for cross-view pose estimation. Our method is validated on the VIGOR and KITTI datasets. On the Oxford RobotCar dataset, our method can reliably estimate the ego-vehicle's pose over time.
arXiv Detail & Related papers (2023-03-09T13:52:28Z)
PNI : Industrial Anomaly Detection using Position and Neighborhood Information [6.316693022958221]
We propose a new algorithm, textbfPNI, which estimates the normal distribution using conditional probability given neighborhood features. We conducted experiments on the MVTec AD benchmark dataset and achieved state-of-the-art performance, with textbf99.56% and textbf98.98% AUROC scores in anomaly detection and localization.
arXiv Detail & Related papers (2022-11-22T23:45:27Z)
CroCo: Cross-Modal Contrastive learning for localization of Earth Observation data [62.96337162094726]
It is of interest to localize a ground-based LiDAR point cloud on remote sensing imagery. We propose a contrastive learning-based method that trains on DEM and high-resolution optical imagery. In the best scenario, the Top-1 score of 0.71 and Top-5 score of 0.81 are obtained.
arXiv Detail & Related papers (2022-04-14T15:55:00Z)
Region-aware Attention for Image Inpainting [33.22497212024083]
We propose a novel region-aware attention (RA) module for inpainting images. By avoiding the directly calculating corralation between each pixel pair in a single samples, the misleading of invalid information in holes can be avoided. A learnable region dictionary (LRD) is introduced to store important information in the entire dataset. Our methodscan generate semantically plausible results with realistic details.
arXiv Detail & Related papers (2022-04-03T06:26:22Z)
Global and Local Alignment Networks for Unpaired Image-to-Image Translation [170.08142745705575]
The goal of unpaired image-to-image translation is to produce an output image reflecting the target domain's style. Due to the lack of attention to the content change in existing methods, semantic information from source images suffers from degradation during translation. We introduce a novel approach, Global and Local Alignment Networks (GLA-Net) Our method effectively generates sharper and more realistic images than existing approaches.
arXiv Detail & Related papers (2021-11-19T18:01:54Z)
Fully Convolutional Cross-Scale-Flows for Image-based Defect Detection [24.0966076588569]
We tackle the problem of automatic defect detection without requiring any image samples of defective parts. We propose a novel fully convolutional cross-scale normalizing flow (CS-Flow) that jointly processes multiple feature maps of different scales. Our work sets a new state-of-the-art in image-level defect detection on the benchmark datasets Magnetic Tile Defects and MVTec AD showing a 100% AUROC on 4 out of 15 classes.
arXiv Detail & Related papers (2021-10-06T15:35:13Z)
Feature Space Targeted Attacks by Statistic Alignment [74.40447383387574]
Feature space targeted attacks perturb images by modulating their intermediate feature maps. The current choice of pixel-wise Euclidean Distance to measure the discrepancy is questionable because it unreasonably imposes a spatial-consistency constraint on the source and target features. We propose two novel approaches called Pair-wise Alignment Attack and Global-wise Alignment Attack, which attempt to measure similarities between feature maps by high-order statistics.
arXiv Detail & Related papers (2021-05-25T03:46:39Z)
Wasserstein Distances for Stereo Disparity Estimation [62.09272563885437]
Existing approaches to depth or disparity estimation output a distribution over a set of pre-defined discrete values. This leads to inaccurate results when the true depth or disparity does not match any of these values. We address these issues using a new neural network architecture that is capable of outputting arbitrary depth values.
arXiv Detail & Related papers (2020-07-06T21:37:50Z)
Image Fine-grained Inpainting [89.17316318927621]
We present a one-stage model that utilizes dense combinations of dilated convolutions to obtain larger and more effective receptive fields. To better train this efficient generator, except for frequently-used VGG feature matching loss, we design a novel self-guided regression loss. We also employ a discriminator with local and global branches to ensure local-global contents consistency.
arXiv Detail & Related papers (2020-02-07T03:45:25Z)

This list is automatically generated from the titles and abstracts of the papers in this site.