View-Invariant Pixelwise Anomaly Detection in Multi-object Scenes with Adaptive View Synthesis
- URL: http://arxiv.org/abs/2406.18012v3
- Date: Mon, 19 May 2025 18:23:14 GMT
- Title: View-Invariant Pixelwise Anomaly Detection in Multi-object Scenes with Adaptive View Synthesis
- Authors: Subin Varghese, Vedhus Hoskere,
- Abstract summary: We introduce and formalize Scene Anomaly Detection (Scene AD) as the task of unsupervised, pixel-wise anomaly localization.<n>We evaluate progress in Scene AD using ToyCity, the first multi-object, multi-view real-image dataset.<n>Our experiments demonstrate that OmniAD, when used with augmented views, yields a 64.33% increase in pixel-wise (F_1) score over Reverse Distillation with no augmentation.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The built environment, encompassing critical infrastructure such as bridges and buildings, requires diligent monitoring of unexpected anomalies or deviations from a normal state in captured imagery. Anomaly detection methods could aid in automating this task; however, deploying anomaly detection effectively in such environments presents significant challenges that have not been evaluated before. These challenges include camera viewpoints that vary, the presence of multiple objects within a scene, and the absence of labeled anomaly data for training. To address these comprehensively, we introduce and formalize Scene Anomaly Detection (Scene AD) as the task of unsupervised, pixel-wise anomaly localization under these specific real-world conditions. Evaluating progress in Scene AD required the development of ToyCity, the first multi-object, multi-view real-image dataset, for unsupervised anomaly detection. Our initial evaluations using ToyCity revealed that established anomaly detection baselines struggle to achieve robust pixel-level localization. To address this, two data augmentation strategies were created to generate additional synthetic images of non-anomalous regions to enhance generalizability. However, the addition of these synthetic images alone only provided minor improvements. Thus, OmniAD, a refinement of the Reverse Distillation methodology, was created to establish a stronger baseline. Our experiments demonstrate that OmniAD, when used with augmented views, yields a 64.33\% increase in pixel-wise \(F_1\) score over Reverse Distillation with no augmentation. Collectively, this work offers the Scene AD task definition, the ToyCity benchmark, the view synthesis augmentation approaches, and the OmniAD method. Project Page: https://drags99.github.io/OmniAD/
Related papers
- Track Any Anomalous Object: A Granular Video Anomaly Detection Pipeline [63.96226274616927]
A new framework called Track Any Anomalous Object (TAO) introduces a granular video anomaly detection pipeline.<n>Unlike methods that assign anomaly scores to every pixel, our approach transforms the problem into pixel-level tracking of anomalous objects.<n>Experiments demonstrate that TAO sets new benchmarks in accuracy and robustness.
arXiv Detail & Related papers (2025-06-05T15:49:39Z) - Zooming In on Fakes: A Novel Dataset for Localized AI-Generated Image Detection with Forgery Amplification Approach [69.01456182499486]
textbfBR-Gen is a large-scale dataset of 150,000 locally forged images with diverse scene-aware annotations.<n>textbfNFA-ViT is a Noise-guided Forgery Amplification Vision Transformer that enhances the detection of localized forgeries.
arXiv Detail & Related papers (2025-04-16T09:57:23Z) - Crane: Context-Guided Prompt Learning and Attention Refinement for Zero-Shot Anomaly Detections [50.343419243749054]
Anomaly Detection (AD) involves identifying deviations from normal data distributions.
We propose a novel approach that conditions the prompts of the text encoder based on image context extracted from the vision encoder.
Our method achieves state-of-the-art performance, improving performance by 2% to 29% across different metrics on 14 datasets.
arXiv Detail & Related papers (2025-04-15T10:42:25Z) - A Dataset for Semantic Segmentation in the Presence of Unknowns [49.795683850385956]
Existing datasets allow evaluation of only knowns or unknowns - but not both.
We propose a novel anomaly segmentation dataset, ISSU, that features a diverse set of anomaly inputs from cluttered real-world environments.
The dataset is twice larger than existing anomaly segmentation datasets.
arXiv Detail & Related papers (2025-03-28T10:31:01Z) - BOOTPLACE: Bootstrapped Object Placement with Detection Transformers [23.300369070771836]
We introduce BOOTPLACE, a novel paradigm that formulates object placement as a placement-by-detection problem.<n> Experimental results on established benchmarks demonstrate BOOTPLACE's superior performance in object repositioning.
arXiv Detail & Related papers (2025-03-27T21:21:20Z) - AnomalyCD: A benchmark for Earth anomaly change detection with high-resolution and time-series observations [12.35831157851407]
AnomalyCD technique learns to identify anomalous changes by learning from the historical normal change pattern.
AnomalyCDM is designed as a two-stage workflow to enhance the efficiency, and has the ability to process the unseen images directly.
arXiv Detail & Related papers (2024-09-09T14:47:57Z) - UMAD: University of Macau Anomaly Detection Benchmark Dataset [26.25955201927986]
We introduce the first benchmark dataset specifically for anomaly detection with reference in robotic patrolling scenarios.
Our benchmark dataset is elaborated such that each query image can find a corresponding reference based on accurate robot localization.
Besides the proposed benchmark dataset, we evaluate the baseline models of ADr on this dataset.
arXiv Detail & Related papers (2024-08-22T16:32:19Z) - Weakly Supervised Video Anomaly Detection and Localization with Spatio-Temporal Prompts [57.01985221057047]
This paper introduces a novel method that learnstemporal prompt embeddings for weakly supervised video anomaly detection and localization (WSVADL) based on pre-trained vision-language models (VLMs)
Our method achieves state-of-theart performance on three public benchmarks for the WSVADL task.
arXiv Detail & Related papers (2024-08-12T03:31:29Z) - Towards Open-World Object-based Anomaly Detection via Self-Supervised Outlier Synthesis [15.748043194987075]
This work aims to bridge the gap by leveraging an open-world object detector and an OoD detector via virtual outlier.
Our approach empowers our overall object detector architecture to learn anomaly-aware feature representations without relying on class labels.
Our method establishes state-of-the-art performance on object-level anomaly detection, achieving an average recall score improvement of over 5.4% for natural images.
arXiv Detail & Related papers (2024-07-22T16:16:38Z) - GeneralAD: Anomaly Detection Across Domains by Attending to Distorted Features [68.14842693208465]
GeneralAD is an anomaly detection framework designed to operate in semantic, near-distribution, and industrial settings.
We propose a novel self-supervised anomaly generation module that employs straightforward operations like noise addition and shuffling to patch features.
We extensively evaluated our approach on ten datasets, achieving state-of-the-art results in six and on-par performance in the remaining.
arXiv Detail & Related papers (2024-07-17T09:27:41Z) - ATAC-Net: Zoomed view works better for Anomaly Detection [1.024113475677323]
ATAC-Net is a framework that trains to detect anomalies from a minimal set of known prior anomalies.
We substantiate its superiority to some of the current state-of-the-art techniques in a comparable setting.
arXiv Detail & Related papers (2024-06-20T15:18:32Z) - RAD: A Comprehensive Dataset for Benchmarking the Robustness of Image Anomaly Detection [4.231702796492545]
This study introduces a Robust Anomaly Detection dataset with free views, uneven illuminations, and blurry collections.
RAD aims to identify foreign objects on working platforms as anomalies.
We assess and analyze 11 state-of-the-art unsupervised and zero-shot methods on RAD.
arXiv Detail & Related papers (2024-06-11T11:39:44Z) - Anomaly Detection by Context Contrasting [57.695202846009714]
Anomaly detection focuses on identifying samples that deviate from the norm.
Recent advances in self-supervised learning have shown great promise in this regard.
We propose Con$$, which learns through context augmentations.
arXiv Detail & Related papers (2024-05-29T07:59:06Z) - DiAD: A Diffusion-based Framework for Multi-class Anomaly Detection [55.48770333927732]
We propose a Difusion-based Anomaly Detection (DiAD) framework for multi-class anomaly detection.
It consists of a pixel-space autoencoder, a latent-space Semantic-Guided (SG) network with a connection to the stable diffusion's denoising network, and a feature-space pre-trained feature extractor.
Experiments on MVTec-AD and VisA datasets demonstrate the effectiveness of our approach.
arXiv Detail & Related papers (2023-12-11T18:38:28Z) - Video Anomaly Detection via Spatio-Temporal Pseudo-Anomaly Generation : A Unified Approach [49.995833831087175]
This work proposes a novel method for generating generic Video-temporal PAs by inpainting a masked out region of an image.
In addition, we present a simple unified framework to detect real-world anomalies under the OCC setting.
Our method performs on par with other existing state-of-the-art PAs generation and reconstruction based methods under the OCC setting.
arXiv Detail & Related papers (2023-11-27T13:14:06Z) - That's BAD: Blind Anomaly Detection by Implicit Local Feature Clustering [28.296651124677556]
Setting blind anomaly detection (BAD) can be converted into a local outlier detection problem.
We propose a novel method named PatchCluster that can accurately detect image- and pixel-level anomalies.
Experimental results show that PatchCluster shows a promising performance without the knowledge of normal data.
arXiv Detail & Related papers (2023-07-06T18:17:43Z) - Unsupervised Visual Defect Detection with Score-Based Generative Model [17.610722842950555]
We focus on the unsupervised visual defect detection and localization tasks.
We propose a novel framework based on the recent score-based generative models.
We evaluate our method on several datasets to demonstrate its effectiveness.
arXiv Detail & Related papers (2022-11-29T11:06:29Z) - Self-Calibrating Anomaly and Change Detection for Autonomous Inspection
Robots [0.07366405857677225]
A visual anomaly or change detection algorithm identifies regions of an image that differ from a reference image or dataset.
We propose a comprehensive deep learning framework for detecting anomalies and changes in a priori unknown environments.
arXiv Detail & Related papers (2022-08-26T09:52:12Z) - Self-Supervised Predictive Convolutional Attentive Block for Anomaly
Detection [97.93062818228015]
We propose to integrate the reconstruction-based functionality into a novel self-supervised predictive architectural building block.
Our block is equipped with a loss that minimizes the reconstruction error with respect to the masked area in the receptive field.
We demonstrate the generality of our block by integrating it into several state-of-the-art frameworks for anomaly detection on image and video.
arXiv Detail & Related papers (2021-11-17T13:30:31Z) - CutPaste: Self-Supervised Learning for Anomaly Detection and
Localization [59.719925639875036]
We propose a framework for building anomaly detectors using normal training data only.
We first learn self-supervised deep representations and then build a generative one-class classifier on learned representations.
Our empirical study on MVTec anomaly detection dataset demonstrates the proposed algorithm is general to be able to detect various types of real-world defects.
arXiv Detail & Related papers (2021-04-08T19:04:55Z) - Unsupervised Two-Stage Anomaly Detection [18.045265572566276]
Anomaly detection from a single image is challenging since anomaly data is always rare and can be with highly unpredictable types.
We propose a two-stage approach, which generates high-fidelity yet anomaly-free reconstructions.
Our method outperforms state-of-the-arts on four anomaly detection datasets.
arXiv Detail & Related papers (2021-03-22T08:57:27Z) - A Background-Agnostic Framework with Adversarial Training for Abnormal
Event Detection in Video [120.18562044084678]
Abnormal event detection in video is a complex computer vision problem that has attracted significant attention in recent years.
We propose a background-agnostic framework that learns from training videos containing only normal events.
arXiv Detail & Related papers (2020-08-27T18:39:24Z) - OIAD: One-for-all Image Anomaly Detection with Disentanglement Learning [23.48763375455514]
We propose a One-for-all Image Anomaly Detection system based on disentangled learning using only clean samples.
Our experiments with three datasets show that OIAD can detect over $90%$ of anomalies while maintaining a low false alarm rate.
arXiv Detail & Related papers (2020-01-18T09:57:37Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.