A Unified Benchmark for the Unknown Detection Capability of Deep Neural
Networks
- URL: http://arxiv.org/abs/2112.00337v2
- Date: Tue, 1 Aug 2023 03:45:57 GMT
- Title: A Unified Benchmark for the Unknown Detection Capability of Deep Neural
Networks
- Authors: Jihyo Kim, Jiin Koo, Sangheum Hwang
- Abstract summary: We introduce the unknown detection task, an integration of previous individual tasks.
We find that Deep Ensemble consistently outperforms the other approaches in detecting unknowns.
- Score: 2.578242050187029
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Deep neural networks have achieved outstanding performance over various
tasks, but they have a critical issue: over-confident predictions even for
completely unknown samples. Many studies have been proposed to successfully
filter out these unknown samples, but they only considered narrow and specific
tasks, referred to as misclassification detection, open-set recognition, or
out-of-distribution detection. In this work, we argue that these tasks should
be treated as fundamentally an identical problem because an ideal model should
possess detection capability for all those tasks. Therefore, we introduce the
unknown detection task, an integration of previous individual tasks, for a
rigorous examination of the detection capability of deep neural networks on a
wide spectrum of unknown samples. To this end, unified benchmark datasets on
different scales were constructed and the unknown detection capabilities of
existing popular methods were subject to comparison. We found that Deep
Ensemble consistently outperforms the other approaches in detecting unknowns;
however, all methods are only successful for a specific type of unknown. The
reproducible code and benchmark datasets are available at
https://github.com/daintlab/unknown-detection-benchmarks .
Related papers
- A Comprehensive Library for Benchmarking Multi-class Visual Anomaly Detection [52.228708947607636]
This paper introduces a comprehensive visual anomaly detection benchmark, ADer, which is a modular framework for new methods.
The benchmark includes multiple datasets from industrial and medical domains, implementing fifteen state-of-the-art methods and nine comprehensive metrics.
We objectively reveal the strengths and weaknesses of different methods and provide insights into the challenges and future directions of multi-class visual anomaly detection.
arXiv Detail & Related papers (2024-06-05T13:40:07Z) - Assaying on the Robustness of Zero-Shot Machine-Generated Text Detectors [57.7003399760813]
We explore advanced Large Language Models (LLMs) and their specialized variants, contributing to this field in several ways.
We uncover a significant correlation between topics and detection performance.
These investigations shed light on the adaptability and robustness of these detection methods across diverse topics.
arXiv Detail & Related papers (2023-12-20T10:53:53Z) - Deep Learning Object Detection Approaches to Source Identification [15.813217907813778]
We propose a system that manages to alleviate the failure cases encountered when using traditional source identification algorithms.
Our contributions include framing source identification as an object detection problem, the publication of a spectrogram object detection dataset, and evaluation of the RetinaNet and YOLOv5 object detection models trained on the dataset.
arXiv Detail & Related papers (2022-10-27T02:08:46Z) - Meta-learning One-class Classifiers with Eigenvalue Solvers for
Supervised Anomaly Detection [55.888835686183995]
We propose a neural network-based meta-learning method for supervised anomaly detection.
We experimentally demonstrate that the proposed method achieves better performance than existing anomaly detection and few-shot learning methods.
arXiv Detail & Related papers (2021-03-01T01:43:04Z) - Open Set Recognition with Conditional Probabilistic Generative Models [51.40872765917125]
We propose Conditional Probabilistic Generative Models (CPGM) for open set recognition.
CPGM can detect unknown samples but also classify known classes by forcing different latent features to approximate conditional Gaussian distributions.
Experiment results on multiple benchmark datasets reveal that the proposed method significantly outperforms the baselines.
arXiv Detail & Related papers (2020-08-12T06:23:49Z) - AutoOD: Automated Outlier Detection via Curiosity-guided Search and
Self-imitation Learning [72.99415402575886]
Outlier detection is an important data mining task with numerous practical applications.
We propose AutoOD, an automated outlier detection framework, which aims to search for an optimal neural network model.
Experimental results on various real-world benchmark datasets demonstrate that the deep model identified by AutoOD achieves the best performance.
arXiv Detail & Related papers (2020-06-19T18:57:51Z) - NADS: Neural Architecture Distribution Search for Uncertainty Awareness [79.18710225716791]
Machine learning (ML) systems often encounter Out-of-Distribution (OoD) errors when dealing with testing data coming from a distribution different from training data.
Existing OoD detection approaches are prone to errors and even sometimes assign higher likelihoods to OoD samples.
We propose Neural Architecture Distribution Search (NADS) to identify common building blocks among all uncertainty-aware architectures.
arXiv Detail & Related papers (2020-06-11T17:39:07Z) - Conditional Gaussian Distribution Learning for Open Set Recognition [10.90687687505665]
We propose Conditional Gaussian Distribution Learning (CGDL) for open set recognition.
In addition to detecting unknown samples, this method can also classify known samples by forcing different latent features to approximate different Gaussian models.
Experiments on several standard image reveal that the proposed method significantly outperforms the baseline method and achieves new state-of-the-art results.
arXiv Detail & Related papers (2020-03-19T14:32:08Z) - A Post-processing Method for Detecting Unknown Intent of Dialogue System
via Pre-trained Deep Neural Network Classifier [23.25650237235107]
We propose SofterMax and deep novelty detection (SMDN) for detecting unknown intent in dialogue systems.
Our method can be flexibly applied on top of any classifiers trained in deep neural networks without changing the model architecture.
Our method classifies examples that differ from known intents as unknown and does not require any examples or prior knowledge of it.
arXiv Detail & Related papers (2020-03-07T03:29:01Z) - $\text{A}^3$: Activation Anomaly Analysis [0.7734726150561088]
We show that the hidden activation values contain information useful to distinguish between normal and anomalous samples.
Our approach combines three neural networks in a purely data-driven end-to-end model.
Thanks to the anomaly network, our method even works in strict semi-supervised settings.
arXiv Detail & Related papers (2020-03-03T21:23:56Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.