Towards Real-World Prohibited Item Detection: A Large-Scale X-ray
Benchmark
- URL: http://arxiv.org/abs/2108.07020v1
- Date: Mon, 16 Aug 2021 11:14:16 GMT
- Title: Towards Real-World Prohibited Item Detection: A Large-Scale X-ray
Benchmark
- Authors: Boying Wang and Libo Zhang and Longyin Wen and Xianglong Liu and
Yanjun Wu
- Abstract summary: This paper presents a large-scale dataset, named as PIDray, which covers various cases in real-world scenarios for prohibited item detection.
With an intensive amount of effort, our dataset contains $12$ categories of prohibited items in $47,677$ X-ray images with high-quality annotated segmentation masks and bounding boxes.
The proposed method performs favorably against the state-of-the-art methods, especially for detecting the deliberately hidden items.
- Score: 53.9819155669618
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Automatic security inspection using computer vision technology is a
challenging task in real-world scenarios due to various factors, including
intra-class variance, class imbalance, and occlusion. Most of the previous
methods rarely solve the cases that the prohibited items are deliberately
hidden in messy objects due to the lack of large-scale datasets, restricted
their applications in real-world scenarios. Towards real-world prohibited item
detection, we collect a large-scale dataset, named as PIDray, which covers
various cases in real-world scenarios for prohibited item detection, especially
for deliberately hidden items. With an intensive amount of effort, our dataset
contains $12$ categories of prohibited items in $47,677$ X-ray images with
high-quality annotated segmentation masks and bounding boxes. To the best of
our knowledge, it is the largest prohibited items detection dataset to date.
Meanwhile, we design the selective dense attention network (SDANet) to
construct a strong baseline, which consists of the dense attention module and
the dependency refinement module. The dense attention module formed by the
spatial and channel-wise dense attentions, is designed to learn the
discriminative features to boost the performance. The dependency refinement
module is used to exploit the dependencies of multi-scale features. Extensive
experiments conducted on the collected PIDray dataset demonstrate that the
proposed method performs favorably against the state-of-the-art methods,
especially for detecting the deliberately hidden items.
Related papers
- Bayesian Detector Combination for Object Detection with Crowdsourced Annotations [49.43709660948812]
Acquiring fine-grained object detection annotations in unconstrained images is time-consuming, expensive, and prone to noise.
We propose a novel Bayesian Detector Combination (BDC) framework to more effectively train object detectors with noisy crowdsourced annotations.
BDC is model-agnostic, requires no prior knowledge of the annotators' skill level, and seamlessly integrates with existing object detection models.
arXiv Detail & Related papers (2024-07-10T18:00:54Z) - A Comprehensive Library for Benchmarking Multi-class Visual Anomaly Detection [52.228708947607636]
This paper introduces a comprehensive visual anomaly detection benchmark, ADer, which is a modular framework for new methods.
The benchmark includes multiple datasets from industrial and medical domains, implementing fifteen state-of-the-art methods and nine comprehensive metrics.
We objectively reveal the strengths and weaknesses of different methods and provide insights into the challenges and future directions of multi-class visual anomaly detection.
arXiv Detail & Related papers (2024-06-05T13:40:07Z) - Transcending Forgery Specificity with Latent Space Augmentation for Generalizable Deepfake Detection [57.646582245834324]
We propose a simple yet effective deepfake detector called LSDA.
It is based on a idea: representations with a wider variety of forgeries should be able to learn a more generalizable decision boundary.
We show that our proposed method is surprisingly effective and transcends state-of-the-art detectors across several widely used benchmarks.
arXiv Detail & Related papers (2023-11-19T09:41:10Z) - Towards End-to-End Unsupervised Saliency Detection with Self-Supervised
Top-Down Context [25.85453873366275]
We propose a self-supervised end-to-end salient object detection framework via top-down context.
We exploit the self-localization from the deepest feature to construct the location maps which are then leveraged to learn the most instructive segmentation guidance.
Our method achieves leading performance among the recent end-to-end methods and most of the multi-stage solutions.
arXiv Detail & Related papers (2023-10-14T08:43:22Z) - PIDray: A Large-scale X-ray Benchmark for Real-World Prohibited Item
Detection [21.055813365091662]
We present a large-scale dataset, named PIDray, which covers various cases in real-world scenarios for prohibited item detection.
In specific, PIDray collects 124,486 X-ray images for $12$ categories of prohibited items.
We propose a general divide-and-conquer pipeline to develop baseline algorithms on PIDray.
arXiv Detail & Related papers (2022-11-19T18:31:34Z) - Towards Real-world X-ray Security Inspection: A High-Quality Benchmark
and Lateral Inhibition Module for Prohibited Items Detection [37.66855218659698]
We first present a High-quality X-ray (HiXray) security inspection image dataset, which contains 102,928 common prohibited items of 8 categories.
For accurate prohibited item detection, we propose the Lateral Inhibition Module (LIM) inspired by the fact that humans recognize these items by ignoring irrelevant information.
arXiv Detail & Related papers (2021-08-23T03:59:23Z) - Over-sampling De-occlusion Attention Network for Prohibited Items
Detection in Noisy X-ray Images [35.35752470993847]
Security inspection is X-ray scanning for personal belongings in suitcases.
Traditional CNN-based models trained through common image recognition datasets fail to achieve satisfactory performance in this scenario.
We propose an over-sampling de-occlusion attention network (DOAM-O), which consists of a novel de-occlusion attention module and a new over-sampling training strategy.
arXiv Detail & Related papers (2021-03-01T07:17:37Z) - SoDA: Multi-Object Tracking with Soft Data Association [75.39833486073597]
Multi-object tracking (MOT) is a prerequisite for a safe deployment of self-driving cars.
We propose a novel approach to MOT that uses attention to compute track embeddings that encode dependencies between observed objects.
arXiv Detail & Related papers (2020-08-18T03:40:25Z) - Occluded Prohibited Items Detection: an X-ray Security Inspection
Benchmark and De-occlusion Attention Module [50.75589128518707]
We contribute the first high-quality object detection dataset for security inspection, named OPIXray.
OPIXray focused on the widely-occurred prohibited item "cutter", annotated manually by professional inspectors from the international airport.
We propose the De-occlusion Attention Module (DOAM), a plug-and-play module that can be easily inserted into and thus promote most popular detectors.
arXiv Detail & Related papers (2020-04-18T16:10:55Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.