Related papers: Enhancing Prohibited Item Detection through X-ray-Specific Augmentation and Contextual Feature Integration

Enhancing Prohibited Item Detection through X-ray-Specific Augmentation and Contextual Feature Integration

URL: http://arxiv.org/abs/2411.18078v2
Date: Tue, 11 Mar 2025 06:10:48 GMT
Title: Enhancing Prohibited Item Detection through X-ray-Specific Augmentation and Contextual Feature Integration
Authors: Renshuai Tao, Haoyu Wang, Wei Wang, Yunchao Wei, Yao Zhao,
Abstract summary: X-ray prohibited item detection faces challenges due to the long-tail distribution and unique characteristics of X-ray imaging.<n>Traditional data augmentation strategies, such as copy-paste and mixup, are ineffective at improving the detection of rare items.<n>We propose the X-ray Imaging-driven Detection Network (XIDNet) to address these challenges.
Score: 81.11400642272976
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: X-ray prohibited item detection faces challenges due to the long-tail distribution and unique characteristics of X-ray imaging. Traditional data augmentation strategies, such as copy-paste and mixup, are ineffective at improving the detection of rare items due to the complex interactions between overlapping objects. Furthermore, X-ray imaging removes easily distinguishable features like color and texture, making it difficult to differentiate between visually similar categories. To address these challenges, in this work, we propose the X-ray Imaging-driven Detection Network (XIDNet). Inspired by the unique characteristics of X-ray imaging, this network introduces two key innovations: a novel X-ray-specific augmentation strategy that generates more realistic training samples for rare items, thereby improving detection performance for categories with insufficient samples, and an contextual feature integration algorithm that captures the spatial and semantic interactions between objects and surroundings under X-ray imaging, enhancing the model's ability to distinguish between similar categories. Extensive experimental results show that XIDNet effectively leverages X-ray imaging characteristics to significantly improve detection performance, outperforming popular SoTA methods by up to 17.2% in tail categories.

Related papers

Self-Supervised Multiview Xray Matching [4.033064933995391]
Current methods often struggle to establish robust correspondences between different X-ray views.<n>We present a novel self-supervised pipeline that eliminates the need for manual annotation.<n>Our approach incorporates a transformer-based training phase to accurately predict correspondences across two or more X-ray views.
arXiv Detail & Related papers (2025-06-30T21:56:14Z)
Superpowering Open-Vocabulary Object Detectors for X-ray Vision [53.07098133237041]
Open-vocabulary object detection (OvOD) is set to revolutionize security screening by enabling systems to recognize any item in X-ray scans. We propose RAXO, a framework that repurposes off-the-shelf RGB OvOD detectors for robust X-ray detection. RAXO builds high-quality X-ray class descriptors using a dual-source retrieval strategy.
arXiv Detail & Related papers (2025-03-21T11:54:16Z)
AdverX-Ray: Ensuring X-Ray Integrity Through Frequency-Sensitive Adversarial VAEs [7.0477485974331895]
AdverX-Ray serves as an image-quality assessment layer. It is trained on patches from X-ray images of specific machine models. It can evaluate whether a scan matches the training distribution, or if a scan from the same machine is captured under different settings.
arXiv Detail & Related papers (2025-02-23T15:32:40Z)
BGM: Background Mixup for X-ray Prohibited Items Detection [75.58709178012502]
This paper introduces a novel data augmentation approach tailored for prohibited item detection, leveraging unique characteristics inherent to X-ray imagery. Our method is motivated by observations of physical properties including: 1) X-ray Transmission Imagery: Unlike reflected light images, transmitted X-ray pixels represent composite information from multiple materials along the imaging path. We propose a simple yet effective X-ray image augmentation technique, Background Mixup (BGM), for prohibited item detection in security screening contexts.
arXiv Detail & Related papers (2024-11-30T12:26:55Z)
Dual-view X-ray Detection: Can AI Detect Prohibited Items from Dual-view X-ray Images like Humans? [78.26435264182763]
We introduce the Large-scale Dual-view X-ray (LDXray), which consists of 353,646 instances across 12 categories.<n>To emulate human intelligence in dual-view detection, we propose the Auxiliary-view Enhanced Network (AENet)<n>Experiments on the LDXray dataset demonstrate that the dual-view mechanism significantly enhances detection performance.
arXiv Detail & Related papers (2024-11-27T06:36:20Z)
Model X-ray:Detecting Backdoored Models via Decision Boundary [62.675297418960355]
Backdoor attacks pose a significant security vulnerability for deep neural networks (DNNs) We propose Model X-ray, a novel backdoor detection approach based on the analysis of illustrated two-dimensional (2D) decision boundaries. Our approach includes two strategies focused on the decision areas dominated by clean samples and the concentration of label distribution.
arXiv Detail & Related papers (2024-02-27T12:42:07Z)
Spatial-Frequency Discriminability for Revealing Adversarial Perturbations [53.279716307171604]
Vulnerability of deep neural networks to adversarial perturbations has been widely perceived in the computer vision community. Current algorithms typically detect adversarial patterns through discriminative decomposition for natural and adversarial data. We propose a discriminative detector relying on a spatial-frequency Krawtchouk decomposition.
arXiv Detail & Related papers (2023-05-18T10:18:59Z)
Illicit item detection in X-ray images for security applications [7.519872646378835]
Automated detection of contraband items in X-ray images can significantly increase public safety. Modern computer vision algorithms relying on Deep Neural Networks (DNNs) have proven capable of undertaking this task. This paper proposes a two-fold improvement of such algorithms for the X-ray analysis domain.
arXiv Detail & Related papers (2023-05-03T07:28:05Z)
Joint Sub-component Level Segmentation and Classification for Anomaly Detection within Dual-Energy X-Ray Security Imagery [14.785070524184649]
The performance is evaluated over a dataset of cluttered X-ray baggage security imagery. The proposed joint sub-component level segmentation and classification approach achieve 99% true positive and 5% false positive for anomaly detection task.
arXiv Detail & Related papers (2022-10-29T00:44:50Z)
Generative Residual Attention Network for Disease Detection [51.60842580044539]
We present a novel approach for disease generation in X-rays using a conditional generative adversarial learning. We generate a corresponding radiology image in a target domain while preserving the identity of the patient. We then use the generated X-ray image in the target domain to augment our training to improve the detection performance.
arXiv Detail & Related papers (2021-10-25T14:15:57Z)
On the impact of using X-ray energy response imagery for object detection via Convolutional Neural Networks [17.639472693362926]
We study the impact of variant X-ray imagery, i.e. X-ray energy response (high, low) and effective-z compared to geometries. We evaluate CNN architectures to explore the transferability of models trained with such 'raw' variant imagery.
arXiv Detail & Related papers (2021-08-27T21:28:28Z)
Towards Real-world X-ray Security Inspection: A High-Quality Benchmark and Lateral Inhibition Module for Prohibited Items Detection [37.66855218659698]
We first present a High-quality X-ray (HiXray) security inspection image dataset, which contains 102,928 common prohibited items of 8 categories. For accurate prohibited item detection, we propose the Lateral Inhibition Module (LIM) inspired by the fact that humans recognize these items by ignoring irrelevant information.
arXiv Detail & Related papers (2021-08-23T03:59:23Z)
Towards Real-World Prohibited Item Detection: A Large-Scale X-ray Benchmark [53.9819155669618]
This paper presents a large-scale dataset, named as PIDray, which covers various cases in real-world scenarios for prohibited item detection. With an intensive amount of effort, our dataset contains $12$ categories of prohibited items in $47,677$ X-ray images with high-quality annotated segmentation masks and bounding boxes. The proposed method performs favorably against the state-of-the-art methods, especially for detecting the deliberately hidden items.
arXiv Detail & Related papers (2021-08-16T11:14:16Z)
Cross-Modal Contrastive Learning for Abnormality Classification and Localization in Chest X-rays with Radiomics using a Feedback Loop [63.81818077092879]
We propose an end-to-end semi-supervised cross-modal contrastive learning framework for medical images. We first apply an image encoder to classify the chest X-rays and to generate the image features. The radiomic features are then passed through another dedicated encoder to act as the positive sample for the image features generated from the same chest X-ray.
arXiv Detail & Related papers (2021-04-11T09:16:29Z)
Over-sampling De-occlusion Attention Network for Prohibited Items Detection in Noisy X-ray Images [35.35752470993847]
Security inspection is X-ray scanning for personal belongings in suitcases. Traditional CNN-based models trained through common image recognition datasets fail to achieve satisfactory performance in this scenario. We propose an over-sampling de-occlusion attention network (DOAM-O), which consists of a novel de-occlusion attention module and a new over-sampling training strategy.
arXiv Detail & Related papers (2021-03-01T07:17:37Z)
Occluded Prohibited Items Detection: an X-ray Security Inspection Benchmark and De-occlusion Attention Module [50.75589128518707]
We contribute the first high-quality object detection dataset for security inspection, named OPIXray. OPIXray focused on the widely-occurred prohibited item "cutter", annotated manually by professional inspectors from the international airport. We propose the De-occlusion Attention Module (DOAM), a plug-and-play module that can be easily inserted into and thus promote most popular detectors.
arXiv Detail & Related papers (2020-04-18T16:10:55Z)

This list is automatically generated from the titles and abstracts of the papers in this site.