AO-DETR: Anti-Overlapping DETR for X-Ray Prohibited Items Detection
- URL: http://arxiv.org/abs/2403.04309v1
- Date: Thu, 7 Mar 2024 08:30:17 GMT
- Title: AO-DETR: Anti-Overlapping DETR for X-Ray Prohibited Items Detection
- Authors: Mingyuan Li, Tong Jia, Hao Wang, Bowen Ma, Shuyang Lin, Da Cai, and
Dongyue Chen
- Abstract summary: We propose an Anti-Overlapping DETR (AO-DETR) based on one of the state-of-the-art general object detectors, DINO.
To address the feature coupling issue caused by overlapping phenomena, we introduce the Category-Specific One-to-One Assignment (CSA) strategy.
To address the edge blurring problem caused by overlapping phenomena, we propose the Look Forwardly scheme.
- Score: 6.603436370737025
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Prohibited item detection in X-ray images is one of the most essential and
highly effective methods widely employed in various security inspection
scenarios. Considering the significant overlapping phenomenon in X-ray
prohibited item images, we propose an Anti-Overlapping DETR (AO-DETR) based on
one of the state-of-the-art general object detectors, DINO. Specifically, to
address the feature coupling issue caused by overlapping phenomena, we
introduce the Category-Specific One-to-One Assignment (CSA) strategy to
constrain category-specific object queries in predicting prohibited items of
fixed categories, which can enhance their ability to extract features specific
to prohibited items of a particular category from the overlapping
foreground-background features. To address the edge blurring problem caused by
overlapping phenomena, we propose the Look Forward Densely (LFD) scheme, which
improves the localization accuracy of reference boxes in mid-to-high-level
decoder layers and enhances the ability to locate blurry edges of the final
layer. Similar to DINO, our AO-DETR provides two different versions with
distinct backbones, tailored to meet diverse application requirements.
Extensive experiments on the PIXray and OPIXray datasets demonstrate that the
proposed method surpasses the state-of-the-art object detectors, indicating its
potential applications in the field of prohibited item detection. The source
code will be released at https://github.com/Limingyuan001/AO-DETR-test.
Related papers
- DiAD: A Diffusion-based Framework for Multi-class Anomaly Detection [55.48770333927732]
We propose a Difusion-based Anomaly Detection (DiAD) framework for multi-class anomaly detection.
It consists of a pixel-space autoencoder, a latent-space Semantic-Guided (SG) network with a connection to the stable diffusion's denoising network, and a feature-space pre-trained feature extractor.
Experiments on MVTec-AD and VisA datasets demonstrate the effectiveness of our approach.
arXiv Detail & Related papers (2023-12-11T18:38:28Z) - Occlusion-Aware Detection and Re-ID Calibrated Network for Multi-Object
Tracking [38.36872739816151]
Occlusion-Aware Attention (OAA) module in the detector highlights the object features while suppressing the occluded background regions.
OAA can serve as a modulator that enhances the detector for some potentially occluded objects.
We design a Re-ID embedding matching block based on the optimal transport problem.
arXiv Detail & Related papers (2023-08-30T06:56:53Z) - PIDray: A Large-scale X-ray Benchmark for Real-World Prohibited Item
Detection [21.055813365091662]
We present a large-scale dataset, named PIDray, which covers various cases in real-world scenarios for prohibited item detection.
In specific, PIDray collects 124,486 X-ray images for $12$ categories of prohibited items.
We propose a general divide-and-conquer pipeline to develop baseline algorithms on PIDray.
arXiv Detail & Related papers (2022-11-19T18:31:34Z) - Towards Real-world X-ray Security Inspection: A High-Quality Benchmark
and Lateral Inhibition Module for Prohibited Items Detection [37.66855218659698]
We first present a High-quality X-ray (HiXray) security inspection image dataset, which contains 102,928 common prohibited items of 8 categories.
For accurate prohibited item detection, we propose the Lateral Inhibition Module (LIM) inspired by the fact that humans recognize these items by ignoring irrelevant information.
arXiv Detail & Related papers (2021-08-23T03:59:23Z) - Towards Real-World Prohibited Item Detection: A Large-Scale X-ray
Benchmark [53.9819155669618]
This paper presents a large-scale dataset, named as PIDray, which covers various cases in real-world scenarios for prohibited item detection.
With an intensive amount of effort, our dataset contains $12$ categories of prohibited items in $47,677$ X-ray images with high-quality annotated segmentation masks and bounding boxes.
The proposed method performs favorably against the state-of-the-art methods, especially for detecting the deliberately hidden items.
arXiv Detail & Related papers (2021-08-16T11:14:16Z) - Reference-based Defect Detection Network [57.89399576743665]
The first issue is the texture shift which means a trained defect detector model will be easily affected by unseen texture.
The second issue is partial visual confusion which indicates that a partial defect box is visually similar with a complete box.
We propose a Reference-based Defect Detection Network (RDDN) to tackle these two problems.
arXiv Detail & Related papers (2021-08-10T05:44:23Z) - Dense Label Encoding for Boundary Discontinuity Free Rotation Detection [69.75559390700887]
This paper explores a relatively less-studied methodology based on classification.
We propose new techniques to push its frontier in two aspects.
Experiments and visual analysis on large-scale public datasets for aerial images show the effectiveness of our approach.
arXiv Detail & Related papers (2020-11-19T05:42:02Z) - Black-box Explanation of Object Detectors via Saliency Maps [66.745167677293]
We propose D-RISE, a method for generating visual explanations for the predictions of object detectors.
We show that D-RISE can be easily applied to different object detectors including one-stage detectors such as YOLOv3 and two-stage detectors such as Faster-RCNN.
arXiv Detail & Related papers (2020-06-05T02:13:35Z) - Occluded Prohibited Items Detection: an X-ray Security Inspection
Benchmark and De-occlusion Attention Module [50.75589128518707]
We contribute the first high-quality object detection dataset for security inspection, named OPIXray.
OPIXray focused on the widely-occurred prohibited item "cutter", annotated manually by professional inspectors from the international airport.
We propose the De-occlusion Attention Module (DOAM), a plug-and-play module that can be easily inserted into and thus promote most popular detectors.
arXiv Detail & Related papers (2020-04-18T16:10:55Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.