Concealed Object Detection for Passive Millimeter-Wave Security Imaging
Based on Task-Aligned Detection Transformer
- URL: http://arxiv.org/abs/2212.00313v2
- Date: Fri, 7 Jul 2023 11:34:41 GMT
- Title: Concealed Object Detection for Passive Millimeter-Wave Security Imaging
Based on Task-Aligned Detection Transformer
- Authors: Cheng Guo, Fei Hu, and Yan Hu
- Abstract summary: This paper proposes a Task-Aligned Detection Transformer network, named PMMW-DETR.
In the first stage, a Denoising Coarse-to-Fine Transformer (DCFT) backbone is designed to extract long- and short-range features in the different scales.
In the second stage, we propose the Query Selection module to introduce learned spatial features into the network as prior knowledge.
In the third stage, aiming to improve the classification performance, we perform a Task-Aligned Dual-Head block to decouple the classification and regression tasks.
- Score: 6.524763502003648
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Passive millimeter-wave (PMMW) is a significant potential technique for human
security screening. Several popular object detection networks have been used
for PMMW images. However, restricted by the low resolution and high noise of
PMMW images, PMMW hidden object detection based on deep learning usually
suffers from low accuracy and low classification confidence. To tackle the
above problems, this paper proposes a Task-Aligned Detection Transformer
network, named PMMW-DETR. In the first stage, a Denoising Coarse-to-Fine
Transformer (DCFT) backbone is designed to extract long- and short-range
features in the different scales. In the second stage, we propose the Query
Selection module to introduce learned spatial features into the network as
prior knowledge, which enhances the semantic perception capability of the
network. In the third stage, aiming to improve the classification performance,
we perform a Task-Aligned Dual-Head block to decouple the classification and
regression tasks. Based on our self-developed PMMW security screening dataset,
experimental results including comparison with State-Of-The-Art (SOTA) methods
and ablation study demonstrate that the PMMW-DETR obtains higher accuracy and
classification confidence than previous works, and exhibits robustness to the
PMMW images of low quality.
Related papers
- Looking for Tiny Defects via Forward-Backward Feature Transfer [12.442574943138794]
We introduce a novel benchmark that evaluates methods on the original, high-resolution image and ground-truth masks.
Our benchmark includes a metric that captures robustness with respect to defect size.
Our proposal features the highest robustness to defect size, runs at the fastest speed and yields state-of-the-art segmentation performance.
arXiv Detail & Related papers (2024-07-04T17:59:26Z) - MoE-FFD: Mixture of Experts for Generalized and Parameter-Efficient Face Forgery Detection [54.545054873239295]
Deepfakes have recently raised significant trust issues and security concerns among the public.
ViT-based methods take advantage of the expressivity of transformers, achieving superior detection performance.
This work introduces Mixture-of-Experts modules for Face Forgery Detection (MoE-FFD), a generalized yet parameter-efficient ViT-based approach.
arXiv Detail & Related papers (2024-04-12T13:02:08Z) - The Blind Normalized Stein Variational Gradient Descent-Based Detection for Intelligent Massive Random Access [0.7655800373514546]
We present a novel early preamble detection scheme based on a maximum likelihood estimation (MLE) model.
A novel blind normalized Stein variational gradient descent (SVGD)-based detector is proposed to obtain an approximate solution to the MLE model.
The proposed block MHT layer outperforms other transform-based methods in terms of costs and denoising performance.
arXiv Detail & Related papers (2024-03-08T04:08:40Z) - Multidomain transformer-based deep learning for early detection of
network intrusion [15.99260480348544]
Timely response of Network Intrusion Detection Systems (NIDS) is constrained by the flow generation process.
We propose a novel feature extractor, Time Series Network Flow Meter (TS-NFM), that represents network flow as MTS with explainable features.
A new deep learning-based early detection model called Multi-Domain Transformer (MDT) is proposed, which incorporates the frequency domain into Transformer.
arXiv Detail & Related papers (2023-09-03T04:18:08Z) - Towards Large-scale Single-shot Millimeter-wave Imaging for Low-cost
Security Inspection [4.970957539542638]
Millimeter-wave (MMW) imaging is emerging as a promising technique for safe security inspection.
Despite of recent advance, the high cost of requisite large-scale antenna array hinders widespread adoption of MMW imaging in practice.
We report a large-scale single-shot MMW imaging framework using sparse antenna array, achieving low-cost but high-fidelity security inspection.
arXiv Detail & Related papers (2023-05-25T06:00:29Z) - Photovoltaic Panel Defect Detection Based on Ghost Convolution with
BottleneckCSP and Tiny Target Prediction Head Incorporating YOLOv5 [5.632384612137748]
This paper proposes an approach named Ghost convolution with BottleneckCSP and a tiny target prediction head for PV panel defect detection.
The BottleneckCSP module is introduced to add a prediction head for tiny target detection to alleviate tiny defect misses.
The proposed PV panel surface-defect detection network improves the mAP performance by at least 27.8%.
arXiv Detail & Related papers (2023-03-02T01:06:35Z) - Active Gaze Control for Foveal Scene Exploration [124.11737060344052]
We propose a methodology to emulate how humans and robots with foveal cameras would explore a scene.
The proposed method achieves an increase in detection F1-score of 2-3 percentage points for the same number of gaze shifts.
arXiv Detail & Related papers (2022-08-24T14:59:28Z) - Efficient Decoder-free Object Detection with Transformers [75.00499377197475]
Vision transformers (ViTs) are changing the landscape of object detection approaches.
We propose a decoder-free fully transformer-based (DFFT) object detector.
DFFT_SMALL achieves high efficiency in both training and inference stages.
arXiv Detail & Related papers (2022-06-14T13:22:19Z) - Integral Migrating Pre-trained Transformer Encoder-decoders for Visual
Object Detection [78.2325219839805]
imTED improves the state-of-the-art of few-shot object detection by up to 7.6% AP.
Experiments on MS COCO dataset demonstrate that imTED consistently outperforms its counterparts by 2.8%.
arXiv Detail & Related papers (2022-05-19T15:11:20Z) - M2TR: Multi-modal Multi-scale Transformers for Deepfake Detection [74.19291916812921]
forged images generated by Deepfake techniques pose a serious threat to the trustworthiness of digital information.
In this paper, we aim to capture the subtle manipulation artifacts at different scales for Deepfake detection.
We introduce a high-quality Deepfake dataset, SR-DF, which consists of 4,000 DeepFake videos generated by state-of-the-art face swapping and facial reenactment methods.
arXiv Detail & Related papers (2021-04-20T05:43:44Z) - MRDet: A Multi-Head Network for Accurate Oriented Object Detection in
Aerial Images [51.227489316673484]
We propose an arbitrary-oriented region proposal network (AO-RPN) to generate oriented proposals transformed from horizontal anchors.
To obtain accurate bounding boxes, we decouple the detection task into multiple subtasks and propose a multi-head network.
Each head is specially designed to learn the features optimal for the corresponding task, which allows our network to detect objects accurately.
arXiv Detail & Related papers (2020-12-24T06:36:48Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.