Handgun detection using combined human pose and weapon appearance
- URL: http://arxiv.org/abs/2010.13753v4
- Date: Fri, 23 Jul 2021 09:55:57 GMT
- Title: Handgun detection using combined human pose and weapon appearance
- Authors: Jesus Ruiz-Santaquiteria, Alberto Velasco-Mata, Noelia Vallez, Gloria
Bueno, Juan A. \'Alvarez-Garc\'ia, Oscar Deniz
- Abstract summary: In this work, a novel method is proposed to combine, in a single architecture, both weapon appearance and human pose information.
Results obtained show that the combined model improves the handgun detection state of the art, achieving from 4.23 to 18.9 AP points more than the best previous approach.
- Score: 0.4141979929350861
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Closed-circuit television (CCTV) systems are essential nowadays to prevent
security threats or dangerous situations, in which early detection is crucial.
Novel deep learning-based methods have allowed to develop automatic weapon
detectors with promising results. However, these approaches are mainly based on
visual weapon appearance only. For handguns, body pose may be a useful cue,
especially in cases where the gun is barely visible. In this work, a novel
method is proposed to combine, in a single architecture, both weapon appearance
and human pose information. First, pose keypoints are estimated to extract hand
regions and generate binary pose images, which are the model inputs. Then, each
input is processed in different subnetworks and combined to produce the handgun
bounding box. Results obtained show that the combined model improves the
handgun detection state of the art, achieving from 4.23 to 18.9 AP points more
than the best previous approach.
Related papers
- Real Time Deep Learning Weapon Detection Techniques for Mitigating Lone Wolf Attacks [0.0]
This research focuses on (You Look Only Once) family and Faster RCNN family for model validation and training.
Models achieve the highest score of 78% with an inference speed of 8.1ms.
However, Faster RCNN models achieve the highest AP 89%.
arXiv Detail & Related papers (2024-05-23T03:48:26Z) - Early Action Recognition with Action Prototypes [62.826125870298306]
We propose a novel model that learns a prototypical representation of the full action for each class.
We decompose the video into short clips, where a visual encoder extracts features from each clip independently.
Later, a decoder aggregates together in an online fashion features from all the clips for the final class prediction.
arXiv Detail & Related papers (2023-12-11T18:31:13Z) - Novel Deep Learning Pipeline for Automatic Weapon Detection [0.0]
Weapon and gun violence have recently become a pressing issue.
Real-time surveillance video is captured and recorded in almost all public forums and places.
This paper proposes a novel pipeline consisting of an ensemble of convolutional neural networks with distinct architectures.
arXiv Detail & Related papers (2023-09-28T17:55:14Z) - Detection and Localization of Firearm Carriers in Complex Scenes for
Improved Safety Measures [17.03422574286601]
We propose a novel approach that leverages human-firearm interaction information, which provides valuable clues for localizing firearm carriers.
Our approach incorporates an attention mechanism that effectively distinguishes humans and firearms from the background by focusing on relevant areas.
To handle inputs of varying sizes, we pass paired human-firearm instances with attention masks as channels through a deep network for feature computation.
arXiv Detail & Related papers (2023-09-17T10:50:46Z) - CCTV-Gun: Benchmarking Handgun Detection in CCTV Images [59.24281591714385]
Gun violence is a critical security problem, and it is imperative for the computer vision community to develop effective gun detection algorithms.
detecting guns in real-world CCTV images remains a challenging and under-explored task.
We present a benchmark, called textbfCCTV-Gun, which addresses the challenges of detecting handguns in real-world CCTV images.
arXiv Detail & Related papers (2023-03-19T16:17:35Z) - Are You Stealing My Model? Sample Correlation for Fingerprinting Deep
Neural Networks [86.55317144826179]
Previous methods always leverage the transferable adversarial examples as the model fingerprint.
We propose a novel yet simple model stealing detection method based on SAmple Correlation (SAC)
SAC successfully defends against various model stealing attacks, even including adversarial training or transfer learning.
arXiv Detail & Related papers (2022-10-21T02:07:50Z) - 3D Interacting Hand Pose Estimation by Hand De-occlusion and Removal [85.30756038989057]
Estimating 3D interacting hand pose from a single RGB image is essential for understanding human actions.
We propose to decompose the challenging interacting hand pose estimation task and estimate the pose of each hand separately.
Experiments show that the proposed method significantly outperforms previous state-of-the-art interacting hand pose estimation approaches.
arXiv Detail & Related papers (2022-07-22T13:04:06Z) - Anomaly Detection-Based Unknown Face Presentation Attack Detection [74.4918294453537]
Anomaly detection-based spoof attack detection is a recent development in face Presentation Attack Detection.
In this paper, we present a deep-learning solution for anomaly detection-based spoof attack detection.
The proposed approach benefits from the representation learning power of the CNNs and learns better features for fPAD task.
arXiv Detail & Related papers (2020-07-11T21:20:55Z) - Localizing Firearm Carriers by Identifying Human-Object Pairs [12.425090880770977]
In a given image, human and firearms are separately detected. Each detected human is paired with each detected firearm, allowing us to create a paired bounding box that contains both object and the human.
A network is trained to classify these paired-bounding-boxes into human carrying the identified firearm or not.
arXiv Detail & Related papers (2020-05-19T09:50:23Z) - Human Motion Transfer from Poses in the Wild [61.6016458288803]
We tackle the problem of human motion transfer, where we synthesize novel motion video for a target person that imitates the movement from a reference video.
It is a video-to-video translation task in which the estimated poses are used to bridge two domains.
We introduce a novel pose-to-video translation framework for generating high-quality videos that are temporally coherent even for in-the-wild pose sequences unseen during training.
arXiv Detail & Related papers (2020-04-07T05:59:53Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.