CCTV-Gun: Benchmarking Handgun Detection in CCTV Images
- URL: http://arxiv.org/abs/2303.10703v3
- Date: Tue, 11 Jul 2023 15:33:09 GMT
- Title: CCTV-Gun: Benchmarking Handgun Detection in CCTV Images
- Authors: Srikar Yellapragada, Zhenghong Li, Kevin Bhadresh Doshi, Purva
Makarand Mhasakar, Heng Fan, Jie Wei, Erik Blasch, Bin Zhang, Haibin Ling
- Abstract summary: Gun violence is a critical security problem, and it is imperative for the computer vision community to develop effective gun detection algorithms.
detecting guns in real-world CCTV images remains a challenging and under-explored task.
We present a benchmark, called textbfCCTV-Gun, which addresses the challenges of detecting handguns in real-world CCTV images.
- Score: 59.24281591714385
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Gun violence is a critical security problem, and it is imperative for the
computer vision community to develop effective gun detection algorithms for
real-world scenarios, particularly in Closed Circuit Television (CCTV)
surveillance data. Despite significant progress in visual object detection,
detecting guns in real-world CCTV images remains a challenging and
under-explored task. Firearms, especially handguns, are typically very small in
size, non-salient in appearance, and often severely occluded or
indistinguishable from other small objects. Additionally, the lack of
principled benchmarks and difficulty collecting relevant datasets further
hinder algorithmic development. In this paper, we present a meticulously
crafted and annotated benchmark, called \textbf{CCTV-Gun}, which addresses the
challenges of detecting handguns in real-world CCTV images. Our contribution is
three-fold. Firstly, we carefully select and analyze real-world CCTV images
from three datasets, manually annotate handguns and their holders, and assign
each image with relevant challenge factors such as blur and occlusion.
Secondly, we propose a new cross-dataset evaluation protocol in addition to the
standard intra-dataset protocol, which is vital for gun detection in practical
settings. Finally, we comprehensively evaluate both classical and
state-of-the-art object detection algorithms, providing an in-depth analysis of
their generalizing abilities. The benchmark will facilitate further research
and development on this topic and ultimately enhance security. Code,
annotations, and trained models are available at
https://github.com/srikarym/CCTV-Gun.
Related papers
- SSIVD-Net: A Novel Salient Super Image Classification & Detection
Technique for Weaponized Violence [3.651114792588495]
Detection of violence and weaponized violence in CCTV footage requires a comprehensive approach.
We introduce the emphSmart-City CCTV Violence Detection (SCVD) dataset.
We propose a novel technique called emphSSIVD-Net (textbfSalient-textbfSuper-textbfImage for textbfViolence textbfDetection)
arXiv Detail & Related papers (2022-07-26T12:31:01Z) - E^2TAD: An Energy-Efficient Tracking-based Action Detector [78.90585878925545]
This paper presents a tracking-based solution to accurately and efficiently localize predefined key actions.
It won first place in the UAV-Video Track of 2021 Low-Power Computer Vision Challenge (LPCVC)
arXiv Detail & Related papers (2022-04-09T07:52:11Z) - YouTube-GDD: A challenging gun detection dataset with rich contextual
information [5.956046069509441]
This work presents a new challenging dataset called YouTube Gun Detection dataset (YouTube-GDD)
Our dataset is collected from 343 high-definition YouTube videos and contains 5000 well-chosen images, in which 16064 instances of gun and 9046 instances of person are annotated.
To build a baseline for gun detection, we evaluate YOLOv5 on YouTube-GDD and analyze the influence of additional related annotated information on gun detection.
arXiv Detail & Related papers (2022-03-08T14:55:10Z) - Real Time Action Recognition from Video Footage [0.5219568203653523]
Video surveillance cameras have added a new dimension to detect crime.
This research focuses on integrating state-of-the-art Deep Learning methods to ensure a robust pipeline for autonomous surveillance for detecting violent activities.
arXiv Detail & Related papers (2021-12-13T07:27:41Z) - Finding Facial Forgery Artifacts with Parts-Based Detectors [73.08584805913813]
We design a series of forgery detection systems that each focus on one individual part of the face.
We use these detectors to perform detailed empirical analysis on the FaceForensics++, Celeb-DF, and Facebook Deepfake Detection Challenge datasets.
arXiv Detail & Related papers (2021-09-21T16:18:45Z) - A Gun Detection Dataset and Searching for Embedded Device Solutions [1.2149550080095914]
We publish a dataset with 51K annotated gun images for gun detection and other 51K cropped gun chip images for gun classification.
To our knowledge, this is the largest dataset for the study of gun detection.
We also study to search for solutions for gun detection in embedded edge device (camera) and a gun/non-gun classification on a cloud server.
arXiv Detail & Related papers (2021-05-03T17:58:45Z) - VideoForensicsHQ: Detecting High-quality Manipulated Face Videos [77.60295082172098]
We show how the performance of forgery detectors depends on the presence of artefacts that the human eye can see.
We introduce a new benchmark dataset for face video forgery detection, of unprecedented quality.
arXiv Detail & Related papers (2020-05-20T21:17:43Z) - Knife and Threat Detectors [0.0]
We present three complementary methods for scoring automated threat identification using multiple knife image datasets.
To alert an observer to the knife-wielding threat, we test and deploy classification built around MobileNet.
A final model built on the PoseNet architecture assigns anatomical waypoints or skeletal features to narrow the threat characteristics.
arXiv Detail & Related papers (2020-04-04T12:41:28Z) - Firearm Detection and Segmentation Using an Ensemble of Semantic Neural
Networks [62.997667081978825]
We present a weapon detection system based on an ensemble of semantic Convolutional Neural Networks.
A set of simpler neural networks dedicated to specific tasks requires less computational resources and can be trained in parallel.
The overall output of the system given by the aggregation of the outputs of individual networks can be tuned by a user to trade-off false positives and false negatives.
arXiv Detail & Related papers (2020-02-11T13:58:16Z) - Training-free Monocular 3D Event Detection System for Traffic
Surveillance [93.65240041833319]
Existing event detection systems are mostly learning-based and have achieved convincing performance when a large amount of training data is available.
In real-world scenarios, collecting sufficient labeled training data is expensive and sometimes impossible.
We propose a training-free monocular 3D event detection system for traffic surveillance.
arXiv Detail & Related papers (2020-02-01T04:42:57Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.