SSIVD-Net: A Novel Salient Super Image Classification & Detection
Technique for Weaponized Violence
- URL: http://arxiv.org/abs/2207.12850v8
- Date: Tue, 7 Nov 2023 12:59:17 GMT
- Title: SSIVD-Net: A Novel Salient Super Image Classification & Detection
Technique for Weaponized Violence
- Authors: Toluwani Aremu, Li Zhiyuan, Reem Alameeri, Mustaqeem Khan,
Abdulmotaleb El Saddik
- Abstract summary: Detection of violence and weaponized violence in CCTV footage requires a comprehensive approach.
We introduce the emphSmart-City CCTV Violence Detection (SCVD) dataset.
We propose a novel technique called emphSSIVD-Net (textbfSalient-textbfSuper-textbfImage for textbfViolence textbfDetection)
- Score: 3.651114792588495
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Detection of violence and weaponized violence in closed-circuit television
(CCTV) footage requires a comprehensive approach. In this work, we introduce
the \emph{Smart-City CCTV Violence Detection (SCVD)} dataset, specifically
designed to facilitate the learning of weapon distribution in surveillance
videos. To tackle the complexities of analyzing 3D surveillance video for
violence recognition tasks, we propose a novel technique called
\emph{SSIVD-Net} (\textbf{S}alient-\textbf{S}uper-\textbf{I}mage for
\textbf{V}iolence \textbf{D}etection). Our method reduces 3D video data
complexity, dimensionality, and information loss while improving inference,
performance, and explainability through salient-super-Image representations.
Considering the scalability and sustainability requirements of futuristic smart
cities, the authors introduce the \emph{Salient-Classifier}, a novel
architecture combining a kernelized approach with a residual learning strategy.
We evaluate variations of SSIVD-Net and Salient Classifier on our SCVD dataset
and benchmark against state-of-the-art (SOTA) models commonly employed in
violence detection. Our approach exhibits significant improvements in detecting
both weaponized and non-weaponized violence instances. By advancing the SOTA in
violence detection, our work offers a practical and scalable solution suitable
for real-world applications. The proposed methodology not only addresses the
challenges of violence detection in CCTV footage but also contributes to the
understanding of weapon distribution in smart surveillance. Ultimately, our
research findings should enable smarter and more secure cities, as well as
enhance public safety measures.
Related papers
- AdvMono3D: Advanced Monocular 3D Object Detection with Depth-Aware
Robust Adversarial Training [64.14759275211115]
We propose a depth-aware robust adversarial training method for monocular 3D object detection, dubbed DART3D.
Our adversarial training approach capitalizes on the inherent uncertainty, enabling the model to significantly improve its robustness against adversarial attacks.
arXiv Detail & Related papers (2023-09-03T07:05:32Z) - Two-Stage Violence Detection Using ViTPose and Classification Models at
Smart Airports [9.53984191161849]
This study introduces an innovative violence detection framework tailored to the unique requirements of smart airports.
The framework harnesses the power of ViTPose for human pose estimation.
The solution underwent integrated testing to ensure robust performance in real world scenarios.
arXiv Detail & Related papers (2023-08-30T21:20:15Z) - CCTV-Gun: Benchmarking Handgun Detection in CCTV Images [59.24281591714385]
Gun violence is a critical security problem, and it is imperative for the computer vision community to develop effective gun detection algorithms.
detecting guns in real-world CCTV images remains a challenging and under-explored task.
We present a benchmark, called textbfCCTV-Gun, which addresses the challenges of detecting handguns in real-world CCTV images.
arXiv Detail & Related papers (2023-03-19T16:17:35Z) - A Comprehensive Study of the Robustness for LiDAR-based 3D Object
Detectors against Adversarial Attacks [84.10546708708554]
3D object detectors are increasingly crucial for security-critical tasks.
It is imperative to understand their robustness against adversarial attacks.
This paper presents the first comprehensive evaluation and analysis of the robustness of LiDAR-based 3D detectors under adversarial attacks.
arXiv Detail & Related papers (2022-12-20T13:09:58Z) - Video Vision Transformers for Violence Detection [0.0]
The proposed solution uses a novel end-to-end deep learning-based video vision transformer (ViViT) that can proficiently discern fights, hostile movements, and violent events in video sequences.
The evaluated results can be subsequently sent to local concerned authority, and the captured video can be analyzed.
arXiv Detail & Related papers (2022-09-08T04:44:01Z) - Detecting Violence in Video Based on Deep Features Fusion Technique [0.30458514384586394]
This work proposed a novel method to detect violence using a fusion tech-nique of two convolutional neural networks (CNNs)
The performance of the proposed method is evaluated using three standard benchmark datasets in terms of detection accuracy.
arXiv Detail & Related papers (2022-04-15T12:51:20Z) - Video Violence Recognition and Localization using a Semi-Supervised
Hard-Attention Model [0.0]
Violence monitoring and surveillance systems could keep communities safe and save lives.
The current state-of-the-art deep learning approaches to video violence recognition to higher levels of accuracy and performance could enable surveillance systems to be more reliable and scalable.
The main contribution of the proposed deep reinforcement learning method is to achieve state-of-the-art accuracy on RWF, Hockey, and Movies datasets.
arXiv Detail & Related papers (2022-02-04T16:15:26Z) - Real Time Action Recognition from Video Footage [0.5219568203653523]
Video surveillance cameras have added a new dimension to detect crime.
This research focuses on integrating state-of-the-art Deep Learning methods to ensure a robust pipeline for autonomous surveillance for detecting violent activities.
arXiv Detail & Related papers (2021-12-13T07:27:41Z) - Overcomplete Representations Against Adversarial Videos [72.04912755926524]
We propose a novel Over-and-Under complete restoration network for Defending against adversarial videos (OUDefend)
OUDefend is designed to balance local and global features by learning those two representations.
Experimental results show that the defenses focusing on images may be ineffective to videos, while OUDefend enhances robustness against different types of adversarial videos.
arXiv Detail & Related papers (2020-12-08T08:00:17Z) - Robust Unsupervised Video Anomaly Detection by Multi-Path Frame
Prediction [61.17654438176999]
We propose a novel and robust unsupervised video anomaly detection method by frame prediction with proper design.
Our proposed method obtains the frame-level AUROC score of 88.3% on the CUHK Avenue dataset.
arXiv Detail & Related papers (2020-11-05T11:34:12Z) - MixNet for Generalized Face Presentation Attack Detection [63.35297510471997]
We have proposed a deep learning-based network termed as textitMixNet to detect presentation attacks.
The proposed algorithm utilizes state-of-the-art convolutional neural network architectures and learns the feature mapping for each attack category.
arXiv Detail & Related papers (2020-10-25T23:01:13Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.