Hybrid Classifiers for Spatio-temporal Real-time Abnormal Behaviors
Detection, Tracking, and Recognition in Massive Hajj Crowds
- URL: http://arxiv.org/abs/2207.11931v1
- Date: Mon, 25 Jul 2022 06:52:55 GMT
- Title: Hybrid Classifiers for Spatio-temporal Real-time Abnormal Behaviors
Detection, Tracking, and Recognition in Massive Hajj Crowds
- Authors: Tarik Alafif, Anas Hadi, Manal Allahyani, Bander Alzahrani, Areej
Alhothali, Reem Alotaibi, Ahmed Barnawi
- Abstract summary: We introduce an annotated and labeled large-scale crowd abnormal behaviors Hajj dataset (HAJJv2).
We propose two methods of hybrid Convolutional Neural Networks (CNNs) and Random Forests (RFs) to detect and recognize Spatio-temporal abnormal behaviors in small and large-scales crowd videos.
- Score: 1.8186887490616164
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: Individual abnormal behaviors vary depending on crowd sizes, contexts, and
scenes. Challenges such as partial occlusions, blurring, large-number abnormal
behavior, and camera viewing occur in large-scale crowds when detecting,
tracking, and recognizing individuals with abnormal behaviors. In this paper,
our contribution is twofold. First, we introduce an annotated and labeled
large-scale crowd abnormal behaviors Hajj dataset (HAJJv2). Second, we propose
two methods of hybrid Convolutional Neural Networks (CNNs) and Random Forests
(RFs) to detect and recognize Spatio-temporal abnormal behaviors in small and
large-scales crowd videos. In small-scale crowd videos, a ResNet-50 pre-trained
CNN model is fine-tuned to verify whether every frame is normal or abnormal in
the spatial domain. If anomalous behaviors are observed, a motion-based
individuals detection method based on the magnitudes and orientations of
Horn-Schunck optical flow is used to locate and track individuals with abnormal
behaviors. A Kalman filter is employed in large-scale crowd videos to predict
and track the detected individuals in the subsequent frames. Then, means,
variances, and standard deviations statistical features are computed and fed to
the RF to classify individuals with abnormal behaviors in the temporal domain.
In large-scale crowds, we fine-tune the ResNet-50 model using YOLOv2 object
detection technique to detect individuals with abnormal behaviors in the
spatial domain.
Related papers
- Frequency-Guided Multi-Level Human Action Anomaly Detection with Normalizing Flows [13.77542608443034]
We introduce the task of human action anomaly detection (HAAD), which aims to identify anomalous motions in an unsupervised manner.
Compared to prior human-related anomaly detection tasks which primarily focus on unusual events from videos, HAAD involves the learning of specific action labels to recognize semantically anomalous human behaviors.
arXiv Detail & Related papers (2024-04-26T12:56:16Z) - Semi-Supervised Learning for Anomaly Traffic Detection via Bidirectional Normalizing Flows [47.4772981101262]
We consider the problem of anomaly network traffic detection and propose a three-stage anomaly detection framework using only normal traffic.
Our framework can generate pseudo anomaly samples without prior knowledge of anomalies to achieve the detection of anomaly data.
arXiv Detail & Related papers (2024-03-13T02:10:32Z) - Dynamic Erasing Network Based on Multi-Scale Temporal Features for
Weakly Supervised Video Anomaly Detection [103.92970668001277]
We propose a Dynamic Erasing Network (DE-Net) for weakly supervised video anomaly detection.
We first propose a multi-scale temporal modeling module, capable of extracting features from segments of varying lengths.
Then, we design a dynamic erasing strategy, which dynamically assesses the completeness of the detected anomalies.
arXiv Detail & Related papers (2023-12-04T09:40:11Z) - Multimodal Motion Conditioned Diffusion Model for Skeleton-based Video
Anomaly Detection [46.8584162860564]
We propose a novel generative model for video anomaly detection (VAD)
We consider skeletal representations and leverage state-of-the-art diffusion probabilistic models to generate multimodal future human poses.
We validate our model on 4 established benchmarks.
arXiv Detail & Related papers (2023-07-14T07:42:45Z) - Crowd-level Abnormal Behavior Detection via Multi-scale Motion
Consistency Learning [18.412026838387806]
Crowd-level abnormal behaviors (CABs) are proven to be the crucial causes of many crowd disasters.
We present a novel crowd motion learning framework, multi-scale motion consistency network (MSMC-Net)
MSMC-Net first captures the spatial and temporal crowd motion consistency information in a graph representation.
Then, it simultaneously trains multiple feature graphs constructed at different scales to capture rich crowd patterns.
arXiv Detail & Related papers (2022-12-01T13:52:32Z) - Audio-visual Representation Learning for Anomaly Events Detection in
Crowds [119.72951028190586]
This paper attempts to exploit multi-modal learning for modeling the audio and visual signals simultaneously.
We conduct the experiments on SHADE dataset, a synthetic audio-visual dataset in surveillance scenes.
We find introducing audio signals effectively improves the performance of anomaly events detection and outperforms other state-of-the-art methods.
arXiv Detail & Related papers (2021-10-28T02:42:48Z) - Using a Neural Network to Detect Anomalies given an N-gram Profile [0.0]
Anomaly detection is designed to profile the normal runtime behavior of computer programs.
Normal but unobserved behavior can trigger false positives.
This paper presents our study on how to explain the presence of anomalies using a neural network.
arXiv Detail & Related papers (2021-04-12T15:40:43Z) - Unsupervised Video Anomaly Detection via Normalizing Flows with Implicit
Latent Features [8.407188666535506]
Most existing methods use an autoencoder to learn to reconstruct normal videos.
We propose an implicit two-path AE (ITAE), a structure in which two encoders implicitly model appearance and motion features.
For the complex distribution of normal scenes, we suggest normal density estimation of ITAE features.
NF models intensify ITAE performance by learning normality through implicitly learned features.
arXiv Detail & Related papers (2020-10-15T05:02:02Z) - A Background-Agnostic Framework with Adversarial Training for Abnormal
Event Detection in Video [120.18562044084678]
Abnormal event detection in video is a complex computer vision problem that has attracted significant attention in recent years.
We propose a background-agnostic framework that learns from training videos containing only normal events.
arXiv Detail & Related papers (2020-08-27T18:39:24Z) - Self-trained Deep Ordinal Regression for End-to-End Video Anomaly
Detection [114.9714355807607]
We show that applying self-trained deep ordinal regression to video anomaly detection overcomes two key limitations of existing methods.
We devise an end-to-end trainable video anomaly detection approach that enables joint representation learning and anomaly scoring without manually labeled normal/abnormal data.
arXiv Detail & Related papers (2020-03-15T08:44:55Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.