Crowd-level Abnormal Behavior Detection via Multi-scale Motion
Consistency Learning
- URL: http://arxiv.org/abs/2212.00501v1
- Date: Thu, 1 Dec 2022 13:52:32 GMT
- Title: Crowd-level Abnormal Behavior Detection via Multi-scale Motion
Consistency Learning
- Authors: Linbo Luo, Yuanjing Li, Haiyan Yin, Shangwei Xie, Ruimin Hu, Wentong
Cai
- Abstract summary: Crowd-level abnormal behaviors (CABs) are proven to be the crucial causes of many crowd disasters.
We present a novel crowd motion learning framework, multi-scale motion consistency network (MSMC-Net)
MSMC-Net first captures the spatial and temporal crowd motion consistency information in a graph representation.
Then, it simultaneously trains multiple feature graphs constructed at different scales to capture rich crowd patterns.
- Score: 18.412026838387806
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: Detecting abnormal crowd motion emerging from complex interactions of
individuals is paramount to ensure the safety of crowds. Crowd-level abnormal
behaviors (CABs), e.g., counter flow and crowd turbulence, are proven to be the
crucial causes of many crowd disasters. In the recent decade, video anomaly
detection (VAD) techniques have achieved remarkable success in detecting
individual-level abnormal behaviors (e.g., sudden running, fighting and
stealing), but research on VAD for CABs is rather limited. Unlike
individual-level anomaly, CABs usually do not exhibit salient difference from
the normal behaviors when observed locally, and the scale of CABs could vary
from one scenario to another. In this paper, we present a systematic study to
tackle the important problem of VAD for CABs with a novel crowd motion learning
framework, multi-scale motion consistency network (MSMC-Net). MSMC-Net first
captures the spatial and temporal crowd motion consistency information in a
graph representation. Then, it simultaneously trains multiple feature graphs
constructed at different scales to capture rich crowd patterns. An attention
network is used to adaptively fuse the multi-scale features for better CAB
detection. For the empirical study, we consider three large-scale crowd event
datasets, UMN, Hajj and Love Parade. Experimental results show that MSMC-Net
could substantially improve the state-of-the-art performance on all the
datasets.
Related papers
- Continuous Memory Representation for Anomaly Detection [24.58611060347548]
CRAD is a novel anomaly detection method for representing normal features within a "continuous" memory.
In an evaluation using the MVTec AD dataset, CRAD significantly outperforms the previous state-of-the-art method by reducing 65.0% of the error for multi-class unified anomaly detection.
arXiv Detail & Related papers (2024-02-28T12:38:44Z) - Dynamic Erasing Network Based on Multi-Scale Temporal Features for
Weakly Supervised Video Anomaly Detection [103.92970668001277]
We propose a Dynamic Erasing Network (DE-Net) for weakly supervised video anomaly detection.
We first propose a multi-scale temporal modeling module, capable of extracting features from segments of varying lengths.
Then, we design a dynamic erasing strategy, which dynamically assesses the completeness of the detected anomalies.
arXiv Detail & Related papers (2023-12-04T09:40:11Z) - Improving Vision Anomaly Detection with the Guidance of Language
Modality [64.53005837237754]
This paper tackles the challenges for vision modality from a multimodal point of view.
We propose Cross-modal Guidance (CMG) to tackle the redundant information issue and sparse space issue.
To learn a more compact latent space for the vision anomaly detector, CMLE learns a correlation structure matrix from the language modality.
arXiv Detail & Related papers (2023-10-04T13:44:56Z) - Abnormal Event Detection via Hypergraph Contrastive Learning [54.80429341415227]
Abnormal event detection plays an important role in many real applications.
In this paper, we study the unsupervised abnormal event detection problem in Attributed Heterogeneous Information Network.
A novel hypergraph contrastive learning method, named AEHCL, is proposed to fully capture abnormal event patterns.
arXiv Detail & Related papers (2023-04-02T08:23:20Z) - Towards Continual Egocentric Activity Recognition: A Multi-modal
Egocentric Activity Dataset for Continual Learning [21.68009790164824]
We present a multi-modal egocentric activity dataset for continual learning named UESTC-MMEA-CL.
It contains synchronized data of videos, accelerometers, and gyroscopes, for 32 types of daily activities, performed by 10 participants.
Results of egocentric activity recognition are reported when using separately, and jointly, three modalities: RGB, acceleration, and gyroscope.
arXiv Detail & Related papers (2023-01-26T04:32:00Z) - Coarse-to-Fine Knowledge-Enhanced Multi-Interest Learning Framework for
Multi-Behavior Recommendation [52.89816309759537]
Multi-types of behaviors (e.g., clicking, adding to cart, purchasing, etc.) widely exist in most real-world recommendation scenarios.
The state-of-the-art multi-behavior models learn behavior dependencies indistinguishably with all historical interactions as input.
We propose a novel Coarse-to-fine Knowledge-enhanced Multi-interest Learning framework to learn shared and behavior-specific interests for different behaviors.
arXiv Detail & Related papers (2022-08-03T05:28:14Z) - Hybrid Classifiers for Spatio-temporal Real-time Abnormal Behaviors
Detection, Tracking, and Recognition in Massive Hajj Crowds [1.8186887490616164]
We introduce an annotated and labeled large-scale crowd abnormal behaviors Hajj dataset (HAJJv2).
We propose two methods of hybrid Convolutional Neural Networks (CNNs) and Random Forests (RFs) to detect and recognize Spatio-temporal abnormal behaviors in small and large-scales crowd videos.
arXiv Detail & Related papers (2022-07-25T06:52:55Z) - Federated Variational Learning for Anomaly Detection in Multivariate
Time Series [13.328883578980237]
We propose an unsupervised time series anomaly detection framework in a federated fashion.
We leave the training data distributed at the edge to learn a shared Variational Autoencoder (VAE) based on Convolutional Gated Recurrent Unit (ConvGRU) model.
Experiments on three real-world networked sensor datasets illustrate the advantage of our approach over other state-of-the-art models.
arXiv Detail & Related papers (2021-08-18T22:23:15Z) - F-FADE: Frequency Factorization for Anomaly Detection in Edge Streams [53.70940420595329]
We propose F-FADE, a new approach for detection of anomalies in edge streams.
It uses a novel frequency-factorization technique to efficiently model the time-evolving distributions of frequencies of interactions between node-pairs.
F-FADE is able to handle in an online streaming setting a broad variety of anomalies with temporal and structural changes, while requiring only constant memory.
arXiv Detail & Related papers (2020-11-09T19:55:40Z) - DIRV: Dense Interaction Region Voting for End-to-End Human-Object
Interaction Detection [53.40028068801092]
We propose a novel one-stage HOI detection approach based on a new concept called interaction region for the HOI problem.
Unlike previous methods, our approach concentrates on the densely sampled interaction regions across different scales for each human-object pair.
In order to compensate for the detection flaws of a single interaction region, we introduce a novel voting strategy.
arXiv Detail & Related papers (2020-10-02T13:57:58Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.