Shopformer: Transformer-Based Framework for Detecting Shoplifting via Human Pose
- URL: http://arxiv.org/abs/2504.19970v1
- Date: Mon, 28 Apr 2025 16:43:01 GMT
- Title: Shopformer: Transformer-Based Framework for Detecting Shoplifting via Human Pose
- Authors: Narges Rashvand, Ghazal Alinezhad Noghre, Armin Danesh Pazho, Babak Rahimi Ardabili, Hamed Tabkhi,
- Abstract summary: Shoplifting remains a costly issue for the retail sector, with only about 2% of shoplifters being arrested.<n>Existing AI-based approaches rely on pixel-level video analysis which raises privacy concerns.<n>We introduce Shopformer, a novel transformer-based model that detects shoplifting by analyzing pose sequences.
- Score: 1.8802008255570537
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Shoplifting remains a costly issue for the retail sector, but traditional surveillance systems, which are mostly based on human monitoring, are still largely ineffective, with only about 2% of shoplifters being arrested. Existing AI-based approaches rely on pixel-level video analysis which raises privacy concerns, is sensitive to environmental variations, and demands significant computational resources. To address these limitations, we introduce Shopformer, a novel transformer-based model that detects shoplifting by analyzing pose sequences rather than raw video. We propose a custom tokenization strategy that converts pose sequences into compact embeddings for efficient transformer processing. To the best of our knowledge, this is the first pose-sequence-based transformer model for shoplifting detection. Evaluated on real-world pose data, our method outperforms state-of-the-art anomaly detection models, offering a privacy-preserving, and scalable solution for real-time retail surveillance. The code base for this work is available at https://github.com/TeCSAR-UNCC/Shopformer.
Related papers
- PersGuard: Preventing Malicious Personalization via Backdoor Attacks on Pre-trained Text-to-Image Diffusion Models [51.458089902581456]
We introduce PersGuard, a novel backdoor-based approach that prevents malicious personalization of specific images.<n>Our method significantly outperforms existing techniques, offering a more robust solution for privacy and copyright protection.
arXiv Detail & Related papers (2025-02-22T09:47:55Z) - Exploring Pose-Based Anomaly Detection for Retail Security: A Real-World Shoplifting Dataset and Benchmark [1.8802008255570537]
Shoplifting poses a significant challenge for retailers, resulting in billions of dollars in annual losses.<n>This paper frames shoplifting detection as an anomaly detection problem, focusing on the identification of deviations from typical shopping patterns.<n>We introduce PoseLift, a privacy-preserving dataset specifically designed for shoplifting detection.
arXiv Detail & Related papers (2025-01-11T17:19:53Z) - A Lean Transformer Model for Dynamic Malware Analysis and Detection [0.0]
Malware is a fast-growing threat to the modern computing world and existing lines of defense are not efficient enough to address this issue.
Previous works have shown some success leveraging Neural Networks and API calls sequences extracted from execution reports.
In this paper, we design an emulation-Only model, based on the Transformers architecture, to detect malicious files.
arXiv Detail & Related papers (2024-08-05T08:46:46Z) - Concept-based Anomaly Detection in Retail Stores for Automatic
Correction using Mobile Robots [3.989104441591223]
Co-AD is a Concept-based Anomaly Detection approach using a Vision Transformer (ViT)
It is able to flag misplaced objects without using a prior knowledge base such as a planogram.
It has a peak success rate of 89.90% on anomaly detection image sets of retail objects.
arXiv Detail & Related papers (2023-10-21T16:49:23Z) - Exploring Decision-based Black-box Attacks on Face Forgery Detection [53.181920529225906]
Face forgery generation technologies generate vivid faces, which have raised public concerns about security and privacy.
Although face forgery detection has successfully distinguished fake faces, recent studies have demonstrated that face forgery detectors are very vulnerable to adversarial examples.
arXiv Detail & Related papers (2023-10-18T14:49:54Z) - Self-Supervised Masked Convolutional Transformer Block for Anomaly
Detection [122.4894940892536]
We present a novel self-supervised masked convolutional transformer block (SSMCTB) that comprises the reconstruction-based functionality at a core architectural level.
In this work, we extend our previous self-supervised predictive convolutional attentive block (SSPCAB) with a 3D masked convolutional layer, a transformer for channel-wise attention, as well as a novel self-supervised objective based on Huber loss.
arXiv Detail & Related papers (2022-09-25T04:56:10Z) - Initiative Defense against Facial Manipulation [82.96864888025797]
We propose a novel framework of initiative defense to degrade the performance of facial manipulation models controlled by malicious users.
We first imitate the target manipulation model with a surrogate model, and then devise a poison perturbation generator to obtain the desired venom.
arXiv Detail & Related papers (2021-12-19T09:42:28Z) - TransForensics: Image Forgery Localization with Dense Self-Attention [37.2172540238706]
We introduce TransForensics, a novel image forgery localization method inspired by Transformers.
The two major components in our framework are dense self-attention encoders and dense correction modules.
By conducting experiments on main benchmarks, we show that TransForensics outperforms the stateof-the-art methods by a large margin.
arXiv Detail & Related papers (2021-08-09T08:43:26Z) - End-to-End Trainable Multi-Instance Pose Estimation with Transformers [68.93512627479197]
We propose a new end-to-end trainable approach for multi-instance pose estimation by combining a convolutional neural network with a transformer.
Inspired by recent work on end-to-end trainable object detection with transformers, we use a transformer encoder-decoder architecture together with a bipartite matching scheme to directly regress the pose of all individuals in a given image.
Our model, called POse Estimation Transformer (POET), is trained using a novel set-based global loss that consists of a keypoint loss, a keypoint visibility loss, a center loss and a class loss.
arXiv Detail & Related papers (2021-03-22T18:19:22Z) - VideoForensicsHQ: Detecting High-quality Manipulated Face Videos [77.60295082172098]
We show how the performance of forgery detectors depends on the presence of artefacts that the human eye can see.
We introduce a new benchmark dataset for face video forgery detection, of unprecedented quality.
arXiv Detail & Related papers (2020-05-20T21:17:43Z) - Suspicious Behavior Detection on Shoplifting Cases for Crime Prevention
by Using 3D Convolutional Neural Networks [0.0]
We implement a 3DCNN model as a video feature extractor and tested its performance on a dataset composed of daily-action and shoplifting samples.
The results are encouraging since it correctly identifies 75% of the cases where a crime is about to happen.
arXiv Detail & Related papers (2020-04-30T22:06:16Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.