Preventing Errors in Person Detection: A Part-Based Self-Monitoring
Framework
- URL: http://arxiv.org/abs/2307.04533v1
- Date: Mon, 10 Jul 2023 12:59:30 GMT
- Title: Preventing Errors in Person Detection: A Part-Based Self-Monitoring
Framework
- Authors: Franziska Schwaiger, Andrea Matic, Karsten Roscher, Stephan
G\"unnemann
- Abstract summary: We propose a self-monitoring framework that allows for the perception system to perform plausibility checks at runtime.
We show that by incorporating an additional component for detecting human body parts, we are able to significantly reduce the number of missed human detections.
- Score: 7.6849475214826315
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The ability to detect learned objects regardless of their appearance is
crucial for autonomous systems in real-world applications. Especially for
detecting humans, which is often a fundamental task in safety-critical
applications, it is vital to prevent errors. To address this challenge, we
propose a self-monitoring framework that allows for the perception system to
perform plausibility checks at runtime. We show that by incorporating an
additional component for detecting human body parts, we are able to
significantly reduce the number of missed human detections by factors of up to
9 when compared to a baseline setup, which was trained only on holistic person
objects. Additionally, we found that training a model jointly on humans and
their body parts leads to a substantial reduction in false positive detections
by up to 50% compared to training on humans alone. We performed comprehensive
experiments on the publicly available datasets DensePose and Pascal VOC in
order to demonstrate the effectiveness of our framework. Code is available at
https://github.com/ FraunhoferIKS/smf-object-detection.
Related papers
- Uncertainty Estimation for 3D Object Detection via Evidential Learning [63.61283174146648]
We introduce a framework for quantifying uncertainty in 3D object detection by leveraging an evidential learning loss on Bird's Eye View representations in the 3D detector.
We demonstrate both the efficacy and importance of these uncertainty estimates on identifying out-of-distribution scenes, poorly localized objects, and missing (false negative) detections.
arXiv Detail & Related papers (2024-10-31T13:13:32Z) - Private Data Leakage in Federated Human Activity Recognition for Wearable Healthcare Devices [6.422056036165425]
We investigate privacy leakage issues within federated user behavior recognition modeling across multiple wearable devices.
Our proposed system entails a federated learning architecture comprising $N$ wearable device users and a parameter server.
Experimentation conducted on five publicly available HAR datasets demonstrates an accuracy rate of 92% for malicious server-based membership inference.
arXiv Detail & Related papers (2024-05-14T07:43:22Z) - A Self-Supervised Method for Body Part Segmentation and Keypoint Detection of Rat Images [0.0]
We propose a method that alleviates the need for manual labeling of laboratory rats.
The final system is capable of instance segmentation, keypoint detection, and body part segmentation even when the objects are heavily occluded.
arXiv Detail & Related papers (2024-05-07T20:11:07Z) - Improving Multi-Person Pose Tracking with A Confidence Network [37.84514614455588]
We develop a novel keypoint confidence network and a tracking pipeline to improve human detection and pose estimation.
Specifically, the keypoint confidence network is designed to determine whether each keypoint is occluded.
In the tracking pipeline, we propose the Bbox-revision module to reduce missing detection and the ID-retrieve module to correct lost trajectories.
arXiv Detail & Related papers (2023-10-29T06:36:27Z) - FROD: Robust Object Detection for Free [1.8139771201780368]
State-of-the-art object detectors are susceptible to small adversarial perturbations.
We propose modifications to the classification-based backbone to instill robustness in object detection.
arXiv Detail & Related papers (2023-08-03T17:31:22Z) - Affect-Aware Deep Belief Network Representations for Multimodal
Unsupervised Deception Detection [3.04585143845864]
unsupervised approach for detecting real-world, high-stakes deception in videos without requiring labels.
This paper presents our novel approach for affect-aware unsupervised Deep Belief Networks (DBN)
In addition to using facial affect as a feature on which DBN models are trained, we also introduce a DBN training procedure that uses facial affect as an aligner of audio-visual representations.
arXiv Detail & Related papers (2021-08-17T22:07:26Z) - TRiPOD: Human Trajectory and Pose Dynamics Forecasting in the Wild [77.59069361196404]
TRiPOD is a novel method for predicting body dynamics based on graph attentional networks.
To incorporate a real-world challenge, we learn an indicator representing whether an estimated body joint is visible/invisible at each frame.
Our evaluation shows that TRiPOD outperforms all prior work and state-of-the-art specifically designed for each of the trajectory and pose forecasting tasks.
arXiv Detail & Related papers (2021-04-08T20:01:00Z) - Detecting Human-Object Interaction via Fabricated Compositional Learning [106.37536031160282]
Human-Object Interaction (HOI) detection is a fundamental task for high-level scene understanding.
Human has extremely powerful compositional perception ability to cognize rare or unseen HOI samples.
We propose Fabricated Compositional Learning (FCL) to address the problem of open long-tailed HOI detection.
arXiv Detail & Related papers (2021-03-15T08:52:56Z) - Detecting Invisible People [58.49425715635312]
We re-purpose tracking benchmarks and propose new metrics for the task of detecting invisible objects.
We demonstrate that current detection and tracking systems perform dramatically worse on this task.
Second, we build dynamic models that explicitly reason in 3D, making use of observations produced by state-of-the-art monocular depth estimation networks.
arXiv Detail & Related papers (2020-12-15T16:54:45Z) - Perceiving Humans: from Monocular 3D Localization to Social Distancing [93.03056743850141]
We present a new cost-effective vision-based method that perceives humans' locations in 3D and their body orientation from a single image.
We show that it is possible to rethink the concept of "social distancing" as a form of social interaction in contrast to a simple location-based rule.
arXiv Detail & Related papers (2020-09-01T10:12:30Z) - Firearm Detection and Segmentation Using an Ensemble of Semantic Neural
Networks [62.997667081978825]
We present a weapon detection system based on an ensemble of semantic Convolutional Neural Networks.
A set of simpler neural networks dedicated to specific tasks requires less computational resources and can be trained in parallel.
The overall output of the system given by the aggregation of the outputs of individual networks can be tuned by a user to trade-off false positives and false negatives.
arXiv Detail & Related papers (2020-02-11T13:58:16Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.