Single Run Action Detector over Video Stream -- A Privacy Preserving
Approach
- URL: http://arxiv.org/abs/2102.03391v1
- Date: Fri, 5 Feb 2021 19:27:38 GMT
- Title: Single Run Action Detector over Video Stream -- A Privacy Preserving
Approach
- Authors: Anbumalar Saravanan, Justin Sanchez, Hassan Ghasemzadeh, Aurelia
Macabasco-O'Connell and Hamed Tabkhi
- Abstract summary: This paper presents Single Run Action Detector (S-RAD) which is a real-time privacy-preserving action detector.
Results on UCF-Sports and UR Fall dataset present comparable accuracy to State-of-the-Art approaches.
- Score: 13.247009439182769
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: This paper takes initial strides at designing and evaluating a vision-based
system for privacy ensured activity monitoring. The proposed technology
utilizing Artificial Intelligence (AI)-empowered proactive systems offering
continuous monitoring, behavioral analysis, and modeling of human activities.
To this end, this paper presents Single Run Action Detector (S-RAD) which is a
real-time privacy-preserving action detector that performs end-to-end action
localization and classification. It is based on Faster-RCNN combined with
temporal shift modeling and segment based sampling to capture the human
actions. Results on UCF-Sports and UR Fall dataset present comparable accuracy
to State-of-the-Art approaches with significantly lower model size and
computation demand and the ability for real-time execution on edge embedded
device (e.g. Nvidia Jetson Xavier).
Related papers
- Uncertainty-Guided Appearance-Motion Association Network for Out-of-Distribution Action Detection [4.938957922033169]
Out-of-distribution (OOD) detection targets to detect and reject test samples with semantic shifts.
We propose a novel Uncertainty-Guided Appearance-Motion Association Network (UAAN)
We show that UAAN beats state-of-the-art methods by a significant margin, illustrating its effectiveness.
arXiv Detail & Related papers (2024-09-16T02:53:49Z) - Implicit Occupancy Flow Fields for Perception and Prediction in
Self-Driving [68.95178518732965]
A self-driving vehicle (SDV) must be able to perceive its surroundings and predict the future behavior of other traffic participants.
Existing works either perform object detection followed by trajectory of the detected objects, or predict dense occupancy and flow grids for the whole scene.
This motivates our unified approach to perception and future prediction that implicitly represents occupancy and flow over time with a single neural network.
arXiv Detail & Related papers (2023-08-02T23:39:24Z) - H-SAUR: Hypothesize, Simulate, Act, Update, and Repeat for Understanding
Object Articulations from Interactions [62.510951695174604]
"Hypothesize, Simulate, Act, Update, and Repeat" (H-SAUR) is a probabilistic generative framework that generates hypotheses about how objects articulate given input observations.
We show that the proposed model significantly outperforms the current state-of-the-art articulated object manipulation framework.
We further improve the test-time efficiency of H-SAUR by integrating a learned prior from learning-based vision models.
arXiv Detail & Related papers (2022-10-22T18:39:33Z) - E^2TAD: An Energy-Efficient Tracking-based Action Detector [78.90585878925545]
This paper presents a tracking-based solution to accurately and efficiently localize predefined key actions.
It won first place in the UAV-Video Track of 2021 Low-Power Computer Vision Challenge (LPCVC)
arXiv Detail & Related papers (2022-04-09T07:52:11Z) - Cut and Continuous Paste towards Real-time Deep Fall Detection [12.15584530151789]
We propose a simple and efficient framework to detect falls through a single and small-sized convolutional neural network.
We first introduce a new image synthesis method that represents human motion in a single frame.
At the inference step, we also represent real human motion in a single image by estimating mean of input frames.
arXiv Detail & Related papers (2022-02-22T06:07:16Z) - Argus++: Robust Real-time Activity Detection for Unconstrained Video
Streams with Overlapping Cube Proposals [85.76513755331318]
Argus++ is a robust real-time activity detection system for analyzing unconstrained video streams.
The overall system is optimized for real-time processing on standalone consumer-level hardware.
arXiv Detail & Related papers (2022-01-14T03:35:22Z) - Improving Variational Autoencoder based Out-of-Distribution Detection
for Embedded Real-time Applications [2.9327503320877457]
Out-of-distribution (OD) detection is an emerging approach to address the challenge of detecting out-of-distribution in real-time.
In this paper, we show how we can robustly detect hazardous motion around autonomous driving agents.
Our methods significantly improve detection capabilities of OoD factors to unique driving scenarios, 42% better than state-of-the-art approaches.
Our model also generalized near-perfectly, 97% better than the state-of-the-art across the real-world and simulation driving data sets experimented.
arXiv Detail & Related papers (2021-07-25T07:52:53Z) - TRiPOD: Human Trajectory and Pose Dynamics Forecasting in the Wild [77.59069361196404]
TRiPOD is a novel method for predicting body dynamics based on graph attentional networks.
To incorporate a real-world challenge, we learn an indicator representing whether an estimated body joint is visible/invisible at each frame.
Our evaluation shows that TRiPOD outperforms all prior work and state-of-the-art specifically designed for each of the trajectory and pose forecasting tasks.
arXiv Detail & Related papers (2021-04-08T20:01:00Z) - Risk-Averse MPC via Visual-Inertial Input and Recurrent Networks for
Online Collision Avoidance [95.86944752753564]
We propose an online path planning architecture that extends the model predictive control (MPC) formulation to consider future location uncertainties.
Our algorithm combines an object detection pipeline with a recurrent neural network (RNN) which infers the covariance of state estimates.
The robustness of our methods is validated on complex quadruped robot dynamics and can be generally applied to most robotic platforms.
arXiv Detail & Related papers (2020-07-28T07:34:30Z) - Weighted Average Precision: Adversarial Example Detection in the Visual
Perception of Autonomous Vehicles [10.72357267154474]
We propose a novel distance metric for practical autonomous driving object detection outputs.
We show how our approach outperforms existing single-frame-mAP based AE detections by increasing 17.76% accuracy of performance.
arXiv Detail & Related papers (2020-01-25T23:59:34Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.