Application-Driven AI Paradigm for Hand-Held Action Detection
- URL: http://arxiv.org/abs/2210.06682v1
- Date: Thu, 13 Oct 2022 02:30:23 GMT
- Title: Application-Driven AI Paradigm for Hand-Held Action Detection
- Authors: Kohou Wang, Zhaoxiang Liu and Shiguo Lian
- Abstract summary: We propose an application-driven AI paradigm for hand-held action detection based on hierarchical object detection.
The proposed framework achieve higher detection rate with good adaptation and robustness in complex environments.
- Score: 1.8531114735719274
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: In practical applications especially with safety requirement, some hand-held
actions need to be monitored closely, including smoking cigarettes, dialing,
eating, etc. Taking smoking cigarettes as example, existing smoke detection
algorithms usually detect the cigarette or cigarette with hand as the target
object only, which leads to low accuracy. In this paper, we propose an
application-driven AI paradigm for hand-held action detection based on
hierarchical object detection. It is a coarse-to-fine hierarchical detection
framework composed of two modules. The first one is a coarse detection module
with the human pose consisting of the whole hand, cigarette and head as target
object. The followed second one is a fine detection module with the fingers
holding cigarette, mouth area and the whole cigarette as target. Some
experiments are done with the dataset collected from real-world scenarios, and
the results show that the proposed framework achieve higher detection rate with
good adaptation and robustness in complex environments.
Related papers
- Domain Adaptive Synapse Detection with Weak Point Annotations [63.97144211520869]
We present AdaSyn, a framework for domain adaptive synapse detection with weak point annotations.
In the WASPSYN challenge at I SBI 2023, our method ranks the 1st place.
arXiv Detail & Related papers (2023-08-31T05:05:53Z) - Generalized Few-Shot 3D Object Detection of LiDAR Point Cloud for
Autonomous Driving [91.39625612027386]
We propose a novel task, called generalized few-shot 3D object detection, where we have a large amount of training data for common (base) objects, but only a few data for rare (novel) classes.
Specifically, we analyze in-depth differences between images and point clouds, and then present a practical principle for the few-shot setting in the 3D LiDAR dataset.
To solve this task, we propose an incremental fine-tuning method to extend existing 3D detection models to recognize both common and rare objects.
arXiv Detail & Related papers (2023-02-08T07:11:36Z) - Distinctive Self-Similar Object Detection [0.8488455943441636]
We propose that the distinctive fractal feature of self-similar in fire and smoke can relieve us from struggling with their various shapes.
We design a semi-supervised method that use Hausdorff distance to describe the resemblance between instances.
Our experiments have been conducted on publicly available fire and smoke detection datasets.
arXiv Detail & Related papers (2022-11-20T15:15:57Z) - A Tri-Layer Plugin to Improve Occluded Detection [100.99802831241583]
We propose a simple '' module for the detection head of two-stage object detectors to improve the recall of partially occluded objects.
The module predicts a tri-layer of segmentation masks for the target object, the occluder and the occludee, and by doing so is able to better predict the mask of the target object.
We also establish a COCO evaluation dataset to measure the recall performance of partially occluded and separated objects.
arXiv Detail & Related papers (2022-10-18T17:59:51Z) - A Multi-task Framework for Infrared Small Target Detection and
Segmentation [9.033048310220346]
We propose a novel end-to-end framework for infrared small target detection and segmentation.
We use UNet as the backbone to maintain resolution and semantic information.
We develop a multi-task framework for infrared small target detection and segmentation.
arXiv Detail & Related papers (2022-06-14T15:43:34Z) - SHOP: A Deep Learning Based Pipeline for near Real-Time Detection of
Small Handheld Objects Present in Blurry Video [0.0]
We present SHOP (Small Handheld Object Pipeline), a pipeline that reliably interprets blurry images containing handheld objects.
The specific models used in each stage of the pipeline are flexible and can be changed based on performance requirements.
We also present a subset of MS COCO consisting solely of handheld objects that can be used to continue the development of handheld object detection methods.
arXiv Detail & Related papers (2022-03-29T04:31:30Z) - Plug-and-Play Few-shot Object Detection with Meta Strategy and Explicit
Localization Inference [78.41932738265345]
This paper proposes a plug detector that can accurately detect the objects of novel categories without fine-tuning process.
We introduce two explicit inferences into the localization process to reduce its dependence on annotated data.
It shows a significant lead in both efficiency, precision, and recall under varied evaluation protocols.
arXiv Detail & Related papers (2021-10-26T03:09:57Z) - A Bottom-up method Towards the Automatic and Objective Monitoring of
Smoking Behavior In-the-wild using Wrist-mounted Inertial Sensors [6.955421797534318]
Tobacco consumption has reached global epidemic proportions and is characterized as the leading cause of death and illness.
We present a two-step, bottom-up algorithm towards the automatic and objective monitoring of cigarette-based, smoking behavior during the day.
In the first step, our algorithm performs the detection of individual smoking gestures (i.e., puffs) using an artificial neural network with both convolutional and recurrent layers.
In the second step, we make use of the detected puff density to achieve the temporal localization of smoking sessions that occur throughout the day.
arXiv Detail & Related papers (2021-09-08T07:50:47Z) - Mining the Benefits of Two-stage and One-stage HOI Detection [26.919979955155664]
Two-stage methods have dominated Human-Object Interaction (HOI) detection for several years.
One-stage methods are challenging to make an appropriate trade-off on multi-task learning, i.e., object detection, and interaction classification.
We propose a novel one-stage framework with disentangling human-object detection and interaction classification in a cascade manner.
arXiv Detail & Related papers (2021-08-11T07:38:09Z) - Slender Object Detection: Diagnoses and Improvements [74.40792217534]
In this paper, we are concerned with the detection of a particular type of objects with extreme aspect ratios, namely textbfslender objects.
For a classical object detection method, a drastic drop of $18.9%$ mAP on COCO is observed, if solely evaluated on slender objects.
arXiv Detail & Related papers (2020-11-17T09:39:42Z) - EHSOD: CAM-Guided End-to-end Hybrid-Supervised Object Detection with
Cascade Refinement [53.69674636044927]
We present EHSOD, an end-to-end hybrid-supervised object detection system.
It can be trained in one shot on both fully and weakly-annotated data.
It achieves comparable results on multiple object detection benchmarks with only 30% fully-annotated data.
arXiv Detail & Related papers (2020-02-18T08:04:58Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.