In-Bed Pose Estimation: A Review
- URL: http://arxiv.org/abs/2402.00700v1
- Date: Thu, 1 Feb 2024 15:57:11 GMT
- Title: In-Bed Pose Estimation: A Review
- Authors: Ziya Ata Yaz{\i}c{\i}, Sara Colantonio, Haz{\i}m Kemal Ekenel
- Abstract summary: In-bed pose estimation can be used to monitor a person's sleep behavior and detect symptoms early for potential disease diagnosis.
Several studies have utilized unimodal and multimodal methods to estimate in-bed human poses.
Our objectives are to show the limitations of the previous studies, current challenges, and provide insights for future works on the in-bed human pose estimation field.
- Score: 8.707107668375906
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Human pose estimation, the process of identifying joint positions in a
person's body from images or videos, represents a widely utilized technology
across diverse fields, including healthcare. One such healthcare application
involves in-bed pose estimation, where the body pose of an individual lying
under a blanket is analyzed. This task, for instance, can be used to monitor a
person's sleep behavior and detect symptoms early for potential disease
diagnosis in homes and hospitals. Several studies have utilized unimodal and
multimodal methods to estimate in-bed human poses. The unimodal studies
generally employ RGB images, whereas the multimodal studies use modalities
including RGB, long-wavelength infrared, pressure map, and depth map.
Multimodal studies have the advantage of using modalities in addition to RGB
that might capture information useful to cope with occlusions. Moreover, some
multimodal studies exclude RGB and, this way, better suit privacy preservation.
To expedite advancements in this domain, we conduct a review of existing
datasets and approaches. Our objectives are to show the limitations of the
previous studies, current challenges, and provide insights for future works on
the in-bed human pose estimation field.
Related papers
- A Textbook Remedy for Domain Shifts: Knowledge Priors for Medical Image Analysis [48.84443450990355]
Deep networks have achieved broad success in analyzing natural images, when applied to medical scans, they often fail in unexcepted situations.
We investigate this challenge and focus on model sensitivity to domain shifts, such as data sampled from different hospitals or data confounded by demographic variables such as sex, race, etc, in the context of chest X-rays and skin lesion images.
Taking inspiration from medical training, we propose giving deep networks a prior grounded in explicit medical knowledge communicated in natural language.
arXiv Detail & Related papers (2024-05-23T17:55:02Z) - Data Augmentation in Human-Centric Vision [54.97327269866757]
This survey presents a comprehensive analysis of data augmentation techniques in human-centric vision tasks.
It delves into a wide range of research areas including person ReID, human parsing, human pose estimation, and pedestrian detection.
Our work categorizes data augmentation methods into two main types: data generation and data perturbation.
arXiv Detail & Related papers (2024-03-13T16:05:18Z) - Deep learning for 3D human pose estimation and mesh recovery: A survey [6.535833206786788]
We present a review of recent progress over the past five years in deep learning methods for 3D human pose estimation.
To the best of our knowledge, this survey is arguably the first to comprehensively cover deep learning methods for 3D human pose estimation.
arXiv Detail & Related papers (2024-02-29T04:30:39Z) - Privacy-Preserving In-Bed Pose Monitoring: A Fusion and Reconstruction
Study [9.474452908573111]
We explore the effective use of images from multiple non-visual and privacy-preserving modalities for the task of in-bed pose estimation.
First, we explore the effective fusion of information from different imaging modalities for better pose estimation.
Secondly, we propose a framework that can estimate in-bed pose estimation when visible images are unavailable.
arXiv Detail & Related papers (2022-02-22T07:24:21Z) - In-Bed Human Pose Estimation from Unseen and Privacy-Preserving Image
Domains [22.92165116962952]
In-bed human posture estimation provides important health-related metrics with potential value in medical condition assessments.
We propose a multi-modal conditional variational autoencoder (MC-VAE) capable of reconstructing features from missing modalities seen during training.
We demonstrate that body positions can be effectively recognized from the available modality, achieving on par results with baseline models.
arXiv Detail & Related papers (2021-11-30T04:56:16Z) - Towards Accurate Cross-Domain In-Bed Human Pose Estimation [3.685548851716087]
Long-wavelength infrared (LWIR) modality based pose estimation algorithms overcome the aforementioned challenges.
We propose a novel learning strategy comprises of two-fold data augmentation to reduce the cross-domain discrepancy.
Our experiments and analysis show the effectiveness of our approach over multiple standard human pose estimation baselines.
arXiv Detail & Related papers (2021-10-07T15:54:46Z) - Deep Co-Attention Network for Multi-View Subspace Learning [73.3450258002607]
We propose a deep co-attention network for multi-view subspace learning.
It aims to extract both the common information and the complementary information in an adversarial setting.
In particular, it uses a novel cross reconstruction loss and leverages the label information to guide the construction of the latent representation.
arXiv Detail & Related papers (2021-02-15T18:46:44Z) - Deep Learning-Based Human Pose Estimation: A Survey [66.01917727294163]
Human pose estimation has drawn increasing attention during the past decade.
It has been utilized in a wide range of applications including human-computer interaction, motion analysis, augmented reality, and virtual reality.
Recent deep learning-based solutions have achieved high performance in human pose estimation.
arXiv Detail & Related papers (2020-12-24T18:49:06Z) - Multimodal In-bed Pose and Shape Estimation under the Blankets [77.12439296395733]
We propose a pyramid scheme to fuse different modalities in a way that best leverages the knowledge captured by the multimodal sensors.
We employ an attention-based reconstruction module to generate uncovered modalities, which are further fused to update current estimation.
arXiv Detail & Related papers (2020-12-12T05:35:23Z) - Multi-view Human Pose and Shape Estimation Using Learnable Volumetric
Aggregation [0.0]
We propose a learnable aggregation approach to reconstruct 3D human body pose and shape from calibrated multi-view images.
Compared to previous approaches, our framework shows higher accuracy and greater promise for real-time prediction, given its cost efficiency.
arXiv Detail & Related papers (2020-11-26T18:33:35Z) - RGB-D Salient Object Detection: A Survey [195.83586883670358]
We provide a comprehensive survey of RGB-D based SOD models from various perspectives.
We also review SOD models and popular benchmark datasets from this domain.
We discuss several challenges and open directions of RGB-D based SOD for future research.
arXiv Detail & Related papers (2020-08-01T10:01:32Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.