Privacy-Preserving In-Bed Pose Monitoring: A Fusion and Reconstruction
Study
- URL: http://arxiv.org/abs/2202.10704v1
- Date: Tue, 22 Feb 2022 07:24:21 GMT
- Title: Privacy-Preserving In-Bed Pose Monitoring: A Fusion and Reconstruction
Study
- Authors: Thisun Dayarathna, Thamidu Muthukumarana, Yasiru Rathnayaka, Simon
Denman, Chathura de Silva, Akila Pemasiri, David Ahmedt-Aristizabal
- Abstract summary: We explore the effective use of images from multiple non-visual and privacy-preserving modalities for the task of in-bed pose estimation.
First, we explore the effective fusion of information from different imaging modalities for better pose estimation.
Secondly, we propose a framework that can estimate in-bed pose estimation when visible images are unavailable.
- Score: 9.474452908573111
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Recently, in-bed human pose estimation has attracted the interest of
researchers due to its relevance to a wide range of healthcare applications.
Compared to the general problem of human pose estimation, in-bed pose
estimation has several inherent challenges, the most prominent being frequent
and severe occlusions caused by bedding. In this paper we explore the effective
use of images from multiple non-visual and privacy-preserving modalities such
as depth, long-wave infrared (LWIR) and pressure maps for the task of in-bed
pose estimation in two settings. First, we explore the effective fusion of
information from different imaging modalities for better pose estimation.
Secondly, we propose a framework that can estimate in-bed pose estimation when
visible images are unavailable, and demonstrate the applicability of fusion
methods to scenarios where only LWIR images are available. We analyze and
demonstrate the effect of fusing features from multiple modalities. For this
purpose, we consider four different techniques: 1) Addition, 2) Concatenation,
3) Fusion via learned modal weights, and 4) End-to-end fully trainable
approach; with a state-of-the-art pose estimation model. We also evaluate the
effect of reconstructing a data-rich modality (i.e., visible modality) from a
privacy-preserving modality with data scarcity (i.e., long-wavelength infrared)
for in-bed human pose estimation. For reconstruction, we use a conditional
generative adversarial network. We conduct ablative studies across different
design decisions of our framework. This includes selecting features with
different levels of granularity, using different fusion techniques, and varying
model parameters. Through extensive evaluations, we demonstrate that our method
produces on par or better results compared to the state-of-the-art.
Related papers
- A Simple Strategy for Body Estimation from Partial-View Images [8.05538560322898]
Virtual try-on and product personalization have become increasingly important in modern online shopping, highlighting the need for accurate body measurement estimation.
Previous research has advanced in estimating 3D body shapes from RGB images, but the task is inherently ambiguous as the observed scale of human subjects in the images depends on two unknown factors: capture distance and body dimensions.
We propose a modular and simple height normalization solution, which relocates the subject skeleton to the desired position, normalizing the scale and disentangling the relationship between the two variables.
arXiv Detail & Related papers (2024-04-14T16:55:23Z) - DVMNet: Computing Relative Pose for Unseen Objects Beyond Hypotheses [59.51874686414509]
Current approaches approximate the continuous pose representation with a large number of discrete pose hypotheses.
We present a Deep Voxel Matching Network (DVMNet) that eliminates the need for pose hypotheses and computes the relative object pose in a single pass.
Our method delivers more accurate relative pose estimates for novel objects at a lower computational cost compared to state-of-the-art methods.
arXiv Detail & Related papers (2024-03-20T15:41:32Z) - DiffPose: SpatioTemporal Diffusion Model for Video-Based Human Pose
Estimation [16.32910684198013]
We present DiffPose, a novel diffusion architecture that formulates video-based human pose estimation as a conditional heatmap generation problem.
We show two unique characteristics from DiffPose on pose estimation task: (i) the ability to combine multiple sets of pose estimates to improve prediction accuracy, particularly for challenging joints, and (ii) the ability to adjust the number of iterative steps for feature refinement without retraining the model.
arXiv Detail & Related papers (2023-07-31T14:00:23Z) - Two Approaches to Supervised Image Segmentation [55.616364225463066]
The present work develops comparison experiments between deep learning and multiset neurons approaches.
The deep learning approach confirmed its potential for performing image segmentation.
The alternative multiset methodology allowed for enhanced accuracy while requiring little computational resources.
arXiv Detail & Related papers (2023-07-19T16:42:52Z) - A survey of top-down approaches for human pose estimation [0.0]
State-of-the-art methods implemented with Deep Learning have brought remarkable results in the field of human pose estimation.
This paper aims to provide newcomers with an extensive review of deep learning methods-based 2D images for recognizing the pose of people.
arXiv Detail & Related papers (2022-02-05T23:27:46Z) - Deep Co-Attention Network for Multi-View Subspace Learning [73.3450258002607]
We propose a deep co-attention network for multi-view subspace learning.
It aims to extract both the common information and the complementary information in an adversarial setting.
In particular, it uses a novel cross reconstruction loss and leverages the label information to guide the construction of the latent representation.
arXiv Detail & Related papers (2021-02-15T18:46:44Z) - Deep Bingham Networks: Dealing with Uncertainty and Ambiguity in Pose
Estimation [74.76155168705975]
Deep Bingham Networks (DBN) can handle pose-related uncertainties and ambiguities arising in almost all real life applications concerning 3D data.
DBN extends the state of the art direct pose regression networks by (i) a multi-hypotheses prediction head which can yield different distribution modes.
We propose new training strategies so as to avoid mode or posterior collapse during training and to improve numerical stability.
arXiv Detail & Related papers (2020-12-20T19:20:26Z) - AdaFuse: Adaptive Multiview Fusion for Accurate Human Pose Estimation in
the Wild [77.43884383743872]
We present AdaFuse, an adaptive multiview fusion method to enhance the features in occluded views.
We extensively evaluate the approach on three public datasets including Human3.6M, Total Capture and CMU Panoptic.
We also create a large scale synthetic dataset Occlusion-Person, which allows us to perform numerical evaluation on the occluded joints.
arXiv Detail & Related papers (2020-10-26T03:19:46Z) - Multi-person 3D Pose Estimation in Crowded Scenes Based on Multi-View
Geometry [62.29762409558553]
Epipolar constraints are at the core of feature matching and depth estimation in multi-person 3D human pose estimation methods.
Despite the satisfactory performance of this formulation in sparser crowd scenes, its effectiveness is frequently challenged under denser crowd circumstances.
In this paper, we depart from the multi-person 3D pose estimation formulation, and instead reformulate it as crowd pose estimation.
arXiv Detail & Related papers (2020-07-21T17:59:36Z) - MirrorNet: A Deep Bayesian Approach to Reflective 2D Pose Estimation
from Human Images [42.27703025887059]
The main problems with the standard supervised approach are that it often yields anatomically implausible poses.
We propose a semi-supervised method that can make effective use of images with and without pose annotations.
The results of experiments show that the proposed reflective architecture makes estimated poses anatomically plausible.
arXiv Detail & Related papers (2020-04-08T05:02:48Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.