ShaRPy: Shape Reconstruction and Hand Pose Estimation from RGB-D with
Uncertainty
- URL: http://arxiv.org/abs/2303.10042v2
- Date: Tue, 12 Sep 2023 13:08:53 GMT
- Title: ShaRPy: Shape Reconstruction and Hand Pose Estimation from RGB-D with
Uncertainty
- Authors: Vanessa Wirth, Anna-Maria Liphardt, Birte Coppers, Johanna Br\"aunig,
Simon Heinrich, Sigrid Leyendecker, Arnd Kleyer, Georg Schett, Martin
Vossiek, Bernhard Egger, Marc Stamminger
- Abstract summary: We propose ShaRPy, the first RGB-D Shape Reconstruction and hand Pose tracking system.
ShaRPy approximates a personalized hand shape, promoting a more realistic and intuitive understanding of its digital twin.
We evaluate ShaRPy on a keypoint detection benchmark and show qualitative results of hand function assessments for activity monitoring of musculoskeletal diseases.
- Score: 6.559796851992517
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Despite their potential, markerless hand tracking technologies are not yet
applied in practice to the diagnosis or monitoring of the activity in
inflammatory musculoskeletal diseases. One reason is that the focus of most
methods lies in the reconstruction of coarse, plausible poses, whereas in the
clinical context, accurate, interpretable, and reliable results are required.
Therefore, we propose ShaRPy, the first RGB-D Shape Reconstruction and hand
Pose tracking system, which provides uncertainty estimates of the computed
pose, e.g., when a finger is hidden or its estimate is inconsistent with the
observations in the input, to guide clinical decision-making. Besides pose,
ShaRPy approximates a personalized hand shape, promoting a more realistic and
intuitive understanding of its digital twin. Our method requires only a
light-weight setup with a single consumer-level RGB-D camera yet it is able to
distinguish similar poses with only small joint angle deviations in a
metrically accurate space. This is achieved by combining a data-driven dense
correspondence predictor with traditional energy minimization. To bridge the
gap between interactive visualization and biomedical simulation we leverage a
parametric hand model in which we incorporate biomedical constraints and
optimize for both, its pose and hand shape. We evaluate ShaRPy on a keypoint
detection benchmark and show qualitative results of hand function assessments
for activity monitoring of musculoskeletal diseases.
Related papers
- MS-MANO: Enabling Hand Pose Tracking with Biomechanical Constraints [50.61346764110482]
We integrate a musculoskeletal system with a learnable parametric hand model, MANO, to create MS-MANO.
This model emulates the dynamics of muscles and tendons to drive the skeletal system, imposing physiologically realistic constraints on the resulting torque trajectories.
We also propose a simulation-in-the-loop pose refinement framework, BioPR, that refines the initial estimated pose through a multi-layer perceptron network.
arXiv Detail & Related papers (2024-04-16T02:18:18Z) - Analyzing Participants' Engagement during Online Meetings Using Unsupervised Remote Photoplethysmography with Behavioral Features [50.82725748981231]
Engagement measurement finds application in healthcare, education, services.
Use of physiological and behavioral features is viable, but impracticality of traditional physiological measurement arises due to the need for contact sensors.
We demonstrate the feasibility of the unsupervised photoplethysmography (rmography) as an alternative for contact sensors.
arXiv Detail & Related papers (2024-04-05T20:39:16Z) - KIDS: kinematics-based (in)activity detection and segmentation in a
sleep case study [5.707737640557724]
Sleep behaviour and in-bed movements contain rich information on the neurophysiological health of people.
This paper proposes an online Bayesian probabilistic framework for objective (in)activity detection and segmentation based on clinically meaningful joint kinematics.
arXiv Detail & Related papers (2023-01-04T16:24:01Z) - Benchmarking Heterogeneous Treatment Effect Models through the Lens of
Interpretability [82.29775890542967]
Estimating personalized effects of treatments is a complex, yet pervasive problem.
Recent developments in the machine learning literature on heterogeneous treatment effect estimation gave rise to many sophisticated, but opaque, tools.
We use post-hoc feature importance methods to identify features that influence the model's predictions.
arXiv Detail & Related papers (2022-06-16T17:59:05Z) - In-Bed Human Pose Estimation from Unseen and Privacy-Preserving Image
Domains [22.92165116962952]
In-bed human posture estimation provides important health-related metrics with potential value in medical condition assessments.
We propose a multi-modal conditional variational autoencoder (MC-VAE) capable of reconstructing features from missing modalities seen during training.
We demonstrate that body positions can be effectively recognized from the available modality, achieving on par results with baseline models.
arXiv Detail & Related papers (2021-11-30T04:56:16Z) - Real-time landmark detection for precise endoscopic submucosal
dissection via shape-aware relation network [51.44506007844284]
We propose a shape-aware relation network for accurate and real-time landmark detection in endoscopic submucosal dissection surgery.
We first devise an algorithm to automatically generate relation keypoint heatmaps, which intuitively represent the prior knowledge of spatial relations among landmarks.
We then develop two complementary regularization schemes to progressively incorporate the prior knowledge into the training process.
arXiv Detail & Related papers (2021-11-08T07:57:30Z) - Data-driven generation of plausible tissue geometries for realistic
photoacoustic image synthesis [53.65837038435433]
Photoacoustic tomography (PAT) has the potential to recover morphological and functional tissue properties.
We propose a novel approach to PAT data simulation, which we refer to as "learning to simulate"
We leverage the concept of Generative Adversarial Networks (GANs) trained on semantically annotated medical imaging data to generate plausible tissue geometries.
arXiv Detail & Related papers (2021-03-29T11:30:18Z) - Multi-view Human Pose and Shape Estimation Using Learnable Volumetric
Aggregation [0.0]
We propose a learnable aggregation approach to reconstruct 3D human body pose and shape from calibrated multi-view images.
Compared to previous approaches, our framework shows higher accuracy and greater promise for real-time prediction, given its cost efficiency.
arXiv Detail & Related papers (2020-11-26T18:33:35Z) - Appearance Learning for Image-based Motion Estimation in Tomography [60.980769164955454]
In tomographic imaging, anatomical structures are reconstructed by applying a pseudo-inverse forward model to acquired signals.
Patient motion corrupts the geometry alignment in the reconstruction process resulting in motion artifacts.
We propose an appearance learning approach recognizing the structures of rigid motion independently from the scanned object.
arXiv Detail & Related papers (2020-06-18T09:49:11Z) - Patient Specific Biomechanics Are Clinically Significant In Accurate
Computer Aided Surgical Image Guidance [3.5760618920650398]
Augmented Reality is used in Image Guided surgery (AR IG) to fuse surgical landmarks from preoperative images into a video overlay.
Physical simulation is essential to maintaining accurate position of the landmarks as surgery progresses.
In liver procedures, AR IG simulation accuracy is hampered by an inability to model stiffness variations to the patients disease.
We introduce a novel method to account for patient specific stiffness variation based on Magnetic Resonance Elastography (MRE) data.
arXiv Detail & Related papers (2020-01-29T08:11:07Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.