A Mixed Reality System for Robust Manikin Localization in Childbirth Training
- URL: http://arxiv.org/abs/2602.05588v1
- Date: Thu, 05 Feb 2026 12:17:05 GMT
- Title: A Mixed Reality System for Robust Manikin Localization in Childbirth Training
- Authors: Haojie Cheng, Chang Liu, Abhiram Kanneganti, Mahesh Arjandas Choolani, Arundhati Tushar Gosavi, Eng Tat Khoo,
- Abstract summary: We introduce a mixed reality (MR) system for childbirth training that combines virtual guidance with tactile manikin interaction.<n>The system extends the passthrough capability of commercial head-mounted displays (HMDs) by spatially calibrating an external RGB-D camera.<n>A large-scale user study involving 83 fourth-year medical students was subsequently conducted to compare MR-based and virtual reality (VR)-based childbirth training.
- Score: 3.6037109860836556
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Opportunities for medical students to gain practical experience in vaginal births are increasingly constrained by shortened clinical rotations, patient reluctance, and the unpredictable nature of labour. To alleviate clinicians' instructional burden and enhance trainees' learning efficiency, we introduce a mixed reality (MR) system for childbirth training that combines virtual guidance with tactile manikin interaction, thereby preserving authentic haptic feedback while enabling independent practice without continuous on-site expert supervision. The system extends the passthrough capability of commercial head-mounted displays (HMDs) by spatially calibrating an external RGB-D camera, allowing real-time visual integration of physical training objects. Building on this capability, we implement a coarse-to-fine localization pipeline that first aligns the maternal manikin with fiducial markers to define a delivery region and then registers the pre-scanned neonatal head within this area. This process enables spatially accurate overlay of virtual guiding hands near the manikin, allowing trainees to follow expert trajectories reinforced by haptic interaction. Experimental evaluations demonstrate that the system achieves accurate and stable manikin localization on a standalone headset, ensuring practical deployment without external computing resources. A large-scale user study involving 83 fourth-year medical students was subsequently conducted to compare MR-based and virtual reality (VR)-based childbirth training. Four senior obstetricians independently assessed performance using standardized criteria. Results showed that MR training achieved significantly higher scores in delivery, post-delivery, and overall task performance, and was consistently preferred by trainees over VR training.
Related papers
- A Voice-Enabled Virtual Patient System for Interactive Training in Standardized Clinical Assessment [0.0]
We introduce a voice-enabled virtual patient simulation system powered by a large language model (LLM)<n>This study describes the system's development and validates its ability to generate virtual patients who adhere to pre-defined clinical profiles.<n>Our findings suggest that LLM-powered virtual patient simulations are a viable and scalable tool for training clinicians.
arXiv Detail & Related papers (2025-11-01T21:18:08Z) - CLiVR: Conversational Learning System in Virtual Reality with AI-Powered Patients [0.0]
CLiVR is a Conversational Learning system in Virtual Reality that integrates large language models, speech processing, and 3D avatars.<n>Developed in Unity and deployed on the Meta Quest 3 platform, CLiVR enables trainees to engage in natural dialogue with virtual patients.
arXiv Detail & Related papers (2025-10-21T19:19:55Z) - Setup-Invariant Augmented Reality for Teaching by Demonstration with Surgical Robots [49.26692555627371]
We present dV-STEAR, an open-source system that plays back task-aligned expert demonstrations without assuming identical setup joint positions between expert and novice.<n>In a user study, dV-STEAR significantly improved novice performance on tasks from the Fundamentals of Laparoscopic Surgery.<n>Across both tasks, participants using dV-STEAR exhibited significantly more balanced hand use and reported lower frustration levels.
arXiv Detail & Related papers (2025-04-09T08:34:25Z) - Analyzing Participants' Engagement during Online Meetings Using Unsupervised Remote Photoplethysmography with Behavioral Features [50.82725748981231]
Engagement measurement finds application in healthcare, education, services.
Use of physiological and behavioral features is viable, but impracticality of traditional physiological measurement arises due to the need for contact sensors.
We demonstrate the feasibility of the unsupervised photoplethysmography (rmography) as an alternative for contact sensors.
arXiv Detail & Related papers (2024-04-05T20:39:16Z) - Assessment and treatment of visuospatial neglect using active learning
with Gaussian processes regression [0.3262230127283452]
Visuospatial neglect is a disorder characterised by impaired awareness for visual stimuli located in regions of space and frames of reference.
We present an artificial intelligence solution designed to accurately assess a patient's visuospatial neglect in a three-dimensional setting.
arXiv Detail & Related papers (2023-09-29T09:18:32Z) - A Survey of the Impact of Self-Supervised Pretraining for Diagnostic
Tasks with Radiological Images [71.26717896083433]
Self-supervised pretraining has been observed to be effective at improving feature representations for transfer learning.
This review summarizes recent research into its usage in X-ray, computed tomography, magnetic resonance, and ultrasound imaging.
arXiv Detail & Related papers (2023-09-05T19:45:09Z) - RL-Based Guidance in Outpatient Hysteroscopy Training: A Feasibility
Study [4.614579113754949]
This work presents an RL-based agent for outpatient hysteroscopy training.
Recent advancements enabled performing this type of intervention in the outpatient setup without anaesthesia.
While being beneficial to the patient, this approach introduces new challenges for clinicians, who should take additional measures to maintain the level of patient comfort and prevent tissue damage.
arXiv Detail & Related papers (2022-11-26T11:16:17Z) - Automated Fidelity Assessment for Strategy Training in Inpatient
Rehabilitation using Natural Language Processing [53.096237570992294]
Strategy training is a rehabilitation approach that teaches skills to reduce disability among those with cognitive impairments following a stroke.
Standardized fidelity assessment is used to measure adherence to treatment principles.
We developed a rule-based NLP algorithm, a long-short term memory (LSTM) model, and a bidirectional encoder representation from transformers (BERT) model for this task.
arXiv Detail & Related papers (2022-09-14T15:33:30Z) - Learning Personal Representations from fMRIby Predicting Neurofeedback
Performance [52.77024349608834]
We present a deep neural network method for learning a personal representation for individuals performing a self neuromodulation task, guided by functional MRI (fMRI)
The representation is learned by a self-supervised recurrent neural network, that predicts the Amygdala activity in the next fMRI frame given recent fMRI frames and is conditioned on the learned individual representation.
arXiv Detail & Related papers (2021-12-06T10:16:54Z) - Real-time landmark detection for precise endoscopic submucosal
dissection via shape-aware relation network [51.44506007844284]
We propose a shape-aware relation network for accurate and real-time landmark detection in endoscopic submucosal dissection surgery.
We first devise an algorithm to automatically generate relation keypoint heatmaps, which intuitively represent the prior knowledge of spatial relations among landmarks.
We then develop two complementary regularization schemes to progressively incorporate the prior knowledge into the training process.
arXiv Detail & Related papers (2021-11-08T07:57:30Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.