Two-Stream Thermal Imaging Fusion for Enhanced Time of Birth Detection in Neonatal Care
- URL: http://arxiv.org/abs/2503.03244v1
- Date: Wed, 05 Mar 2025 07:52:52 GMT
- Title: Two-Stream Thermal Imaging Fusion for Enhanced Time of Birth Detection in Neonatal Care
- Authors: Jorge García-Torres, Øyvind Meinich-Bache, Sara Brunner, Siren Rettedal, Vilde Kolstad, Kjersti Engan,
- Abstract summary: We present a two-stream fusion system that combines the power of image and video analysis to accurately detect the Time of Birth (ToB)<n>Our system achieves 95.7% precision and 84.8% recall in detecting birth within short video clips.
- Score: 1.101731711817642
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Around 10% of newborns require some help to initiate breathing, and 5\% need ventilation assistance. Accurate Time of Birth (ToB) documentation is essential for optimizing neonatal care, as timely interventions are vital for proper resuscitation. However, current clinical methods for recording ToB often rely on manual processes, which can be prone to inaccuracies. In this study, we present a novel two-stream fusion system that combines the power of image and video analysis to accurately detect the ToB from thermal recordings in the delivery room and operating theater. By integrating static and dynamic streams, our approach captures richer birth-related spatiotemporal features, leading to more robust and precise ToB estimation. We demonstrate that this synergy between data modalities enhances performance over single-stream approaches. Our system achieves 95.7% precision and 84.8% recall in detecting birth within short video clips. Additionally, with the help of a score aggregation module, it successfully identifies ToB in 100% of test cases, with a median absolute error of 2 seconds and an absolute mean deviation of 4.5 seconds compared to manual annotations.
Related papers
- Synergistic Bleeding Region and Point Detection in Surgical Videos [58.52416525595543]
Intraoperative bleeding in laparoscopic surgery causes rapid obscuration of the operative field to hinder the surgical process.
In this study, we first construct a real-world surgical bleeding detection dataset, named SurgBlood, comprising 5,330 frames from 95 surgical video clips with bleeding region and point annotations.
We develop a dual-task synergistic online detector called BlooDet, designed to perform simultaneous detection of bleeding regions and points in surgical videos.
arXiv Detail & Related papers (2025-03-28T06:27:55Z) - Finetuning and Quantization of EEG-Based Foundational BioSignal Models on ECG and PPG Data for Blood Pressure Estimation [53.2981100111204]
Photoplethysmography and electrocardiography can potentially enable continuous blood pressure (BP) monitoring.<n>Yet accurate and robust machine learning (ML) models remains challenging due to variability in data quality and patient-specific factors.<n>In this work, we investigate whether a model pre-trained on one modality can effectively be exploited to improve the accuracy of a different signal type.<n>Our approach achieves near state-of-the-art accuracy for diastolic BP and surpasses by 1.5x the accuracy of prior works for systolic BP.
arXiv Detail & Related papers (2025-02-10T13:33:12Z) - AI-Based Thermal Video Analysis in Privacy-Preserving Healthcare: A Case Study on Detecting Time of Birth [1.2524536193679123]
We present an AI-driven, video-based system for automated ToB detection using thermal imaging.<n>Our system achieves 91.4% precision and 97.4% recall in detecting ToB within thermal video clips during performance evaluation.
arXiv Detail & Related papers (2025-02-05T07:01:49Z) - Advancing Newborn Care: Precise Birth Time Detection Using AI-Driven Thermal Imaging with Adaptive Normalization [1.101731711817642]
We investigate the fusion of Artificial Intelligence (AI) and thermal imaging to develop the first AI-driven Time of Birth detector.
Our methodology involves a three-step process: first, we propose an adaptive normalization method based on Gaussian mixture models (GMM) to mitigate issues related to temperature variations.
A precision of 88.1% and a recall of 89.3% are reported in the detection of the newborn within thermal frames during performance evaluation.
arXiv Detail & Related papers (2024-10-14T13:20:51Z) - Automatic Infant Respiration Estimation from Video: A Deep Flow-based
Algorithm and a Novel Public Benchmark [10.097634735211654]
We develop a deep-learning method for estimating respiratory rate and waveform from plain video footage in natural settings.
Our model supports the first public annotated infant respiration dataset with 125 videos.
When trained and tested on the AIR-125 infant data, our method significantly outperforms other state-of-the-art methods in respiratory rate estimation.
arXiv Detail & Related papers (2023-07-24T19:59:15Z) - Domain Transfer Through Image-to-Image Translation for Uncertainty-Aware Prostate Cancer Classification [42.75911994044675]
We present a novel approach for unpaired image-to-image translation of prostate MRIs and an uncertainty-aware training approach for classifying clinically significant PCa.
Our approach involves a novel pipeline for translating unpaired 3.0T multi-parametric prostate MRIs to 1.5T, thereby augmenting the available training data.
Our experiments demonstrate that the proposed method significantly improves the Area Under ROC Curve (AUC) by over 20% compared to the previous work.
arXiv Detail & Related papers (2023-07-02T05:26:54Z) - Object Detection During Newborn Resuscitation Activities [3.5661795505491445]
We propose a two-step process in order to detect activities possibly overlapping in time.
The first step is to detect and track the relevant objects, like bag-mask resuscitator, heart rate sensors etc.
The performance of the object detection during activities were 96.97 % (ventilations), 100 % (attaching/removing heart rate sensor) and 75 % (suction) on a test set of 20 videos.
arXiv Detail & Related papers (2023-03-14T11:04:50Z) - Activity Recognition From Newborn Resuscitation Videos [3.54339629317433]
A dataset of recorded signals during newborn resuscitation, including videos, has been collected in Haydom, Tanzania.
We propose a two-step deep neural network system, ORAA-net, to do activity recognition during newborn resuscitation.
The system recognized the activities newborn uncovered, stimulation, ventilation and suction with a mean precision of 77.67 %, a mean recall of 77,64 %, and a mean accuracy of 92.40 %.
arXiv Detail & Related papers (2023-03-14T11:04:32Z) - TempNet: Temporal Attention Towards the Detection of Animal Behaviour in
Videos [63.85815474157357]
We propose an efficient computer vision- and deep learning-based method for the detection of biological behaviours in videos.
TempNet uses an encoder bridge and residual blocks to maintain model performance with a two-staged, spatial, then temporal, encoder.
We demonstrate its application to the detection of sablefish (Anoplopoma fimbria) startle events.
arXiv Detail & Related papers (2022-11-17T23:55:12Z) - EMT-NET: Efficient multitask network for computer-aided diagnosis of
breast cancer [58.720142291102135]
We propose an efficient and light-weighted learning architecture to classify and segment breast tumors simultaneously.
We incorporate a segmentation task into a tumor classification network, which makes the backbone network learn representations focused on tumor regions.
The accuracy, sensitivity, and specificity of tumor classification is 88.6%, 94.1%, and 85.3%, respectively.
arXiv Detail & Related papers (2022-01-13T05:24:40Z) - A Deep Learning Approach to Predicting Collateral Flow in Stroke
Patients Using Radiomic Features from Perfusion Images [58.17507437526425]
Collateral circulation results from specialized anastomotic channels which provide oxygenated blood to regions with compromised blood flow.
The actual grading is mostly done through manual inspection of the acquired images.
We present a deep learning approach to predicting collateral flow grading in stroke patients based on radiomic features extracted from MR perfusion data.
arXiv Detail & Related papers (2021-10-24T18:58:40Z) - Finding Action Tubes with a Sparse-to-Dense Framework [62.60742627484788]
We propose a framework that generates action tube proposals from video streams with a single forward pass in a sparse-to-dense manner.
We evaluate the efficacy of our model on the UCF101-24, JHMDB-21 and UCFSports benchmark datasets.
arXiv Detail & Related papers (2020-08-30T15:38:44Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.