PACMAN: a framework for pulse oximeter digit detection and reading in a
low-resource setting
- URL: http://arxiv.org/abs/2212.04964v1
- Date: Fri, 9 Dec 2022 16:22:28 GMT
- Title: PACMAN: a framework for pulse oximeter digit detection and reading in a
low-resource setting
- Authors: Chiraphat Boonnag, Wanumaidah Saengmolee, Narongrid Seesawad, Amrest
Chinkamol, Saendee Rattanasomrerk, Kanyakorn Veerakanjana, Kamonwan
Thanontip, Warissara Limpornchitwilai, Piyalitt Ittichaiwong, and Theerawit
Wilaiprasitporn
- Abstract summary: In light of the COVID-19 pandemic, patients were required to manually input their daily oxygen saturation (SpO2) and pulse rate (PR) values into a health monitoring system.
Several studies attempted to detect the physiological value from the captured image using optical character recognition (OCR)
This study aimed to propose a novel framework called PACMAN with a low-resource deep learning-based computer vision.
- Score: 0.42897826548373363
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: In light of the COVID-19 pandemic, patients were required to manually input
their daily oxygen saturation (SpO2) and pulse rate (PR) values into a health
monitoring system-unfortunately, such a process trend to be an error in typing.
Several studies attempted to detect the physiological value from the captured
image using optical character recognition (OCR). However, the technology has
limited availability with high cost. Thus, this study aimed to propose a novel
framework called PACMAN (Pandemic Accelerated Human-Machine Collaboration) with
a low-resource deep learning-based computer vision. We compared
state-of-the-art object detection algorithms (scaled YOLOv4, YOLOv5, and
YOLOR), including the commercial OCR tools for digit recognition on the
captured images from pulse oximeter display. All images were derived from
crowdsourced data collection with varying quality and alignment. YOLOv5 was the
best-performing model against the given model comparison across all datasets,
notably the correctly orientated image dataset. We further improved the model
performance with the digits auto-orientation algorithm and applied a clustering
algorithm to extract SpO2 and PR values. The accuracy performance of YOLOv5
with the implementations was approximately 81.0-89.5%, which was enhanced
compared to without any additional implementation. Accordingly, this study
highlighted the completion of PACMAN framework to detect and read digits in
real-world datasets. The proposed framework has been currently integrated into
the patient monitoring system utilized by hospitals nationwide.
Related papers
- A Framework for Evaluating Predictive Models Using Synthetic Image Covariates and Longitudinal Data [0.0]
We present a novel framework for synthesizing patient data with longitudinal observations.
Our approach introduces controlled association in latent spaces generating each data modality.
We demonstrate our framework using optical coherence tomography ( OCT) scans.
arXiv Detail & Related papers (2024-10-21T16:43:16Z) - Local-to-Global Self-Supervised Representation Learning for Diabetic Retinopathy Grading [0.0]
This research aims to present a novel hybrid learning model using self-supervised learning and knowledge distillation.
In our algorithm, for the first time among all self-supervised learning and knowledge distillation models, the test dataset is 50% larger than the training dataset.
Compared to a similar state-of-the-art model, our results achieved higher accuracy and more effective representation spaces.
arXiv Detail & Related papers (2024-10-01T15:19:16Z) - REST: Efficient and Accelerated EEG Seizure Analysis through Residual State Updates [54.96885726053036]
This paper introduces a novel graph-based residual state update mechanism (REST) for real-time EEG signal analysis.
By leveraging a combination of graph neural networks and recurrent structures, REST efficiently captures both non-Euclidean geometry and temporal dependencies within EEG data.
Our model demonstrates high accuracy in both seizure detection and classification tasks.
arXiv Detail & Related papers (2024-06-03T16:30:19Z) - A Review and Implementation of Object Detection Models and Optimizations for Real-time Medical Mask Detection during the COVID-19 Pandemic [0.0]
This work assesses the most fundamental object detection models on the Common Objects in Context (COCO) dataset.
We select a highly efficient model called YOLOv5 to train on the topical and unexplored dataset of human faces with medical masks.
We propose an optimized model based on YOLOv5 using transfer learning for the detection of correctly and incorrectly worn medical masks.
arXiv Detail & Related papers (2024-05-28T17:27:24Z) - VAE-IF: Deep feature extraction with averaging for fully unsupervised artifact detection in routinely acquired ICU time-series [1.9665926763554147]
We propose a novel fully unsupervised approach to detect artifacts in minute-by-minute resolution ICU data without prior labeling or signal-specific knowledge.
Our approach combines a variational autoencoder (VAE) and an isolation forest (IF) into a hybrid model to learn features and identify anomalies.
We show that our unsupervised approach achieves comparable sensitivity to fully supervised methods and generalizes well to an external dataset.
arXiv Detail & Related papers (2023-12-10T18:03:40Z) - Remote Bio-Sensing: Open Source Benchmark Framework for Fair Evaluation
of rPPG [2.82697733014759]
r (pg photoplethysmography) is a technology that measures and analyzes BVP (Blood Volume Pulse) by using the light absorption characteristics of hemoglobin captured through a camera.
This study is to provide a framework to evaluate various r benchmarking techniques across a wide range of datasets for fair evaluation and comparison.
arXiv Detail & Related papers (2023-07-24T09:35:47Z) - Vision-Language Modelling For Radiological Imaging and Reports In The
Low Data Regime [70.04389979779195]
This paper explores training medical vision-language models (VLMs) where the visual and language inputs are embedded into a common space.
We explore several candidate methods to improve low-data performance, including adapting generic pre-trained models to novel image and text domains.
Using text-to-image retrieval as a benchmark, we evaluate the performance of these methods with variable sized training datasets of paired chest X-rays and radiological reports.
arXiv Detail & Related papers (2023-03-30T18:20:00Z) - Intelligent Masking: Deep Q-Learning for Context Encoding in Medical
Image Analysis [48.02011627390706]
We develop a novel self-supervised approach that occludes targeted regions to improve the pre-training procedure.
We show that training the agent against the prediction model can significantly improve the semantic features extracted for downstream classification tasks.
arXiv Detail & Related papers (2022-03-25T19:05:06Z) - Lung Cancer Lesion Detection in Histopathology Images Using Graph-Based
Sparse PCA Network [93.22587316229954]
We propose a graph-based sparse principal component analysis (GS-PCA) network, for automated detection of cancerous lesions on histological lung slides stained by hematoxylin and eosin (H&E)
We evaluate the performance of the proposed algorithm on H&E slides obtained from an SVM K-rasG12D lung cancer mouse model using precision/recall rates, F-score, Tanimoto coefficient, and area under the curve (AUC) of the receiver operator characteristic (ROC)
arXiv Detail & Related papers (2021-10-27T19:28:36Z) - Many-to-One Distribution Learning and K-Nearest Neighbor Smoothing for
Thoracic Disease Identification [83.6017225363714]
deep learning has become the most powerful computer-aided diagnosis technology for improving disease identification performance.
For chest X-ray imaging, annotating large-scale data requires professional domain knowledge and is time-consuming.
In this paper, we propose many-to-one distribution learning (MODL) and K-nearest neighbor smoothing (KNNS) methods to improve a single model's disease identification performance.
arXiv Detail & Related papers (2021-02-26T02:29:30Z) - Improved Slice-wise Tumour Detection in Brain MRIs by Computing
Dissimilarities between Latent Representations [68.8204255655161]
Anomaly detection for Magnetic Resonance Images (MRIs) can be solved with unsupervised methods.
We have proposed a slice-wise semi-supervised method for tumour detection based on the computation of a dissimilarity function in the latent space of a Variational AutoEncoder.
We show that by training the models on higher resolution images and by improving the quality of the reconstructions, we obtain results which are comparable with different baselines.
arXiv Detail & Related papers (2020-07-24T14:02:09Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.