PACMAN: a framework for pulse oximeter digit detection and reading in a
low-resource setting
- URL: http://arxiv.org/abs/2212.04964v1
- Date: Fri, 9 Dec 2022 16:22:28 GMT
- Title: PACMAN: a framework for pulse oximeter digit detection and reading in a
low-resource setting
- Authors: Chiraphat Boonnag, Wanumaidah Saengmolee, Narongrid Seesawad, Amrest
Chinkamol, Saendee Rattanasomrerk, Kanyakorn Veerakanjana, Kamonwan
Thanontip, Warissara Limpornchitwilai, Piyalitt Ittichaiwong, and Theerawit
Wilaiprasitporn
- Abstract summary: In light of the COVID-19 pandemic, patients were required to manually input their daily oxygen saturation (SpO2) and pulse rate (PR) values into a health monitoring system.
Several studies attempted to detect the physiological value from the captured image using optical character recognition (OCR)
This study aimed to propose a novel framework called PACMAN with a low-resource deep learning-based computer vision.
- Score: 0.42897826548373363
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: In light of the COVID-19 pandemic, patients were required to manually input
their daily oxygen saturation (SpO2) and pulse rate (PR) values into a health
monitoring system-unfortunately, such a process trend to be an error in typing.
Several studies attempted to detect the physiological value from the captured
image using optical character recognition (OCR). However, the technology has
limited availability with high cost. Thus, this study aimed to propose a novel
framework called PACMAN (Pandemic Accelerated Human-Machine Collaboration) with
a low-resource deep learning-based computer vision. We compared
state-of-the-art object detection algorithms (scaled YOLOv4, YOLOv5, and
YOLOR), including the commercial OCR tools for digit recognition on the
captured images from pulse oximeter display. All images were derived from
crowdsourced data collection with varying quality and alignment. YOLOv5 was the
best-performing model against the given model comparison across all datasets,
notably the correctly orientated image dataset. We further improved the model
performance with the digits auto-orientation algorithm and applied a clustering
algorithm to extract SpO2 and PR values. The accuracy performance of YOLOv5
with the implementations was approximately 81.0-89.5%, which was enhanced
compared to without any additional implementation. Accordingly, this study
highlighted the completion of PACMAN framework to detect and read digits in
real-world datasets. The proposed framework has been currently integrated into
the patient monitoring system utilized by hospitals nationwide.
Related papers
- Improving Interpretability and Robustness for the Detection of AI-Generated Images [6.116075037154215]
We analyze existing state-of-the-art AIGI detection methods based on frozen CLIP embeddings.
We show how to interpret them, shedding light on how images produced by various AI generators differ from real ones.
arXiv Detail & Related papers (2024-06-21T10:33:09Z) - REST: Efficient and Accelerated EEG Seizure Analysis through Residual State Updates [54.96885726053036]
This paper introduces a novel graph-based residual state update mechanism (REST) for real-time EEG signal analysis.
By leveraging a combination of graph neural networks and recurrent structures, REST efficiently captures both non-Euclidean geometry and temporal dependencies within EEG data.
Our model demonstrates high accuracy in both seizure detection and classification tasks.
arXiv Detail & Related papers (2024-06-03T16:30:19Z) - A Review and Implementation of Object Detection Models and Optimizations for Real-time Medical Mask Detection during the COVID-19 Pandemic [0.0]
This work assesses the most fundamental object detection models on the Common Objects in Context (COCO) dataset.
We select a highly efficient model called YOLOv5 to train on the topical and unexplored dataset of human faces with medical masks.
We propose an optimized model based on YOLOv5 using transfer learning for the detection of correctly and incorrectly worn medical masks.
arXiv Detail & Related papers (2024-05-28T17:27:24Z) - VAE-IF: Deep feature extraction with averaging for unsupervised artifact
detection in routine acquired ICU time-series [1.9665926763554147]
We propose an unsupervised approach to detect artifacts in minute-by-minute resolution ICU data without prior labeling or signal-specific knowledge.
Our approach combines a variational autoencoder (VAE) and an isolation forest (iForest) model to learn features and identify anomalies.
We show that our approach achieves comparable sensitivity and generalizes well to an external dataset.
arXiv Detail & Related papers (2023-12-10T18:03:40Z) - Remote Bio-Sensing: Open Source Benchmark Framework for Fair Evaluation
of rPPG [2.82697733014759]
r (pg photoplethysmography) is a technology that measures and analyzes BVP (Blood Volume Pulse) by using the light absorption characteristics of hemoglobin captured through a camera.
This study is to provide a framework to evaluate various r benchmarking techniques across a wide range of datasets for fair evaluation and comparison.
arXiv Detail & Related papers (2023-07-24T09:35:47Z) - Vision-Language Modelling For Radiological Imaging and Reports In The
Low Data Regime [70.04389979779195]
This paper explores training medical vision-language models (VLMs) where the visual and language inputs are embedded into a common space.
We explore several candidate methods to improve low-data performance, including adapting generic pre-trained models to novel image and text domains.
Using text-to-image retrieval as a benchmark, we evaluate the performance of these methods with variable sized training datasets of paired chest X-rays and radiological reports.
arXiv Detail & Related papers (2023-03-30T18:20:00Z) - Lung Cancer Lesion Detection in Histopathology Images Using Graph-Based
Sparse PCA Network [93.22587316229954]
We propose a graph-based sparse principal component analysis (GS-PCA) network, for automated detection of cancerous lesions on histological lung slides stained by hematoxylin and eosin (H&E)
We evaluate the performance of the proposed algorithm on H&E slides obtained from an SVM K-rasG12D lung cancer mouse model using precision/recall rates, F-score, Tanimoto coefficient, and area under the curve (AUC) of the receiver operator characteristic (ROC)
arXiv Detail & Related papers (2021-10-27T19:28:36Z) - A parameter refinement method for Ptychography based on Deep Learning
concepts [55.41644538483948]
coarse parametrisation in propagation distance, position errors and partial coherence frequently menaces the experiment viability.
A modern Deep Learning framework is used to correct autonomously the setup incoherences, thus improving the quality of a ptychography reconstruction.
We tested our system on both synthetic datasets and also on real data acquired at the TwinMic beamline of the Elettra synchrotron facility.
arXiv Detail & Related papers (2021-05-18T10:15:17Z) - Many-to-One Distribution Learning and K-Nearest Neighbor Smoothing for
Thoracic Disease Identification [83.6017225363714]
deep learning has become the most powerful computer-aided diagnosis technology for improving disease identification performance.
For chest X-ray imaging, annotating large-scale data requires professional domain knowledge and is time-consuming.
In this paper, we propose many-to-one distribution learning (MODL) and K-nearest neighbor smoothing (KNNS) methods to improve a single model's disease identification performance.
arXiv Detail & Related papers (2021-02-26T02:29:30Z) - Improved Slice-wise Tumour Detection in Brain MRIs by Computing
Dissimilarities between Latent Representations [68.8204255655161]
Anomaly detection for Magnetic Resonance Images (MRIs) can be solved with unsupervised methods.
We have proposed a slice-wise semi-supervised method for tumour detection based on the computation of a dissimilarity function in the latent space of a Variational AutoEncoder.
We show that by training the models on higher resolution images and by improving the quality of the reconstructions, we obtain results which are comparable with different baselines.
arXiv Detail & Related papers (2020-07-24T14:02:09Z) - BS-Net: learning COVID-19 pneumonia severity on a large Chest X-Ray
dataset [6.5800499500032705]
We design an end-to-end deep learning architecture for predicting, on Chest X-rays images (CXR), a multi-regional score conveying the degree of lung compromise in COVID-19 patients.
We exploit a clinical dataset of almost 5,000 CXR annotated images collected in the same hospital.
Our solution outperforms single human annotators in rating accuracy and consistency.
arXiv Detail & Related papers (2020-06-08T13:55:58Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.