A Deep Learning Approach to Localizing Multi-level Airway Collapse Based on Snoring Sounds
- URL: http://arxiv.org/abs/2408.16030v1
- Date: Wed, 28 Aug 2024 09:30:20 GMT
- Title: A Deep Learning Approach to Localizing Multi-level Airway Collapse Based on Snoring Sounds
- Authors: Ying-Chieh Hsu, Stanley Yung-Chuan Liu, Chao-Jung Huang, Chi-Wei Wu, Ren-Kai Cheng, Jane Yung-Jen Hsu, Shang-Ran Huang, Yuan-Ren Cheng, Fu-Shun Hsu,
- Abstract summary: This study investigates the application of machine/deep learning to classify snoring sounds excited at different levels of the upper airway in patients with obstructive sleep apnea (OSA)
The snoring sounds of 39 subjects were analyzed and labeled according to the Velum, Oropharynx, Tongue Base, and Epiglottis (VOTE) classification system.
The ResNet-50, a convolutional neural network (CNN), showed the best overall performance in classifying snoring acoustics.
- Score: 1.165734481380989
- License: http://creativecommons.org/licenses/by-sa/4.0/
- Abstract: This study investigates the application of machine/deep learning to classify snoring sounds excited at different levels of the upper airway in patients with obstructive sleep apnea (OSA) using data from drug-induced sleep endoscopy (DISE). The snoring sounds of 39 subjects were analyzed and labeled according to the Velum, Oropharynx, Tongue Base, and Epiglottis (VOTE) classification system. The dataset, comprising 5,173 one-second segments, was used to train and test models, including Support Vector Machine (SVM), Bidirectional Long Short-Term Memory (BiLSTM), and ResNet-50. The ResNet-50, a convolutional neural network (CNN), showed the best overall performance in classifying snoring acoustics, particularly in identifying multi-level obstructions. The study emphasizes the potential of integrating snoring acoustics with deep learning to improve the diagnosis and treatment of OSA. However, challenges such as limited sample size, data imbalance, and differences between pharmacologically induced and natural snoring sounds were noted, suggesting further research to enhance model accuracy and generalizability.
Related papers
- MobileNetV2: A lightweight classification model for home-based sleep apnea screening [3.463585190363689]
This study proposes a novel lightweight neural network model leveraging features extracted from electrocardiogram (ECG) and respiratory signals for early OSA screening.
ECG signals are used to generate feature spectrograms to predict sleep stages, while respiratory signals are employed to detect sleep-related breathing abnormalities.
By integrating these predictions, the method calculates the apnea-hypopnea index (AHI) with enhanced accuracy, facilitating precise OSA diagnosis.
arXiv Detail & Related papers (2024-12-28T01:37:25Z) - Mamba-based Deep Learning Approaches for Sleep Staging on a Wireless Multimodal Wearable System without Electroencephalography [3.7428541180163126]
We investigate Mamba-based deep learning approaches for sleep staging on signals from the ANNE One system.
We trained Mamba-based models with convolutional-recurrent neural network (CRNN) and the recurrent neural network (RNN) architectures.
Deep learning models can infer major sleep stages from the ANNE One and can be successfully applied to data from adults attending a tertiary care sleep clinic.
arXiv Detail & Related papers (2024-12-20T14:43:02Z) - Improving snore detection under limited dataset through harmonic/percussive source separation and convolutional neural networks [0.0]
Snoring is an acoustic biomarker commonly observed in individuals with Obstructive Sleep Apnoea Syndrome (OSAS)
We propose a novel method to differentiate monaural snoring from non-snoring sounds by analyzing the harmonic content of the input sound.
arXiv Detail & Related papers (2024-10-31T10:27:48Z) - Contrasting Deep Learning Models for Direct Respiratory Insufficiency Detection Versus Blood Oxygen Saturation Estimation [1.4149417323913716]
We study pretrained audio neural networks (CNN6, CNN10 and CNN14) and the Masked Autoencoder (Audio-MAE) for RI detection.
For the regression task of estimating SpO2 levels, the models achieve root mean square error values exceeding the accepted clinical range of 3.5% for finger oximeters.
We transform SpO2-regression into a SpO2-threshold binary classification problem, with a threshold of 92%.
arXiv Detail & Related papers (2024-07-30T17:26:16Z) - Rene: A Pre-trained Multi-modal Architecture for Auscultation of Respiratory Diseases [5.810320353233697]
We introduce Rene, a pioneering large-scale model tailored for respiratory sound recognition.
Our innovative approach applies a pre-trained speech recognition model to process respiratory sounds.
We have developed a real-time respiratory sound discrimination system utilizing the Rene architecture.
arXiv Detail & Related papers (2024-05-13T03:00:28Z) - Multimodal Sleep Apnea Detection with Missing or Noisy Modalities [1.3518297878940662]
We propose a comprehensive pipeline aiming to compensate for the missing or noisy modalities when performing sleep apnea detection.
Our experiments show that the proposed model outperforms other state-of-the-art approaches in sleep apnea detection.
arXiv Detail & Related papers (2024-02-24T16:29:36Z) - Detecting Respiratory Pathologies Using Convolutional Neural Networks
and Variational Autoencoders for Unbalancing Data [0.3749861135832073]
This dataset is composed of 920 sounds of which 810 are of chronic diseases, 75 of non-chronic diseases and only 35 of healthy individuals.
A Convolutional Neural Network (CNN) was used to classify the respiratory sounds into healthy, chronic, and non-chronic disease.
We achieved results up to 0.993 F-Score in the three-label classification and 0.990 F-Score in the more challenging six-class classification.
arXiv Detail & Related papers (2024-02-03T15:17:32Z) - A Federated Learning Framework for Stenosis Detection [70.27581181445329]
This study explores the use of Federated Learning (FL) for stenosis detection in coronary angiography images (CA)
Two heterogeneous datasets from two institutions were considered: dataset 1 includes 1219 images from 200 patients, which we acquired at the Ospedale Riuniti of Ancona (Italy)
dataset 2 includes 7492 sequential images from 90 patients from a previous study available in the literature.
arXiv Detail & Related papers (2023-10-30T11:13:40Z) - Spiking-LEAF: A Learnable Auditory front-end for Spiking Neural Networks [53.31894108974566]
Spiking-LEAF is a learnable auditory front-end meticulously designed for SNN-based speech processing.
On keyword spotting and speaker identification tasks, the proposed Spiking-LEAF outperforms both SOTA spiking auditory front-ends.
arXiv Detail & Related papers (2023-09-18T04:03:05Z) - Using BOLD-fMRI to Compute the Respiration Volume per Time (RTV) and
Respiration Variation (RV) with Convolutional Neural Networks (CNN) in the
Human Connectome Development Cohort [55.41644538483948]
This study proposes a one-dimensional CNN model for reconstruction of two respiratory measures, RV and RVT.
Results show that a CNN can capture informative features from resting BOLD signals and reconstruct realistic RV and RVT timeseries.
arXiv Detail & Related papers (2023-07-03T18:06:36Z) - Attention-based Saliency Maps Improve Interpretability of Pneumothorax
Classification [52.77024349608834]
To investigate chest radiograph (CXR) classification performance of vision transformers (ViT) and interpretability of attention-based saliency.
ViTs were fine-tuned for lung disease classification using four public data sets: CheXpert, Chest X-Ray 14, MIMIC CXR, and VinBigData.
ViTs had comparable CXR classification AUCs compared with state-of-the-art CNNs.
arXiv Detail & Related papers (2023-03-03T12:05:41Z) - The role of noise in denoising models for anomaly detection in medical
images [62.0532151156057]
Pathological brain lesions exhibit diverse appearance in brain images.
Unsupervised anomaly detection approaches have been proposed using only normal data for training.
We show that optimization of the spatial resolution and magnitude of the noise improves the performance of different model training regimes.
arXiv Detail & Related papers (2023-01-19T21:39:38Z) - Osteoporosis Prescreening using Panoramic Radiographs through a Deep
Convolutional Neural Network with Attention Mechanism [65.70943212672023]
Deep convolutional neural network (CNN) with an attention module can detect osteoporosis on panoramic radiographs.
dataset of 70 panoramic radiographs (PRs) from 70 different subjects of age between 49 to 60 was used.
arXiv Detail & Related papers (2021-10-19T00:03:57Z) - Pointwise visual field estimation from optical coherence tomography in
glaucoma: a structure-function analysis using deep learning [12.70143462176992]
Standard Automated Perimetry (SAP) is the gold standard to monitor visual field (VF) loss in glaucoma management.
We developed and validated a deep learning (DL) regression model that estimates pointwise and overall VF loss from unsegmented optical coherence tomography ( OCT) scans.
arXiv Detail & Related papers (2021-06-07T16:58:38Z) - Automatic Classification of OSA related Snoring Signals from Nocturnal
Audio Recordings [0.30586855806896046]
An automatic algorithm is presented to classify the nocturnal audio recording of an obstructive sleep apnoea (OSA) patient as OSA related snore, simple snore and other sounds.
Time and frequency features of the audio signal were extracted to classify the audio signal into OSA related snore, simple snore and other sounds.
arXiv Detail & Related papers (2021-02-25T13:04:30Z) - Detecting COVID-19 from Breathing and Coughing Sounds using Deep Neural
Networks [68.8204255655161]
We adapt an ensemble of Convolutional Neural Networks to classify if a speaker is infected with COVID-19 or not.
Ultimately, it achieves an Unweighted Average Recall (UAR) of 74.9%, or an Area Under ROC Curve (AUC) of 80.7% by ensembling neural networks.
arXiv Detail & Related papers (2020-12-29T01:14:17Z) - Inception-Based Network and Multi-Spectrogram Ensemble Applied For
Predicting Respiratory Anomalies and Lung Diseases [16.318395700171624]
This paper presents an inception-based deep neural network for detecting lung diseases using respiratory sound input.
Recordings of respiratory sound collected from patients are transformed into spectrograms where both spectral and temporal information are well presented.
These spectrograms are fed into the proposed network, referred to as back-end classification, for detecting whether patients suffer from lung-relevant diseases.
arXiv Detail & Related papers (2020-12-26T08:25:02Z) - CovidDeep: SARS-CoV-2/COVID-19 Test Based on Wearable Medical Sensors
and Efficient Neural Networks [51.589769497681175]
The novel coronavirus (SARS-CoV-2) has led to a pandemic.
The current testing regime based on Reverse Transcription-Polymerase Chain Reaction for SARS-CoV-2 has been unable to keep up with testing demands.
We propose a framework called CovidDeep that combines efficient DNNs with commercially available WMSs for pervasive testing of the virus.
arXiv Detail & Related papers (2020-07-20T21:47:28Z) - Capturing scattered discriminative information using a deep architecture
in acoustic scene classification [49.86640645460706]
In this study, we investigate various methods to capture discriminative information and simultaneously mitigate the overfitting problem.
We adopt a max feature map method to replace conventional non-linear activations in a deep neural network.
Two data augment methods and two deep architecture modules are further explored to reduce overfitting and sustain the system's discriminative power.
arXiv Detail & Related papers (2020-07-09T08:32:06Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.