Related papers: EZhouNet:A framework based on graph neural network and anchor interval for the respiratory sound event detection

EZhouNet:A framework based on graph neural network and anchor interval for the respiratory sound event detection

URL: http://arxiv.org/abs/2509.01153v2
Date: Thu, 04 Sep 2025 01:07:56 GMT
Title: EZhouNet:A framework based on graph neural network and anchor interval for the respiratory sound event detection
Authors: Yun Chu, Qiuhao Wang, Enze Zhou, Qian Liu, Gang Zheng,
Abstract summary: We propose a graph neural network-based framework with anchor intervals, capable of handling variable-length audio.<n>Our method improves both the flexibility and applicability of respiratory sound detection.
Score: 7.29257171556766
License: http://creativecommons.org/licenses/by-nc-nd/4.0/
Abstract: Auscultation is a key method for early diagnosis of respiratory and pulmonary diseases, relying on skilled healthcare professionals. However, the process is often subjective, with variability between experts. As a result, numerous deep learning-based automatic classification methods have emerged, most of which focus on respiratory sound classification. In contrast, research on respiratory sound event detection remains limited. Existing sound event detection methods typically rely on frame-level predictions followed by post-processing to generate event-level outputs, making interval boundaries challenging to learn directly. Furthermore, many approaches can only handle fixed-length audio, limiting their applicability to variable-length respiratory sounds. Additionally, the impact of respiratory sound location information on detection performance has not been extensively explored. To address these issues, we propose a graph neural network-based framework with anchor intervals, capable of handling variable-length audio and providing more precise temporal localization for abnormal respiratory sound events. Our method improves both the flexibility and applicability of respiratory sound detection. Experiments on the SPRSound 2024 and HF Lung V1 datasets demonstrate the effectiveness of the proposed approach, and incorporating respiratory position information enhances the discrimination between abnormal sounds. The reference implementation is available at https://github.com/chumingqian/EzhouNet.

Related papers

Pre-Trained Foundation Model representations to uncover Breathing patterns in Speech [2.935056044470713]
Respiratory rate (RR) is a vital metric that is used to assess the overall health, fitness, and general well-being of an individual. Existing approaches to measure RR are performed using specialized equipment or training. Studies have demonstrated that machine learning algorithms can be used to estimate RR using bio-sensor signals as input.
arXiv Detail & Related papers (2024-07-17T21:57:18Z)
Rene: A Pre-trained Multi-modal Architecture for Auscultation of Respiratory Diseases [5.810320353233697]
We introduce Rene, a pioneering large-scale model tailored for respiratory sound recognition. Our innovative approach applies a pre-trained speech recognition model to process respiratory sounds. We have developed a real-time respiratory sound discrimination system utilizing the Rene architecture.
arXiv Detail & Related papers (2024-05-13T03:00:28Z)
Stethoscope-guided Supervised Contrastive Learning for Cross-domain Adaptation on Respiratory Sound Classification [1.690115983364313]
We introduce cross-domain adaptation techniques, which transfer the knowledge from a source domain to a distinct target domain. In particular, by considering different stethoscope types as individual domains, we propose a novel stethoscope-guided supervised contrastive learning approach. The experimental results on the ICBHI dataset demonstrate that the proposed methods are effective in reducing the domain dependency and achieving the ICBHI Score of 61.71%, which is a significant improvement of 2.16% over the baseline.
arXiv Detail & Related papers (2023-12-15T08:34:31Z)
The role of noise in denoising models for anomaly detection in medical images [62.0532151156057]
Pathological brain lesions exhibit diverse appearance in brain images. Unsupervised anomaly detection approaches have been proposed using only normal data for training. We show that optimization of the spatial resolution and magnitude of the noise improves the performance of different model training regimes.
arXiv Detail & Related papers (2023-01-19T21:39:38Z)
Fuzzy Attention Neural Network to Tackle Discontinuity in Airway Segmentation [67.19443246236048]
Airway segmentation is crucial for the examination, diagnosis, and prognosis of lung diseases. Some small-sized airway branches (e.g., bronchus and terminaloles) significantly aggravate the difficulty of automatic segmentation. This paper presents an efficient method for airway segmentation, comprising a novel fuzzy attention neural network and a comprehensive loss function.
arXiv Detail & Related papers (2022-09-05T16:38:13Z)
Preservation of High Frequency Content for Deep Learning-Based Medical Image Classification [74.84221280249876]
An efficient analysis of large amounts of chest radiographs can aid physicians and radiologists. We propose a novel Discrete Wavelet Transform (DWT)-based method for the efficient identification and encoding of visual information.
arXiv Detail & Related papers (2022-05-08T15:29:54Z)
A Deep Learning Approach to Predicting Collateral Flow in Stroke Patients Using Radiomic Features from Perfusion Images [58.17507437526425]
Collateral circulation results from specialized anastomotic channels which provide oxygenated blood to regions with compromised blood flow. The actual grading is mostly done through manual inspection of the acquired images. We present a deep learning approach to predicting collateral flow grading in stroke patients based on radiomic features extracted from MR perfusion data.
arXiv Detail & Related papers (2021-10-24T18:58:40Z)
Collaborative Three-Tier Architecture Non-contact Respiratory Rate Monitoring using Target Tracking and False Peaks Eliminating Algorithms [10.232449356645608]
Non-contact respiratory monitoring techniques have poor accuracy because they are sensitive to environmental influences like lighting and motion artifacts. frequent contact between users and the cloud might cause service request delays and potentially the loss of personal data. We proposed a non-contact respiratory rate monitoring system with a cooperative three-layer design to increase the precision of respiratory monitoring and decrease data transmission latency.
arXiv Detail & Related papers (2020-11-17T07:33:00Z)
Respiratory Sound Classification Using Long-Short Term Memory [62.997667081978825]
This paper examines the difficulties that exist when attempting to perform sound classification as it relates to respiratory disease classification. An examination on the use of deep learning and long short-term memory networks is performed in order to identify how such a task can be implemented.
arXiv Detail & Related papers (2020-08-06T23:11:57Z)
Capturing scattered discriminative information using a deep architecture in acoustic scene classification [49.86640645460706]
In this study, we investigate various methods to capture discriminative information and simultaneously mitigate the overfitting problem. We adopt a max feature map method to replace conventional non-linear activations in a deep neural network. Two data augment methods and two deep architecture modules are further explored to reduce overfitting and sustain the system's discriminative power.
arXiv Detail & Related papers (2020-07-09T08:32:06Z)
Deep Learning for Automatic Pneumonia Detection [72.55423549641714]
Pneumonia is the leading cause of death among young children and one of the top mortality causes worldwide. Computer-aided diagnosis systems showed the potential for improving diagnostic accuracy. We develop the computational approach for pneumonia regions detection based on single-shot detectors, squeeze-and-excitation deep convolution neural networks, augmentations and multi-task learning.
arXiv Detail & Related papers (2020-05-28T10:54:34Z)
CNN-MoE based framework for classification of respiratory anomalies and lung disease detection [33.45087488971683]
This paper presents and explores a robust deep learning framework for auscultation analysis. It aims to classify anomalies in respiratory cycles and detect disease, from respiratory sound recordings.
arXiv Detail & Related papers (2020-04-04T21:45:06Z)
Robust Deep Learning Framework For Predicting Respiratory Anomalies and Diseases [26.786743524562322]
This paper presents a robust deep learning framework developed to detect respiratory diseases from recordings of respiratory sounds. A back-end deep learning model classifies the features into classes of respiratory disease or anomaly. Experiments, conducted over the ICBHI benchmark dataset of respiratory sounds, evaluate the ability of the framework to classify sounds.
arXiv Detail & Related papers (2020-01-21T15:26:52Z)

This list is automatically generated from the titles and abstracts of the papers in this site.