An AI-enabled Bias-Free Respiratory Disease Diagnosis Model using Cough
Audio: A Case Study for COVID-19
- URL: http://arxiv.org/abs/2401.02996v1
- Date: Thu, 4 Jan 2024 13:09:45 GMT
- Title: An AI-enabled Bias-Free Respiratory Disease Diagnosis Model using Cough
Audio: A Case Study for COVID-19
- Authors: Tabish Saeed, Aneeqa Ijaz, Ismail Sadiq, Haneya N. Qureshi, Ali
Rizwan, and Ali Imran
- Abstract summary: We propose the Bias Free Network (RBFNet) to mitigate the impact of confounders in the training data distribution.
RBFNet ensures accurate and unbiased RD diagnosis features, emphasizing its relevance by incorporating a COVID19 dataset.
An additional bias predictor is incorporated in the classification scheme to formulate a conditional Generative Adrial Network (cGAN)
- Score: 1.1146119513912156
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: Cough-based diagnosis for Respiratory Diseases (RDs) using Artificial
Intelligence (AI) has attracted considerable attention, yet many existing
studies overlook confounding variables in their predictive models. These
variables can distort the relationship between cough recordings (input data)
and RD status (output variable), leading to biased associations and unrealistic
model performance. To address this gap, we propose the Bias Free Network
(RBFNet), an end to end solution that effectively mitigates the impact of
confounders in the training data distribution. RBFNet ensures accurate and
unbiased RD diagnosis features, emphasizing its relevance by incorporating a
COVID19 dataset in this study. This approach aims to enhance the reliability of
AI based RD diagnosis models by navigating the challenges posed by confounding
variables. A hybrid of a Convolutional Neural Networks (CNN) and Long-Short
Term Memory (LSTM) networks is proposed for the feature encoder module of
RBFNet. An additional bias predictor is incorporated in the classification
scheme to formulate a conditional Generative Adversarial Network (cGAN) which
helps in decorrelating the impact of confounding variables from RD prediction.
The merit of RBFNet is demonstrated by comparing classification performance
with State of The Art (SoTA) Deep Learning (DL) model (CNN LSTM) after training
on different unbalanced COVID-19 data sets, created by using a large scale
proprietary cough data set. RBF-Net proved its robustness against extremely
biased training scenarios by achieving test set accuracies of 84.1%, 84.6%, and
80.5% for the following confounding variables gender, age, and smoking status,
respectively. RBF-Net outperforms the CNN-LSTM model test set accuracies by
5.5%, 7.7%, and 8.2%, respectively
Related papers
- Wafer Map Defect Classification Using Autoencoder-Based Data Augmentation and Convolutional Neural Network [4.8748194765816955]
This study proposes a novel method combining a self-encoder-based data augmentation technique with a convolutional neural network (CNN)
The proposed method achieves a classification accuracy of 98.56%, surpassing Random Forest, SVM, and Logistic Regression by 19%, 21%, and 27%, respectively.
arXiv Detail & Related papers (2024-11-17T10:19:54Z) - Sustaining model performance for covid-19 detection from dynamic audio data: Development and evaluation of a comprehensive drift-adaptive framework [0.5679775668038152]
The COVID-19 pandemic has highlighted the need for robust diagnostic tools capable of detecting the disease from diverse and evolving data sources.
The dynamic nature of real-world data can lead to model drift, where performance degrades over time as the underlying data distribution changes.
This study aims to develop a framework that monitors model drift and employs adaptation mechanisms to mitigate performance fluctuations.
arXiv Detail & Related papers (2024-09-28T10:06:30Z) - Autoencoder based approach for the mitigation of spurious correlations [2.7624021966289605]
Spurious correlations refer to erroneous associations in data that do not reflect true underlying relationships.
These correlations can lead deep neural networks (DNNs) to learn patterns that are not robust across diverse datasets or real-world scenarios.
We propose an autoencoder-based approach to analyze the nature of spurious correlations that exist in the Global Wheat Head Detection (GWHD) 2021 dataset.
arXiv Detail & Related papers (2024-06-27T05:28:44Z) - MedDiffusion: Boosting Health Risk Prediction via Diffusion-based Data
Augmentation [58.93221876843639]
This paper introduces a novel, end-to-end diffusion-based risk prediction model, named MedDiffusion.
It enhances risk prediction performance by creating synthetic patient data during training to enlarge sample space.
It discerns hidden relationships between patient visits using a step-wise attention mechanism, enabling the model to automatically retain the most vital information for generating high-quality data.
arXiv Detail & Related papers (2023-10-04T01:36:30Z) - Deep Learning-based Fall Detection Algorithm Using Ensemble Model of
Coarse-fine CNN and GRU Networks [7.624051346741515]
An ensemble model that combines a coarse-fine convolutional neural network and gated recurrent unit is proposed in this study.
The proposed model achieves a recall, precision, and F-score of 92.54%, 96.13%, and 94.26%, respectively.
arXiv Detail & Related papers (2023-04-13T08:30:46Z) - Compound Density Networks for Risk Prediction using Electronic Health
Records [1.1786249372283562]
We propose an integrated end-to-end approach by utilizing a Compound Density Network (CDNet)
CDNet allows the imputation method and prediction model to be tuned together within a single framework.
We validate CDNet on the mortality prediction task on the MIMIC-III dataset.
arXiv Detail & Related papers (2022-08-02T09:04:20Z) - Bootstrapping Your Own Positive Sample: Contrastive Learning With
Electronic Health Record Data [62.29031007761901]
This paper proposes a novel contrastive regularized clinical classification model.
We introduce two unique positive sampling strategies specifically tailored for EHR data.
Our framework yields highly competitive experimental results in predicting the mortality risk on real-world COVID-19 EHR data.
arXiv Detail & Related papers (2021-04-07T06:02:04Z) - UNITE: Uncertainty-based Health Risk Prediction Leveraging Multi-sourced
Data [81.00385374948125]
We present UNcertaInTy-based hEalth risk prediction (UNITE) model.
UNITE provides accurate disease risk prediction and uncertainty estimation leveraging multi-sourced health data.
We evaluate UNITE on real-world disease risk prediction tasks: nonalcoholic fatty liver disease (NASH) and Alzheimer's disease (AD)
UNITE achieves up to 0.841 in F1 score for AD detection, up to 0.609 in PR-AUC for NASH detection, and outperforms various state-of-the-art baselines by up to $19%$ over the best baseline.
arXiv Detail & Related papers (2020-10-22T02:28:11Z) - Unlabelled Data Improves Bayesian Uncertainty Calibration under
Covariate Shift [100.52588638477862]
We develop an approximate Bayesian inference scheme based on posterior regularisation.
We demonstrate the utility of our method in the context of transferring prognostic models of prostate cancer across globally diverse populations.
arXiv Detail & Related papers (2020-06-26T13:50:19Z) - SUOD: Accelerating Large-Scale Unsupervised Heterogeneous Outlier
Detection [63.253850875265115]
Outlier detection (OD) is a key machine learning (ML) task for identifying abnormal objects from general samples.
We propose a modular acceleration system, called SUOD, to address it.
arXiv Detail & Related papers (2020-03-11T00:22:50Z) - Uncertainty Estimation Using a Single Deep Deterministic Neural Network [66.26231423824089]
We propose a method for training a deterministic deep model that can find and reject out of distribution data points at test time with a single forward pass.
We scale training in these with a novel loss function and centroid updating scheme and match the accuracy of softmax models.
arXiv Detail & Related papers (2020-03-04T12:27:36Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.