Statistical Design and Analysis for Robust Machine Learning: A Case
Study from COVID-19
- URL: http://arxiv.org/abs/2212.08571v1
- Date: Thu, 15 Dec 2022 13:50:13 GMT
- Title: Statistical Design and Analysis for Robust Machine Learning: A Case
Study from COVID-19
- Authors: Davide Pigoli, Kieran Baker, Jobie Budd, Lorraine Butler, Harry
Coppock, Sabrina Egglestone, Steven G. Gilmour, Chris Holmes, David Hurley,
Radka Jersakova, Ivan Kiskin, Vasiliki Koutra, Jonathon Mellor, George
Nicholson, Joe Packham, Selina Patel, Richard Payne, Stephen J. Roberts,
Bj\"orn W. Schuller, Ana Tendero-Ca\~nadas, Tracey Thornley, Alexander
Titcomb
- Abstract summary: This paper rigorously assesses state-of-the-art machine learning techniques used to predict COVID-19 infection status based on vocal audio signals.
We provide guidelines on testing the performance of methods to classify COVID-19 infection status based on acoustic features.
- Score: 45.216628450147034
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Since early in the coronavirus disease 2019 (COVID-19) pandemic, there has
been interest in using artificial intelligence methods to predict COVID-19
infection status based on vocal audio signals, for example cough recordings.
However, existing studies have limitations in terms of data collection and of
the assessment of the performances of the proposed predictive models. This
paper rigorously assesses state-of-the-art machine learning techniques used to
predict COVID-19 infection status based on vocal audio signals, using a dataset
collected by the UK Health Security Agency. This dataset includes acoustic
recordings and extensive study participant meta-data. We provide guidelines on
testing the performance of methods to classify COVID-19 infection status based
on acoustic features and we discuss how these can be extended more generally to
the development and assessment of predictive methods based on public health
datasets.
Related papers
- Ultrasound-Based AI for COVID-19 Detection: A Comprehensive Review of Public and Private Lung Ultrasound Datasets and Studies [0.8431149869144428]
We focus on AI-driven studies utilizing lung ultrasound (LUS) for COVID-19 detection and analysis.
In total, we reviewed 60 articles, 41 of which utilized public datasets, while the remaining employed private data.
Our findings suggest that ultrasound-based AI studies for COVID-19 detection have great potential for clinical use, especially for children and pregnant women.
arXiv Detail & Related papers (2024-11-06T06:59:41Z) - CoVScreen: Pitfalls and recommendations for screening COVID-19 using Chest X-rays [1.0878040851637998]
The novel coronavirus (COVID-19), a highly infectious respiratory disease caused by the SARS-CoV-2 has emerged as an unprecedented healthcare crisis.
Early screening and diagnosis of symptomatic patients plays crucial role in isolation of patient to help stop community transmission.
Due to its accessibility, availability, lower-cost, ease of sanitisation, and portable setup, chest X-Ray imaging can serve as an effective screening and diagnostic tool.
arXiv Detail & Related papers (2024-05-13T12:03:15Z) - Developing a multi-variate prediction model for the detection of
COVID-19 from Crowd-sourced Respiratory Voice Data [0.0]
The novelty of this work is in the development of a deep learning model for the identification of COVID-19 patients from voice recordings.
We used the Cambridge University dataset consisting of 893 audio samples, crowd-sourced from 4352 participants that used a COVID-19 Sounds app.
Based on the voice data, we developed deep learning classification models to detect positive COVID-19 cases.
arXiv Detail & Related papers (2022-09-08T11:46:37Z) - COVYT: Introducing the Coronavirus YouTube and TikTok speech dataset
featuring the same speakers with and without infection [4.894353840908006]
We introduce the COVYT dataset -- a novel COVID-19 dataset collected from public sources containing more than 8 hours of speech from 65 speakers.
As compared to other existing COVID-19 sound datasets, the unique feature of the COVYT dataset is that it comprises both COVID-19 positive and negative samples from all 65 speakers.
arXiv Detail & Related papers (2022-06-20T16:26:51Z) - The pitfalls of using open data to develop deep learning solutions for
COVID-19 detection in chest X-rays [64.02097860085202]
Deep learning models have been developed to identify COVID-19 from chest X-rays.
Results have been exceptional when training and testing on open-source data.
Data analysis and model evaluations show that the popular open-source dataset COVIDx is not representative of the real clinical problem.
arXiv Detail & Related papers (2021-09-14T10:59:11Z) - Project Achoo: A Practical Model and Application for COVID-19 Detection
from Recordings of Breath, Voice, and Cough [55.45063681652457]
We propose a machine learning method to quickly triage COVID-19 using recordings made on consumer devices.
The approach combines signal processing methods with fine-tuned deep learning networks and provides methods for signal denoising, cough detection and classification.
We have also developed and deployed a mobile application that uses symptoms checker together with voice, breath and cough signals to detect COVID-19 infection.
arXiv Detail & Related papers (2021-07-12T08:07:56Z) - COVIDx-US -- An open-access benchmark dataset of ultrasound imaging data
for AI-driven COVID-19 analytics [116.6248556979572]
COVIDx-US is an open-access benchmark dataset of COVID-19 related ultrasound imaging data.
It consists of 93 lung ultrasound videos and 10,774 processed images of patients infected with SARS-CoV-2 pneumonia, non-SARS-CoV-2 pneumonia, as well as healthy control cases.
arXiv Detail & Related papers (2021-03-18T03:31:33Z) - Virufy: A Multi-Branch Deep Learning Network for Automated Detection of
COVID-19 [1.9899603776429056]
Researchers have successfully presented models for detecting COVID-19 infection status using audio samples recorded in clinical settings.
We propose a multi-branch deep learning network that is trained and tested on crowdsourced data where most of the data has not been manually processed and cleaned.
arXiv Detail & Related papers (2021-03-02T15:31:09Z) - Classification supporting COVID-19 diagnostics based on patient survey
data [82.41449972618423]
logistic regression and XGBoost classifiers, that allow for effective screening of patients for COVID-19 were generated.
The obtained classification models provided the basis for the DECODE service (decode.polsl.pl), which can serve as support in screening patients with COVID-19 disease.
This data set consists of more than 3,000 examples is based on questionnaires collected at a hospital in Poland.
arXiv Detail & Related papers (2020-11-24T17:44:01Z) - Integrative Analysis for COVID-19 Patient Outcome Prediction [53.11258640541513]
We combine radiomics of lung opacities and non-imaging features from demographic data, vital signs, and laboratory findings to predict need for intensive care unit admission.
Our methods may also be applied to other lung diseases including but not limited to community acquired pneumonia.
arXiv Detail & Related papers (2020-07-20T19:08:50Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.