On the pragmatism of using binary classifiers over data intensive neural
network classifiers for detection of COVID-19 from voice
- URL: http://arxiv.org/abs/2204.04802v1
- Date: Mon, 11 Apr 2022 00:19:14 GMT
- Title: On the pragmatism of using binary classifiers over data intensive neural
network classifiers for detection of COVID-19 from voice
- Authors: Ankit Shah, Hira Dhamyal, Yang Gao, Rita Singh, Bhiksha Raj
- Abstract summary: We show that detecting COVID-19 from voice does not require custom-made non-standard features or complicated neural network classifiers.
We demonstrate this from a human-curated dataset collected and calibrated in clinical settings.
- Score: 34.553128768223615
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: Lately, there has been a global effort by multiple research groups to detect
COVID-19 from voice. Different researchers use different kinds of information
from the voice signal to achieve this. Various types of phonated sounds and the
sound of cough and breath have all been used with varying degrees of success in
automated voice-based COVID-19 detection apps. In this paper, we show that
detecting COVID-19 from voice does not require custom-made non-standard
features or complicated neural network classifiers rather it can be
successfully done with just standard features and simple binary classifiers. In
fact, we show that the latter is not only more accurate and interpretable and
also more computationally efficient in that they can be run locally on small
devices. We demonstrate this from a human-curated dataset collected and
calibrated in clinical settings. On this dataset which comprises over 1000
speakers, a simple binary classifier is able to achieve 94% detection accuracy.
Related papers
- Developing a Multi-variate Prediction Model For COVID-19 From
Crowd-sourced Respiratory Voice Data [0.0]
We develop a deep learning model to identify COVID-19 from voice recording data.
We use the Cambridge COVID-19 Sound database which contains 893 speech samples, crowd-sourced from 4352 participants via a COVID-19 Sounds app.
Based on the voice data, we develop deep learning classification models to detect COVID-19 cases.
arXiv Detail & Related papers (2024-02-12T12:52:47Z) - Faked Speech Detection with Zero Prior Knowledge [2.407976495888858]
We introduce a neural network method to develop a classifier that will blindly classify an input audio as real or mimicked.
We propose a deep neural network following a sequential model that comprises three hidden layers, with alternating dense and drop out layers.
We were able to get at least 94% correct classification of the test cases, as against the 85% accuracy in the case of human observers.
arXiv Detail & Related papers (2022-09-26T10:38:39Z) - Developing a multi-variate prediction model for the detection of
COVID-19 from Crowd-sourced Respiratory Voice Data [0.0]
The novelty of this work is in the development of a deep learning model for the identification of COVID-19 patients from voice recordings.
We used the Cambridge University dataset consisting of 893 audio samples, crowd-sourced from 4352 participants that used a COVID-19 Sounds app.
Based on the voice data, we developed deep learning classification models to detect positive COVID-19 cases.
arXiv Detail & Related papers (2022-09-08T11:46:37Z) - Fully Automated End-to-End Fake Audio Detection [57.78459588263812]
This paper proposes a fully automated end-toend fake audio detection method.
We first use wav2vec pre-trained model to obtain a high-level representation of the speech.
For the network structure, we use a modified version of the differentiable architecture search (DARTS) named light-DARTS.
arXiv Detail & Related papers (2022-08-20T06:46:55Z) - Project Achoo: A Practical Model and Application for COVID-19 Detection
from Recordings of Breath, Voice, and Cough [55.45063681652457]
We propose a machine learning method to quickly triage COVID-19 using recordings made on consumer devices.
The approach combines signal processing methods with fine-tuned deep learning networks and provides methods for signal denoising, cough detection and classification.
We have also developed and deployed a mobile application that uses symptoms checker together with voice, breath and cough signals to detect COVID-19 infection.
arXiv Detail & Related papers (2021-07-12T08:07:56Z) - Spotting adversarial samples for speaker verification by neural vocoders [102.1486475058963]
We adopt neural vocoders to spot adversarial samples for automatic speaker verification (ASV)
We find that the difference between the ASV scores for the original and re-synthesize audio is a good indicator for discrimination between genuine and adversarial samples.
Our codes will be made open-source for future works to do comparison.
arXiv Detail & Related papers (2021-07-01T08:58:16Z) - Virufy: A Multi-Branch Deep Learning Network for Automated Detection of
COVID-19 [1.9899603776429056]
Researchers have successfully presented models for detecting COVID-19 infection status using audio samples recorded in clinical settings.
We propose a multi-branch deep learning network that is trained and tested on crowdsourced data where most of the data has not been manually processed and cleaned.
arXiv Detail & Related papers (2021-03-02T15:31:09Z) - Detecting COVID-19 from Breathing and Coughing Sounds using Deep Neural
Networks [68.8204255655161]
We adapt an ensemble of Convolutional Neural Networks to classify if a speaker is infected with COVID-19 or not.
Ultimately, it achieves an Unweighted Average Recall (UAR) of 74.9%, or an Area Under ROC Curve (AUC) of 80.7% by ensembling neural networks.
arXiv Detail & Related papers (2020-12-29T01:14:17Z) - COVID-19 Patient Detection from Telephone Quality Speech Data [4.726777092009554]
We try to investigate the presence of cues about the COVID-19 disease in the speech data.
An SVM classifier on this dataset is able to achieve an accuracy of 88.6% and an F1-Score of 92.7%.
Some phone classes, such as nasals, stops, and mid vowels can distinguish the two classes better than the others.
arXiv Detail & Related papers (2020-11-09T10:16:08Z) - Discriminative Nearest Neighbor Few-Shot Intent Detection by
Transferring Natural Language Inference [150.07326223077405]
Few-shot learning is attracting much attention to mitigate data scarcity.
We present a discriminative nearest neighbor classification with deep self-attention.
We propose to boost the discriminative ability by transferring a natural language inference (NLI) model.
arXiv Detail & Related papers (2020-10-25T00:39:32Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.