Classifying Autism from Crowdsourced Semi-Structured Speech Recordings:
A Machine Learning Approach
- URL: http://arxiv.org/abs/2201.00927v1
- Date: Tue, 4 Jan 2022 01:31:02 GMT
- Title: Classifying Autism from Crowdsourced Semi-Structured Speech Recordings:
A Machine Learning Approach
- Authors: Nathan A. Chi, Peter Washington, Aaron Kline, Arman Husic, Cathy Hou,
Chloe He, Kaitlyn Dunlap, and Dennis Wall
- Abstract summary: We present a suite of machine learning approaches to detect autism in self-recorded speech audio captured from autistic and neurotypical (NT) children in home environments.
We consider three methods to detect autism in child speech: first, Random Forests trained on extracted audio features; second, convolutional neural networks (CNNs) trained on spectrograms; and third, fine-tuned wav2vec 2.0--a state-of-the-art Transformer-based ASR model.
- Score: 0.9945783208680666
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Autism spectrum disorder (ASD) is a neurodevelopmental disorder which results
in altered behavior, social development, and communication patterns. In past
years, autism prevalence has tripled, with 1 in 54 children now affected. Given
that traditional diagnosis is a lengthy, labor-intensive process, significant
attention has been given to developing systems that automatically screen for
autism. Prosody abnormalities are among the clearest signs of autism, with
affected children displaying speech idiosyncrasies including echolalia,
monotonous intonation, atypical pitch, and irregular linguistic stress
patterns. In this work, we present a suite of machine learning approaches to
detect autism in self-recorded speech audio captured from autistic and
neurotypical (NT) children in home environments. We consider three methods to
detect autism in child speech: first, Random Forests trained on extracted audio
features (including Mel-frequency cepstral coefficients); second, convolutional
neural networks (CNNs) trained on spectrograms; and third, fine-tuned wav2vec
2.0--a state-of-the-art Transformer-based ASR model. We train our classifiers
on our novel dataset of cellphone-recorded child speech audio curated from
Stanford's Guess What? mobile game, an app designed to crowdsource videos of
autistic and neurotypical children in a natural home environment. The Random
Forest classifier achieves 70% accuracy, the fine-tuned wav2vec 2.0 model
achieves 77% accuracy, and the CNN achieves 79% accuracy when classifying
children's audio as either ASD or NT. Our models were able to predict autism
status when training on a varied selection of home audio clips with
inconsistent recording quality, which may be more generalizable to real world
conditions. These results demonstrate that machine learning methods offer
promise in detecting autism automatically from speech without specialized
equipment.
Related papers
- Hear Me, See Me, Understand Me: Audio-Visual Autism Behavior Recognition [47.550391816383794]
We introduce a novel problem of audio-visual autism behavior recognition.
Social behavior recognition is an essential aspect previously omitted in AI-assisted autism screening research.
We will release our dataset, code, and pre-trained models.
arXiv Detail & Related papers (2024-03-22T22:52:35Z) - Autism Detection in Speech - A Survey [0.9028773906859542]
We look at linguistic, prosodic and acoustic cues that could indicate autism.
We especially look at observations such as verbal and semantic fluency, prosodic features, but also disfluencies and speaking rate.
We conclude, while there already is a lot of research, female patients seem to be severely under-researched.
arXiv Detail & Related papers (2024-02-20T10:18:18Z) - Toward a realistic model of speech processing in the brain with
self-supervised learning [67.7130239674153]
Self-supervised algorithms trained on the raw waveform constitute a promising candidate.
We show that Wav2Vec 2.0 learns brain-like representations with as little as 600 hours of unlabelled speech.
arXiv Detail & Related papers (2022-06-03T17:01:46Z) - Recent Progress in the CUHK Dysarthric Speech Recognition System [66.69024814159447]
Disordered speech presents a wide spectrum of challenges to current data intensive deep neural networks (DNNs) based automatic speech recognition technologies.
This paper presents recent research efforts at the Chinese University of Hong Kong to improve the performance of disordered speech recognition systems.
arXiv Detail & Related papers (2022-01-15T13:02:40Z) - Investigation of Data Augmentation Techniques for Disordered Speech
Recognition [69.50670302435174]
This paper investigates a set of data augmentation techniques for disordered speech recognition.
Both normal and disordered speech were exploited in the augmentation process.
The final speaker adapted system constructed using the UASpeech corpus and the best augmentation approach based on speed perturbation produced up to 2.92% absolute word error rate (WER)
arXiv Detail & Related papers (2022-01-14T17:09:22Z) - Predi\c{c}\~ao da Idade Cerebral a partir de Imagens de Resson\^ancia
Magn\'etica utilizando Redes Neurais Convolucionais [57.52103125083341]
Deep learning techniques for brain age prediction from magnetic resonance images are investigated.
The identification of biomarkers is useful for detecting an early-stage neurodegenerative process, as well as for predicting age-related or non-age-related cognitive decline.
The best result was obtained by the 2D model, which achieved a mean absolute error of 3.83 years.
arXiv Detail & Related papers (2021-12-23T14:51:45Z) - Detecting Autism Spectrum Disorders with Machine Learning Models Using
Speech Transcripts [0.0]
Autism spectrum disorder (ASD) can be defined as a neurodevelopmental disorder that affects how children interact, communicate and socialize with others.
Current methods to accurately diagnose ASD are invasive, time-consuming, and tedious.
New technologies are rapidly emerging that include machine learning models using speech, computer vision from facial, retinal, and brain MRI images of patients to accurately and timely detect this disorder.
arXiv Detail & Related papers (2021-10-07T09:10:15Z) - Development of an autism screening classification model for toddlers [0.0]
Autism spectrum disorder ASD is a neurodevelopmental disorder associated with challenges in communication, social interaction, and repetitive behaviors.
This work contributes to the early screening of toddlers by helping identify those who have ASD traits and should pursue formal clinical diagnosis.
arXiv Detail & Related papers (2021-09-29T09:07:39Z) - Diagnosis of Autism in Children using Facial Analysis and Deep Learning [0.0]
We introduce a deep learning model to classify children as either healthy or potentially autistic with 94.6% accuracy using Deep Learning.
Autistic patients struggle with social skills, repetitive behaviors, and communication, both verbal and nonverbal.
Based on our accuracy, we propose that the diagnosis of autism can be done effectively using only a picture.
arXiv Detail & Related papers (2020-08-06T22:15:20Z) - 4D Spatio-Temporal Deep Learning with 4D fMRI Data for Autism Spectrum
Disorder Classification [69.62333053044712]
We propose a 4D convolutional deep learning approach for ASD classification where we jointly learn from spatial and temporal data.
We employ 4D neural networks and convolutional-recurrent models which outperform a previous approach with an F1-score of 0.71 compared to an F1-score of 0.65.
arXiv Detail & Related papers (2020-04-21T17:19:06Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.