Psychophysiological Arousal in Young Children Who Stutter: An
Interpretable AI Approach
- URL: http://arxiv.org/abs/2208.08859v1
- Date: Wed, 3 Aug 2022 13:28:15 GMT
- Title: Psychophysiological Arousal in Young Children Who Stutter: An
Interpretable AI Approach
- Authors: Harshit Sharma, Yi Xiao, Victoria Tumanova, Asif Salekin
- Abstract summary: The presented study effectively identifies and visualizes the second-by-second pattern differences in the physiological arousal of preschool-age children who do stutter (CWS) and who do not stutter (CWNS)
The first condition may affect children's speech due to high arousal; the latter introduces linguistic, cognitive, and communicative demands on speakers.
- Score: 6.507353572917133
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: The presented first-of-its-kind study effectively identifies and visualizes
the second-by-second pattern differences in the physiological arousal of
preschool-age children who do stutter (CWS) and who do not stutter (CWNS) while
speaking perceptually fluently in two challenging conditions i.e speaking in
stressful situations and narration. The first condition may affect children's
speech due to high arousal; the latter introduces linguistic, cognitive, and
communicative demands on speakers. We collected physiological parameters data
from 70 children in the two target conditions. First, we adopt a novel
modality-wise multiple-instance-learning (MI-MIL) approach to classify CWS vs.
CWNS in different conditions effectively. The evaluation of this classifier
addresses four critical research questions that align with state-of-the-art
speech science studies' interests. Later, we leverage SHAP classifier
interpretations to visualize the salient, fine-grain, and temporal
physiological parameters unique to CWS at the population/group-level and
personalized-level. While group-level identification of distinct patterns would
enhance our understanding of stuttering etiology and development, the
personalized-level identification would enable remote, continuous, and
real-time assessment of stuttering children's physiological arousal, which may
lead to personalized, just-in-time interventions, resulting in an improvement
in speech fluency. The presented MI-MIL approach is novel, generalizable to
different domains, and real-time executable. Finally, comprehensive evaluations
are done on multiple datasets, presented framework, and several baselines that
identified notable insights on CWSs' physiological arousal during speech
production.
Related papers
- Egocentric Speaker Classification in Child-Adult Dyadic Interactions: From Sensing to Computational Modeling [30.099739460287566]
Autism spectrum disorder (ASD) is a neurodevelopmental condition characterized by challenges in social communication, repetitive behavior, and sensory processing.
One important research area in ASD is evaluating children's behavioral changes over time during treatment.
A fundamental aspect of understanding children's behavior in these interactions is automatic speech understanding.
arXiv Detail & Related papers (2024-09-14T07:03:08Z) - Selfsupervised learning for pathological speech detection [0.0]
Speech production is susceptible to influence and disruption by various neurodegenerative pathological speech disorders.
These disorders lead to pathological speech characterized by abnormal speech patterns and imprecise articulation.
Unlike neurotypical speakers, patients with speech pathologies or impairments are unable to access various virtual assistants such as Alexa, Siri, etc.
arXiv Detail & Related papers (2024-05-16T07:12:47Z) - Exploring Speech Pattern Disorders in Autism using Machine Learning [12.469348589699766]
This study presents a comprehensive approach to identify distinctive speech patterns through the analysis of examiner-patient dialogues.
We extracted 40 speech-related features, categorized into frequency, zero-crossing rate, energy, spectral characteristics, Mel Frequency Cepstral Coefficients (MFCCs) and balance.
The classification model aimed to differentiate between ASD and non-ASD cases, achieving an accuracy of 87.75%.
arXiv Detail & Related papers (2024-05-03T02:59:15Z) - Identification of Cognitive Decline from Spoken Language through Feature
Selection and the Bag of Acoustic Words Model [0.0]
The early identification of symptoms of memory disorders plays a significant role in ensuring the well-being of populations.
The lack of standardized speech tests in clinical settings has led to a growing emphasis on developing automatic machine learning techniques for analyzing naturally spoken language.
The work presents an approach related to feature selection, allowing for the automatic selection of the essential features required for diagnosis from the Geneva minimalistic acoustic parameter set and relative speech pauses.
arXiv Detail & Related papers (2024-02-02T17:06:03Z) - CIAO! A Contrastive Adaptation Mechanism for Non-Universal Facial
Expression Recognition [80.07590100872548]
We propose Contrastive Inhibitory Adaptati On (CIAO), a mechanism that adapts the last layer of facial encoders to depict specific affective characteristics on different datasets.
CIAO presents an improvement in facial expression recognition performance over six different datasets with very unique affective representations.
arXiv Detail & Related papers (2022-08-10T15:46:05Z) - Self-supervised models of audio effectively explain human cortical
responses to speech [71.57870452667369]
We capitalize on the progress of self-supervised speech representation learning to create new state-of-the-art models of the human auditory system.
We show that these results show that self-supervised models effectively capture the hierarchy of information relevant to different stages of speech processing in human cortex.
arXiv Detail & Related papers (2022-05-27T22:04:02Z) - Perception Point: Identifying Critical Learning Periods in Speech for
Bilingual Networks [58.24134321728942]
We compare and identify cognitive aspects on deep neural-based visual lip-reading models.
We observe a strong correlation between these theories in cognitive psychology and our unique modeling.
arXiv Detail & Related papers (2021-10-13T05:30:50Z) - Automatic Analysis of the Emotional Content of Speech in Daylong
Child-Centered Recordings from a Neonatal Intensive Care Unit [3.7373314439051106]
Hundreds of hours of daylong recordings from preterm infants' audio environments were collected from two hospitals in Finland and Estonia.
We introduce this initially unannotated large-scale real-world audio dataset and describe the development of a functional SER system for the Finnish subset of the data.
We show that the best-performing models are able to achieve a classification performance of 73.4% unweighted average recall.
arXiv Detail & Related papers (2021-06-14T11:17:52Z) - A Preliminary Study of a Two-Stage Paradigm for Preserving Speaker
Identity in Dysarthric Voice Conversion [50.040466658605524]
We propose a new paradigm for maintaining speaker identity in dysarthric voice conversion (DVC)
The poor quality of dysarthric speech can be greatly improved by statistical VC.
But as the normal speech utterances of a dysarthria patient are nearly impossible to collect, previous work failed to recover the individuality of the patient.
arXiv Detail & Related papers (2021-06-02T18:41:03Z) - Pose-based Body Language Recognition for Emotion and Psychiatric Symptom
Interpretation [75.3147962600095]
We propose an automated framework for body language based emotion recognition starting from regular RGB videos.
In collaboration with psychologists, we extend the framework for psychiatric symptom prediction.
Because a specific application domain of the proposed framework may only supply a limited amount of data, the framework is designed to work on a small training set.
arXiv Detail & Related papers (2020-10-30T18:45:16Z) - 1-D Convlutional Neural Networks for the Analysis of Pupil Size
Variations in Scotopic Conditions [79.71065005161566]
1-D convolutional neural network models are trained for classification of short-range sequences.
Model provides prediction with high average accuracy on a hold out test set.
arXiv Detail & Related papers (2020-02-06T17:25:37Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.