Machine Learning for Stuttering Identification: Review, Challenges &
Future Directions
- URL: http://arxiv.org/abs/2107.04057v2
- Date: Mon, 12 Jul 2021 15:32:44 GMT
- Title: Machine Learning for Stuttering Identification: Review, Challenges &
Future Directions
- Authors: Shakeel Ahmad Sheikh and Md Sahidullah and Fabrice Hirsch and Slim
Ouni
- Abstract summary: Stuttering is a speech disorder during which the flow of speech is interrupted by involuntary pauses and repetition of sounds.
Recent developments in machine and deep learning have dramatically revolutionized speech domain.
This work fills the gap by trying to bring researchers together from interdisciplinary fields.
- Score: 9.726119468893721
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Stuttering is a speech disorder during which the flow of speech is
interrupted by involuntary pauses and repetition of sounds. Stuttering
identification is an interesting interdisciplinary domain research problem
which involves pathology, psychology, acoustics, and signal processing that
makes it hard and complicated to detect. Recent developments in machine and
deep learning have dramatically revolutionized speech domain, however minimal
attention has been given to stuttering identification. This work fills the gap
by trying to bring researchers together from interdisciplinary fields. In this
paper, we review comprehensively acoustic features, statistical and deep
learning based stuttering/disfluency classification methods. We also present
several challenges and possible future directions.
Related papers
- Self-supervised Speech Models for Word-Level Stuttered Speech Detection [66.46810024006712]
We introduce a word-level stuttering speech detection model leveraging self-supervised speech models.
Our evaluation demonstrates that our model surpasses previous approaches in word-level stuttering speech detection.
arXiv Detail & Related papers (2024-09-16T20:18:20Z) - Impact of Speech Mode in Automatic Pathological Speech Detection [14.011517808456892]
This paper analyzes the influence of speech mode on pathological speech detection approaches.
It examines two categories of approaches, i.e., classical machine learning and deep learning.
Results indicate that classical approaches may struggle to capture pathology-discriminant cues in spontaneous speech.
In contrast, deep learning approaches demonstrate superior performance, managing to extract additional cues that were previously inaccessible in non-spontaneous speech.
arXiv Detail & Related papers (2024-06-14T12:19:18Z) - Selfsupervised learning for pathological speech detection [0.0]
Speech production is susceptible to influence and disruption by various neurodegenerative pathological speech disorders.
These disorders lead to pathological speech characterized by abnormal speech patterns and imprecise articulation.
Unlike neurotypical speakers, patients with speech pathologies or impairments are unable to access various virtual assistants such as Alexa, Siri, etc.
arXiv Detail & Related papers (2024-05-16T07:12:47Z) - Deep Learning for Visual Speech Analysis: A Survey [54.53032361204449]
This paper presents a review of recent progress in deep learning methods on visual speech analysis.
We cover different aspects of visual speech, including fundamental problems, challenges, benchmark datasets, a taxonomy of existing methods, and state-of-the-art performance.
arXiv Detail & Related papers (2022-05-22T14:44:53Z) - KSoF: The Kassel State of Fluency Dataset -- A Therapy Centered Dataset
of Stuttering [58.91587609873915]
This work introduces the Kassel State of Fluency (KSoF), a therapy-based dataset containing over 5500 clips of stuttering PWSs.
The audio was recorded during therapy sessions at the Institut der Kasseler Stottertherapie.
arXiv Detail & Related papers (2022-03-10T14:17:07Z) - Deep Learning for Hate Speech Detection: A Comparative Study [54.42226495344908]
We present here a large-scale empirical comparison of deep and shallow hate-speech detection methods.
Our goal is to illuminate progress in the area, and identify strengths and weaknesses in the current state-of-the-art.
In doing so we aim to provide guidance as to the use of hate-speech detection in practice, quantify the state-of-the-art, and identify future research directions.
arXiv Detail & Related papers (2022-02-19T03:48:20Z) - STAN: A stuttering therapy analysis helper [59.37911277681339]
Stuttering is a complex speech disorder identified by repeti-tions, prolongations of sounds, syllables or words and blockswhile speaking.
We introduceSTAN, a system to aid speech therapists in stuttering therapysessions.
arXiv Detail & Related papers (2021-06-15T13:48:12Z) - Multi-view Temporal Alignment for Non-parallel Articulatory-to-Acoustic
Speech Synthesis [59.623780036359655]
Articulatory-to-acoustic (A2A) synthesis refers to the generation of audible speech from captured movement of the speech articulators.
This technique has numerous applications, such as restoring oral communication to people who cannot longer speak due to illness or injury.
We propose a solution to this problem based on the theory of multi-view learning.
arXiv Detail & Related papers (2020-12-30T15:09:02Z) - Stutter Diagnosis and Therapy System Based on Deep Learning [2.3581263491506097]
Stuttering, also called stammering, is a communication disorder that breaks the continuity of the speech.
This paper focuses on the implementation of a stutter diagnosis agent using Gated Recurrent CNN on MFCC audio features and therapy recommendation agent using SVM.
arXiv Detail & Related papers (2020-07-13T10:24:02Z) - Towards Automated Assessment of Stuttering and Stuttering Therapy [0.22940141855172028]
Stuttering is a complex speech disorder that can be identified by repetitions, prolongations of sounds, syllables or words, and blocks while speaking.
Common methods for the assessment of stuttering severity include percent stuttered syllables (% SS), the average of the three longest stuttering symptoms during a speech task, or the recently introduced Speech Efficiency Score (SES)
This paper introduces the Speech Control Index (SCI), a new method to evaluate the severity of stuttering.
arXiv Detail & Related papers (2020-06-16T14:50:56Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.