KSoF: The Kassel State of Fluency Dataset -- A Therapy Centered Dataset
of Stuttering
- URL: http://arxiv.org/abs/2203.05383v1
- Date: Thu, 10 Mar 2022 14:17:07 GMT
- Title: KSoF: The Kassel State of Fluency Dataset -- A Therapy Centered Dataset
of Stuttering
- Authors: Sebastian P. Bayerl, Alexander Wolff von Gudenberg, Florian H\"onig,
Elmar N\"oth and Korbinian Riedhammer
- Abstract summary: This work introduces the Kassel State of Fluency (KSoF), a therapy-based dataset containing over 5500 clips of stuttering PWSs.
The audio was recorded during therapy sessions at the Institut der Kasseler Stottertherapie.
- Score: 58.91587609873915
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Stuttering is a complex speech disorder that negatively affects an
individual's ability to communicate effectively. Persons who stutter (PWS)
often suffer considerably under the condition and seek help through therapy.
Fluency shaping is a therapy approach where PWSs learn to modify their speech
to help them to overcome their stutter. Mastering such speech techniques takes
time and practice, even after therapy. Shortly after therapy, success is
evaluated highly, but relapse rates are high. To be able to monitor speech
behavior over a long time, the ability to detect stuttering events and
modifications in speech could help PWSs and speech pathologists to track the
level of fluency. Monitoring could create the ability to intervene early by
detecting lapses in fluency. To the best of our knowledge, no public dataset is
available that contains speech from people who underwent stuttering therapy
that changed the style of speaking. This work introduces the Kassel State of
Fluency (KSoF), a therapy-based dataset containing over 5500 clips of PWSs. The
clips were labeled with six stuttering-related event types: blocks,
prolongations, sound repetitions, word repetitions, interjections, and -
specific to therapy - speech modifications. The audio was recorded during
therapy sessions at the Institut der Kasseler Stottertherapie. The data will be
made available for research purposes upon request.
Related papers
- Self-supervised Speech Models for Word-Level Stuttered Speech Detection [66.46810024006712]
We introduce a word-level stuttering speech detection model leveraging self-supervised speech models.
Our evaluation demonstrates that our model surpasses previous approaches in word-level stuttering speech detection.
arXiv Detail & Related papers (2024-09-16T20:18:20Z) - DisfluencyFixer: A tool to enhance Language Learning through Speech To
Speech Disfluency Correction [50.51901599433536]
DisfluencyFixer is a tool that performs speech-to-speech disfluency correction in English and Hindi.
Our proposed system removes disfluencies from input speech and returns fluent speech as output.
arXiv Detail & Related papers (2023-05-26T14:13:38Z) - Detecting Dysfluencies in Stuttering Therapy Using wav2vec 2.0 [0.22940141855172028]
Fine-tuning wav2vec 2.0 for the classification of stuttering on a sizeable English corpus boosts the effectiveness of the general-purpose features.
We evaluate our method on Fluencybank and the German therapy-centric Kassel State of Fluency dataset.
arXiv Detail & Related papers (2022-04-07T13:02:12Z) - Investigation of Data Augmentation Techniques for Disordered Speech
Recognition [69.50670302435174]
This paper investigates a set of data augmentation techniques for disordered speech recognition.
Both normal and disordered speech were exploited in the augmentation process.
The final speaker adapted system constructed using the UASpeech corpus and the best augmentation approach based on speed perturbation produced up to 2.92% absolute word error rate (WER)
arXiv Detail & Related papers (2022-01-14T17:09:22Z) - STAN: A stuttering therapy analysis helper [59.37911277681339]
Stuttering is a complex speech disorder identified by repeti-tions, prolongations of sounds, syllables or words and blockswhile speaking.
We introduceSTAN, a system to aid speech therapists in stuttering therapysessions.
arXiv Detail & Related papers (2021-06-15T13:48:12Z) - Stuttering Speech Disfluency Prediction using Explainable Attribution
Vectors of Facial Muscle Movements [2.6540572249827514]
Speech disorders such as stuttering disrupt the normal fluency of speech by involuntary repetitions, prolongations and blocking of sounds and syllables.
Recent studies have explored automatic detection of stuttering using Artificial Intelligence (AI) based algorithm from respiratory rate, audio, etc. during speech utterance.
We hypothesize that pre-speech facial activity in AWS, which can be captured non-invasively, contains enough information to accurately classify the upcoming utterance as either fluent or stuttered.
arXiv Detail & Related papers (2020-10-02T23:45:41Z) - Silent Speech Interfaces for Speech Restoration: A Review [59.68902463890532]
Silent speech interface (SSI) research aims to provide alternative and augmentative communication methods for persons with severe speech disorders.
SSIs rely on non-acoustic biosignals generated by the human body during speech production to enable communication.
Most present-day SSIs have only been validated in laboratory settings for healthy users.
arXiv Detail & Related papers (2020-09-04T11:05:50Z) - Stutter Diagnosis and Therapy System Based on Deep Learning [2.3581263491506097]
Stuttering, also called stammering, is a communication disorder that breaks the continuity of the speech.
This paper focuses on the implementation of a stutter diagnosis agent using Gated Recurrent CNN on MFCC audio features and therapy recommendation agent using SVM.
arXiv Detail & Related papers (2020-07-13T10:24:02Z) - Towards Automated Assessment of Stuttering and Stuttering Therapy [0.22940141855172028]
Stuttering is a complex speech disorder that can be identified by repetitions, prolongations of sounds, syllables or words, and blocks while speaking.
Common methods for the assessment of stuttering severity include percent stuttered syllables (% SS), the average of the three longest stuttering symptoms during a speech task, or the recently introduced Speech Efficiency Score (SES)
This paper introduces the Speech Control Index (SCI), a new method to evaluate the severity of stuttering.
arXiv Detail & Related papers (2020-06-16T14:50:56Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.