Automatic Speaker Independent Dysarthric Speech Intelligibility
Assessment System
- URL: http://arxiv.org/abs/2103.06157v1
- Date: Wed, 10 Mar 2021 16:15:32 GMT
- Title: Automatic Speaker Independent Dysarthric Speech Intelligibility
Assessment System
- Authors: Ayush Tripathi and Swapnil Bhosale and Sunil Kumar Kopparapu
- Abstract summary: Dysarthria is a condition which hampers the ability of an individual to control the muscles that play a major role in speech delivery.
The loss of fine control over muscles that assist the movement of lips, vocal chords, tongue and diaphragm results in abnormal speech delivery.
One can assess the level of dysarthria by analyzing the intelligibility of speech spoken by an individual.
- Score: 28.01689694536572
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: Dysarthria is a condition which hampers the ability of an individual to
control the muscles that play a major role in speech delivery. The loss of fine
control over muscles that assist the movement of lips, vocal chords, tongue and
diaphragm results in abnormal speech delivery. One can assess the severity
level of dysarthria by analyzing the intelligibility of speech spoken by an
individual. Continuous intelligibility assessment helps speech language
pathologists not only study the impact of medication but also allows them to
plan personalized therapy. It helps the clinicians immensely if the
intelligibility assessment system is reliable, automatic, simple for (a)
patients to undergo and (b) clinicians to interpret. Lack of availability of
dysarthric data has resulted in development of speaker dependent automatic
intelligibility assessment systems which requires patients to speak a large
number of utterances. In this paper, we propose (a) a cost minimization
procedure to select an optimal (small) number of utterances that need to be
spoken by the dysarthric patient, (b) four different speaker independent
intelligibility assessment systems which require the patient to speak a small
number of words, and (c) the assessment score is close to the perceptual score
that the Speech Language Pathologist (SLP) can relate to. The need for small
number of utterances to be spoken by the patient and the score being relatable
to the SLP benefits both the dysarthric patient and the clinician from
usability perspective.
Related papers
- Self-supervised Speech Models for Word-Level Stuttered Speech Detection [66.46810024006712]
We introduce a word-level stuttering speech detection model leveraging self-supervised speech models.
Our evaluation demonstrates that our model surpasses previous approaches in word-level stuttering speech detection.
arXiv Detail & Related papers (2024-09-16T20:18:20Z) - A Comparative Study on Patient Language across Therapeutic Domains for Effective Patient Voice Classification in Online Health Discussions [0.48124799513933847]
In this study, we analyse the importance of linguistic characteristics in accurately classifying patient voices.
We fine-tuned a pre-trained Language Model on the combined datasets with similar linguistic patterns, resulting in a highly accurate automatic patient voice classification.
Being the pioneering study on the topic, our focus on extracting authentic patient experiences from social media stands as a crucial step towards advancing healthcare standards.
arXiv Detail & Related papers (2024-07-23T15:51:46Z) - Speech-based Clinical Depression Screening: An Empirical Study [32.84863235794086]
This study investigates the utility of speech signals for AI-based depression screening across varied interaction scenarios.
participants include depressed patients recruited from the outpatient clinics of Peking University Sixth Hospital.
We extracted acoustic and deep speech features from each participant's segmented recordings.
arXiv Detail & Related papers (2024-06-05T09:43:54Z) - Identification of Cognitive Decline from Spoken Language through Feature
Selection and the Bag of Acoustic Words Model [0.0]
The early identification of symptoms of memory disorders plays a significant role in ensuring the well-being of populations.
The lack of standardized speech tests in clinical settings has led to a growing emphasis on developing automatic machine learning techniques for analyzing naturally spoken language.
The work presents an approach related to feature selection, allowing for the automatic selection of the essential features required for diagnosis from the Geneva minimalistic acoustic parameter set and relative speech pauses.
arXiv Detail & Related papers (2024-02-02T17:06:03Z) - Automatically measuring speech fluency in people with aphasia: first
achievements using read-speech data [55.84746218227712]
This study aims at assessing the relevance of a signalprocessingalgorithm, initially developed in the field of language acquisition, for the automatic measurement of speech fluency.
arXiv Detail & Related papers (2023-08-09T07:51:40Z) - Disentangled Latent Speech Representation for Automatic Pathological
Intelligibility Assessment [10.93598143328628]
We show that disentangled speech representations can be used for automatic pathological speech intelligibility assessment.
Our results are among the first to show that disentangled speech representations can be used for automatic pathological speech intelligibility assessment.
arXiv Detail & Related papers (2022-04-08T12:02:14Z) - Investigation of Data Augmentation Techniques for Disordered Speech
Recognition [69.50670302435174]
This paper investigates a set of data augmentation techniques for disordered speech recognition.
Both normal and disordered speech were exploited in the augmentation process.
The final speaker adapted system constructed using the UASpeech corpus and the best augmentation approach based on speed perturbation produced up to 2.92% absolute word error rate (WER)
arXiv Detail & Related papers (2022-01-14T17:09:22Z) - STAN: A stuttering therapy analysis helper [59.37911277681339]
Stuttering is a complex speech disorder identified by repeti-tions, prolongations of sounds, syllables or words and blockswhile speaking.
We introduceSTAN, a system to aid speech therapists in stuttering therapysessions.
arXiv Detail & Related papers (2021-06-15T13:48:12Z) - A Preliminary Study of a Two-Stage Paradigm for Preserving Speaker
Identity in Dysarthric Voice Conversion [50.040466658605524]
We propose a new paradigm for maintaining speaker identity in dysarthric voice conversion (DVC)
The poor quality of dysarthric speech can be greatly improved by statistical VC.
But as the normal speech utterances of a dysarthria patient are nearly impossible to collect, previous work failed to recover the individuality of the patient.
arXiv Detail & Related papers (2021-06-02T18:41:03Z) - NUVA: A Naming Utterance Verifier for Aphasia Treatment [49.114436579008476]
Assessment of speech performance using picture naming tasks is a key method for both diagnosis and monitoring of responses to treatment interventions by people with aphasia (PWA)
Here we present NUVA, an utterance verification system incorporating a deep learning element that classifies 'correct' versus'incorrect' naming attempts from aphasic stroke patients.
When tested on eight native British-English speaking PWA the system's performance accuracy ranged between 83.6% to 93.6%, with a 10-fold cross-validation mean of 89.5%.
arXiv Detail & Related papers (2021-02-10T13:00:29Z) - Comparison of Speaker Role Recognition and Speaker Enrollment Protocol
for conversational Clinical Interviews [9.728371067160941]
We train end-to-end neural network architectures to adapt to each task and evaluate each approach under the same metric.
Results do not depend on the demographics of the Interviewee, highlighting the clinical relevance of our methods.
arXiv Detail & Related papers (2020-10-30T09:07:37Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.