論文の概要: Conformer Based Elderly Speech Recognition System for Alzheimer's
Disease Detection
- arxiv url: http://arxiv.org/abs/2206.13232v1
- Date: Thu, 23 Jun 2022 12:50:55 GMT
- ステータス: 処理完了
- システム内更新日: 2022-06-28 17:18:11.433363
- Title: Conformer Based Elderly Speech Recognition System for Alzheimer's
Disease Detection
- Title(参考訳): アルツハイマー病検出のためのコンフォーマー型高齢者音声認識システム
- Authors: Tianzi Wang, Jiajun Deng, Mengzhe Geng, Zi Ye, Shoukang Hu, Yi Wang,
Mingyu Cui, Zengrui Jin, Xunying Liu, Helen Meng
- Abstract要約: アルツハイマー病(AD)の早期診断は、予防ケアがさらなる進行を遅らせるのに不可欠である。
本稿では,DementiaBank Pitt コーパスをベースとした最新のコンバータに基づく音声認識システムの開発について述べる。
- 参考スコア(独自算出の注目度): 62.23830810096617
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Early diagnosis of Alzheimer's disease (AD) is crucial in facilitating
preventive care to delay further progression. This paper presents the
development of a state-of-the-art Conformer based speech recognition system
built on the DementiaBank Pitt corpus for automatic AD detection. The baseline
Conformer system trained with speed perturbation and SpecAugment based data
augmentation is significantly improved by incorporating a set of purposefully
designed modeling features, including neural architecture search based
auto-configuration of domain-specific Conformer hyper-parameters in addition to
parameter fine-tuning; fine-grained elderly speaker adaptation using learning
hidden unit contributions (LHUC); and two-pass cross-system rescoring based
combination with hybrid TDNN systems. An overall word error rate (WER)
reduction of 13.6% absolute (34.8% relative) was obtained on the evaluation
data of 48 elderly speakers. Using the final systems' recognition outputs to
extract textual features, the best-published speech recognition based AD
detection accuracy of 91.7% was obtained.
- Abstract(参考訳): アルツハイマー病(AD)の早期診断は、予防ケアがさらなる進行を遅らせるのに不可欠である。
本稿では,DementiaBank Pitt コーパスをベースとした最新のコンバータに基づく音声認識システムの開発について述べる。
The baseline Conformer system trained with speed perturbation and SpecAugment based data augmentation is significantly improved by incorporating a set of purposefully designed modeling features, including neural architecture search based auto-configuration of domain-specific Conformer hyper-parameters in addition to parameter fine-tuning; fine-grained elderly speaker adaptation using learning hidden unit contributions (LHUC); and two-pass cross-system rescoring based combination with hybrid TDNN systems.
高齢者48名を対象に, 単語誤り率 (WER) の絶対値 (34.8%) を13.6%削減した。
- Hyper-parameter Adaptation of Conformer ASR Systems for Elderly and
Dysarthric Speech Recognition [64.9816313630768]
本稿では,Librispeech corpus 上で事前学習した Conformer ASR システムのハイパーパラメータ適応について検討する。
論文 参考訳(メタデータ) (2023-06-27T07:49:35Z) - Multilingual Alzheimer's Dementia Recognition through Spontaneous
Speech: a Signal Processing Grand Challenge [18.684024762601215]
論文 参考訳(メタデータ) (2023-01-13T14:09:13Z) - Exploiting prompt learning with pre-trained language models for
Alzheimer's Disease detection [70.86672569101536]
論文 参考訳(メタデータ) (2022-10-29T09:18:41Z) - Exploring linguistic feature and model combination for speech
recognition based automatic AD detection [61.91708957996086]
本稿では,BERT と Roberta の事前学習したテキストエンコーダのドメイン微調整の堅牢性向上のための特徴とモデルの組み合わせ手法について検討する。
論文 参考訳(メタデータ) (2022-06-28T05:09:01Z) - On-the-Fly Feature Based Rapid Speaker Adaptation for Dysarthric and
Elderly Speech Recognition [53.17176024917725]
論文 参考訳(メタデータ) (2022-03-28T09:12:24Z) - Investigation of Data Augmentation Techniques for Disordered Speech
Recognition [69.50670302435174]
論文 参考訳(メタデータ) (2022-01-14T17:09:22Z) - To BERT or Not To BERT: Comparing Speech and Language-based Approaches
for Alzheimer's Disease Detection [17.99855227184379]
論文 参考訳(メタデータ) (2020-07-26T04:50:47Z)