Fugu-MT 論文翻訳(概要): Phonological Subspace Collapse Is Aetiology-Specific and Cross-Lingually Stable: Evidence from 3,374 Speakers

論文の概要: Phonological Subspace Collapse Is Aetiology-Specific and Cross-Lingually Stable: Evidence from 3,374 Speakers

arxiv url: http://arxiv.org/abs/2604.21706v1
Date: Thu, 23 Apr 2026 14:12:27 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-24 14:40:06.586643
Title: Phonological Subspace Collapse Is Aetiology-Specific and Cross-Lingually Stable: Evidence from 3,374 Speakers
Title（参考訳）: 音韻的部分空間の崩壊は、エトロジーに特有で言語横断的に安定している:3,374人の話者による証拠
Authors: Bernard Muller, Antonio Armando Ortiz Barrañón, LaVonne Roberts,
Abstract要約: HuBERTをベースとした5言語890話者を対象にした音韻的特徴部分空間に基づく難聴度評価のためのトレーニングフリーフレームワーク。 12言語および5言語にまたがる25言語話者の分析(パーキンソン病、脳性麻痺、ALSダウン症候群、脳卒中) 代表標本における言語間プロファイル形状と安定性のクロスバックボーン
参考スコア（独自算出の注目度）: 0.0
License: http://creativecommons.org/licenses/by/4.0/
Abstract: We previously introduced a training-free method for dysarthria severity assessment based on d-prime separability of phonological feature subspaces in frozen self-supervised speech representations, validated on 890 speakers across 5 languages with HuBERT-base. Here, we scale the analysis to 3,374 speakers from 25 datasets spanning 12 languages and 5 aetiologies (Parkinson's disease, cerebral palsy, ALS, Down syndrome, and stroke), plus healthy controls, using 6 SSL backbones. We report three findings. First, aetiology-specific degradation profiles are distinguishable at the group level: 10 of 13 features yield large effect sizes (epsilon-squared > 0.14, Holm-corrected p < 0.001), with Parkinson's disease separable from the articulatory execution group at Cohen's d = 0.83; individual-level classification remains limited (22.6% macro F1). Second, profiles show cross-lingual profile-shape stability: cosine similarity of 5-dimensional consonant d-prime profiles exceeds 0.95 across the languages available for each aetiology. Absolute d-prime magnitudes are not cross-lingually calibrated, so the method supports language-independent phenotyping of degradation patterns but requires within-corpus calibration for absolute severity interpretation. Third, the method is architecture-independent: all 6 backbones produce monotonic severity gradients with inter-model agreement exceeding rho = 0.77. Fixed-token d-prime estimation preserves the severity correlation (rho = -0.733 at 200 tokens per class), confirming that the signal is not a token-count artefact. These results support phonological subspace analysis as a robust, training-free framework for aetiology-aware dysarthria characterisation, with evidence of cross-lingual profile-shape stability and cross-backbone robustness in the represented sample.
Abstract（参考訳）: 凍結自己教師型音声表現における音韻特徴部分空間のd-prime分離性に基づく難聴度評価のトレーニングを,HuBERTベース5言語890話者を対象に実施した。そこで本研究では,12言語と5つのエチオロジー(パーキンソン病,脳性麻痺,ALS,ダウンシンドローム,脳卒中)にまたがる25のデータセットから3,374人の話者に,SSLバックボーンを6つ使用して解析を行った。我々は3つの発見を報告した。第一に、エトロジー特異的な劣化プロファイルはグループレベルで識別可能である:13の特徴のうち10は大きな効果サイズ(エプシロン二乗法>0.14、ホルム補正法 p < 0.001)を生じるが、パーキンソン病はコーエンのd = 0.83の調音実行群から分離可能であり、個々のレベルの分類は限定的である(マクロF1の22.6%)。第二に、プロファイルは言語間プロファイル形状の安定性を示す: 5次元子音d-プライムプロファイルのコサイン類似性は、各エチオロジーで利用可能な言語で0.95以上である。絶対的なd-プライム等級は言語横断的に校正されないため、劣化パターンの言語非依存表現型化をサポートするが、絶対重大度解釈には体内校正が必要である。第三に、この手法はアーキテクチャ非依存であり、すべての6つのバックボーンは、rho = 0.77を超えるモデル間合意を持つ単調な重度勾配を生成する。固定トークンd-prime推定は、深刻度相関(rho = -0.733 at 200 tokens per class)を保ち、信号がトークン数アーチファクトではないことを確認する。これらの結果は, 音韻的部分空間解析を, 耳鼻咽喉頭機能評価のための頑健で無訓練の枠組みとして支持するものである。

論文の概要: Phonological Subspace Collapse Is Aetiology-Specific and Cross-Lingually Stable: Evidence from 3,374 Speakers

関連論文リスト