Fugu-MT 論文翻訳(概要): LLMCARE: early detection of cognitive impairment via transformer models enhanced by LLM-generated synthetic data

論文の概要: LLMCARE: early detection of cognitive impairment via transformer models enhanced by LLM-generated synthetic data

arxiv url: http://arxiv.org/abs/2508.10027v3
Date: Mon, 10 Nov 2025 09:23:49 GMT
ステータス: 翻訳完了
システム内更新日: 2025-11-11 14:56:00.064516
Title: LLMCARE: early detection of cognitive impairment via transformer models enhanced by LLM-generated synthetic data
Title（参考訳）: LLMCARE:LLM合成データにより増強されたトランスフォーマーモデルによる認知障害の早期検出
Authors: Ali Zolnour, Hossein Azadmaleki, Yasaman Haghbin, Fatemeh Taherinezhad, Mohamad Javad Momeni Nezhad, Sina Rashidi, Masoud Khani, AmirSajjad Taleban, Samin Mahdizadeh Sani, Maryam Dadkhah, James M. Noble, Suzanne Bakken, Yadollah Yaghoobzadeh, Abdol-Hossein Vahabie, Masoud Rouhizadeh, Maryam Zolnoori,
Abstract要約: アルツハイマー病と関連する認知症は、米国で500万人近い高齢者に影響を及ぼす。本研究は,トランスフォーマー埋め込みと手作り言語的特徴を融合した音声ベースのスクリーニングパイプラインを開発し,評価する。
参考スコア（独自算出の注目度）: 32.69241041313969
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Alzheimer's disease and related dementias(ADRD) affect nearly five million older adults in the United States, yet more than half remain undiagnosed. Speech-based natural language processing(NLP) offers a scalable approach for detecting early cognitive decline through subtle linguistic markers that may precede clinical diagnosis. This study develops and evaluates a speech-based screening pipeline integrating transformer embeddings with handcrafted linguistic features, synthetic augmentation using large language models(LLMs), and benchmarking of unimodal and multimodal classifiers. External validation assessed generalizability to a MCI-only cohort. Transcripts were drawn from the ADReSSo 2021 benchmark dataset(n=237, Pitt Corpus) and the DementiaBank Delaware corpus(n=205, MCI vs. controls). Ten transformer models were tested under three fine-tuning strategies. A late-fusion model combined embeddings from the top transformer with 110 linguistic features. Five LLMs(LLaMA8B/70B, MedAlpaca7B, Ministral8B,GPT-4o) generated label-conditioned synthetic speech for augmentation, and three multimodal LLMs(GPT-4o,Qwen-Omni,Phi-4) were evaluated in zero-shot and fine-tuned modes. On ADReSSo, the fusion model achieved F1=83.3(AUC=89.5), outperforming transformer-only and linguistic baselines. MedAlpaca7B augmentation(2x) improved F1=85.7, though larger scales reduced gains. Fine-tuning boosted unimodal LLMs(MedAlpaca7B F1=47.7=>78.7), while multimodal models performed lower (Phi-4=71.6;GPT-4o=67.6). On Delaware, the fusion plus 1x MedAlpaca7B model achieved F1=72.8(AUC=69.6). Integrating transformer and linguistic features enhances ADRD detection. LLM-based augmentation improves data efficiency but yields diminishing returns, while current multimodal models remain limited. Validation on an independent MCI cohort supports the pipeline's potential for scalable, clinically relevant early screening.
Abstract（参考訳）: アルツハイマー病と関連する認知症(ADRD)は、米国で500万人近い高齢者に影響を及ぼすが、半数以上が未診断のままである。音声に基づく自然言語処理(NLP)は、臨床診断に先行する微妙な言語マーカーを通して早期の認知低下を検出するスケーラブルなアプローチを提供する。本研究は,手作り言語特徴と変換器埋め込みを統合した音声ベースのスクリーニングパイプラインの開発と評価,大規模言語モデル(LLM)を用いた合成拡張,非モーダル・マルチモーダル分類器のベンチマークを行う。外部検証は、MCIのみのコホートに対する一般化性を評価した。 ADReSSo 2021ベンチマークデータセット(n=237, Pitt Corpus)とDementiaBank Delaware corpus(n=205, MCI vs. コントロール)から転写された。 10個のトランスモデルが3つの微調整戦略の下で試験された。後期融合モデルでは、トップトランスからの埋め込みと110の言語的特徴が組み合わされた。 5つのLLM(LLaMA8B/70B,MedAlpaca7B,Ministral8B,GPT-4o)がラベル条件付き合成音声を生成し,GPT-4o,Qwen-Omni,Phi-4)をゼロショットモードおよび微調整モードで評価した。 ADReSSoでは、融合モデルはF1=83.3(AUC=89.5)を達成し、トランスフォーマーのみおよび言語ベースラインを上回った。 MedAlpaca7B augmentation(2x)によりF1=85.7が向上したが、より大きなスケールでは利得が低下した。 MedAlpaca7B F1=47.7=>78.7)、Phi-4=71.6;GPT-4o=67.6)。デラウェア州では1倍のMedAlpaca7BがF1=72.8(AUC=69.6)に達した。変換器と言語機能を統合することでADRD検出が強化される。 LLMベースの拡張はデータの効率を向上するが、現在のマルチモーダルモデルには制限があるが、リターンは減少する。独立したMCIコホートでの検証は、スケーラブルで臨床的に関係のある早期スクリーニングに対するパイプラインの可能性を支持する。

論文の概要: LLMCARE: early detection of cognitive impairment via transformer models enhanced by LLM-generated synthetic data

関連論文リスト