Fugu-MT 論文翻訳(概要): MDD-Thinker: Towards Large Reasoning Models for Major Depressive Disorder Diagnosis

論文の概要: MDD-Thinker: Towards Large Reasoning Models for Major Depressive Disorder Diagnosis

arxiv url: http://arxiv.org/abs/2509.24217v1
Date: Mon, 29 Sep 2025 02:56:38 GMT
ステータス: 翻訳完了
システム内更新日: 2025-09-30 22:32:19.707634
Title: MDD-Thinker: Towards Large Reasoning Models for Major Depressive Disorder Diagnosis
Title（参考訳）: MDD-Thinker:大うつ病診断のための大規模推論モデルに向けて
Authors: Yuyang Sha, Hongxin Pan, Gang Luo, Caijuan Shi, Jing Wang, Kefeng Li,
Abstract要約: 大うつ病 (Major depressive disorder, MDD) は、世界的障害の主要な原因である。現在の診断アプローチは、しばしば主観的評価に依存し、臨床情報を統合する能力が欠如している。我々はMDD-Thinkerを開発した。MDD-Thinkerは、教師付き微調整(SFT)と強化学習(RL)を統合し、推論能力と解釈可能性を高める。
参考スコア（独自算出の注目度）: 3.2986206562794234
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Background Major depressive disorder (MDD) is a leading cause of global disability, yet current diagnostic approaches often rely on subjective assessments and lack the ability to integrate multimodal clinical information. Large language models (LLMs) hold promise for enhancing diagnostic accuracy through advanced reasoning but face challenges in interpretability, hallucination, and reliance on synthetic data. Methods We developed MDD-Thinker, an LLM-based diagnostic framework that integrates supervised fine-tuning (SFT) with reinforcement learning (RL) to strengthen reasoning ability and interpretability. Using the UK Biobank dataset, we generated 40,000 reasoning samples, supplemented with 10,000 samples from publicly available mental health datasets. The model was fine-tuned on these reasoning corpora, and its diagnostic and reasoning performance was evaluated against machine learning, deep learning, and state-of-the-art LLM baselines. Findings MDD-Thinker achieved an accuracy of 0.8268 and F1-score of 0.8081, significantly outperforming traditional baselines such as SVM and MLP, as well as general-purpose LLMs. Incorporating both SFT and RL yielded the greatest improvements, with relative gains of 29.0% in accuracy, 38.1% in F1-score, and 34.8% in AUC. Moreover, the model demonstrated comparable reasoning performance compared to much larger LLMs, while maintaining computational efficiency. Interpretation This study presents the first reasoning-enhanced LLM framework for MDD diagnosis trained on large-scale real-world clinical data. By integrating SFT and RL, MDD-Thinker balances accuracy, interpretability, and efficiency, offering a scalable approach for intelligent psychiatric diagnostics. These findings suggest that reasoning-oriented LLMs can provide clinically reliable support for MDD detection and may inform broader applications in mental health care.
Abstract（参考訳）: 背景主要なうつ病 (MDD) は世界的な障害の主要な原因であるが、現在の診断手法は主観的評価に頼り、マルチモーダルな臨床情報を統合する能力に欠けることが多い。大規模言語モデル(LLM)は、高度な推論による診断精度の向上を約束するが、解釈可能性、幻覚、合成データへの依存といった課題に直面している。 MDD-Thinkerは、教師付き微調整(SFT)と強化学習(RL)を統合し、推論能力と解釈可能性を強化する。 UK Biobankのデータセットを使用して、公開されているメンタルヘルスデータセットから1万のサンプルを補足した4万の推論サンプルを生成しました。モデルはこれらの推論コーパスに基づいて微調整され、その診断と推論性能は機械学習、ディープラーニング、最先端のLCMベースラインに対して評価された。 MDD-Thinker の精度は 0.8268 と F1 スコア 0.8081 であり、SVM や MLP などの従来のベースラインや汎用 LLM よりも大幅に優れていた。 SFTとRLの両方を組み込むことで、精度は29.0%、F1スコアは38.1%、AUCは34.8%向上した。さらに、このモデルは計算効率を保ちながら、より大きなLCMと比較した推論性能を示した。本研究は,大規模臨床データを用いたMDD診断のためのLCMフレームワークを初めて提示する。 SFTとRLを統合することで、MDD-Thinkerは精度、解釈可能性、効率のバランスをとり、インテリジェントな精神医学診断のためのスケーラブルなアプローチを提供する。これらの結果から,理学療法指向のLSMはMDD検出に対する臨床的に信頼性の高いサポートを提供し,メンタルヘルスの幅広い応用を通知できる可能性が示唆された。

論文の概要: MDD-Thinker: Towards Large Reasoning Models for Major Depressive Disorder Diagnosis

関連論文リスト