Fugu-MT 論文翻訳(概要): MedThink: Enhancing Diagnostic Accuracy in Small Models via Teacher-Guided Reasoning Correction

論文の概要: MedThink: Enhancing Diagnostic Accuracy in Small Models via Teacher-Guided Reasoning Correction

arxiv url: http://arxiv.org/abs/2605.08094v1
Date: Thu, 09 Apr 2026 18:00:47 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-25 12:34:33.689321
Title: MedThink: Enhancing Diagnostic Accuracy in Small Models via Teacher-Guided Reasoning Correction
Title（参考訳）: MedThink:教師誘導推論補正による小型モデルの診断精度向上
Authors: Xinchun Su, Chunxu Luo, Lipeng Ma, Yixuan Li, Weidong Yang,
Abstract要約: 小言語モデルにおけるロバストな臨床推論を育むための2段階蒸留フレームワークであるMedThinkを提案する。第1段階では、教師のLLMがデータをスクリーニングし、ドメイン知識の説明を注入し、学生モデルを微調整する。第2段階では、教師は、生徒の誤りを評価し、知識をリンクして答えを正す推論連鎖を生成し、生徒の診断的推論を洗練させる。
参考スコア（独自算出の注目度）: 22.35140929464229
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Accurate clinical diagnosis requires extensive domain knowledge and complex clinical reasoning capabilities. Although large language models (LLMs) hold great potential for clinical reasoning, their high computational and memory requirements limit their deployment in resource-constrained environments. Knowledge distillation (KD) can compress LLM capabilities into smaller models, but traditional KD merely transfers superficial answer patterns and fails to preserve the structured reasoning required for reliable diagnosis. To address this, we propose a two-stage distillation framework, MedThink, designed to cultivate robust clinical reasoning in small language models (SLMs). In the first stage, a teacher LLM screens data and injects domain-knowledge explanations to fine-tune a student model, establishing a knowledge foundation. In the second stage, the teacher evaluates the student's errors, generates reasoning chains linking knowledge to correct answers, and refines the student's diagnostic reasoning through a second round of fine-tuning. We evaluate MedThink on general medical benchmarks and a gastroenterology dataset comprising 955 question-answer pairs. Experiments demonstrate that MedThink outperforms six distillation strategies in all benchmarks: achieving an improvement of up to 12.7% over the student baseline in general tasks, and reaching a total top accuracy of 56.4% in gastroenterology evaluation. This indicates that iterative distillation centered on reasoning can significantly enhance the diagnostic accuracy and generalization capabilities of SLMs whilst maintaining computational efficiency. Our code and data are publicly available at https://github.com/destinybird/PrecisionBoost.
Abstract（参考訳）: 正確な臨床診断には、広範なドメイン知識と複雑な臨床推論能力が必要である。大規模言語モデル (LLM) は臨床推論において大きな可能性を秘めているが、その高い計算量とメモリ要求は資源制約のある環境への展開を制限する。知識蒸留(KD)はLLM能力をより小さなモデルに圧縮することができるが、従来のKDは表面的な応答パターンを伝達するだけで、信頼できる診断に必要な構造的推論を維持できない。そこで本研究では,小規模言語モデル (SLM) におけるロバストな臨床推論の育成を目的とした2段階蒸留フレームワーク MedThink を提案する。第1段階では、教師LLMがデータをスクリーニングし、ドメイン知識の説明を注入し、学生モデルを微調整し、知識基盤を確立する。第2段階では、教師は、生徒の誤りを評価し、知識を正しい回答にリンクする推論連鎖を生成し、第2ラウンドの微調整を通して、生徒の診断推論を洗練させる。 MedThinkを一般医用ベンチマークで評価し,955組の質問応答対からなる胃腸科学データセットについて検討した。実験の結果、MedThinkは全ベンチマークで6つの蒸留戦略を上回り、学生の基準よりも最大12.7%向上し、胃腸科学評価では56.4%の精度に達した。このことから,推理を中心とした反復蒸留は,計算効率を保ちながら,SLMの診断精度と一般化能力を著しく向上させることが示唆された。私たちのコードとデータはhttps://github.com/destinybird/PrecisionBoost.comで公開されています。

論文の概要: MedThink: Enhancing Diagnostic Accuracy in Small Models via Teacher-Guided Reasoning Correction

関連論文リスト