Fugu-MT 論文翻訳(概要): Faithful or Just Plausible? Evaluating the Faithfulness of Closed-Source LLMs in Medical Reasoning

論文の概要: Faithful or Just Plausible? Evaluating the Faithfulness of Closed-Source LLMs in Medical Reasoning

arxiv url: http://arxiv.org/abs/2603.13988v1
Date: Sat, 14 Mar 2026 15:30:53 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-17 16:19:35.527473
Title: Faithful or Just Plausible? Evaluating the Faithfulness of Closed-Source LLMs in Medical Reasoning
Title（参考訳）: ファイントフルかただプラウシブルか : 医療推論におけるクローズドソースLCMの忠実度の評価
Authors: Halimat Afolabi, Zainab Afolabi, Elizabeth Friel, Jude Roberts, Antonio Ji-Xu, Lloyd Chen, Egheosa Ogbomo, Emiliomo Imevbore, Phil Eneje, Wissal El Ouahidi, Aaron Sohal, Alisa Kennan, Shreya Srivastava, Anirudh Vairavan, Laura Napitu, Katie McClure,
Abstract要約: 我々は,広く使用されている3つの大規模言語モデルの中で,医学的推論における忠実度を体系的に評価する。 CoT推論のステップは、しばしば予測を因果的に駆動せず、モデルは、認識せずに容易に外部ヒントを組み込む。
参考スコア（独自算出の注目度）: 0.0
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Closed-source large language models (LLMs), such as ChatGPT and Gemini, are increasingly consulted for medical advice, yet their explanations may appear plausible while failing to reflect the model's underlying reasoning process. This gap poses serious risks as patients and clinicians may trust coherent but misleading explanations. We conduct a systematic black-box evaluation of faithfulness in medical reasoning among three widely used closed-source LLMs. Our study consists of three perturbation-based probes: (1) causal ablation, testing whether stated chain-of-thought (CoT) reasoning causally influences predictions; (2) positional bias, examining whether models create post-hoc justifications for answers driven by input positioning; and (3) hint injection, testing susceptibility to external suggestions. We complement these quantitative probes with a small-scale human evaluation of model responses to patient-style medical queries to examine concordance between physician assessments of explanation faithfulness and layperson perceptions of trustworthiness. We find that CoT reasoning steps often do not causally drive predictions, and models readily incorporate external hints without acknowledgment. In contrast, positional biases showed minimal impact in this setting. These results underscore that faithfulness, not just accuracy, must be central in evaluating LLMs for medicine, to ensure both public protection and safe clinical deployment.
Abstract（参考訳）: ChatGPT(英語版)やGemini(英語版)などのオープンソース・大型言語モデル(LLM)は、医学的アドバイスを求める声が高まっているが、それらの説明はモデルの基本的推論過程を反映していないように思える。このギャップは、患者や臨床医が一貫性はあるが誤解を招く説明を信用する可能性があるため、深刻なリスクを引き起こす。医学的推論における信頼度をブラックボックスで体系的に評価する。本研究は3つの摂動型プローブから成っている。(1)因果的アブレーション,(2)因果的思考の連鎖(CoT)推論が予測に因果的影響を及ぼすかどうか,(2)モデルが入力位置によって引き起こされる回答に対するポストホック正当性を生成するかどうか,(3)ヒント注入,外部提案に対する感受性をテストする。我々は,これらの定量的調査を,患者スタイルの医療クエリに対するモデル応答の小規模人為的評価と補完し,医師による説明忠実度の評価と信頼感の素人認識の一致を検証した。 CoT推論のステップは、しばしば予測を因果的に駆動せず、モデルは、認識せずに容易に外部ヒントを組み込む。対照的に、位置バイアスは、この設定に最小限の影響を示した。これらの結果は, 医療用LSMの評価において, 信頼感が重要であり, 公衆の保護と安全な臨床展開の両面において重要であることを裏付けるものである。

論文の概要: Faithful or Just Plausible? Evaluating the Faithfulness of Closed-Source LLMs in Medical Reasoning

関連論文リスト