Fugu-MT 論文翻訳(概要): Do Large Language Models Mentalize When They Teach?

論文の概要: Do Large Language Models Mentalize When They Teach?

arxiv url: http://arxiv.org/abs/2604.01594v1
Date: Thu, 02 Apr 2026 04:04:20 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-03 14:21:10.258804
Title: Do Large Language Models Mentalize When They Teach?
Title（参考訳）: 大規模言語モデルは教えるときにメンタル化するか?
Authors: Sevan K. Harootonian, Mark K. Ho, Thomas L. Griffiths, Yael Niv, Ilia Sucholutsky,
Abstract要約: 我々は、シミュレーション教師として様々なLSMを実行し、試行錯誤の選択を人間と同じ認知モデルに適合させる。ほとんどのLSMは良好に動作し、試行よりも戦略がほとんど変化せず、グラフ・バイ・グラフの性能は人間のものと類似している。
参考スコア（独自算出の注目度）: 15.991054959432269
License: http://creativecommons.org/licenses/by/4.0/
Abstract: How do LLMs decide what to teach next: by reasoning about a learner's knowledge, or by using simpler rules of thumb? We test this in a controlled task previously used to study human teaching strategies. On each trial, a teacher LLM sees a hypothetical learner's trajectory through a reward-annotated directed graph and must reveal a single edge so the learner would choose a better path if they replanned. We run a range of LLMs as simulated teachers and fit their trial-by-trial choices with the same cognitive models used for humans: a Bayes-Optimal teacher that infers which transitions the learner is missing (inverse planning), weaker Bayesian variants, heuristic baselines (e.g., reward based), and non-mentalizing utility models. In a baseline experiment matched to the stimuli presented to human subjects, most LLMs perform well, show little change in strategy over trials, and their graph-by-graph performance is similar to that of humans. Model comparison (BIC) shows that Bayes-Optimal teaching best explains most models' choices. When given a scaffolding intervention, models follow auxiliary inference- or reward-focused prompts, but these scaffolds do not reliably improve later teaching on heuristic-incongruent test graphs and can sometimes reduce performance. Overall, cognitive model fits provide insight into LLM tutoring policies and show that prompt compliance does not guarantee better teaching decisions.
Abstract（参考訳）: LLMは、学習者の知識を推論したり、より単純な親指の規則を使って、次に何を教えるかをどう決めるか? 我々は、これまで人間の教え方略の研究に用いられてきた制御タスクでこれをテストした。各試行において、LLM教師は、報酬を付与した有向グラフを通して仮説学習者の軌跡を視認し、修正された場合、学習者がより良い経路を選択するように、単一のエッジを明らかにする必要がある。我々は、LLMをシミュレーション教師として実行し、そのトライアル・バイ・トライアル選択を、学習者が欠落している(逆計画)ベイズ・最適教師、弱いベイズ変種、ヒューリスティック・ベースライン(例えば報酬ベース)、非メンタライズユーティリティモデルといった、人間が使用する認知モデルに適合させる。被験者に提示された刺激に合わせたベースライン実験では、ほとんどのLCMは良好に動作し、試行よりも戦略がほとんど変化せず、グラフ・バイ・グラフのパフォーマンスは人間のものと類似している。モデル比較(BIC)は、ベイズ最適指導がほとんどのモデルの選択を最もよく説明していることを示している。足場の介入が与えられると、モデルは補助的な推論や報酬に焦点を当てたプロンプトに従うが、これらの足場はヒューリスティックな矛盾したテストグラフに関する後の教育を確実に改善せず、時には性能を低下させる。全体として、認知モデルはLLMの教育方針に関する洞察を与え、コンプライアンスの迅速化がより良い教育決定を保証していないことを示す。

論文の概要: Do Large Language Models Mentalize When They Teach?

関連論文リスト