Fugu-MT 論文翻訳(概要): Hey Chat, Can You Teach Me? Structuring Socratic Dialogue for Human Learning in the Wild

論文の概要: Hey Chat, Can You Teach Me? Structuring Socratic Dialogue for Human Learning in the Wild

arxiv url: http://arxiv.org/abs/2606.11744v1
Date: Wed, 10 Jun 2026 07:20:59 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-11 16:42:38.340007
Title: Hey Chat, Can You Teach Me? Structuring Socratic Dialogue for Human Learning in the Wild
Title（参考訳）: ヘイ・チャット, 教えられるか? 野生の人間の学習のためのソクラテス的対話を構造化する
Authors: Sidney Tio, Arunesh Sinha, Pradeep Varakantham,
Abstract要約: 大規模な言語モデルは今では日常的な学習に広く使われている。しかし、基礎となる対話は通常、カリキュラムに従うのではなく、非構造化のチャットである。このギャップは、スケーリングモデルだけでは閉じられません。
参考スコア（独自算出の注目度）: 36.633166596198635
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Large language models are now widely used for everyday learning, but the underlying interactions are typically unstructured chats rather than following a curriculum. Unlike formal online learning systems, these interactions carry no prior record of the student, so any estimate of what the student already knows must be inferred from the dialogue itself. We show that this gap is not closed by scaling models alone. Frontier and education-tuned LLMs perform poorly when asked to tutor a student over an extended session, because doing so requires three things at once. The tutor must sequence a curriculum, conduct Socratic dialogue, and infer the student's knowledge state from that dialogue. We propose separating these responsibilities. Given a student query, our system constructs a prerequisite knowledge graph in which subtopics are nodes and dependencies are edges, and frames tutoring as deciding which node to teach next and how many dialogue turns to spend on it before moving on. A lightweight PPO policy handles this sequencing decision, while an LLM conducts the Socratic exchange at the chosen node and returns a signal of student progress. Across held-out STEM and non-STEM topics, our PPO-paired tutor outperforms heuristic baselines, frontier general-purpose models, and a model specialised for Socratic dialogue: on both the rate at which students reach full curriculum mastery and the number of turns required. Explicit curriculum structure delivers gains that scaling the underlying model does not.
Abstract（参考訳）: 大規模な言語モデルは今では日常的な学習に広く使われているが、基礎となる相互作用は通常、カリキュラムに従うのではなく、構造化されていないチャットである。公式のオンライン学習システムとは異なり、これらの相互作用は生徒の事前の記録を持たないので、生徒が既に知っていることの見積もりは、対話そのものから推測されなければならない。このギャップは、スケーリングモデルだけでは閉じられません。フロンティアと教育用に調整されたLLMは、拡張セッションで生徒に指導を依頼すると、パフォーマンスが悪くなります。教師はカリキュラムを解析し、ソクラテス的な対話を行い、その対話から生徒の知識状態を推測しなければならない。これらの責任を分離することを提案する。学生問合せを前提として,サブトピックがノードであり,依存関係がエッジである前提知識グラフを構築し,次にどのノードを教えるか,どの対話に何回費やすかを決定するフレームを学習する。軽量PPOポリシーはこのシーケンシング決定を処理し、LCMは選択ノードでソクラテス交換を行い、生徒の進歩のシグナルを返す。 STEMと非STEMのトピック全体で、私たちのPPO対応チューターは、ヒューリスティックベースライン、フロンティア汎用モデル、ソクラティック対話に特化したモデル、すなわち、学生が完全なカリキュラムの熟達と必要なターン数に到達する速度において、優れたパフォーマンスを発揮します。明示的なカリキュラム構造は、基礎となるモデルをスケールしない利益をもたらす。

論文の概要: Hey Chat, Can You Teach Me? Structuring Socratic Dialogue for Human Learning in the Wild

関連論文リスト