Fugu-MT 論文翻訳(概要): Can LLMs Model Incorrect Student Reasoning? A Case Study on Distractor Generation

論文の概要: Can LLMs Model Incorrect Student Reasoning? A Case Study on Distractor Generation

arxiv url: http://arxiv.org/abs/2603.15547v1
Date: Mon, 16 Mar 2026 17:09:41 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-17 18:28:58.678933
Title: Can LLMs Model Incorrect Student Reasoning? A Case Study on Distractor Generation
Title（参考訳）: LLMは不正確な学生共振をモデル化できるか? : ディトラクタ生成を事例として
Authors: Yanick Zengaffinen, Andreas Opedal, Donya Rooein, Kv Aditya Srivatsa, Shashank Sonkar, Mrinmaya Sachan,
Abstract要約: 本研究は,大規模言語モデル (LLM) が複数選択の散逸を発生させる際の誤解に対する理由について検討する。私たちの構造化分析は、彼らのプロセスとベストプラクティスの驚くほどの一致を明らかにします。
参考スコア（独自算出の注目度）: 46.210941814500906
License: http://creativecommons.org/licenses/by-nc-sa/4.0/
Abstract: Modeling plausible student misconceptions is critical for AI in education. In this work, we examine how large language models (LLMs) reason about misconceptions when generating multiple-choice distractors, a task that requires modeling incorrect yet plausible answers by coordinating solution knowledge, simulating student misconceptions, and evaluating plausibility. We introduce a taxonomy for analyzing the strategies used by state-of-the-art LLMs, examining their reasoning procedures and comparing them to established best practices in the learning sciences. Our structured analysis reveals a surprising alignment between their processes and best practices: the models typically solve the problem correctly first, then articulate and simulate multiple potential misconceptions, and finally select a set of distractors. An analysis of failure modes reveals that errors arise primarily from failures in recovering the correct solution and selecting among response candidates, rather than simulating errors or structuring the process. Consistent with these results, we find that providing the correct solution in the prompt improves alignment with human-authored distractors by 8%, highlighting the critical role of anchoring to the correct solution when generating plausible incorrect student reasoning. Overall, our analysis offers a structured and interpretable lens into LLMs' ability to model incorrect student reasoning and produce high-quality distractors.
Abstract（参考訳）: もっともらしい学生の誤解をモデル化することは、教育におけるAIにとって重要である。本研究は,大規模言語モデル (LLM) が,複数選択の散逸を発生させる際の誤解に対する理由について検討し,解法知識の調整,学生の誤解のシミュレーション,妥当性の評価などによって,不正確かつ妥当な回答をモデル化する必要がある課題について考察する。本稿では,最先端のLLMが使用する戦略を分析し,それらの推論手順を調べ,学習科学の確立したベストプラクティスと比較するための分類法を提案する。モデルは通常、まず最初に問題を正しく解き、次に複数の潜在的な誤解を明瞭化し、シミュレートし、最後に注意をそらすものを選びます。障害モードの分析によると、エラーは、エラーをシミュレートしたり、プロセスを構成するのではなく、正しいソリューションを回復し、応答候補の中から選択する際の失敗から発生する。これらの結果と一致して, 正しい解法を提供することで, 正しい解を導き出すことで, 正しい解に固執する重要な役割を浮き彫りにする。我々の分析は、構造化され解釈可能なレンズをLLMに提供し、不正確な学生推論をモデル化し、高品質なイントラクタを生成する。

論文の概要: Can LLMs Model Incorrect Student Reasoning? A Case Study on Distractor Generation

関連論文リスト