Fugu-MT 論文翻訳(概要): Constructing a Question-Answering Simulator through the Distillation of LLMs

論文の概要: Constructing a Question-Answering Simulator through the Distillation of LLMs

arxiv url: http://arxiv.org/abs/2509.09226v1
Date: Thu, 11 Sep 2025 07:59:30 GMT
ステータス: 翻訳完了
システム内更新日: 2025-09-12 16:52:24.281088
Title: Constructing a Question-Answering Simulator through the Distillation of LLMs
Title（参考訳）: LLMの蒸留による質問応答シミュレータの構築
Authors: Haipeng Liu, Ting Long, Jing Fu,
Abstract要約: 質問応答シミュレータ (QA) は、学生の実際の学習行動を模倣し、質問に対する回答の正しさを予測するモデルである。 QAシミュレータは、実際の学生と対話することなく、教育推薦システム(ERS)が大量のトレーニングデータを収集することを可能にする。そこで本研究では, LLMからドメイン知識と推論能力を蒸留し, 予測支援を行うLDSim (LLM Distillation Based Simulator) を提案する。
参考スコア（独自算出の注目度）: 4.573445061106203
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: The question-answering (QA) simulator is a model that mimics real student learning behaviors and predicts their correctness of their responses to questions. QA simulators enable educational recommender systems (ERS) to collect large amounts of training data without interacting with real students, thereby preventing harmful recommendations made by an undertrained ERS from undermining actual student learning. Given the QA history, there are two categories of solutions to predict the correctness, conducting the simulation: (1) LLM-free methods, which apply a traditional sequential model to transfer the QA history into a vector representation first, and make predictions based on the representation; (2) LLM-based methods, which leverage the domain knowledge and reasoning capability of LLM to enhence the prediction. LLM-free methods offer fast inference but generally yield suboptimal performance. In contrast, most LLM-based methods achieve better results, but at the cost of slower inference speed and higher GPU memory consumption. In this paper, we propose a method named LLM Distillation based Simulator (LDSim), which distills domain knowledge and reasoning capability from an LLM to better assist prediction, thereby improving simulation performance. Extensive experiments demonstrate that our LDSim achieves strong results on both the simulation task and the knowledge tracing (KT) task. Our code is publicly available at https://anonymous.4open.science/r/LDSim-05A9.
Abstract（参考訳）: 質問応答シミュレータ (QA) は、学生の実際の学習行動を模倣し、質問に対する回答の正しさを予測するモデルである。 QAシミュレータは、実際の学生と対話することなく、教育推薦システム(ERS)が大量のトレーニングデータを収集することを可能にし、未学習のERSによる有害なレコメンデーションが実際の生徒の学習を損なうのを防ぐ。 1)QA履歴をベクトル表現に転送する従来の逐次モデルを適用し,その表現に基づいて予測を行う LLM-free法,(2)LLMのドメイン知識と推論能力を活用して予測を行う LLM-based method である。 LLMフリーメソッドは高速な推論を提供するが、一般に準最適性能をもたらす。対照的に、ほとんどのLCMベースの手法はより良い結果を得るが、推論速度を遅くし、GPUメモリ消費を高くするコストがかかる。本稿では, LLM からドメイン知識と推論能力を蒸留し, 予測支援を向上し, シミュレーション性能を向上させる LLM 蒸留ベースシミュレータ (LDSim) を提案する。シミュレーションタスクと知識追跡(KT)タスクの両方において,LDSimが強い結果をもたらすことを示す。私たちのコードはhttps://anonymous.4open.science/r/LDSim-05A9.comで公開されています。

論文の概要: Constructing a Question-Answering Simulator through the Distillation of LLMs

関連論文リスト