Fugu-MT 論文翻訳(概要): Structure-Aware Modeling of Multiple-Choice Questions Improves Automatic Difficulty Estimation

論文の概要: Structure-Aware Modeling of Multiple-Choice Questions Improves Automatic Difficulty Estimation

arxiv url: http://arxiv.org/abs/2606.08988v1
Date: Mon, 08 Jun 2026 03:35:49 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-09 14:42:06.683151
Title: Structure-Aware Modeling of Multiple-Choice Questions Improves Automatic Difficulty Estimation
Title（参考訳）: 複数項目質問の構造認識モデルによる自動難易度推定の改善
Authors: Gabriel Ortega, Abelino Jiménez, Séverin Lions, Pablo Dartnell,
Abstract要約: AQDEは、専門家の判断と競合する難易度推定を得られる可能性があるため、教育評価への期待が高まっている。 AQDE以前の研究では、質問幹と正しいキーにイントラクタを追加することで、難易度予測が一貫して改善されるという証拠が混在していたことが報告されている。本研究では,その構造的表現に係わる障害情報の有効性を仮定し,分離成分としての障害情報を明示的にモデル化することで,この情報を省略する基準線による難易度推定を改善することを仮定する。
参考スコア（独自算出の注目度）: 0.0
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Automatic Question Difficulty Estimation (AQDE) holds growing promise for educational assessment because it has the potential to yield difficulty estimates that are competitive with expert judgment, while helping reduce the time and financial burden associated with pilot administrations and scaling to digital testing contexts. Prior AQDE studies report mixed evidence on whether adding distractors as additional text to the question stem and the correct key consistently improves difficulty prediction. We hypothesize that the effectiveness of distractor information depends on its structural representation, and that explicitly modeling distractors as separate components improves difficulty estimation over baselines that omit this information. To address this, we designed controlled architectures that model MCQ components as distinct inputs to isolate the contribution of distractor content and order. Specifically, we represented distractors by encoding each distractor as its own text input and aggregating their representations either with order-aware concatenation (with positional tags) or with an order-invariant summation. We evaluated these architectures using two Chilean datasets (Natural and Social Sciences, 2016-2020; 4,114 multiple-choice questions). Compared to a simpler model that only used the question stem and the key, our best distractor-aware architecture achieved higher predictive performance, reaching R^2 = 0.83 for Natural Sciences and R^2 = 0.71 for Social Sciences items. An order-invariant variant achieved nearly the same accuracy with approximately half as many parameters, offering a favorable accuracy-efficiency trade-off. These results show that structural information (especially distractor content) drives gains in predictive accuracy, supporting the development of efficient, structure-aware models that are computationally viable for large-scale educational applications.
Abstract（参考訳）: AQDE(Automatic Question Difficulty Estimation)は、専門家の判断と競合する難易度推定値を得る可能性があり、パイロット管理やデジタルテストのコンテキストへのスケーリングに伴う時間と財政的負担を軽減できるため、教育評価への期待が高まっている。 AQDE以前の研究では、質問幹と正しいキーにイントラクタを追加することで、難易度予測が一貫して改善されるという証拠が混在していたことが報告されている。本研究では,その構造的表現に係わる障害情報の有効性を仮定し,分離成分としての障害情報を明示的にモデル化することで,この情報を省略する基準線による難易度推定を改善することを仮定する。これを解決するために,MCQコンポーネントを個別の入力としてモデル化する制御アーキテクチャを設計し,イントラクタの内容と順序の寄与を分離した。具体的には、各イントラクタを独自のテキスト入力として符号化し、その表現を(位置タグ付き)順序認識結合または順序不変和で集約することで、トラクタを表現した。チリの2つのデータセットを用いてこれらのアーキテクチャを評価した(Natural and Social Sciences, 2016-2020; 4,114 質問)。質問ステムとキーのみを用いた単純なモデルと比較して,我々の最も気を散らすアーキテクチャは,自然科学のR^2 = 0.83,社会科学のR^2 = 0.71に到達し,高い予測性能を達成した。順序不変な変種は、ほぼ同じ精度で、パラメータの約半分を達成し、良好な精度と効率のトレードオフを提供した。これらの結果は、構造情報(特に注意散らし内容)が予測精度を向上し、大規模教育アプリケーションで計算可能な効率的な構造認識モデルの開発を支援することを示している。

論文の概要: Structure-Aware Modeling of Multiple-Choice Questions Improves Automatic Difficulty Estimation

関連論文リスト