Fugu-MT 論文翻訳(概要): Empirical Analysis of the Effect of Context in the Task of Automated Essay Scoring in Transformer-Based Models

論文の概要: Empirical Analysis of the Effect of Context in the Task of Automated Essay Scoring in Transformer-Based Models

arxiv url: http://arxiv.org/abs/2508.16638v1
Date: Sun, 17 Aug 2025 17:17:34 GMT
ステータス: 翻訳完了
システム内更新日: 2025-08-26 18:43:45.07165
Title: Empirical Analysis of the Effect of Context in the Task of Automated Essay Scoring in Transformer-Based Models
Title（参考訳）: 変圧器モデルを用いた自動評価作業における文脈の影響に関する実証分析
Authors: Abhirup Chakravarty,
Abstract要約: 本研究では,コンテクスト要因がトランスフォーマーモデルの性能に及ぼす影響について検討する。我々の最も効果的なモデルでは、エッセイデータセット全体で平均四重み付きカッパスコアが0.823であり、個々のエッセイセットでトレーニングすると0.8697となる。
参考スコア（独自算出の注目度）: 0.0
License: http://creativecommons.org/licenses/by-sa/4.0/
Abstract: Automated Essay Scoring (AES) has emerged to prominence in response to the growing demand for educational automation. Providing an objective and cost-effective solution, AES standardises the assessment of extended responses. Although substantial research has been conducted in this domain, recent investigations reveal that alternative deep-learning architectures outperform transformer-based models. Despite the successful dominance in the performance of the transformer architectures across various other tasks, this discrepancy has prompted a need to enrich transformer-based AES models through contextual enrichment. This study delves into diverse contextual factors using the ASAP-AES dataset, analysing their impact on transformer-based model performance. Our most effective model, augmented with multiple contextual dimensions, achieves a mean Quadratic Weighted Kappa score of 0.823 across the entire essay dataset and 0.8697 when trained on individual essay sets. Evidently surpassing prior transformer-based models, this augmented approach only underperforms relative to the state-of-the-art deep learning model trained essay-set-wise by an average of 3.83\% while exhibiting superior performance in three of the eight sets. Importantly, this enhancement is orthogonal to architecture-based advancements and seamlessly adaptable to any AES model. Consequently, this contextual augmentation methodology presents a versatile technique for refining AES capabilities, contributing to automated grading and evaluation evolution in educational settings.
Abstract（参考訳）: AES(Automated Essay Scoring)は、教育自動化の需要の高まりに対応して注目されている。客観的で費用対効果の高いソリューションを提供することで、AESは拡張された応答の評価を標準化する。この領域でかなりの研究が行われてきたが、最近の研究では、代替のディープラーニングアーキテクチャがトランスフォーマーベースのモデルより優れていることが示されている。他の様々なタスクにおけるトランスフォーマーアーキテクチャの性能の優位性にもかかわらず、この相違により、コンテクストエンリッチメントを通じてトランスフォーマーベースのAESモデルを強化する必要性が高まっている。本研究は,ASAP-AESデータセットを用いて,トランスフォーマーに基づくモデル性能への影響を分析する。我々の最も効果的なモデルは、複数の文脈次元で拡張され、エッセイデータセット全体で平均の四重み付きカッパスコアが0.823、個々のエッセイセットでトレーニングされたときに0.8697となる。従来のトランスフォーマーベースモデルに比して、この拡張アプローチは、最先端のディープラーニングモデルと比較して、エッセイセットを平均3.83\%の精度でトレーニングし、8セットのうち3セットで優れたパフォーマンスを示しながら、パフォーマンスを低下させるだけである。重要なのは、この拡張はアーキテクチャベースの進歩と直交しており、任意のAESモデルにシームレスに適応可能であることだ。その結果、この文脈拡張手法は、AES能力を改良するための多用途技術を示し、教育環境における自動階調と評価の進化に寄与する。

論文の概要: Empirical Analysis of the Effect of Context in the Task of Automated Essay Scoring in Transformer-Based Models

関連論文リスト