Fugu-MT 論文翻訳(概要): Probing for Reading Times

論文の概要: Probing for Reading Times

arxiv url: http://arxiv.org/abs/2604.18712v1
Date: Mon, 20 Apr 2026 18:12:59 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-22 22:41:49.405911
Title: Probing for Reading Times
Title（参考訳）: 読書時間に関する調査
Authors: Eleftheria Tsipidi, Samuel Kiegeland, Francesco Ignazio Re, Tianyang Xu, Mario Giulianelli, Karolina Stanczak, Ryan Cotterell,
Abstract要約: 我々は、英語、ギリシャ語、ヘブライ語、ロシア語、トルコ語における人間の読書時間を表す言語モデル表現を探索する。その結果,初期層からの表現は,初回固定や視線持続時間などの早期通過の予測において極めて優れていた。
参考スコア（独自算出の注目度）: 60.30314214800421
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Probing has shown that language model representations encode rich linguistic information, but it remains unclear whether they also capture cognitive signals about human processing. In this work, we probe language model representations for human reading times. Using regularized linear regression on two eye-tracking corpora spanning five languages (English, Greek, Hebrew, Russian, and Turkish), we compare the representations from every model layer against scalar predictors -- surprisal, information value, and logit-lens surprisal. We find that the representations from early layers outperform surprisal in predicting early-pass measures such as first fixation and gaze duration. The concentration of predictive power in the early layers suggests that human-like processing signatures are captured by low-level structural or lexical representations, pointing to a functional alignment between model depth and the temporal stages of human reading. In contrast, for late-pass measures such as total reading time, scalar surprisal remains superior, despite its being a much more compressed representation. We also observe performance gains when using both surprisal and early-layer representations. Overall, we find that the best-performing predictor varies strongly depending on the language and eye-tracking measure.
Abstract（参考訳）: 言語モデル表現は豊かな言語情報をエンコードすることを示したが、それらが人間の処理に関する認知信号も捉えているかどうかは不明だ。本研究では,人間の読解時間に対する言語モデル表現を探索する。 5つの言語(英語、ギリシャ語、ヘブライ語、ロシア語、トルコ語)にまたがる2つの視線追跡コーパスの正規化線形回帰を用いて、各モデル層からの表現を、予備、情報値、およびロジットレンズの予想値であるスカラー予測子と比較する。その結果,初期層からの表現は,初回固定や視線持続時間などの早期通過の予測において極めて優れていた。初期の層における予測力の集中は、人間の様の処理シグネチャが低レベルの構造的あるいは語彙的表現によって捉えられ、モデル深度と人間の読取の時間的段階の間の機能的な整合性を示すことを示唆している。対照的に、総読解時間などの遅滞度測定では、より圧縮された表現であるにもかかわらず、スカラー・サプライサルの方が優れている。また,予備層と初期層の両方の表現を用いた場合のパフォーマンス向上も観察する。全体として、最高の性能の予測器は言語や視線追跡の指標によって大きく異なることが判明した。

論文の概要: Probing for Reading Times

関連論文リスト