Fugu-MT 論文翻訳(概要): To model human linguistic prediction, make LLMs less superhuman

論文の概要: To model human linguistic prediction, make LLMs less superhuman

arxiv url: http://arxiv.org/abs/2510.05141v1
Date: Wed, 01 Oct 2025 21:53:42 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-08 17:57:07.848979
Title: To model human linguistic prediction, make LLMs less superhuman
Title（参考訳）: 人間の言語予測をモデル化するために、LLMを超人的にする
Authors: Byung-Doh Oh, Tal Linzen,
Abstract要約: 近年,言語モデルによる次の単語の予測が向上するにつれて,人間の読解行動を予測する能力は低下している。なぜなら、LLMは人よりずっと優れた単語を予測でき、読み出しの難易度を予測できるからです。我々は、人間のような長期記憶と短期記憶を持つモデルの作成を提唱し、この目標を達成するためのいくつかの可能な方向性を概説する。
参考スコア（独自算出の注目度）: 22.141352033537554
License: http://creativecommons.org/licenses/by/4.0/
Abstract: When people listen to or read a sentence, they actively make predictions about upcoming words: words that are less predictable are generally read more slowly than predictable ones. The success of large language models (LLMs), which, like humans, make predictions about upcoming words, has motivated exploring the use of these models as cognitive models of human linguistic prediction. Surprisingly, in the last few years, as language models have become better at predicting the next word, their ability to predict human reading behavior has declined. This is because LLMs are able to predict upcoming words much better than people can, leading them to predict lower processing difficulty in reading than observed in human experiments; in other words, mainstream LLMs are 'superhuman' as models of language comprehension. In this position paper, we argue that LLMs' superhumanness is primarily driven by two factors: compared to humans, LLMs have much stronger long-term memory for facts and training examples, and they have much better short-term memory for previous words in the text. We advocate for creating models that have human-like long-term and short-term memory, and outline some possible directions for achieving this goal. Finally, we argue that currently available human data is insufficient to measure progress towards this goal, and outline human experiments that can address this gap.
Abstract（参考訳）: 人々が文章を聴いたり読んだりすると、次の単語について積極的に予測する:予測しにくい単語は一般的に予測できる単語よりもゆっくり読む。大型言語モデル(LLMs)の成功は、人間と同様に、今後の単語について予測するものであり、これらのモデルが人間の言語予測の認知モデルとして使われることを探求する動機となっている。驚くべきことに、ここ数年で言語モデルが次の単語を予測できるようになり、人間の読書行動を予測する能力は低下している。言い換えると、LLMは言語理解のモデルとして「超人」である。本稿では, LLMの超人性は, 人間に比べて, 事実や訓練例に対する長期記憶がより強く, テキスト中の単語に対する短期記憶がより優れている,という2つの要因によって引き起こされていることを論じる。我々は、人間のような長期記憶と短期記憶を持つモデルの作成を提唱し、この目標を達成するためのいくつかの可能な方向性を概説する。最後に、現在利用可能な人間のデータは、この目標に向けての進捗を測るには不十分であり、このギャップに対処できる人間の実験の概要を述べる。

論文の概要: To model human linguistic prediction, make LLMs less superhuman

関連論文リスト