Fugu-MT 論文翻訳(概要): Positional Biases Shift as Inputs Approach Context Window Limits

論文の概要: Positional Biases Shift as Inputs Approach Context Window Limits

arxiv url: http://arxiv.org/abs/2508.07479v1
Date: Sun, 10 Aug 2025 20:40:24 GMT
ステータス: 翻訳完了
システム内更新日: 2025-08-12 21:23:28.872482
Title: Positional Biases Shift as Inputs Approach Context Window Limits
Title（参考訳）: 入力がコンテキストウィンドウ限界に近づくと位置バイアスがシフトする
Authors: Blerta Veseli, Julian Chibane, Mariya Toneva, Alexander Koller,
Abstract要約: 入力がモデルのコンテキストウィンドウの最大50%を占める場合、LiM効果は最強となる。関係情報が入力の終端に近づくと,モデルの性能が向上する,距離に基づくバイアスが観測される。
参考スコア（独自算出の注目度）: 57.00239097102958
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Large Language Models (LLMs) often struggle to use information across long inputs effectively. Prior work has identified positional biases, such as the Lost in the Middle (LiM) effect, where models perform better when information appears at the beginning (primacy bias) or end (recency bias) of the input, rather than in the middle. However, long-context studies have not consistently replicated these effects, raising questions about their intensity and the conditions under which they manifest. To address this, we conducted a comprehensive analysis using relative rather than absolute input lengths, defined with respect to each model's context window. Our findings reveal that the LiM effect is strongest when inputs occupy up to 50% of a model's context window. Beyond that, the primacy bias weakens, while recency bias remains relatively stable. This effectively eliminates the LiM effect; instead, we observe a distance-based bias, where model performance is better when relevant information is closer to the end of the input. Furthermore, our results suggest that successful retrieval is a prerequisite for reasoning in LLMs, and that the observed positional biases in reasoning are largely inherited from retrieval. These insights have implications for long-context tasks, the design of future LLM benchmarks, and evaluation methodologies for LLMs handling extended inputs.
Abstract（参考訳）: 大規模言語モデル(LLM)は、長い入力の情報を効果的に利用するのに苦労することが多い。従来の作業では、中間の損失(LiM)効果のような位置バイアスを特定しており、中央ではなく、入力の開始(プライマリーバイアス)や終了(遅延バイアス)に情報が現れると、モデルはより良く機能する。しかし、長文の研究はこれらの効果を一貫して再現しておらず、その強さとそれらが表す条件について疑問を呈している。これを解決するために、各モデルのコンテキストウィンドウに対して定義された絶対的な入力長ではなく、相対的な入力長を用いて包括的解析を行った。その結果、入力がモデルのコンテキストウィンドウの最大50%を占める場合、LiM効果が最強であることが判明した。さらに、プライマリーバイアスは弱まり、リレーレンシーバイアスは比較的安定している。これにより、LiM効果を効果的に排除し、入力の終端に近い場合にモデル性能がよい距離ベースのバイアスを観測する。さらに,本研究の結果から,LLMの推論には検索が不可欠であることが示唆され,解析における位置バイアスが検索から大きく受け継がれていることが示唆された。これらの知見は、長いコンテキストタスク、将来のLLMベンチマークの設計、拡張入力を扱うLLMの評価手法に影響を及ぼす。

論文の概要: Positional Biases Shift as Inputs Approach Context Window Limits

関連論文リスト