Fugu-MT 論文翻訳(概要): Context Is What You Need: The Maximum Effective Context Window for Real World Limits of LLMs

論文の概要: Context Is What You Need: The Maximum Effective Context Window for Real World Limits of LLMs

arxiv url: http://arxiv.org/abs/2509.21361v1
Date: Sun, 21 Sep 2025 14:38:17 GMT
ステータス: 翻訳完了
システム内更新日: 2025-09-29 20:57:53.869694
Title: Context Is What You Need: The Maximum Effective Context Window for Real World Limits of LLMs
Title（参考訳）: コンテキストは必要なもの: LLMの実世界限界に最適なコンテキストウィンドウ
Authors: Norman Paulsen,
Abstract要約: 大規模言語モデルにおけるコンテキストウィンドウの現実的利用をテストする。報告した最大コンテキストウィンドウ (MCW) サイズと最大有効コンテキストウィンドウ (MECW) サイズを比較した。我々のデータは、提供された問題の種類に基づいて、最大有効コンテキストウィンドウシフトを明らかにします。
参考スコア（独自算出の注目度）: 0.0
License: http://creativecommons.org/licenses/by-nc-sa/4.0/
Abstract: Large language model (LLM) providers boast big numbers for maximum context window sizes. To test the real world use of context windows, we 1) define a concept of maximum effective context window, 2) formulate a testing method of a context window's effectiveness over various sizes and problem types, and 3) create a standardized way to compare model efficacy for increasingly larger context window sizes to find the point of failure. We collected hundreds of thousands of data points across several models and found significant differences between reported Maximum Context Window (MCW) size and Maximum Effective Context Window (MECW) size. Our findings show that the MECW is, not only, drastically different from the MCW but also shifts based on the problem type. A few top of the line models in our test group failed with as little as 100 tokens in context; most had severe degradation in accuracy by 1000 tokens in context. All models fell far short of their Maximum Context Window by as much as 99 percent. Our data reveals the Maximum Effective Context Window shifts based on the type of problem provided, offering clear and actionable insights into how to improve model accuracy and decrease model hallucination rates.
Abstract（参考訳）: 大規模言語モデル(LLM)プロバイダは、最大コンテキストウィンドウサイズに対して大きな数値を誇っている。コンテキストウィンドウの現実的利用をテストするには 1)最大有効コンテキストウィンドウという概念を定義する。 2 各種サイズ及び問題種別における文脈窓の有効性の検査方法を定式化し、 3) 失敗点を見つけるために、より大きなコンテキストウィンドウサイズに対するモデルの有効性を比較するための標準化された方法を作成する。複数のモデルにまたがって数十万のデータポイントを収集し、報告された最大コンテキストウィンドウ(MCW)サイズと最大有効コンテキストウィンドウ(MECW)サイズの間に大きな違いがあることを発見した。以上の結果から,MECWはMCWと大きく異なるだけでなく,問題の種類によっても変化していることがわかった。テストグループのラインモデルのいくつかのトップは、100トークンのコンテキストで失敗しました。すべてのモデルは、最大コンテキストウィンドウの99%をはるかに下回った。我々のデータでは、与えられた問題の種類に基づいて、最大有効コンテキストウィンドウシフトを明らかにし、モデル精度の向上とモデル幻覚率の低下に関する明確で実用的な洞察を提供する。

論文の概要: Context Is What You Need: The Maximum Effective Context Window for Real World Limits of LLMs

関連論文リスト