Fugu-MT 論文翻訳(概要): The Architecture of Errors: From Universal Impossibility to Patch-Local LLM Reliability

論文の概要: The Architecture of Errors: From Universal Impossibility to Patch-Local LLM Reliability

arxiv url: http://arxiv.org/abs/2605.30628v1
Date: Thu, 28 May 2026 22:27:08 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-01 20:56:50.270341
Title: The Architecture of Errors: From Universal Impossibility to Patch-Local LLM Reliability
Title（参考訳）: エラーのアーキテクチャ:Universal Impossibility から Patch-Local LLM Reliability へ
Authors: Mikhail L. Arbuzov, Lee Mosbacker, Sisong Bei, Ziwei Dong, Dmitri Kalaev, Alexey Shvets,
Abstract要約: デプロイされたシステムは、宇宙全体にわたって動作しないことを示す。このようなパッチの中では、失敗はスパースで反復的で、小さな反復するカタログに集中しているという実証的な証拠がある。この遷移を2つの命題と1つの結論で定式化する。
参考スコア（独自算出の注目度）: 0.0
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Universal LLM reliability is not a finite-library problem: across all possible tasks, tools, schemas, knowledge sources, and evaluator expectations, new intervention-distinguishable failure modes can appear without bound, so no finite intervention dictionary can guarantee bounded residual error for every such mode. But deployed systems do not operate over the whole universe. They operate inside operationally bounded patches (legal review, medical RAG, code repair, customer-support agents, contract extraction) with recurring tasks, schemas, tools, and evaluator expectations. Within such patches, empirical evidence suggests failures are sparse, repetitive, and concentrated in a small recurring catalogue, so reliability becomes a local catalogue-discovery and intervention-coverage problem rather than an exponential token-length problem. We formalize this transition with two propositions and one corollary. Proposition 1 is the worst-case-mode-wise negative result: no finite intervention dictionary covers every distinguishable failure mode of an unbounded domain. Corollary 1 is the inverse-discovery implication: the logarithmic upper bound on mode discovery cannot accommodate linearly more distinct tail modes without exponentially more observed hard-failure events. Proposition 2 is the positive patch-local result: under log active-mode exposure and head-heavy coverage, a sufficient per-hard-decision intervention budget grows polylogarithmically in sequence length and becomes domain-constant once the patch catalogue saturates. The framework relocates rather than dissolves long-context difficulty: where the number of hard decisions itself grows with task length, reliability remains hard; the contribution is to identify the on-axis intervention rather than to make those regimes easy.
Abstract（参考訳）: あらゆる可能なタスク、ツール、スキーマ、知識ソース、評価器の期待において、新しい干渉区別可能な障害モードはバウンダリなしで現れるため、有限介入辞書はそのようなすべてのモードに対して有界残差を保証できない。しかし、デプロイされたシステムは宇宙全体にわたって動作しない。運用上のバインドされたパッチ(法的レビュー、医療RAG、コード修復、カスタマーサポートエージェント、コントラクト抽出)内で、繰り返し実行されるタスク、スキーマ、ツール、評価対象の期待に基づいて運用する。このようなパッチの中では、失敗はスパースで反復的で、小さな繰り返しカタログに集中していることを実証的な証拠として示しているため、信頼性は指数的トークン長問題ではなく、局所的なカタログ発見および介入被覆問題となる。この遷移を2つの命題と1つの結論で定式化する。命題1は最悪のケースモードの負の結果であり、有限介入辞書は、非有界領域のすべての区別可能な障害モードをカバーしていない。対数的上界のモード発見は、指数関数的に観察されるハード・フェイルな事象がなければ、線形的により異なるテールモードに対応できない。ログアクティブモード露光とヘッドヘビーカバレッジの下では、十分なハード-決定介入予算は、シーケンス長で多対数的に増加し、パッチカタログが飽和するとドメイン・コンスタントになる。ハードな決定の数がタスクの長さとともに増加すると、信頼性は依然として難しくなります。

論文の概要: The Architecture of Errors: From Universal Impossibility to Patch-Local LLM Reliability

関連論文リスト