Fugu-MT 論文翻訳(概要): An Information Theoretic Perspective on Agentic System Design

論文の概要: An Information Theoretic Perspective on Agentic System Design

arxiv url: http://arxiv.org/abs/2512.21720v1
Date: Thu, 25 Dec 2025 15:45:31 GMT
ステータス: 翻訳完了
システム内更新日: 2025-12-29 20:48:41.955227
Title: An Information Theoretic Perspective on Agentic System Design
Title（参考訳）: エージェントシステム設計における情報理論の視点
Authors: Shizhe He, Avanika Narayan, Ishan S. Khare, Scott W. Linderman, Christopher Ré, Dan Biderman,
Abstract要約: 小さな「圧縮器」LMは生のコンテキストをコンパクトテキストに蒸留し、より大きな「予測器」LMによって消費される。その人気にもかかわらず、圧縮機・予測装置の設計はいまだ大半がアドホックである。我々は、特定のタスクによらず、相互情報が下流のパフォーマンスを強く予測することを示す。
参考スコア（独自算出の注目度）: 40.402567029445045
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Agentic language model (LM) systems power modern applications like "Deep Research" and "Claude Code," and leverage multi-LM architectures to overcome context limitations. Beneath their apparent diversity lies a recurring pattern: smaller "compressor" LMs (that can even run locally) distill raw context into compact text that is then consumed by larger "predictor" LMs. Despite their popularity, the design of compressor-predictor systems remains largely ad hoc, with little guidance on how compressor and predictor choices shape downstream performance. In practice, attributing gains to compression versus prediction requires costly, task-specific pairwise sweeps. We argue that these agentic system design questions are, at root, information-theoretic. Viewing the compressor LM as a noisy channel, we introduce a simple estimator of mutual information between the context and its compression to quantify compression quality in a task-independent way. We show that mutual information strongly predicts downstream performance, independent of any specific task. Through an information-theoretic framework, we perform a comprehensive empirical analysis across five datasets and three model families. Results reveal that larger compressors not only are more accurate, but also more token-efficient, conveying more bits of information per token. A 7B Qwen-2.5 compressor, for instance, is $1.6\times$ more accurate, $4.6\times$ more concise, and conveys $5.5\times$ more bits of mutual information per token than its 1.5B sibling. Across datasets, scaling compressors is substantially more effective than scaling predictors, enabling larger on-device compressors to pair with smaller cloud predictors. Applied to a Deep Research system, these principles enable local compressors as small as 3B parameters to recover $99\%$ of frontier-LM accuracy at $26\%$ of API costs.
Abstract（参考訳）: エージェント言語モデル(LM)システムは、"Deep Research"や"Claude Code"のようなモダンなアプリケーションに電力を供給し、コンテキスト制限を克服するためにマルチLMアーキテクチャを活用する。より小さな「圧縮器」LM(ローカルでも実行できる)は生のコンテキストをコンパクトなテキストに蒸留し、より大きな「予測器」LMが消費する。その人気にもかかわらず、圧縮機と予測器の選択が下流の性能をどのように形成するかについてのガイダンスは乏しいが、圧縮機と予測器の設計はいまだに大半が苦手である。実際には、圧縮と予測に利益をもたらすには、コストがかかるタスク固有のペアワイズが必要です。エージェントシステム設計の疑問は、根本から情報理論である、と我々は主張する。圧縮機LMをノイズのあるチャネルと見なして、コンテキストと圧縮の間の相互情報の簡易な推定器を導入し、タスクに依存しない方法で圧縮品質を定量化する。我々は、特定のタスクによらず、相互情報が下流のパフォーマンスを強く予測することを示す。情報理論の枠組みにより、5つのデータセットと3つのモデルファミリーにわたる包括的経験分析を行う。その結果、より大きな圧縮機はより正確であるだけでなく、トークン効率も向上し、トークンごとにより多くの情報を運ぶことがわかった。例えば、7B Qwen-2.5圧縮機は1.6\times$より正確で4.6\times$より簡潔で、5.5\times$トークン当たりの相互情報を1.5B兄弟より多く伝達する。データセット全体にわたって、圧縮機のスケーリングは予測器のスケーリングよりも実質的に効果的であり、より大きなデバイス上の圧縮機がより小さなクラウド予測器と組み合わせることができる。 Deep Researchシステムに適用されたこれらの原則により、ローカル圧縮機を3Bパラメータまで小さくして、フロンティア・LMの精度を26セントのAPIコストで回収することができる。

論文の概要: An Information Theoretic Perspective on Agentic System Design

関連論文リスト