Fugu-MT 論文翻訳(概要): Scaling Small Agents Through Strategy Auctions

論文の概要: Scaling Small Agents Through Strategy Auctions

arxiv url: http://arxiv.org/abs/2602.02751v1
Date: Mon, 02 Feb 2026 20:05:51 GMT
ステータス: 翻訳完了
システム内更新日: 2026-02-04 18:37:15.058506
Title: Scaling Small Agents Through Strategy Auctions
Title（参考訳）: 戦略オークションによる小型エージェントのスケーリング
Authors: Lisa Alazraki, William F. Shen, Yoram Bachrach, Akhil Mathur,
Abstract要約: 我々は,小エージェントの性能が深層検索やコーディングタスクにおいて,タスクの複雑さを伴ってスケールしないことを示す。フリーランサーマーケットプレースにインスパイアされたエージェントフレームワークである、ワークロード効率のためのStrategy Auctionsを紹介する。 SALEは最大のエージェントへの依存を53%減らし、全体のコストを35%減らし、最大のエージェントのpass@1を継続的に改善することを示した。
参考スコア（独自算出の注目度）: 6.863341991827782
License: http://creativecommons.org/licenses/by-sa/4.0/
Abstract: Small language models are increasingly viewed as a promising, cost-effective approach to agentic AI, with proponents claiming they are sufficiently capable for agentic workflows. However, while smaller agents can closely match larger ones on simple tasks, it remains unclear how their performance scales with task complexity, when large models become necessary, and how to better leverage small agents for long-horizon workloads. In this work, we empirically show that small agents' performance fails to scale with task complexity on deep search and coding tasks, and we introduce Strategy Auctions for Workload Efficiency (SALE), an agent framework inspired by freelancer marketplaces. In SALE, agents bid with short strategic plans, which are scored by a systematic cost-value mechanism and refined via a shared auction memory, enabling per-task routing and continual self-improvement without training a separate router or running all models to completion. Across deep search and coding tasks of varying complexity, SALE reduces reliance on the largest agent by 53%, lowers overall cost by 35%, and consistently improves upon the largest agent's pass@1 with only a negligible overhead beyond executing the final trace. In contrast, established routers that rely on task descriptions either underperform the largest agent or fail to reduce cost -- often both -- underscoring their poor fit for agentic workflows. These results suggest that while small agents may be insufficient for complex workloads, they can be effectively "scaled up" through coordinated task allocation and test-time self-improvement. More broadly, they motivate a systems-level view of agentic AI in which performance gains come less from ever-larger individual models and more from market-inspired coordination mechanisms that organize heterogeneous agents into efficient, adaptive ecosystems.
Abstract（参考訳）: 小型言語モデルはエージェントAIに対する有望で費用対効果の高いアプローチとして、エージェントワークフローに十分な能力があると主張する支持者が増えている。しかし、小さなエージェントは単純なタスクで大きなエージェントと密にマッチするが、そのパフォーマンスがタスクの複雑さとどのようにスケールするか、大規模なモデルが必要になったとき、そして長期のワークロードにどのように小さなエージェントをうまく活用するかは、まだ不明である。本研究では,小エージェントのパフォーマンスが深層検索やコーディングタスクにおけるタスクの複雑さによってスケールできないことを実証的に示し,フリーランサーのマーケットプレースにインスパイアされたエージェントフレームワークであるStrategy Auctions for Workload efficiency (SALE)を紹介した。 SALEでは、エージェントは短い戦略的な計画で入札を行い、それは体系的なコスト値のメカニズムで決められ、共有オークションメモリを介して洗練され、個別のルータを訓練したり、全てのモデルを完了させることなく、タスクごとのルーティングと継続的な自己改善を可能にする。さまざまな複雑さの深い検索とコーディングのタスクの中で、SALEは最大のエージェントへの依存を53%減らし、全体的なコストを35%減らし、最大のエージェントのpass@1を継続的に改善する。対照的に、タスク記述に依存している確立されたルータは、最大のエージェントを過小評価するか、コスト削減に失敗する。これらの結果は、小さなエージェントは複雑なワークロードでは不十分であるが、コーディネートされたタスクアロケーションとテスト時の自己改善によって、効果的に"スケールアップ"することができることを示唆している。より広い意味では、彼らはエージェントAIのシステムレベルの視点を動機付けており、パフォーマンスの向上はより大きく個々のモデルから、さらに、異種エージェントを効率よく適応的なエコシステムに組織する市場インスパイアされた調整メカニズムから生まれる。

論文の概要: Scaling Small Agents Through Strategy Auctions

関連論文リスト