Fugu-MT 論文翻訳(概要): Speculative Actions: A Lossless Framework for Faster Agentic Systems

論文の概要: Speculative Actions: A Lossless Framework for Faster Agentic Systems

arxiv url: http://arxiv.org/abs/2510.04371v1
Date: Sun, 05 Oct 2025 21:28:11 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-07 16:52:59.610265
Title: Speculative Actions: A Lossless Framework for Faster Agentic Systems
Title（参考訳）: Speculative Actions: より高速なエージェントシステムのためのロスレスフレームワーク
Authors: Naimeng Ye, Arnav Ahuja, Georgios Liargkovas, Yunan Lu, Kostis Kaffes, Tianyi Peng,
Abstract要約: AIエージェントの実行は遅く、トレーニングや評価、デプロイメントを妨げていることが多い。マイクロプロセッサにおける投機的実行に着想を得て,より高速なモデルを用いて潜在的行動を予測するフレームワークを提案する。我々は,このフレームワークを3つのエージェント環境 – ゲーム,eコマース,Web検索,オペレーティングシステム環境のための"ロッキー"拡張 – で評価する。
参考スコア（独自算出の注目度）: 6.708126506152481
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Despite growing interest in AI agents across industry and academia, their execution in an environment is often slow, hampering training, evaluation, and deployment. For example, a game of chess between two state-of-the-art agents may take hours. A critical bottleneck is that agent behavior unfolds sequentially: each action requires an API call, and these calls can be time-consuming. Inspired by speculative execution in microprocessors and speculative decoding in LLM inference, we propose speculative actions, a lossless framework for general agentic systems that predicts likely actions using faster models, enabling multiple steps to be executed in parallel. We evaluate this framework across three agentic environments: gaming, e-commerce, web search, and a "lossy" extension for an operating systems environment. In all cases, speculative actions achieve substantial accuracy in next-action prediction (up to 55%), translating into significant reductions in end-to-end latency. Moreover, performance can be further improved through stronger guessing models, top-K action prediction, multi-step speculation, and uncertainty-aware optimization, opening a promising path toward deploying low-latency agentic systems in the real world.
Abstract（参考訳）: 業界や学界にまたがるAIエージェントへの関心が高まっているにも関わらず、環境におけるAIエージェントの実行は遅く、トレーニングや評価、デプロイメントを妨げていることが多い。例えば、2つの最先端エージェント間のチェスの試合には数時間を要することがある。重要なボトルネックは、エージェントの振る舞いが順次展開することです。それぞれのアクションはAPI呼び出しを必要としており、これらの呼び出しは時間がかかります。マイクロプロセッサにおける投機的実行とLLM推論における投機的復号化を契機として,より高速なモデルを用いた潜在的動作の予測を行う汎用エージェントシステムのための投機的動作を提案する。我々は,このフレームワークを3つのエージェント環境 – ゲーム,eコマース,Web検索,オペレーティングシステム環境のための"ロッキー"拡張 – で評価する。いずれの場合も、投機的行動は次のアクション予測(最大55%)でかなりの精度を達成し、エンドツーエンドのレイテンシを大幅に削減する。さらに、より強力な推測モデル、トップKアクション予測、マルチステップの推測、不確実性を考慮した最適化により、パフォーマンスをさらに向上させ、現実世界に低遅延エージェントシステムを展開するための有望な道を開くことができる。

論文の概要: Speculative Actions: A Lossless Framework for Faster Agentic Systems

関連論文リスト