Fugu-MT 論文翻訳(概要): IdleSpec: Exploiting Idle Time via Speculative Planning for LLM Agents

論文の概要: IdleSpec: Exploiting Idle Time via Speculative Planning for LLM Agents

arxiv url: http://arxiv.org/abs/2605.22154v1
Date: Thu, 21 May 2026 08:25:17 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-22 16:35:42.16195
Title: IdleSpec: Exploiting Idle Time via Speculative Planning for LLM Agents
Title（参考訳）: IdleSpec: LLMエージェントの投機計画によるアイドルタイムの爆発
Authors: Daewon Choi, Kyunghyun Park, Woomin Song, Saket Dingliwal, Sai Muralidhar Jayanthi, Jinwoo Shin, Aram Galstyan,
Abstract要約: 大規模言語モデル(LLM)ベースのエージェントは、反復的なツールコールと環境相互作用で多段階推論を活用することで複雑なタスクを解決する。ほとんどのエージェントシナリオではアイドル時間が流行しているが、既存の作業では避けられないオーバーヘッドとして扱っている。 IdleSpecは、アイドル時間計算を利用してエージェントのパフォーマンスを向上させる、スケーラブルで汎用的な推論手法である。
参考スコア（独自算出の注目度）: 56.77101184011525
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Large language model (LLM)-based agents solve complex tasks by leveraging multi-step reasoning with iterative tool calls and environment interactions, which incur idle time while waiting for observations. Despite the prevalence of idle time in most agentic scenarios, existing works treat it as an unavoidable overhead or propose restricted solutions that overlook varying computational budgets across different tool calls and future observation uncertainty, thereby leading to suboptimal utilization of idle time. In this paper, we introduce IdleSpec, a scalable and generic inference approach that leverages idle-time computation to improve agent performance while minimizing latency overhead. Specifically, IdleSpec iteratively generates plan candidates during idle periods and, once observations become available, aggregates them to guide the next reasoning step. For effective plan generation under observation uncertainty, IdleSpec samples between complementary drafting strategies (i.e., progressive and recovery) from a learned distribution that is updated via posterior feedback. Our experiments demonstrate that IdleSpec significantly improves agent performance in various agentic scenarios by effectively utilizing idle time. In particular, on the GAIA and FRAMES, IdleSpec achieves 55.6% average accuracy with Gemini-2.5-Flash, surpassing the vanilla baseline without idle-time usage by 5.1%. Furthermore, for MLE-Bench, which involves substantial delay from code executions, IdleSpec achieves performance gains of up to 9.1% on the Any Medal rate, highlighting its generalizability to long-horizon tasks.
Abstract（参考訳）: 大規模言語モデル(LLM)ベースのエージェントは、反復的ツールコールと環境相互作用による多段階推論を利用して複雑なタスクを解く。多くのエージェントシナリオにおいてアイドル時間が流行しているにもかかわらず、既存の研究は避けられないオーバーヘッドとして扱うか、異なるツールコールの様々な計算予算と将来の観察の不確実性を見極める制限された解決策を提案し、アイドル時間の最適利用につながる。本稿では、アイドル時間計算を利用して遅延オーバーヘッドを最小限に抑えながらエージェント性能を向上させる、スケーラブルで汎用的な推論手法であるIdleSpecを紹介する。具体的には、IdleSpecはアイドル期間中に計画候補を反復的に生成します。観測不確実性下での効果的な計画生成のために、IdleSpecは、後続フィードバックによって更新された学習分布からの相補的起草戦略(プログレッシブおよびリカバリ)のサンプルをサンプリングする。実験により,IdleSpecはアイドルタイムを有効活用することにより,エージェントシナリオにおけるエージェント性能を著しく向上することが示された。特にGAIAとFRAMESでは、IdleSpecはGemini-2.5-Flashで平均55.6%の精度を達成し、アイドル時間を使用しないバニラベースラインを5.1%上回っている。さらに、コード実行からかなり遅れるMLE-Benchでは、IdleSpecはAny Medalレートで最大9.1%のパフォーマンス向上を実現し、長距離タスクへの一般化性を強調している。

論文の概要: IdleSpec: Exploiting Idle Time via Speculative Planning for LLM Agents

関連論文リスト