Fugu-MT 論文翻訳(概要): Latent Action Reparameterization for Efficient Agent Inference

論文の概要: Latent Action Reparameterization for Efficient Agent Inference

arxiv url: http://arxiv.org/abs/2605.18597v2
Date: Tue, 19 May 2026 03:38:49 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-20 15:03:08.646627
Title: Latent Action Reparameterization for Efficient Agent Inference
Title（参考訳）: 効率的なエージェント推論のための潜時行動パラメータ化
Authors: Wenhao Huang, Qingwen Zeng, Qiyue Chen, Zijie Guo, Yu Sun, Cheng Yang, Siru Ouyang, Jiri Gesi, Fang Wu, Jiayi Zhang, Huaming Chen, Bang Liu, Xiangru Tang, Chenglin Wu,
Abstract要約: 本稿では,複数のステップのセマンティックな振る舞いに対応する,コンパクトな潜在行動空間を学習するフレームワークを提案する。手作りのマクロや階層型コントローラとは異なり、潜在動作はエージェントの軌跡から学習され、モデルに直接統合される。
参考スコア（独自算出の注目度）: 56.42014061367112
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Large language model (LLM) agents often rely on long sequences of low-level textual actions, resulting in large effective decision horizons and high inference cost. While prior work has focused on improving inference efficiency through system-level optimizations or prompt engineering, we argue that a key bottleneck lies in the representation of the action space itself. We propose Latent Action Reparameterization (LAR), a framework that learns a compact latent action space in which each latent action corresponds to a multi-step semantic behavior. By reparameterizing agent actions into latent units, LAR enables decision making over a shorter effective horizon while preserving the expressiveness of the original action space. Unlike hand-crafted macros or hierarchical controllers, latent actions are learned from agent trajectories and integrated directly into the model, allowing both planning and execution to operate over abstract action representations. Across a range of LLM-based agent benchmarks, LAR significantly reduces the effective action horizon and improves inference efficiency under fixed compute budgets. As a consequence, our approach achieves substantial reductions in action tokens and corresponding wall-clock inference time, while maintaining or improving task success rates. These results suggest that action representation learning is a critical and underexplored factor in scaling efficient LLM agent inference, complementary to advances in model architecture and hardware.
Abstract（参考訳）: 大規模言語モデル(LLM)エージェントは、しばしば低レベルのテキストアクションの長いシーケンスに依存し、大きな効果的な決定の地平線と高い推論コストをもたらす。これまでの作業では、システムレベルの最適化やプロンプトエンジニアリングによる推論効率の改善に重点を置いてきたが、重要なボトルネックはアクション空間自体の表現にある、と我々は論じている。本稿では,複数のステップのセマンティックな振る舞いに対応する,コンパクトな潜時行動空間を学習するフレームワークであるLatent Action Reparameterization(LAR)を提案する。エージェントアクションを潜時単位に再パラメータ化することにより、LARは元のアクション空間の表現性を保ちながら、より短い有効地平線上の決定を可能にする。手作りのマクロや階層型コントローラとは異なり、潜在アクションはエージェントの軌跡から学習され、モデルに直接統合され、計画と実行の両方が抽象的なアクション表現を介して操作できる。 LLMベースのエージェントベンチマークの範囲で、LARは有効なアクション水平線を著しく削減し、固定された計算予算下での推論効率を向上させる。その結果,タスク成功率の維持や改善を図りながら,アクショントークンとそれに対応するウォールタイム推定時間の大幅な削減を実現している。これらの結果から, 行動表現学習は, モデルアーキテクチャやハードウェアの進歩を補完する, 効率的なLLMエージェント推論のスケーリングにおいて, 重要かつ過小評価された要因であることが示唆された。

論文の概要: Latent Action Reparameterization for Efficient Agent Inference

関連論文リスト