Fugu-MT 論文翻訳(概要): AgentSpec: Understanding Embodied Agent Scaffolds Through Controlled Composition

論文の概要: AgentSpec: Understanding Embodied Agent Scaffolds Through Controlled Composition

arxiv url: http://arxiv.org/abs/2606.14674v1
Date: Fri, 12 Jun 2026 17:39:49 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-15 16:00:43.018073
Title: AgentSpec: Understanding Embodied Agent Scaffolds Through Controlled Composition
Title（参考訳）: AgentSpec: コントロールされた構成を通して、身体的エージェントスカッフルドを理解する
Authors: Jixuan Chen, Jianzhi Shen, Haoqiang Kang, Zhi Hong, Qingyi Jiang, Soham Bose, Yiming Zhang, Leon Leng, Amit Vyas, Lingjun Mao, Siru Ouyang, Kun Zhou, Lianhui Qin,
Abstract要約: 我々は、再利用可能なポリシーコンポーネントの型付け構成としてエンボディされたエージェントを表現するモジュール仕様フレームワークであるAgentSpecを紹介した。 DeliveryBench、ALFRED、MiniGrid、RoboTHORでこのフレームワークをインスタンス化し、推論、メモリ、リフレクション、強化学習モジュールを分析します。この結果から, エージェント性能は, 分離モジュール強度よりも, 足場との互換性と相互作用効果によって制御されていることがわかった。
参考スコア（独自算出の注目度）: 23.160500379113703
License: http://creativecommons.org/licenses/by/4.0/
Abstract: LLM agents are increasingly built not as single model calls, but as scaffolded systems that combine reasoning, memory, reflection, action execution, and learning. While such scaffolds often improve performance, they are often embedded in tightly coupled pipelines, making it difficult to isolate component contributions, compare alternative designs, or understand how module interactions shape agent behavior. We introduce AgentSpec, a modular specification framework that represents embodied agents as typed compositions of reusable policy components with standardized interfaces. AgentSpec standardizes the interfaces among perception, memory, reasoning, reflection, action, and optional learning, enabling components to be swapped and recombined under controlled conditions. We instantiate this framework across DeliveryBench, ALFRED, MiniGrid, and RoboTHOR, and analyze reasoning, memory, reflection, and reinforcement-learning modules across model backbones. Our results show that agent performance is governed by scaffold compatibility and interaction effects rather than isolated module strength. In particular, structured multi-granularity memory improves long-horizon state tracking, reasoning and memory interact non-uniformly across environments, reflection trades off correction and cost, and RL-trained policies compose best when optimized with deployment-time scaffold structure. AgentSpec provides a controlled foundation for studying, comparing, and designing composable LLM agents. Our code, baselines and interactive playground are publicly available at https://agentspec-embodied.github.io.
Abstract（参考訳）: LLMエージェントは、単一のモデルコールではなく、推論、メモリ、リフレクション、アクション実行、学習を組み合わせた足場システムとしてますます構築されている。このような足場は、しばしばパフォーマンスを改善するが、しばしば密結合されたパイプラインに埋め込まれ、コンポーネントのコントリビューションを分離したり、代替設計を比較したり、モジュール間の相互作用がエージェントの振る舞いをどう形作るかを理解するのが難しくなる。我々は,実装されたエージェントを標準化されたインターフェースで再利用可能なポリシコンポーネントの型付け構成として表現するモジュール仕様フレームワークであるAgenSpecを紹介した。 AgentSpecは、認識、メモリ、推論、リフレクション、アクション、任意の学習のインターフェイスを標準化し、制御された条件下でコンポーネントを交換、再結合できるようにする。 DeliveryBench、ALFRED、MiniGrid、RoboTHORにまたがるこのフレームワークをインスタンス化し、モデルバックボーン間の推論、メモリ、リフレクション、強化学習モジュールを分析します。この結果から, エージェント性能は, 分離モジュール強度よりも, 足場との互換性と相互作用効果によって制御されていることがわかった。特に、構造化された多粒度メモリは、長期状態追跡を改善し、推論とメモリは環境間で不均一に相互作用し、リフレクションは修正とコストをオフにし、RLで訓練されたポリシーは、デプロイ時の足場構造に最適化した場合に最適である。 AgentSpecは、構成可能なLLMエージェントの研究、比較、設計のための制御された基盤を提供する。私たちのコード、ベースライン、インタラクティブな遊び場はhttps://agentspec-embodied.github.io.comで公開されています。

論文の概要: AgentSpec: Understanding Embodied Agent Scaffolds Through Controlled Composition

関連論文リスト