Fugu-MT 論文翻訳(概要): RoboClaw: An Agentic Framework for Scalable Long-Horizon Robotic Tasks

論文の概要: RoboClaw: An Agentic Framework for Scalable Long-Horizon Robotic Tasks

arxiv url: http://arxiv.org/abs/2603.11558v1
Date: Thu, 12 Mar 2026 05:22:59 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-13 14:46:25.909141
Title: RoboClaw: An Agentic Framework for Scalable Long-Horizon Robotic Tasks
Title（参考訳）: RoboClaw: スケーラブルな長距離ロボットタスクのためのエージェントフレームワーク
Authors: Ruiying Li, Yunlang Zhou, YuYao Zhu, Kylin Chen, Jingyuan Wang, Sukai Wang, Kongtao Hu, Minhui Yu, Bowen Jiang, Zhan Su, Jiayao Ma, Xin He, Yongjian Shen, Yangyang, Guanghui Ren, Maoqing Yao, Wenhao Wang, Yao Mu,
Abstract要約: データ収集、ポリシー学習、タスク実行を単一のVLM駆動コントローラで統合するエージェントロボットフレームワークであるRoboClawを提案する。ポリシーレベルでは、RoboClaw氏はEntangled Action Pairs(EAP)を紹介している。デプロイ中、同じエージェントが高レベルの推論を行い、学習されたポリシープリミティブを動的にオーケストレーションして長期のタスクを遂行する。
参考スコア（独自算出の注目度）: 28.827331437876452
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Vision-Language-Action (VLA) systems have shown strong potential for language-driven robotic manipulation. However, scaling them to long-horizon tasks remains challenging. Existing pipelines typically separate data collection, policy learning, and deployment, resulting in heavy reliance on manual environment resets and brittle multi-policy execution. We present RoboClaw, an agentic robotics framework that unifies data collection, policy learning, and task execution under a single VLM-driven controller. At the policy level, RoboClaw introduces Entangled Action Pairs (EAP), which couple forward manipulation behaviors with inverse recovery actions to form self-resetting loops for autonomous data collection. This mechanism enables continuous on-policy data acquisition and iterative policy refinement with minimal human intervention. During deployment, the same agent performs high-level reasoning and dynamically orchestrates learned policy primitives to accomplish long-horizon tasks. By maintaining consistent contextual semantics across collection and execution, RoboClaw reduces mismatch between the two phases and improves multi-policy robustness. Experiments in real-world manipulation tasks demonstrate improved stability and scalability compared to conventional open-loop pipelines, while significantly reducing human effort throughout the robot lifecycle, achieving a 25% improvement in success rate over baseline methods on long-horizon tasks and reducing human time investment by 53.7%.
Abstract（参考訳）: VLA(Vision-Language-Action)システムは、言語駆動型ロボット操作の強力な可能性を示している。しかし、それを長期のタスクにスケールすることは依然として困難である。既存のパイプラインは通常、データ収集、ポリシー学習、デプロイメントを分離し、手動の環境リセットに大きく依存する。データ収集、ポリシー学習、タスク実行を単一のVLM駆動コントローラで統合するエージェントロボットフレームワークであるRoboClawを提案する。ポリシーレベルでは、RoboClaw氏はEntangled Action Pairs(EAP)を紹介している。このメカニズムは、人間の介入を最小限に抑えながら、継続的なオンラインデータ取得と反復的な政策改善を可能にする。デプロイ中、同じエージェントが高レベルの推論を行い、学習されたポリシープリミティブを動的にオーケストレーションして長期のタスクを遂行する。コレクションと実行間で一貫したコンテキストセマンティクスを維持することで、RoboClawは2つのフェーズ間のミスマッチを低減し、マルチポリシの堅牢性を改善する。実世界の操作タスクの実験では、従来のオープンループパイプラインに比べて安定性とスケーラビリティが向上し、ロボットライフサイクル全体を通して人間の労力を大幅に削減し、長い水平タスクのベースラインメソッドよりも25%改善し、人間の時間投資を53.7%削減した。

論文の概要: RoboClaw: An Agentic Framework for Scalable Long-Horizon Robotic Tasks

関連論文リスト