Fugu-MT 論文翻訳(概要): Eye, Robot: Learning to Look to Act with a BC-RL Perception-Action Loop

論文の概要: Eye, Robot: Learning to Look to Act with a BC-RL Perception-Action Loop

arxiv url: http://arxiv.org/abs/2506.10968v1
Date: Thu, 12 Jun 2025 17:59:11 GMT
ステータス: 翻訳完了
システム内更新日: 2025-06-13 15:37:22.900379
Title: Eye, Robot: Learning to Look to Act with a BC-RL Perception-Action Loop
Title（参考訳）: 目とロボット:BC-RLの知覚-行動ループで行動を学ぶ
Authors: Justin Kerr, Kush Hari, Ethan Weber, Chung Min Kim, Brent Yi, Tyler Bonnen, Ken Goldberg, Angjoo Kanazawa,
Abstract要約: EyeRobotは、現実世界のタスクを完了する必要から生じる視線行動を備えたロボットシステムである。我々は、周囲を自由に回転させて観察し、強化学習を用いて視線ポリシーを訓練できるメカニカルアイボールを開発した。我々は,ロボットアームを囲む弧の操作を必要とする5つのパノラマワークスペース操作タスクに対して,EyeRobotを評価した。
参考スコア（独自算出の注目度）: 37.5231371254634
License: http://creativecommons.org/publicdomain/zero/1.0/
Abstract: Humans do not passively observe the visual world -- we actively look in order to act. Motivated by this principle, we introduce EyeRobot, a robotic system with gaze behavior that emerges from the need to complete real-world tasks. We develop a mechanical eyeball that can freely rotate to observe its surroundings and train a gaze policy to control it using reinforcement learning. We accomplish this by first collecting teleoperated demonstrations paired with a 360 camera. This data is imported into a simulation environment that supports rendering arbitrary eyeball viewpoints, allowing episode rollouts of eye gaze on top of robot demonstrations. We then introduce a BC-RL loop to train the hand and eye jointly: the hand (BC) agent is trained from rendered eye observations, and the eye (RL) agent is rewarded when the hand produces correct action predictions. In this way, hand-eye coordination emerges as the eye looks towards regions which allow the hand to complete the task. EyeRobot implements a foveal-inspired policy architecture allowing high resolution with a small compute budget, which we find also leads to the emergence of more stable fixation as well as improved ability to track objects and ignore distractors. We evaluate EyeRobot on five panoramic workspace manipulation tasks requiring manipulation in an arc surrounding the robot arm. Our experiments suggest EyeRobot exhibits hand-eye coordination behaviors which effectively facilitate manipulation over large workspaces with a single camera. See project site for videos: https://www.eyerobot.net/
Abstract（参考訳）: 人間は視覚の世界を受動的に観察しない。この原理に動機づけられたEyeRobotは、現実世界のタスクを完了する必要から生じる視線行動を持つロボットシステムである。我々は、周囲を自由に回転させて観察し、強化学習を用いて視線ポリシーを訓練できるメカニカルアイボールを開発した。まず、360度カメラと組み合わせた遠隔操作デモを収集する。このデータは、任意の眼球視点のレンダリングをサポートするシミュレーション環境にインポートされ、ロボットのデモンストレーションの上に視線のエピソードロールアウトを可能にする。次に、手と眼を共同で訓練するためのBC-RLループを導入し、手(BC)エージェントをレンダリングされた眼の観察から訓練し、手(RL)エージェントが正しい行動予測を行うと、眼(RL)エージェントを報奨する。このように、目がタスクを完了させる領域に目を向けるにつれて、手目調整が出現する。 EyeRobotは、小さな計算予算で高解像度を実現するために、フォビアインスパイアされたポリシーアーキテクチャを実装しています。我々は,ロボットアームを囲む弧の操作を必要とする5つのパノラマワークスペース操作タスクに対して,EyeRobotを評価した。実験の結果,EyeRobotは眼球の協調動作を示し,単一のカメラで作業空間を効果的に操作できることが示唆された。ビデオのプロジェクトサイト(https://www.eyerobot.net/)を参照。

論文の概要: Eye, Robot: Learning to Look to Act with a BC-RL Perception-Action Loop

関連論文リスト