Fugu-MT 論文翻訳(概要): PoLAR: Factorizing Extent and Mode in Latent Actions for Robot Policy Learning

論文の概要: PoLAR: Factorizing Extent and Mode in Latent Actions for Robot Policy Learning

arxiv url: http://arxiv.org/abs/2606.21139v1
Date: Fri, 19 Jun 2026 06:24:05 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-26 08:01:57.279009
Title: PoLAR: Factorizing Extent and Mode in Latent Actions for Robot Policy Learning
Title（参考訳）: PoLAR:ロボット政策学習における潜在行動の要因とモード
Authors: Youngjoon Jeong, Jihwan Yu, Minsoo Jo, Junha Chun, Taesup Kim,
Abstract要約: 潜在的行動事前訓練は、一対の観察から視覚変化の表現を学ぶ。放射状構造を持つ極性潜伏作用 (PoLAR) を導入し, 潜伏作用に放射状指向構造を課す。 PoLARはシミュレーションおよび実世界のロボット実験における下流ポリシー性能を改善する。
参考スコア（独自算出の注目度）: 7.207198416919797
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Latent action pretraining learns representations of visual change from pairs of observations, but existing methods typically encode each transition as a single unstructured representation that entangles transition extent and transition mode. We introduce Polar Latent Actions with Radial structure (PoLAR), which imposes a radial-direction structure on latent actions, encouraging radius to encode transition extent and direction to retain transition mode. PoLAR uses temporal offset between two observations as a weak proxy for transition extent, encouraging latent action from observation pairs separated by larger temporal gaps to occupy larger radii. We instantiate this structure in hyperbolic space, whose expanding volume with radius offers a natural fit for more diverse transition modes at larger extents. Across in-task and large-scale pretraining settings, PoLAR improves downstream policy performance in simulation and real-world robot experiments, outperforming latent action baselines and strong pretrained VLAs. These results suggest that the geometry of the latent action space is an important design choice for transferring visual pretraining to downstream robot policy learning.
Abstract（参考訳）: 潜在行動事前訓練は、一対の観測から視覚変化の表現を学ぶが、既存の手法では、遷移範囲と遷移モードを絡める単一の非構造化表現として、各遷移を符号化する。我々は、ラジアル構造を持つ極遅延作用(PoLAR)を導入し、ラジアル方向構造を潜在作用に課し、遷移範囲と方向をエンコードして遷移モードを維持するように半径を奨励する。 PoLARは、2つの観測間の時間的オフセットを遷移範囲の弱いプロキシとして使用し、より大きな時間的ギャップによって分離された観測対からの潜伏作用を促進し、より大きな半径を占有する。我々はこの構造を双曲空間でインスタンス化し、半径を持つ膨張体積はより多様な遷移モードに対してより広い範囲で自然に適合する。タスク内および大規模事前トレーニング設定全体において、PoLARはシミュレーションおよび実世界のロボット実験における下流ポリシー性能を改善し、潜伏したアクションベースラインと強い事前訓練されたVLAを上回っている。これらの結果から,潜伏行動空間の幾何学は,下流のロボットポリシー学習に視覚的事前学習を移すための重要な設計選択であることが示唆された。

論文の概要: PoLAR: Factorizing Extent and Mode in Latent Actions for Robot Policy Learning

関連論文リスト