Fugu-MT 論文翻訳(概要): EnergyAction: Unimanual to Bimanual Composition with Energy-Based Models

論文の概要: EnergyAction: Unimanual to Bimanual Composition with Energy-Based Models

arxiv url: http://arxiv.org/abs/2603.20236v1
Date: Mon, 09 Mar 2026 04:50:17 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-06 02:36:12.928618
Title: EnergyAction: Unimanual to Bimanual Composition with Energy-Based Models
Title（参考訳）: Energy Action:Energy-based Modelを用いた一対二構成
Authors: Mingchen Song, Xiang Deng, Jie Wei, Dongmei Jiang, Liqiang Nie, Weili Guan,
Abstract要約: EnergyActionは、一元的操作ポリシーを二元的タスクに合成する新しいフレームワークである。まず,個々の一元的政策をEMMとしてモデル化し,その構成特性を利用して左腕と右腕の動作を構成する。第2に,エネルギー制約を通した時空間協調機構を導入する。第3に,行動品質評価に基づいて動的に認知段階を適応させる2つの異なるエネルギー認知型認知戦略を提案する。
参考スコア（独自算出の注目度）: 73.21721376238233
License: http://creativecommons.org/licenses/by-nc-sa/4.0/
Abstract: Recent advances in unimanual manipulation policies have achieved remarkable success across diverse robotic tasks through abundant training data and well-established model architectures. However, extending these capabilities to bimanual manipulation remains challenging due to the lack of bimanual demonstration data and the complexity of coordinating dual-arm actions. Existing approaches either rely on extensive bimanual datasets or fail to effectively leverage pre-trained unimanual policies. To address this limitation, we propose \textbf{EnergyAction}, a novel framework that compositionally transfers unimanual manipulation policies to bimanual tasks through the Energy-Based Models (EBMs). Specifically, our method incorporates three key innovations. First, we model individual unimanual policies as EBMs and leverage their compositional properties to compose left and right arm actions, enabling the fusion of unimanual policies into a bimanual policy. Second, we introduce an energy-based temporal-spatial coordination mechanism through energy constraints, ensuring the generated bimanual actions are both temporal coherence and spatial feasibility. Third, we propose two different energy-aware denoising strategies that dynamically adapt denoising steps based on action quality assessment. These strategies ensure the generation of high-quality actions while maintaining superior computational efficiency compared to fixed-step denoising approaches. Experimental results demonstrate that EnergyAction effectively transfers unimanual knowledge to bimanual tasks, achieving superior performance on both simulated and real-world tasks with minimal bimanual data.
Abstract（参考訳）: 一元的な操作ポリシーの最近の進歩は、豊富なトレーニングデータと確立されたモデルアーキテクチャを通して、多様なロボットタスクで顕著な成功を収めている。しかし,両腕動作のコーディネートが困難であることや,両腕動作のコーディネートが困難であることから,これらの能力を両腕操作に拡張することは依然として困難である。既存のアプローチは、広範な双方向データセットに依存するか、事前訓練された単一ポリシーを効果的に活用できないかのいずれかである。この制限に対処するために、エネルギーベースモデル(EBM)を通して一元的操作ポリシーを二元的タスクに合成する新しいフレームワークである「textbf{EnergyAction}」を提案する。特に,本手法には3つの重要な革新が組み込まれている。まず、個々の一元的政策をEMMとしてモデル化し、その構成特性を活用して、左右の腕の動きを構成することにより、一元的政策を二元的政策に融合させる。第2に,エネルギー制約を通した時間空間協調機構を導入し,生成したバイマンダル動作が時間的コヒーレンスと空間的実現可能性の両方を確実にする。第3に,行動品質評価に基づいて動的に認知段階を適応させる2つの異なるエネルギー認知型認知戦略を提案する。これらの戦略は、固定ステップのデノイングアプローチよりも優れた計算効率を維持しながら、高品質なアクションの生成を保証する。実験結果から,EnergyActionは一元的知識を二元的タスクに効果的に伝達し,二元的データを最小限に抑えたシミュレーションと実世界のタスクにおいて優れた性能を発揮することが示された。

論文の概要: EnergyAction: Unimanual to Bimanual Composition with Energy-Based Models

関連論文リスト