Fugu-MT 論文翻訳(概要): HandelBot: Real-World Piano Playing via Fast Adaptation of Dexterous Robot Policies

論文の概要: HandelBot: Real-World Piano Playing via Fast Adaptation of Dexterous Robot Policies

arxiv url: http://arxiv.org/abs/2603.12243v2
Date: Sun, 15 Mar 2026 17:16:45 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-21 18:33:56.730511
Title: HandelBot: Real-World Piano Playing via Fast Adaptation of Dexterous Robot Policies
Title（参考訳）: HandelBot:Dexterous Robot Policiesの高速適応による現実世界のピアノ演奏
Authors: Amber Xie, Haozhi Qi, Dorsa Sadigh,
Abstract要約: HandelBotは、シミュレーションポリシと、2段階のパイプラインによる迅速な適応を組み合わせたフレームワークである。我々はHandelBotが正確にバイマニュアルピアノを演奏できることを実証した。本システムでは, 直接シミュレーションの展開を1.8倍に向上させ, 30分間の物理的相互作用データを必要とする。
参考スコア（独自算出の注目度）: 32.80504141722999
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Mastering dexterous manipulation with multi-fingered hands has been a grand challenge in robotics for decades. Despite its potential, the difficulty of collecting high-quality data remains a primary bottleneck for high-precision tasks. While reinforcement learning and simulation-to-real-world transfer offer a promising alternative, the transferred policies often fail for tasks demanding millimeter-scale precision, such as bimanual piano playing. In this work, we introduce HandelBot, a framework that combines a simulation policy and rapid adaptation through a two-stage pipeline. Starting from a simulation-trained policy, we first apply a structured refinement stage to correct spatial alignments by adjusting lateral finger joints based on physical rollouts. Next, we use residual reinforcement learning to autonomously learn fine-grained corrective actions. Through extensive hardware experiments across five recognized songs, we demonstrate that HandelBot can successfully perform precise bimanual piano playing. Our system outperforms direct simulation deployment by a factor of 1.8x and requires only 30 minutes of physical interaction data.
Abstract（参考訳）: マルチフィンガーハンドで巧妙な操作をマスターすることは、ロボット工学において何十年にもわたって大きな課題だった。その可能性にもかかわらず、高品質なデータを集めることの難しさは、高精度なタスクの主要なボトルネックである。強化学習や実世界へのシミュレーション・トゥ・リアル・トランスファーは有望な代替手段となるが、バイマニュアルピアノ演奏のようなミリスケールの精度を必要とするタスクでは、転送ポリシーは失敗することが多い。本研究では,シミュレーションポリシと2段階パイプラインによる迅速な適応を組み合わせたフレームワークであるHandelBotを紹介する。シミュレーション学習から始め, 物理的ロールアウトに基づいて横指関節を調整することにより, 空間アライメントの補正に構造化された改良段階を適用した。次に、残留強化学習を用いて、きめ細かい補正行動を自律的に学習する。 5曲にわたるハードウェア実験を通じて、HandelBotが正確にバイマニュアルピアノを演奏できることを実証した。本システムでは, 直接シミュレーションの展開を1.8倍に向上させ, 30分間の物理インタラクションデータしか必要としない。

論文の概要: HandelBot: Real-World Piano Playing via Fast Adaptation of Dexterous Robot Policies

関連論文リスト