Fugu-MT 論文翻訳(概要): DexDrummer: In-Hand, Contact-Rich, and Long-Horizon Dexterous Robot Drumming

論文の概要: DexDrummer: In-Hand, Contact-Rich, and Long-Horizon Dexterous Robot Drumming

arxiv url: http://arxiv.org/abs/2603.22263v1
Date: Mon, 23 Mar 2026 17:49:06 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-24 19:11:39.827318
Title: DexDrummer: In-Hand, Contact-Rich, and Long-Horizon Dexterous Robot Drumming
Title（参考訳）: DexDrummer:In-Hand, Contact-Rich, Long-Horizon Dexterous Robot Drumming
Authors: Hung-Chieh Fang, Amber Xie, Jennifer Grannen, Kenneth Llontop, Dorsa Sadigh,
Abstract要約: ドラムは自然に手動制御、コンタクトリッチな相互作用、長い水平調整を統合している。本稿では,DexDrummerについて述べる。DexDrummerは,シミュレート・トゥ・リアル・トランスファーによるシミュレーションで訓練された階層的オブジェクト中心のバイマンダル・ドラム・ポリシーである。巧妙な操作ポリシーは、指-スティックとスティック-ドラムの相互作用の両方を明示的にモデル化する報酬によって導かれる、コンタクトリッチなダイナミクスを扱う。
参考スコア（独自算出の注目度）: 28.284862746585535
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Performing in-hand, contact-rich, and long-horizon dexterous manipulation remains an unsolved challenge in robotics. Prior hand dexterity works have considered each of these three challenges in isolation, yet do not combine these skills into a single, complex task. To further test the capabilities of dexterity, we propose drumming as a testbed for dexterous manipulation. Drumming naturally integrates all three challenges: it involves in-hand control for stabilizing and adjusting the drumstick with the fingers, contact-rich interaction through repeated striking of the drum surface, and long-horizon coordination when switching between drums and sustaining rhythmic play. We present DexDrummer, a hierarchical object-centric bimanual drumming policy trained in simulation with sim-to-real transfer. The framework reduces the exploration difficulty of pure reinforcement learning by combining trajectory planning with residual RL corrections for fast transitions between drums. A dexterous manipulation policy handles contact-rich dynamics, guided by rewards that explicitly model both finger-stick and stick-drum interactions. In simulation, we show our policy can play two styles of music: multi-drum, bimanual songs and challenging, technical exercises that require increased dexterity. Across simulated bimanual tasks, our dexterous, reactive policy outperforms a fixed grasp policy by 1.87x across easy songs and 1.22x across hard songs F1 scores. In real-world tasks, we show song performance across a multi-drum setup. DexDrummer is able to play our training song and its extended version with an F1 score of 1.0.
Abstract（参考訳）: ロボット工学における未解決の課題は、手動で、接触に富んだ、そして長い水平な操作である。以前は、これらの3つの課題のそれぞれを分離して検討していたが、これらのスキルを1つの複雑なタスクに組み合わせることはできなかった。ディクスタリティの能力を更に検証するため,ディクスタラス操作のためのテストベッドとしてドラム加工を提案する。ドラムは、ドラムスティックを指で安定・調整するための手動制御、ドラム面の繰り返し打撃によるコンタクトリッチな相互作用、ドラムを切り替えたりリズミカルな演奏を継続する際の長い水平調整、という3つの課題を自然に統合している。本稿では,DexDrummerについて述べる。DexDrummerは,Sim-to-realトランスファーを用いたシミュレーションで訓練された階層的オブジェクト中心のバイマンダルドラム政策である。この枠組みは、ドラム間の高速遷移のための軌道計画と残差RL補正を組み合わせることにより、純粋な強化学習の探索困難を低減させる。巧妙な操作ポリシーは、指-スティックとスティック-ドラムの相互作用の両方を明示的にモデル化する報酬によって導かれる、コンタクトリッチなダイナミクスを扱う。シミュレーションでは,マルチドラム,バイマニュアル楽曲,ディクスタリティの増大を必要とする技術的課題の2つのスタイルを演奏できることを示す。シミュレーションされたバイマニュアルタスク全体にわたって、当社の厳密でリアクティブなポリシーは、簡単な曲で1.87倍、ハードソングで1.22倍という、固定的な把握ポリシーよりも優れています。実世界のタスクでは,マルチドラム構成の楽曲のパフォーマンスを示す。 DexDrummerはF1スコア1.0のトレーニング曲と拡張バージョンを再生できます。

論文の概要: DexDrummer: In-Hand, Contact-Rich, and Long-Horizon Dexterous Robot Drumming

関連論文リスト