Fugu-MT 論文翻訳(概要): PGDG: Physically Grounded Data Generation for Robust Bimanual Policy Learning from a Single Demonstration

論文の概要: PGDG: Physically Grounded Data Generation for Robust Bimanual Policy Learning from a Single Demonstration

arxiv url: http://arxiv.org/abs/2605.21710v1
Date: Wed, 20 May 2026 20:14:24 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-22 16:35:41.98435
Title: PGDG: Physically Grounded Data Generation for Robust Bimanual Policy Learning from a Single Demonstration
Title（参考訳）: PGDG:ロバストなバイマラルポリシー学習のための物理接地データ生成
Authors: Cunxi Dai, Haoran Chang, Aditya Nisal, Rahul Kumar, Guofei Chen, Tao Chen, Yuzhe Qin, Guanya Shi,
Abstract要約: ゼロショットキュレーションを備えたデータ生成フレームワークPGDGを提案する。 PGDGは物理地上のサンプルとデータセットキュレーターを反復する。シミュレーションと実世界転送の両方において、空間のみの増大を一貫して上回っている。
参考スコア（独自算出の注目度）: 13.432047023375608
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Behavior cloning for contact-rich bimanual manipulation remains challenging because diverse demonstrations are expensive to collect, and even small disturbances can push the system into off-manifold states where no recovery supervision is available. We propose PGDG, a data generation framework with zero-shot curation that expands a single demonstration into a compact dataset of physically plausible, successful, and diverse recovery behaviors without additional human labeling. PGDG iterates between a physics-grounded sampler and a dataset curator, where the curator selects informative, non-redundant, and recoverable behaviors to update the sampling distribution toward under-covered recovery modes, and the sampler draws physically plausible rollout candidates from this updated distribution and retains successful trajectories. To further improve data quality, PGDG applies short-horizon sampling-based control to relabel selected risky states with corrective actions. Across four bimanual manipulation tasks, PGDG consistently outperforms spatial-only augmentation in both simulation and zero-shot real-world transfer. On RotateBox-Pitch, success improves from 38% to 93% in simulation and from 35% to 82% in the real world. PGDG also enables effective foundation models fine-tuning such as GR00T, increasing success from 46% to 77%. Additional results are available in our website: https://cunxid.github.io/PGDG/.
Abstract（参考訳）: 多様なデモンストレーションの収集が高価であり、小さな障害でさえ、回復の監督ができないオフマンド状態にシステムを押し上げることができるため、コンタクトリッチなバイマニュアル操作のための行動クローニングは依然として困難である。本稿では,ゼロショットキュレーションを施したデータ生成フレームワークPGDGを提案する。 PGDGは、物理接地されたサンプルラーとデータセットキュレーターの間を反復し、キュレーターは、サンプリング分布を未発見のリカバリモードに向けて更新するために、情報、非冗長、回復可能な振る舞いを選択し、サンプリングは、この更新された分布から物理的に妥当なロールアウト候補を引き出し、軌道を成功させる。データ品質をさらに改善するため、PGDGは短水平サンプリングに基づく制御を、修正作用のある選択されたリスク状態の緩和に適用した。 4つの双方向操作タスクの中で、PGDGは、シミュレーションとゼロショットの実世界転送の両方において、空間のみの増大を一貫して上回っている。 RotateBox-Pitchでは、シミュレーションでは38%から93%、現実世界では35%から82%に改善されている。 PGDGはまた、GR00Tのような効果的な基礎モデルの微調整を可能にし、成功率は46%から77%に増加した。追加の結果は、私たちのWebサイト(https://cunxid.github.io/PGDG/)で公開されています。

論文の概要: PGDG: Physically Grounded Data Generation for Robust Bimanual Policy Learning from a Single Demonstration

関連論文リスト