Fugu-MT 論文翻訳(概要): VR-DAgger: Immersive VR for Dexterous Data Collection and Uncertainty-Guided On-Policy Correction

論文の概要: VR-DAgger: Immersive VR for Dexterous Data Collection and Uncertainty-Guided On-Policy Correction

arxiv url: http://arxiv.org/abs/2605.27114v2
Date: Thu, 28 May 2026 19:27:21 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-01 13:54:20.989137
Title: VR-DAgger: Immersive VR for Dexterous Data Collection and Uncertainty-Guided On-Policy Correction
Title（参考訳）: VR-DAgger:不確実なデータ収集と不確実なオンライン補正のための没入型VR
Authors: René Zurbrügg, Tifanny Portela, Arjun Bhardwaj, Aravind Elanjimattathil Vijayan, Maximum Wilder-Smith, Marco Hutter,
Abstract要約: 提案するVR-DAggerは,遠隔操作,デモコレクション,選択的なポリシー修正のためのヒューマン・イン・ザ・ループ・フレームワークである。 VR-DAggerは、完全なロールアウトではなく、選択したセグメントをレビューすることで、サンプル単位のコレクション時間を約40%削減する。
参考スコア（独自算出の注目度）: 5.847492700915662
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Learning from demonstrations is effective for robotic manipulation, but collecting sufficient task-specific data remains a major bottleneck. Under distribution shift, small errors compound, performance degrades, and expert time is often spent on redundant, low-value corrections instead of the few critical failure cases. We present VR-DAgger, a human-in-the-loop framework centered on an immersive VR application for dexterous teleoperation, demonstration collection, and selective policy correction. The VR client provides intuitive hand control with synchronized scene visualization, while a backend workstation runs simulation and learning, enabling autonomous rollouts without continuous operator oversight. We use Monte Carlo (MC) dropout to score uncertainty during Isaac Lab rollouts of a diffusion policy and select informative failure segments for correction. These segments are replayed in VR as clips, where the operator selectively labels and corrects the policy's behavior, concentrating supervision where uncertainty is highest without full-rollout monitoring or a separate intervention classifier. We evaluate on three dexterous manipulation tasks (Pan pick-and-place, Drawer opening, Valve turning) with a 10-DoF XHand under standard and challenging initial configurations. Active labeling consistently improves over behavioral cloning across all tasks, with gains of up to 23 percentage points. Compared to unguided human-in-the-loop inspection, VR-DAgger reduces per-sample collection time by approximately 40% by focusing review on selected segments rather than full rollouts.
Abstract（参考訳）: デモから学ぶことはロボット操作に有効だが、十分なタスク固有のデータを集めることは大きなボトルネックである。分散シフトの下では、小さなエラーが複雑になり、パフォーマンスが低下し、専門家の時間は、少数の重大な障害ケースではなく、冗長で低い値の修正に費やされることが多い。本稿では,没入型VRアプリケーションを中心としたVR-DAggerについて紹介する。バックエンドのワークステーションはシミュレーションと学習を実行し、継続的なオペレータの監視なしに自律的なロールアウトを可能にする。我々は、Isaac Labの拡散ポリシーのロールアウト中に不確実性を評価するためにMonte Carlo(MC)のドロップアウトを使用し、修正のために情報的障害セグメントを選択する。これらのセグメントはVRでクリップとして再生され、オペレータがポリシーの動作を選択的にラベル付けして修正し、フルロールアウト監視や個別の介入分類器なしで不確実性が最も高い監視に集中する。我々は,10-DoF XHand の3つの操作タスク (Pan Pick-and-place, Drawer Open, Valve Turn) を,標準的かつ困難な初期設定の下で評価した。アクティブなラベル付けは、すべてのタスクにおける行動的クローンよりも一貫して改善され、最大23ポイントまで上昇する。イン・ザ・ループ検査と比較して、VR-DAggerは全ロールアウトではなく、選択したセグメントをレビューすることで、サンプルごとの収集時間を約40%短縮する。

論文の概要: VR-DAgger: Immersive VR for Dexterous Data Collection and Uncertainty-Guided On-Policy Correction

関連論文リスト