Fugu-MT 論文翻訳(概要): Learning Actionable Manipulation Recovery via Counterfactual Failure Synthesis

論文の概要: Learning Actionable Manipulation Recovery via Counterfactual Failure Synthesis

arxiv url: http://arxiv.org/abs/2603.13528v1
Date: Fri, 13 Mar 2026 19:02:58 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-26 21:06:10.92132
Title: Learning Actionable Manipulation Recovery via Counterfactual Failure Synthesis
Title（参考訳）: 擬似故障合成による動作可能なマニピュレーションの学習
Authors: Dayou Li, Jiuzhou Lei, Hao Wang, Lulin Liu, Yunhao Yang, Zihan Wang, Bangya Liu, Minghui Zheng, Zhiwen Fan,
Abstract要約: 現在の障害学習パラダイムは、コストと安全性の低い実世界のデータ収集か、シミュレータベースの摂動に依存している。このフレームワークは,実世界の実演を成功させるのから直接,フォトリアリスティックでファクトファクトファクトのロールアウトを合成するものだ。生成の世界モデル内でアクションを摂動させることで、Dream2Fixはシミュレータに頼ることなくペアの失敗言語データを生成する。
参考スコア（独自算出の注目度）: 21.197844940385725
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: While recent foundation models have significantly advanced robotic manipulation, these systems still struggle to autonomously recover from execution errors. Current failure-learning paradigms rely on either costly and unsafe real-world data collection or simulator-based perturbations, which introduce a severe sim-to-real gap. Furthermore, existing visual analyzers predominantly output coarse, binary diagnoses rather than the executable, trajectory-level corrections required for actual recovery. To bridge the gap between failure diagnosis and actionable recovery, we introduce Dream2Fix, a framework that synthesizes photorealistic, counterfactual failure rollouts directly from successful real-world demonstrations. By perturbing actions within a generative world model, Dream2Fix creates paired failure-correction data without relying on simulators. To ensure the generated data is physically viable for robot learning, we implement a structured verification mechanism that strictly filters rollouts for task validity, visual coherence, and kinematic safety. This engine produces a high-fidelity dataset of over 120k paired samples. Using this dataset, we fine-tune a vision-language model to jointly predict failure types and precise recovery trajectories, mapping visual anomalies directly to corrective actions. Extensive real-world robotic experiments show our approach achieves state-of-the-art correction accuracy, improving from 19.7% to 81.3% over prior baselines, and successfully enables zero-shot closed-loop failure recovery in physical deployments.
Abstract（参考訳）: 最近の基礎モデルではロボット操作が大幅に進歩しているが、これらのシステムは実行エラーから自律的に回復するのに苦戦している。現在の障害学習パラダイムは、コストと安全性の低い実世界のデータ収集か、シミュレータベースの摂動に依存している。さらに、既存のビジュアルアナライザは、実際の回復に必要な軌道レベルの実行可能な補正よりも、粗いバイナリ診断を主に出力する。故障診断と動作可能回復のギャップを埋めるために,実世界の実演成功から直接,フォトリアリスティックなフォールトロールアウトを合成するフレームワークDream2Fixを紹介した。生成的世界モデル内でのアクションの摂動により、Dream2Fixはシミュレータに頼ることなくペアの失敗補正データを生成する。ロボット学習において生成したデータが物理的に有効であることを保証するため,タスクの妥当性,視覚的コヒーレンス,運動安全のためのロールアウトを厳格にフィルタリングする構造的検証機構を実装した。このエンジンは120k以上のサンプルからなる高忠実度データセットを生成する。このデータセットを用いて視覚言語モデルを微調整し、障害タイプと正確な回復軌跡を共同で予測し、視覚異常を直接修正行動にマッピングする。大規模な実世界のロボット実験により、我々のアプローチは最先端の精度を達成し、以前のベースラインよりも19.7%から81.3%向上し、物理的展開においてゼロショットクローズループ障害の回復を成功させた。

論文の概要: Learning Actionable Manipulation Recovery via Counterfactual Failure Synthesis

関連論文リスト