Fugu-MT 論文翻訳(概要): Failure Prediction at Runtime for Generative Robot Policies

論文の概要: Failure Prediction at Runtime for Generative Robot Policies

arxiv url: http://arxiv.org/abs/2510.09459v1
Date: Fri, 10 Oct 2025 15:09:27 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-14 04:53:46.94975
Title: Failure Prediction at Runtime for Generative Robot Policies
Title（参考訳）: 生成ロボットポリシー実行時の故障予測
Authors: Ralf Römer, Adrian Kobras, Luca Worbis, Angela P. Schoellig,
Abstract要約: 実行中の早期の障害予測は、人間中心で安全クリティカルな環境でロボットをデプロイするために不可欠である。本稿では,フェールデータを必要としない生成ロボットポリシーの故障予測フレームワークであるFIPERを提案する。その結果、FIPERは実際の障害と良質なOOD状況とをよく区別し、既存の手法よりも正確に早期に障害を予測できることがわかった。
参考スコア（独自算出の注目度）: 6.375597233389154
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Imitation learning (IL) with generative models, such as diffusion and flow matching, has enabled robots to perform complex, long-horizon tasks. However, distribution shifts from unseen environments or compounding action errors can still cause unpredictable and unsafe behavior, leading to task failure. Early failure prediction during runtime is therefore essential for deploying robots in human-centered and safety-critical environments. We propose FIPER, a general framework for Failure Prediction at Runtime for generative IL policies that does not require failure data. FIPER identifies two key indicators of impending failure: (i) out-of-distribution (OOD) observations detected via random network distillation in the policy's embedding space, and (ii) high uncertainty in generated actions measured by a novel action-chunk entropy score. Both failure prediction scores are calibrated using a small set of successful rollouts via conformal prediction. A failure alarm is triggered when both indicators, aggregated over short time windows, exceed their thresholds. We evaluate FIPER across five simulation and real-world environments involving diverse failure modes. Our results demonstrate that FIPER better distinguishes actual failures from benign OOD situations and predicts failures more accurately and earlier than existing methods. We thus consider this work an important step towards more interpretable and safer generative robot policies. Code, data and videos are available at https://tum-lsy.github.io/fiper_website.
Abstract（参考訳）: 拡散やフローマッチングなどの生成モデルを用いた模倣学習(IL)により、ロボットは複雑な長距離タスクを実行できるようになった。しかしながら、予期せぬ環境や複雑なアクションエラーからの分散シフトは、予測不可能で安全でない振る舞いを引き起こす可能性があるため、タスクの失敗につながる。したがって、実行中の早期の障害予測は、人間中心で安全クリティカルな環境にロボットを配置するために不可欠である。本稿では,障害データを必要としない生成ILポリシに対して,実行時の障害予測のための一般的なフレームワークであるFIPERを提案する。 FIPERは、差し迫った失敗の2つの重要な指標を特定します。 (i)政策の埋め込み空間におけるランダムネットワーク蒸留により検出されたアウト・オブ・ディストリビューション(OOD)観測、及び (II)新しいアクションチャンクエントロピースコアによって測定された生成行動の不確実性が高い。両方の故障予測スコアは、共形予測を通じて小さな成功ロールアウトセットを使用して校正される。両方のインジケータが短時間のウィンドウに集約され、しきい値を超えると、障害アラームがトリガーされる。各種故障モードを含む5つのシミュレーション環境および実環境におけるFIPERの評価を行った。その結果、FIPERは実際の障害と良質なOOD状況とをよく区別し、既存の手法よりも正確に早期に障害を予測できることがわかった。そこで本研究は,より解釈可能な,より安全な生成ロボットポリシーに向けた重要なステップであると考えている。コード、データ、ビデオはhttps://tum-lsy.github.io/fiper_website.orgで公開されている。

論文の概要: Failure Prediction at Runtime for Generative Robot Policies

関連論文リスト