Fugu-MT 論文翻訳(概要): SkiP: When to Skip and When to Refine for Efficient Robot Manipulation

論文の概要: SkiP: When to Skip and When to Refine for Efficient Robot Manipulation

arxiv url: http://arxiv.org/abs/2605.15536v1
Date: Fri, 15 May 2026 02:16:34 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-18 17:44:16.29509
Title: SkiP: When to Skip and When to Refine for Efficient Robot Manipulation
Title（参考訳）: SkiP: 効率的なロボット操作のために、いつスキップするか、いつリファインするか
Authors: Mingtong Dai, Guanqi Peng, Yongjie Bai, Feng Yan, Chunjie Chen, Lingbo Liu, Liang Lin, Xinyu Wu,
Abstract要約: textbfSkip Policy (SkiP)は、スキップセグメントを動的に跳躍し、キーセグメントのアクションを集中的に洗練する。 emphMotion Spectrum Keying (MSK) は、動作信号から局所的な動きの複雑さを検出する高速でタスクに依存しない手順である。実験によると、SkiPは実行ステップを15ドルから40%ほど削減し、さまざまなポリシーバックボーン間での成功率のマッチングや改善を実現している。
参考スコア（独自算出の注目度）: 50.03201984643502
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Previous imitation learning policies predict future actions at every control step, whether in smooth motion phases or precise, contact-rich operation phases. This uniform treatment is wasteful: most steps in a manipulation trajectory traverse free space and carry little task-relevant information, while a small fraction of \emph{key} steps around contacts, grasps, and alignment demand dense, high-resolution prediction. We propose a novel \emph{action relabeling} mechanism: at each timestep in a skip segment, we replace the behavior cloning target with the action at the entrance of the next key segment, enabling the policy to leap over redundant steps in a single decision. The resulting \textbf{Skip Policy (SkiP)} dynamically leaps over skip segments and intensively refines actions in key segments, within a single unified network requiring no learned skip planner or hierarchical structure. To automatically partition demonstrations into key and skip segments without manual annotation, we introduce \emph{Motion Spectrum Keying} (MSK), a fast, task-agnostic procedure that detects local motion complexity from action signals. Extensive experiments across 72 simulated manipulation tasks and three real-robot tasks show that SkiP reduces executed steps by $15$--$40\%$ while matching or improving success rates across various policy backbones. Project page: \texttt{https://pgq18.github.io/SkiP-page/}.
Abstract（参考訳）: 従来の模倣学習ポリシーは、スムーズな動作フェーズでも、接触豊富な操作フェーズであっても、制御ステップ毎に将来の動作を予測する。この一様処理は無駄であり、操作軌跡の自由空間のほとんどのステップはタスク関連情報をほとんど持たず、一方、接点、グリップ、アライメントの要求密度、高解像度の予測はごくわずかである。スキップセグメントの各時間ステップにおいて、次のキーセグメントの入り口のアクションに振舞いクローンターゲットを置き換え、冗長なステップを1つの決定で跳躍できるように、新しい「emph{action relabeling}」機構を提案する。結果として生じる‘textbf{Skip Policy(SkiP)’は、スキップセグメントを動的に跳躍し、キーセグメントでのアクションを、学習されたスキッププランナーや階層構造を必要としない単一の統一ネットワーク内で、集中的に洗練する。手動のアノテーションを使わずに、デモをキーに分割し、セグメントをスキップするために、アクション信号から局所的な動きの複雑さを検出する高速なタスク非依存の手順である 'emph{Motion Spectrum Keying} (MSK) を導入する。 72のシミュレートされた操作タスクと3つの実ロボットタスクにわたる大規模な実験によると、SkiPは、さまざまなポリシーバックボーン間の成功率のマッチングや改善をしながら、実行ステップを15ドル～40ドル%削減する。プロジェクトページ: \texttt{https://pgq18.github.io/SkiP-page/}。

論文の概要: SkiP: When to Skip and When to Refine for Efficient Robot Manipulation

関連論文リスト