Fugu-MT 論文翻訳(概要): Action-Prior Denoising for Smooth Real-Time Chunking

論文の概要: Action-Prior Denoising for Smooth Real-Time Chunking

arxiv url: http://arxiv.org/abs/2605.25537v1
Date: Mon, 25 May 2026 07:49:25 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-26 19:50:19.449281
Title: Action-Prior Denoising for Smooth Real-Time Chunking
Title（参考訳）: Smooth Real-Time Chunkingのためのアクションパラメータデノーミング
Authors: Dongyang Liu, Zhaowen Zheng, Yu Sun, Longxu Zhang, Yixuan Liu, Hao Wan,
Abstract要約: リアルタイムチャンキング(RTC)により、チャンクされたアクションポリシーは、以前のチャンクがコミットしたアクションに対して新たに生成されたアクションチャンクを条件にすることで、推論遅延の下で動作することができる。トレーニングタイムRTCは、学習中にこの遅延をシミュレートし、デプロイメント時の高価なランタイムガイダンスを回避する。本稿では,アクションプライオリジングに基づく訓練時間RTCの一般化であるソフトRTCを提案する。
参考スコア（独自算出の注目度）: 12.956533987402054
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Real-time chunking (RTC) lets chunked action policies operate under inference delay by conditioning a newly generated action chunk on actions already committed by the previous chunk. Training-time RTC simulates this delay during learning and avoids expensive guidance at deployment, but its binary prefix mask treats all non-prefix tokens as fully unconstrained. This under-models asynchronous execution: early overlap actions are fixed, while later overlap actions remain editable but should still stay close to the previous plan. We propose Soft RTC, a training-time RTC generalization based on action-prior denoising. Soft RTC constructs corrupted overlap tokens from partially denoised states instead of pure noise and injects the aligned previous chunk as the same prior during inference through a lightweight token-wise blending rule. On the 12 released large Kinetix levels, a short soft window nearly matches hard training-time RTC in overall solve rate (0.809 vs. 0.815), while a medium window reduces high-delay action delta and jerk by 9.1% and 9.6% relative to hard RTC. Both variants keep near-naive runtime, unlike inference-time RTC baselines. A small preliminary real-robot sorting study provides additional evidence that training-time RTC can improve completion and that Soft RTC gives the lowest commanded-action finite-difference metrics among the tested policies.
Abstract（参考訳）: リアルタイムチャンキング(RTC)により、チャンクされたアクションポリシーは、以前のチャンクがコミットしたアクションに対して新たに生成されたアクションチャンクを条件にすることで、推論遅延の下で動作することができる。トレーニングタイムRTCは、学習中にこの遅延をシミュレートし、デプロイメントにおける高価なガイダンスを回避するが、バイナリプレフィックスマスクは、すべての非プリフィックストークンを完全に非制約として扱う。初期の重複アクションは固定されているが、後続の重複アクションは編集可能であるが、以前の計画に近づき続けるべきである。本稿では,アクションプライオリジングに基づく訓練時間RTCの一般化であるソフトRTCを提案する。ソフトRTCは、純粋なノイズではなく、部分的に分解された状態から破損した重複トークンを構成し、軽量なトークンワイド・ブレンディング・ルールを通じて推論中に、一致した前のチャンクを同じ前のチャンクに注入する。 12個の大きなキネテックスレベルでは、短いソフトウィンドウはハードトレーニング時のRTCとほぼ一致し(0.809 vs. 0.815)、中ウィンドウはハードRTCと比較して9.1%と9.6%の遅延アクションデルタとジャークを減少させる。どちらの変種も、推論時のRTCベースラインとは異なり、ほぼネイティブのランタイムを維持している。小さな予備的な実ロボットソート研究は、訓練時間RTCが完成度を向上し、ソフトRTCが試験されたポリシーの中で最下位のコマンドアクション有限差測定値を与えるという追加の証拠を提供する。

論文の概要: Action-Prior Denoising for Smooth Real-Time Chunking

関連論文リスト