Fugu-MT 論文翻訳(概要): Hand-in-the-Loop: Improving VLA Policies for Dexterous Manipulation via Seamless Hand-Arm Intervention

論文の概要: Hand-in-the-Loop: Improving VLA Policies for Dexterous Manipulation via Seamless Hand-Arm Intervention

arxiv url: http://arxiv.org/abs/2605.15157v2
Date: Wed, 20 May 2026 14:31:09 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-21 14:55:44.198176
Title: Hand-in-the-Loop: Improving VLA Policies for Dexterous Manipulation via Seamless Hand-Arm Intervention
Title（参考訳）: Hand-in-the-Loop:Seamless Hand-ArmインターベンションによるデクサラスマニピュレーションのためのVLAポリシーの改善
Authors: Zhuohang Li, Liqun Huang, Wei Xu, Zhengming Zhu, Nie Lin, Xiao Ma, Xinjun Sheng, Ruoshi Wen,
Abstract要約: Hand-in-the-Loop (HandITL) は、人間の修正意図と自律的な政策実行をブレンドする。 HandITLは介入ジッタを99.8%削減し、介入後の堅牢な操作を維持する。政策改善のための修正データ収集に使用される場合、HandITLは標準遠隔操作データで訓練された者より19%優れるポリシーを出力する。
参考スコア（独自算出の注目度）: 15.34162270431179
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Vision-Language-Action (VLA) models are prone to compounding errors in dexterous manipulation, where high-dimensional action spaces and contact-rich dynamics amplify small policy deviations over long horizons. While Interactive Imitation Learning (IIL) can refine policies through human correction data, applying it to high-degree-of-freedom (DoF) robotic hands remains challenging due to a command mismatch between human teleoperation and policy execution at the intervention moment, which causes abrupt robot-hand configuration changes, or "gesture jumps". We present Hand-in-the-Loop (HandITL), a seamless human-in-the-loop intervention method that blends human corrective intent with autonomous policy execution to avoid gesture jumps during bimanual dexterous manipulation. Compared with taking over control using direct teleoperation, HandITL reduces intervention jitter by 99.8% and preserves robust post-intervention manipulation, reducing grasp failures by 87.5% and mean completion time by 19.1%. We validate HandITL on tasks requiring bimanual coordination, tool use, and fine-grained long-horizon manipulation. When used to collect correction data for policy refinement, HandITL yields policies that outperform those trained with standard teleoperation data by 19% on average across three long-horizon dexterous tasks.
Abstract（参考訳）: VLA(Vision-Language-Action)モデルは、高次元のアクション空間とコンタクトリッチなダイナミクスが長い地平線上の小さな政策偏差を増幅するデキスタラスな操作において、エラーを複雑化する傾向にある。インタラクティブ・イミテーション・ラーニング (Interactive Imitation Learning, IIL) は、人間の修正データを通じてポリシーを洗練できるが、人間の遠隔操作と介入時の政策実行のコマンドミスマッチにより、ロボットの手の急激な構成変更や「ジェスチャージャンプ」を引き起こすため、ロボットハンドを高自由度(DoF)に応用することは依然として困難である。 HandITL (Hand-in-the-Loop) は、人間の矯正意図と自律的なポリシー実行をブレンドし、両眼的な操作時のジェスチャージャンプを回避する。直接遠隔操作による制御の引き継ぎに比べ、HandITLは介入ジッタを99.8%減らし、干渉後の堅牢な操作を維持し、把握障害を87.5%減らし、平均完了時間を19.1%減らした。両面調整,ツール使用,細粒度長水平操作を必要とするタスクに対してHandITLを検証した。政策改善のための修正データ収集に使用される場合、HandITLは、標準的な遠隔操作データで訓練された人より19%上回るポリシーを3つの長期的デキスタラスタスクで取得する。

論文の概要: Hand-in-the-Loop: Improving VLA Policies for Dexterous Manipulation via Seamless Hand-Arm Intervention

関連論文リスト