Fugu-MT 論文翻訳(概要): Tabero: Learning Gentle Manipulation with Closed-Loop Force Feedback from Vision, Touch, and Language

論文の概要: Tabero: Learning Gentle Manipulation with Closed-Loop Force Feedback from Vision, Touch, and Language

arxiv url: http://arxiv.org/abs/2605.27886v1
Date: Wed, 27 May 2026 03:08:21 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-28 17:38:55.704684
Title: Tabero: Learning Gentle Manipulation with Closed-Loop Force Feedback from Vision, Touch, and Language
Title（参考訳）: Tabero:視覚・触覚・言語からの閉ループ力フィードバックによるジェントル操作の学習
Authors: Qiwei Wu, Rui Zhang, Xin Xiang, Tao Li, Weihua Zhang, Junjie Lai, Renjing Xu,
Abstract要約: 優雅で言語条件のロボット操作のためのベンチマークとモデルスイートであるTaberoを紹介した。本稿では,分離された力配置命令インタフェースを持つアーキテクチャであるTabero-VTLAを提案する。本モデルでは,緩やかな指示の下で平均グリップ力を70%以上削減しながら高いタスク成功率を維持している。
参考スコア（独自算出の注目度）: 21.997523369157093
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Tactile sensing is essential for robots to achieve human-like gentle manipulation. However, existing Vision-Language-Action (VLA) models struggle to exploit tactile feedback for gentle manipulation due to scarce aligned vision-tactile-language data and the lack of effective closed-loop force feedback mechanisms. To address these challenges, we introduce Tabero, a benchmark and model suite for gentle, language-conditioned robotic manipulation that demands fine-grained contact force perception. First, the Tabero benchmark addresses the scarcity of tactile data by presenting a data-efficient pipeline that repurposes open-source robot manipulation trajectories to generate diverse vision-tactile-language tasks, and establishes a multidimensional evaluation protocol that measures task success alongside physical interaction quality. Second, we propose Tabero-VTLA, an architecture with a decoupled force-position command interface; the resulting force-position commands are executed by a fixed hybrid controller to enable real-time, force-aware manipulation. Evaluated on Tabero, our model maintains high task success while reducing average grip force by over 70\% under gentle instructions, demonstrating its ability to modulate interaction forces based on multimodal experience. Our code is publicly available at https://github.com/NathanWu7/Tabero.
Abstract（参考訳）: ロボットにとって触覚は、人間のような穏やかな操作を実現するために不可欠である。しかし、既存のVision-Language-Action(VLA)モデルは、視覚触覚言語データ不足と効果的な閉ループ力フィードバック機構の欠如により、触覚フィードバックを緩やかな操作に活用するのに苦労している。これらの課題に対処するために、我々は、きめ細かい接触力の知覚を必要とする、優雅で言語条件のロボット操作のためのベンチマークとモデルスイートであるTaberoを紹介した。まず、Taberoベンチマークは、オープンソースのロボット操作トラジェクトリを再利用して多様な視覚触覚言語タスクを生成するデータ効率の高いパイプラインを提示し、物理的相互作用の品質とともにタスクの成功を測定する多次元評価プロトコルを確立することで、触覚データの不足に対処する。第2に,分離された力配置命令インタフェースを備えたアーキテクチャであるTabero-VTLAを提案する。提案モデルでは,多モード経験に基づく相互作用力の調整能力を示すとともに,緩やかな指示の下で平均グリップ力の70%以上を減らし,タスク成功の維持を図っている。私たちのコードはhttps://github.com/NathanWu7/Tabero.comで公開されています。

論文の概要: Tabero: Learning Gentle Manipulation with Closed-Loop Force Feedback from Vision, Touch, and Language

関連論文リスト