Fugu-MT 論文翻訳(概要): FingerEye: Continuous and Unified Vision-Tactile Sensing for Dexterous Manipulation

論文の概要: FingerEye: Continuous and Unified Vision-Tactile Sensing for Dexterous Manipulation

arxiv url: http://arxiv.org/abs/2604.20689v2
Date: Thu, 23 Apr 2026 14:32:27 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-24 14:40:06.060627
Title: FingerEye: Continuous and Unified Vision-Tactile Sensing for Dexterous Manipulation
Title（参考訳）: FingerEye: ディクサラスマニピュレーションのための連続的かつ統一された視覚触覚センシング
Authors: Zhixuan Xu, Yichen Li, Xuanye Wu, Tianyu Qiu, Lin Shao,
Abstract要約: 有害なロボット操作は、あらゆる段階の相互作用を包括的に知覚する必要がある。我々は,対話プロセスを通じて連続的な視覚触覚フィードバックを提供する,コンパクトで費用効率のよいセンサであるFingerEyeを紹介する。我々は、複数のFingerEyeセンサーからの信号を融合して、限られた実世界のデータから巧妙な操作行動を学習する視覚触覚模倣学習ポリシーを開発した。
参考スコア（独自算出の注目度）: 8.809778152652724
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Dexterous robotic manipulation requires comprehensive perception across all phases of interaction: pre-contact, contact initiation, and post-contact. Such continuous feedback allows a robot to adapt its actions throughout interaction. However, many existing tactile sensors, such as GelSight and its variants, only provide feedback after contact is established, limiting a robot's ability to precisely initiate contact. We introduce FingerEye, a compact and cost-effective sensor that provides continuous vision-tactile feedback throughout the interaction process. FingerEye integrates binocular RGB cameras to provide close-range visual perception with implicit stereo depth. Upon contact, external forces and torques deform a compliant ring structure; these deformations are captured via marker-based pose estimation and serve as a proxy for contact wrench sensing. This design enables a perception stream that smoothly transitions from pre-contact visual cues to post-contact tactile feedback. Building on this sensing capability, we develop a vision-tactile imitation learning policy that fuses signals from multiple FingerEye sensors to learn dexterous manipulation behaviors from limited real-world data. We further develop a digital twin of our sensor and robot platform to improve policy generalization. By combining real demonstrations with visually augmented simulated observations for representation learning, the learned policies become more robust to object appearance variations. Together, these design aspects enable dexterous manipulation across diverse object properties and interaction regimes, including coin standing, chip picking, letter retrieving, and syringe manipulation. The hardware design, code, appendix, and videos are available on our project website: https://nus-lins-lab.github.io/FingerEyeWeb/
Abstract（参考訳）: 有害なロボット操作は、事前接触、接触開始、後接触など、あらゆる段階の相互作用の包括的認識を必要とする。このような継続的なフィードバックにより、ロボットは相互作用を通してその行動に適応することができる。しかし、GelSightなどの既存の触覚センサーの多くは、接触が確立した後にのみフィードバックを提供し、ロボットの接触を正確に開始する能力を制限する。我々は,対話プロセスを通じて連続的な視覚触覚フィードバックを提供する,コンパクトで費用効率のよいセンサであるFingerEyeを紹介する。 FingerEyeは両眼のRGBカメラを統合し、暗黙のステレオ深度で近距離の視覚知覚を提供する。これらの変形はマーカーベースのポーズ推定によって捉えられ、接触レンチセンシングのプロキシとして機能する。この設計により、事前に接触した視覚的手がかりから触覚後のフィードバックへ円滑に遷移する知覚ストリームが実現される。このセンシング機能に基づいて,複数のFingerEyeセンサからの信号を融合した視覚触覚模倣学習ポリシーを開発し,実世界の限られたデータから巧妙な操作動作を学習する。我々はさらに、ポリシーの一般化を改善するために、センサとロボットプラットフォームのデジタルツインを開発する。実演と視覚的に拡張された表現学習の観察を組み合わせることで、学習されたポリシーは、オブジェクトの外観の変化に対してより堅牢になる。これらのデザインの側面は、コインスタンド、チップピック、レター検索、注射器操作など、さまざまな物体の性質や相互作用の仕組みをまたいだ巧妙な操作を可能にする。ハードウェア設計、コード、付録、ビデオは、プロジェクトのWebサイト(https://nus-lins-lab.github.io/FingerEyeWeb/)で入手できる。

論文の概要: FingerEye: Continuous and Unified Vision-Tactile Sensing for Dexterous Manipulation

関連論文リスト