Fugu-MT 論文翻訳(概要): Continual Hand-Eye Calibration for Open-world Robotic Manipulation

論文の概要: Continual Hand-Eye Calibration for Open-world Robotic Manipulation

arxiv url: http://arxiv.org/abs/2604.15814v1
Date: Fri, 17 Apr 2026 08:11:52 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-20 22:00:19.813692
Title: Continual Hand-Eye Calibration for Open-world Robotic Manipulation
Title（参考訳）: オープンワールドロボットマニピュレーションのための連続的ハンドアイ校正
Authors: Fazeng Li, Gan Sun, Chenxi Liu, Yao He, Wei Cong, Yang Cong,
Abstract要約: 視覚的位置決めによる目視の校正は、オープンワールド環境におけるロボット操作にとって重要である。ほとんどのディープラーニングベースのキャリブレーションモデルは、オープンワールドのシーンの変化の中で目に見えないデータに適応するとき、破滅的な忘れに苦しむ。本研究では,ロボットが連続的に遭遇するオープンワールドの操作シーンに適応できる連続的手目校正フレームワークを提案する。
参考スコア（独自算出の注目度）: 37.99491137671598
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Hand-eye calibration through visual localization is a critical capability for robotic manipulation in open-world environments. However, most deep learning-based calibration models suffer from catastrophic forgetting when adapting into unseen data amongst open-world scene changes, while simple rehearsal-based continual learning strategy cannot well mitigate this issue. To overcome this challenge, we propose a continual hand-eye calibration framework, enabling robots to adapt to sequentially encountered open-world manipulation scenes through spatially replay strategy and structure-preserving distillation. Specifically, a Spatial-Aware Replay Strategy (SARS) constructs a geometrically uniform replay buffer that ensures comprehensive coverage of each scene pose space, replacing redundant adjacent frames with maximally informative viewpoints. Meanwhile, a Structure-Preserving Dual Distillation (SPDD) is proposed to decompose localization knowledge into coarse scene layout and fine pose precision, and distills them separately to alleviate both types of forgetting during continual adaptation. As a new manipulation scene arrives, SARS provides geometrically representative replay samples from all prior scenes, and SPDD applies structured distillation on these samples to retain previously learned knowledge. After training on the new scene, SARS incorporates selected samples from the new scene into the replay buffer for future rehearsal, allowing the model to continuously accumulate multi-scene calibration capability. Experiments on multiple public datasets show significant anti scene forgetting performance, maintaining accuracy on past scenes while preserving adaptation to new scenes, confirming the effectiveness of the framework.
Abstract（参考訳）: 視覚的位置決めによる目視の校正は、オープンワールド環境におけるロボット操作にとって重要な能力である。しかし、ほとんどのディープラーニングベースのキャリブレーションモデルは、オープンワールドのシーンの変化の中で見えないデータに適応する場合、破滅的な忘れがちであるが、単純なリハーサルベースの継続的な学習戦略はこの問題を十分に軽減できない。この課題を克服するために,ロボットは空間的再生戦略と構造保存蒸留により,逐次的に遭遇するオープンワールドの操作シーンに適応できる連続的手目校正フレームワークを提案する。具体的には、空間認識再生戦略(SARS)は、各シーンポーズ空間の包括的カバレッジを保証する幾何学的に均一な再生バッファを構築し、冗長な隣接フレームを最大情報的視点で置き換える。一方,SPDD (Structure-Preserving Dual Distillation) は,局所化知識を粗いシーンレイアウトと微調整精度に分解し,連続適応時の両種類の忘れを緩和するために別々に蒸留する手法である。新しい操作シーンが到着すると、SARSは以前のすべてのシーンから幾何学的に代表的なリプレイサンプルを提供し、SPDDはこれらのサンプルに構造化された蒸留を適用して、それまでの知識を保持する。新しいシーンのトレーニングの後、SARSは新しいシーンから選択したサンプルを将来のリハーサルのためにリプレイバッファに組み込んで、モデルが連続的にマルチシーンキャリブレーション機能を蓄積できるようにする。複数の公開データセットの実験では、パフォーマンスを忘れ、過去のシーンの精度を維持しつつ、新しいシーンへの適応を維持し、フレームワークの有効性を確認している。

論文の概要: Continual Hand-Eye Calibration for Open-world Robotic Manipulation

関連論文リスト