Fugu-MT 論文翻訳(概要): Mind the Shape Gap: A Benchmark and Baseline for Deformation-Aware 6D Pose Estimation of Agricultural Produce

論文の概要: Mind the Shape Gap: A Benchmark and Baseline for Deformation-Aware 6D Pose Estimation of Agricultural Produce

arxiv url: http://arxiv.org/abs/2603.27429v1
Date: Sat, 28 Mar 2026 22:06:51 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-31 23:18:44.953654
Title: Mind the Shape Gap: A Benchmark and Baseline for Deformation-Aware 6D Pose Estimation of Agricultural Produce
Title（参考訳）: マインド・ザ・シェイプ・ギャップ:農業生産物の変形を考慮した6Dポス推定のためのベンチマークとベースライン
Authors: Nikolas Chatzis, Angeliki Tsinouka, Katerina Papadimitriou, Niki Efthymiou, Marios Glytsos, George Retsinas, Paris Oikonomou, Gerasimos Potamianos, Petros Maragos, Panagiotis Paraskevas Filntisis,
Abstract要約: PEARは,8つのカテゴリーにまたがって,関節6次元ポーズと3次元変形基底真理を提供する最初のベンチマークである。実世界の生産物に固有の幾何学的偏差に直面すると,最先端の手法が最大6倍の性能劣化に悩まされることを示す。単一画像からの6次元ポーズと明示的な格子変形を共同で予測する統合RGBのみのフレームワークSEEDを提案する。
参考スコア（独自算出の注目度）: 32.03751544524548
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Accurate 6D pose estimation for robotic harvesting is fundamentally hindered by the biological deformability and high intra-class shape variability of agricultural produce. Instance-level methods fail in this setting, as obtaining exact 3D models for every unique piece of produce is practically infeasible, while category-level approaches that rely on a fixed template suffer significant accuracy degradation when the prior deviates from the true instance geometry. To bridge such lack of robustness to deformation, we introduce PEAR (Pose and dEformation of Agricultural pRoduce), the first benchmark providing joint 6D pose and per-instance 3D deformation ground truth across 8 produce categories, acquired via a robotic manipulator for high annotation accuracy. Using PEAR, we show that state-of-the-art methods suffer up to 6x performance degradation when faced with the inherent geometric deviations of real-world produce. Motivated by this finding, we propose SEED (Simultaneous Estimation of posE and Deformation), a unified RGB-only framework that jointly predicts 6D pose and explicit lattice deformations from a single image across multiple produce categories. Trained entirely on synthetic data with generative texture augmentation applied at the UV level, SEED outperforms MegaPose on 6 out of 8 categories under identical RGB-only conditions, demonstrating that explicit shape modeling is a critical step toward reliable pose estimation in agricultural robotics.
Abstract（参考訳）: ロボット収穫における高精度な6次元ポーズ推定は, 生物変形性や農作物のクラス内形状のばらつきによって根本的に阻害される。インスタンスレベルの手法はこの設定では失敗し、各ユニークな生産物に対して正確な3Dモデルを取得することは事実上不可能であり、一方、固定テンプレートに依存するカテゴリレベルのアプローチは、前のインスタンスが真のインスタンス形状から逸脱した場合、かなり精度が低下する。 PEAR(Pose and dEformation of Agricultural pRoduce)は,ロボットマニピュレータを用いて高精度な精度で取得した8つのカテゴリーにまたがる関節6Dポーズと3次元変形グラウンド真実を提供する最初のベンチマークである。 PEARを用いて、実世界の生産物に固有の幾何学的偏差に直面すると、最先端の手法が最大6倍の性能劣化に悩まされることを示す。この発見を動機として,複数生成カテゴリにまたがる単一画像からの6次元ポーズと明示的な変形を共同で予測する統一RGB専用フレームワークSEED(Simultaneous Estimation of posE and deformation)を提案する。 SEEDは、UVレベルに適用された生成的テクスチャ増強による合成データに基づいて、同一のRGBのみの条件下で8つのカテゴリのうち6つでMegaPoseを上回り、農業ロボット工学において、明確な形状モデリングが信頼性の高いポーズ推定への重要なステップであることを実証した。

論文の概要: Mind the Shape Gap: A Benchmark and Baseline for Deformation-Aware 6D Pose Estimation of Agricultural Produce

関連論文リスト