Fugu-MT 論文翻訳(概要): BOP Challenge 2024 on Model-Based and Model-Free 6D Object Pose Estimation

論文の概要: BOP Challenge 2024 on Model-Based and Model-Free 6D Object Pose Estimation

arxiv url: http://arxiv.org/abs/2504.02812v1
Date: Thu, 03 Apr 2025 17:55:19 GMT
ステータス: 翻訳完了
システム内更新日: 2025-04-11 15:25:21.318063
Title: BOP Challenge 2024 on Model-Based and Model-Free 6D Object Pose Estimation
Title（参考訳）: BOP Challenge 2024 on Model-based and Model-free 6D Object Pose Estimation
Authors: Van Nguyen Nguyen, Stephen Tyree, Andrew Guo, Mederic Fourmy, Anas Gouda, Taeyeop Lee, Sungphill Moon, Hyeontae Son, Lukas Ranftl, Jonathan Tremblay, Eric Brachmann, Bertram Drost, Vincent Lepetit, Carsten Rother, Stan Birchfield, Jiri Matas, Yann Labbe, Martin Sundermeyer, Tomas Hodan,
Abstract要約: 一連のパブリックコンペティションの第6回は、6Dオブジェクトでアートの状態をキャプチャするために組織された。 2024年、我々は3Dオブジェクトモデルが利用できず、提供された参照ビデオからのみオブジェクトをオンボードする必要がある新しいモデルフリータスクを導入した。我々は、テスト画像で見える物体の同一性が入力として提供されない、より実用的な6Dオブジェクト検出タスクを定義した。
参考スコア（独自算出の注目度）: 55.13521733366838
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: We present the evaluation methodology, datasets and results of the BOP Challenge 2024, the sixth in a series of public competitions organized to capture the state of the art in 6D object pose estimation and related tasks. In 2024, our goal was to transition BOP from lab-like setups to real-world scenarios. First, we introduced new model-free tasks, where no 3D object models are available and methods need to onboard objects just from provided reference videos. Second, we defined a new, more practical 6D object detection task where identities of objects visible in a test image are not provided as input. Third, we introduced new BOP-H3 datasets recorded with high-resolution sensors and AR/VR headsets, closely resembling real-world scenarios. BOP-H3 include 3D models and onboarding videos to support both model-based and model-free tasks. Participants competed on seven challenge tracks, each defined by a task, object onboarding setup, and dataset group. Notably, the best 2024 method for model-based 6D localization of unseen objects (FreeZeV2.1) achieves 22% higher accuracy on BOP-Classic-Core than the best 2023 method (GenFlow), and is only 4% behind the best 2023 method for seen objects (GPose2023) although being significantly slower (24.9 vs 2.7s per image). A more practical 2024 method for this task is Co-op which takes only 0.8s per image and is 25X faster and 13% more accurate than GenFlow. Methods have a similar ranking on 6D detection as on 6D localization but higher run time. On model-based 2D detection of unseen objects, the best 2024 method (MUSE) achieves 21% relative improvement compared to the best 2023 method (CNOS). However, the 2D detection accuracy for unseen objects is still noticealy (-53%) behind the accuracy for seen objects (GDet2023). The online evaluation system stays open and is available at http://bop.felk.cvut.cz/
Abstract（参考訳）: BOPチャレンジ2024は、6Dオブジェクトにおける最先端のポーズ推定と関連するタスクを捉えるために組織された一連の公開コンペティションの6番目である。 2024年、私たちの目標は、BOPをラボのようなセットアップから現実のシナリオに移行することです。まず、3Dオブジェクトモデルが利用できず、提供された参照ビデオからのみオブジェクトをオンボードする必要がある新しいモデルフリータスクを紹介した。第2に、テスト画像に写っている物体の身元が入力として提供されない、より実用的な6Dオブジェクト検出タスクを定義した。第3に、高解像度センサーとAR/VRヘッドセットで記録された新しいBOP-H3データセットを導入しました。 BOP-H3には3Dモデルと、モデルベースとモデルフリーの両方をサポートするビデオが同梱されている。参加者は7つのチャレンジトラックで競い、それぞれがタスク、オブジェクトのオンボーディング設定、データセットグループによって定義される。特に、未確認物体のモデルベースの6Dローカライズ法(FreeZeV2.1)では、2023法(GenFlow)よりもBOP-Classic-Coreの方が22%高い精度を実現し(GPose2023)、画像当たり24.9対2.7秒)、観察対象に対する2023法(GPose2023)では4%しか遅れない(画像あたり24.9対2.7秒)。このタスクのより実用的な2024の手法はCo-opであり、画像あたり0.8秒しか必要とせず、GenFlowよりも25倍高速で13%精度がある。方法は6Dローカライゼーションと同様に6D検出に類似しているが、実行時間も高い。モデルに基づく未知物体の2次元検出では、2024法(MUSE)が2023法(CNOS)と比較して21%改善されている。しかし、未確認物体の2次元検出精度は、まだ目に見える物体の精度より53%遅れている(GDet2023)。オンライン評価システムはオープンであり、http://bop.felk.cvut.cz/で利用可能である。

論文の概要: BOP Challenge 2024 on Model-Based and Model-Free 6D Object Pose Estimation

関連論文リスト