Fugu-MT 論文翻訳(概要): CPPF: Towards Robust Category-Level 9D Pose Estimation in the Wild

論文の概要: CPPF: Towards Robust Category-Level 9D Pose Estimation in the Wild

arxiv url: http://arxiv.org/abs/2203.03089v1
Date: Mon, 7 Mar 2022 01:36:22 GMT
ステータス: 翻訳完了
システム内更新日: 2022-03-08 17:42:46.643065
Title: CPPF: Towards Robust Category-Level 9D Pose Estimation in the Wild
Title（参考訳）: CPPF:野生におけるロバストなカテゴリーレベル9Dポス推定に向けて
Authors: Yang You, Ruoxi Shi, Weiming Wang, Cewu Lu
Abstract要約: カテゴリーレベルのPPF投票法は、野生における正確で堅牢で一般化可能な9Dポーズ推定を実現する。ノイズの多い点対のサンプルを排除し、個体群から最終的な予測を生成するために、新しい粗大な投票アルゴリズムを提案する。我々の手法は実世界のトレーニングデータと同等である。
参考スコア（独自算出の注目度）: 45.93626858034774
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: In this paper, we tackle the problem of category-level 9D pose estimation in the wild, given a single RGB-D frame. Using supervised data of real-world 9D poses is tedious and erroneous, and also fails to generalize to unseen scenarios. Besides, category-level pose estimation requires a method to be able to generalize to unseen objects at test time, which is also challenging. Drawing inspirations from traditional point pair features (PPFs), in this paper, we design a novel Category-level PPF (CPPF) voting method to achieve accurate, robust and generalizable 9D pose estimation in the wild. To obtain robust pose estimation, we sample numerous point pairs on an object, and for each pair our model predicts necessary SE(3)-invariant voting statistics on object centers, orientations and scales. A novel coarse-to-fine voting algorithm is proposed to eliminate noisy point pair samples and generate final predictions from the population. To get rid of false positives in the orientation voting process, an auxiliary binary disambiguating classification task is introduced for each sampled point pair. In order to detect objects in the wild, we carefully design our sim-to-real pipeline by training on synthetic point clouds only, unless objects have ambiguous poses in geometry. Under this circumstance, color information is leveraged to disambiguate these poses. Results on standard benchmarks show that our method is on par with current state of the arts with real-world training data. Extensive experiments further show that our method is robust to noise and gives promising results under extremely challenging scenarios. Our code is available on https://github.com/qq456cvb/CPPF.
Abstract（参考訳）: 本稿では,1つのRGB-Dフレームを考慮し,カテゴリーレベルの9Dポーズ推定の問題に取り組む。実世界の9dポーズの教師付きデータの使用は、退屈で誤ったものであり、目に見えないシナリオに一般化することができない。さらに、カテゴリレベルのポーズ推定には、テスト時に見えないオブジェクトに一般化できるメソッドが必要であるが、これも難しい。本稿では,従来の点対特徴(ppfs)から着想を得た新しいカテゴリーレベルのppf(cppf)投票法を考案し,実環境において高精度でロバストで一般化可能な9次元ポーズ推定を実現する。頑健なポーズ推定を得るためには,オブジェクト上に多数の点対をサンプリングし,各ペアに対して,オブジェクト中心,向き,スケールに関するSE(3)不変投票統計量を予測する。ノイズの多い点対サンプルを除去し, 個体群から最終予測を生成するため, 新たな粗粒間投票アルゴリズムを提案する。配向投票プロセスにおいて偽陽性を除去するために、各サンプル点対に対して補助的二項曖昧化分類タスクを導入する。野生の物体を検出するため、オブジェクトが幾何にあいまいなポーズをとらない限り、合成ポイントクラウドのみをトレーニングすることで、sim-to-realパイプラインを慎重に設計します。この状況下では、色情報はこれらのポーズを曖昧にするために利用される。標準ベンチマークの結果,本手法は実世界のトレーニングデータと同等であることが判明した。さらに広範な実験により,提案手法は雑音に対して頑健であり,極めて困難なシナリオにおいて有望な結果が得られることを示した。私たちのコードはhttps://github.com/qq456cvb/CPPFで利用可能です。

論文の概要: CPPF: Towards Robust Category-Level 9D Pose Estimation in the Wild

関連論文リスト