Fugu-MT 論文翻訳(概要): Exploring Active 3D Object Detection from a Generalization Perspective

論文の概要: Exploring Active 3D Object Detection from a Generalization Perspective

arxiv url: http://arxiv.org/abs/2301.09249v1
Date: Mon, 23 Jan 2023 02:43:03 GMT
ステータス: 翻訳完了
システム内更新日: 2023-01-24 14:21:23.992582
Title: Exploring Active 3D Object Detection from a Generalization Perspective
Title（参考訳）: 一般化の視点からのアクティブ3次元物体検出の探索
Authors: Yadan Luo, Zhuoxiao Chen, Zijian Wang, Xin Yu, Zi Huang, Mahsa Baktashmotlagh
Abstract要約: 不確実性に基づくアクティブな学習ポリシーは、ポイントクラウドの情報性とボックスレベルのアノテーションコストの間のトレードオフのバランスを取れません。冗長な3次元境界ボックスラベルの点群を階層的にフィルタリングするtextscCrbを提案する。実験により,提案手法が既存のアクティブラーニング戦略より優れていることが示された。
参考スコア（独自算出の注目度）: 58.597942380989245
License: http://creativecommons.org/licenses/by-nc-sa/4.0/
Abstract: To alleviate the high annotation cost in LiDAR-based 3D object detection, active learning is a promising solution that learns to select only a small portion of unlabeled data to annotate, without compromising model performance. Our empirical study, however, suggests that mainstream uncertainty-based and diversity-based active learning policies are not effective when applied in the 3D detection task, as they fail to balance the trade-off between point cloud informativeness and box-level annotation costs. To overcome this limitation, we jointly investigate three novel criteria in our framework Crb for point cloud acquisition - label conciseness}, feature representativeness and geometric balance, which hierarchically filters out the point clouds of redundant 3D bounding box labels, latent features and geometric characteristics (e.g., point cloud density) from the unlabeled sample pool and greedily selects informative ones with fewer objects to annotate. Our theoretical analysis demonstrates that the proposed criteria align the marginal distributions of the selected subset and the prior distributions of the unseen test set, and minimizes the upper bound of the generalization error. To validate the effectiveness and applicability of \textsc{Crb}, we conduct extensive experiments on the two benchmark 3D object detection datasets of KITTI and Waymo and examine both one-stage (\textit{i.e.}, \textsc{Second}) and two-stage 3D detectors (i.e., Pv-rcnn). Experiments evidence that the proposed approach outperforms existing active learning strategies and achieves fully supervised performance requiring $1\%$ and $8\%$ annotations of bounding boxes and point clouds, respectively. Source code: https://github.com/Luoyadan/CRB-active-3Ddet.
Abstract（参考訳）: LiDARベースの3Dオブジェクト検出における高アノテーションコストを軽減するため、アクティブラーニングは、モデル性能を損なうことなく、少量のラベルなしデータをアノテートするように選択する、有望なソリューションである。しかし,本研究では,不確実性と多様性に基づくアクティブ・ラーニング・ポリシーは,ポイント・クラウド・インフォメーションとボックスレベルのアノテーションコストのトレードオフのバランスが取れないため,3d検出タスクに適用しても有効ではないことが示唆された。この制限を克服するために,我々のフレームワーク Crb for point cloud acquisition - label conciseness}, feature representativeness and geometry balance の3つの新しい基準を共同で検討した。理論的解析により,提案基準は選択された部分集合の限界分布と未確認テスト集合の先行分布とを一致させ,一般化誤差の上限を最小化することを示した。 KITTI と Waymo の2つのベンチマーク 3D オブジェクト検出データセットにおいて,その有効性と適用性を検証するため,1段目 (\textit{i.e.}, \textsc{Second}) と2段目 (Pv-rcnn) の2つの3D検出器 (Pv-rcnn) の2つの実験を行った。実験によると、提案手法は既存のアクティブな学習戦略より優れており、それぞれ境界ボックスと点クラウドのアノテーションを1\%と8\%で完全に教師付きのパフォーマンスを達成する。ソースコード:https://github.com/Luoyadan/CRB-active-3Ddet。

論文の概要: Exploring Active 3D Object Detection from a Generalization Perspective

関連論文リスト