Fugu-MT 論文翻訳(概要): Learning 3D Perception from Others' Predictions

論文の概要: Learning 3D Perception from Others' Predictions

arxiv url: http://arxiv.org/abs/2410.02646v2
Date: Fri, 4 Oct 2024 16:35:32 GMT
ステータス: 翻訳完了
システム内更新日: 2024-11-04 01:52:35.812088
Title: Learning 3D Perception from Others' Predictions
Title（参考訳）: 他人の予測から3次元知覚を学ぶ
Authors: Jinsu Yoo, Zhenyang Feng, Tai-Yu Pan, Yihong Sun, Cheng Perng Phoo, Xiangyu Chen, Mark Campbell, Kilian Q. Weinberger, Bharath Hariharan, Wei-Lun Chao,
Abstract要約: 本研究では,3次元物体検出装置を構築するための新たなシナリオについて検討する。例えば、自動運転車が新しいエリアに入ると、その領域に最適化された検出器を持つ他の交通参加者から学ぶことができる。
参考スコア（独自算出の注目度）: 64.09115694891679
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Accurate 3D object detection in real-world environments requires a huge amount of annotated data with high quality. Acquiring such data is tedious and expensive, and often needs repeated effort when a new sensor is adopted or when the detector is deployed in a new environment. We investigate a new scenario to construct 3D object detectors: learning from the predictions of a nearby unit that is equipped with an accurate detector. For example, when a self-driving car enters a new area, it may learn from other traffic participants whose detectors have been optimized for that area. This setting is label-efficient, sensor-agnostic, and communication-efficient: nearby units only need to share the predictions with the ego agent (e.g., car). Naively using the received predictions as ground-truths to train the detector for the ego car, however, leads to inferior performance. We systematically study the problem and identify viewpoint mismatches and mislocalization (due to synchronization and GPS errors) as the main causes, which unavoidably result in false positives, false negatives, and inaccurate pseudo labels. We propose a distance-based curriculum, first learning from closer units with similar viewpoints and subsequently improving the quality of other units' predictions via self-training. We further demonstrate that an effective pseudo label refinement module can be trained with a handful of annotated data, largely reducing the data quantity necessary to train an object detector. We validate our approach on the recently released real-world collaborative driving dataset, using reference cars' predictions as pseudo labels for the ego car. Extensive experiments including several scenarios (e.g., different sensors, detectors, and domains) demonstrate the effectiveness of our approach toward label-efficient learning of 3D perception from other units' predictions.
Abstract（参考訳）: 実環境における高精度な3Dオブジェクト検出には,高品質な大量の注釈付きデータが必要である。このようなデータを取得するのは面倒で費用がかかるため、新しいセンサーが採用されたり、検出器が新しい環境にデプロイされたりする際には、繰り返し作業が必要になることが多い。本研究では,3次元物体検出装置を構築するための新たなシナリオについて検討する。例えば、自動運転車が新しいエリアに入ると、その領域に最適化された検出器を持つ他の交通参加者から学ぶことができる。この設定はラベル効率、センサ非依存、通信効率が高い:近くのユニットは予測をエゴエージェント(例えば車)と共有するだけでよい。しかし、受信した予測を地絡として、エゴ車の検知器を訓練することは、性能の低下につながる。本研究は, 疑似陽性, 偽陰性, 不正確な擬似ラベルが生じる主な原因として, 問題を体系的に検討し, 視点ミスマッチと(同期やGPSエラーによる)位置ずれを同定する。距離に基づくカリキュラムを提案し、まず、類似した視点で近接した単位から学習し、その後、自己学習によって他の単位の予測の質を向上させる。さらに、有効な擬似ラベルリファインメントモジュールを少数の注釈付きデータでトレーニングできることを示し、オブジェクト検出器のトレーニングに必要なデータ量を大幅に削減する。我々は、エゴカーの擬似ラベルとして参照車の予測を用いて、最近リリースされた実世界の協調運転データセットに対するアプローチを検証する。いくつかのシナリオ(センサ、検出器、ドメインなど)を含む広範囲な実験は、他のユニットの予測から3D知覚をラベル効率よく学習するアプローチの有効性を実証している。

論文の概要: Learning 3D Perception from Others' Predictions

関連論文リスト