Fugu-MT 論文翻訳(概要): UniDA3D: A Unified Domain-Adaptive Framework for Multi-View 3D Object Detection

論文の概要: UniDA3D: A Unified Domain-Adaptive Framework for Multi-View 3D Object Detection

arxiv url: http://arxiv.org/abs/2603.27995v1
Date: Mon, 30 Mar 2026 03:35:32 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-31 23:18:45.212772
Title: UniDA3D: A Unified Domain-Adaptive Framework for Multi-View 3D Object Detection
Title（参考訳）: UniDA3D:マルチビュー3Dオブジェクト検出のための統一ドメイン適応フレームワーク
Authors: Hongjing Wu, Cheng Chi, Jinlin Wu, Yanzhao Su, Zhen Lei, Wenqi Ren,
Abstract要約: カメラのみの3Dオブジェクト検出は自動運転にとって重要であり、LiDARベースの方法に代わる費用対効果を提供する。既存の手法は、夜間、霧、雨などの複雑な環境条件下では大きな性能劣化を経験することが多い。種々の有害条件下での堅牢な認識を目的としたドメイン適応型多視点3Dオブジェクト検出器UniDA3Dを提案する。
参考スコア（独自算出の注目度）: 59.35818507180513
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Camera-only 3D object detection is critical for autonomous driving, offering a cost-effective alternative to LiDAR based methods. In particular, multi-view 3D object detection has emerged as a promising direction due to its balanced trade-off between performance and cost. However, existing methods often suffer significant performance degradation under complex environmental conditions such as nighttime, fog, and rain, primarily due to their reliance on training data collected mostly in ideal conditions. To address this challenge, we propose UniDA3D, a unified domain-adaptive multi-view 3D object detector designed for robust perception under diverse adverse conditions. UniDA3D formulates nighttime, rainy, and foggy scenes as a unified multi target domain adaptation problem and leverages a novel query guided domain discrepancy mitigation (QDDM) module to align object features between source and target domains at both batch and global levels via query-centric adversarial and contrastive learning. Furthermore, we introduce a domain-adaptive teacher student training pipeline with an exponential-moving-average teacher and dynamically updated high-quality pseudo labels to enhance consistency learning and suppress background noise in unlabeled target domains. In contrast to prior approaches that require separate training for each condition, UniDA3D performs a single unified training process across multiple domains, enabling robust all-weather 3D perception. On a synthesized multi-view 3D benchmark constructed by generating nighttime, rainy, and foggy counterparts from nuScenes (nuScenes-Night, nuScenes-Rain, and nuScenes-Haze), UniDA3D consistently outperforms state of-the-art camera-only multi-view 3D detectors under extreme conditions, achieving substantial gains in mAP and NDS while maintaining real-time inference efficiency.
Abstract（参考訳）: カメラのみの3Dオブジェクト検出は自動運転にとって重要であり、LiDARベースの方法に代わる費用対効果を提供する。特に、多視点3Dオブジェクト検出は、性能とコストのバランスの取れたトレードオフのため、将来性のある方向として現れている。しかし, 従来手法では, 夜間, 霧, 雨などの複雑な環境条件下では, 主に理想的な条件で収集したトレーニングデータに依存するため, 顕著な性能劣化がみられがちである。この課題に対処するために、様々な有害条件下での堅牢な認識を目的としたドメイン適応型多視点3Dオブジェクト検出器UniDA3Dを提案する。 UniDA3Dは、一貫したマルチターゲットドメイン適応問題として、夜間、雨、霧のシーンを定式化し、新しいクエリガイド付きドメイン離散化(QDDM)モジュールを利用して、クエリ中心の対角学習とコントラスト学習を通じて、ソースドメインとターゲットドメインのオブジェクト特徴をバッチレベルとグローバルレベルで整列させる。さらに、指数関数的な移動平均教師と動的に更新された高品質な擬似ラベルを備えたドメイン適応型教員養成パイプラインを導入し、一貫性学習を強化し、ラベルなし対象ドメインの背景雑音を抑制する。それぞれの条件に対する個別のトレーニングを必要とする従来のアプローチとは対照的に、UniDA3Dは複数のドメインにまたがる単一の統一的なトレーニングプロセスを実行し、堅牢な全天候の3D知覚を可能にする。 NUScenes-Night, nuScenes-Rain, nuScenes-Hazeから夜間, 雨, 霧を発生させた合成マルチビュー3Dベンチマークにおいて、UniDA3Dは、極端条件下での最先端カメラのみのマルチビュー3D検出器の状態を一貫して上回り、リアルタイム推論効率を維持しつつ、mAPとNDSの大幅な向上を実現している。

論文の概要: UniDA3D: A Unified Domain-Adaptive Framework for Multi-View 3D Object Detection

関連論文リスト