Related papers: Investigating Domain Gaps for Indoor 3D Object Detection

Investigating Domain Gaps for Indoor 3D Object Detection

URL: http://arxiv.org/abs/2508.17439v2
Date: Mon, 01 Sep 2025 07:43:03 GMT
Title: Investigating Domain Gaps for Indoor 3D Object Detection
Authors: Zijing Zhao, Zhu Xu, Qingchao Chen, Yuxin Peng, Yang Liu,
Abstract summary: We consider the task of adapting indoor 3D object detectors from one dataset to another.<n>In this paper, we present a benchmark with ScanNet, SUN RGB-D and 3D Front datasets, as well as our newly proposed large-scale datasets ProcTHOR-OD and ProcFront.<n>We conduct experiments on different adaptation scenarios including synthetic-to-real adaptation, point cloud quality adaptation, layout adaptation and instance feature adaptation, analyzing the impact of different domain gaps on 3D object detectors.
Score: 60.55242233729081
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: As a fundamental task for indoor scene understanding, 3D object detection has been extensively studied, and the accuracy on indoor point cloud data has been substantially improved. However, existing researches have been conducted on limited datasets, where the training and testing sets share the same distribution. In this paper, we consider the task of adapting indoor 3D object detectors from one dataset to another, presenting a comprehensive benchmark with ScanNet, SUN RGB-D and 3D Front datasets, as well as our newly proposed large-scale datasets ProcTHOR-OD and ProcFront generated by a 3D simulator. Since indoor point cloud datasets are collected and constructed in different ways, the object detectors are likely to overfit to specific factors within each dataset, such as point cloud quality, bounding box layout and instance features. We conduct experiments across datasets on different adaptation scenarios including synthetic-to-real adaptation, point cloud quality adaptation, layout adaptation and instance feature adaptation, analyzing the impact of different domain gaps on 3D object detectors. We also introduce several approaches to improve adaptation performances, providing baselines for domain adaptive indoor 3D object detection, hoping that future works may propose detectors with stronger generalization ability across domains. Our project homepage can be found in https://jeremyzhao1998.github.io/DAVoteNet-release/.

Related papers

Domain Adaptation for Different Sensor Configurations in 3D Object Detection [1.4566410781522745]
We address domain adaptation across different sensor configurations in 3D object detection.<n>We propose two techniques: Downstream Fine-tuning and Partial Layer Fine-tuning.<n>Our findings provide a practical and scalable solution for adapting 3D object detection models to diverse vehicle platforms.
arXiv Detail & Related papers (2025-09-04T23:54:25Z)
Syn-to-Real Unsupervised Domain Adaptation for Indoor 3D Object Detection [50.448520056844885]
We propose a novel framework for syn-to-real unsupervised domain adaptation in indoor 3D object detection. Our adaptation results from synthetic dataset 3D-FRONT to real-world datasets ScanNetV2 and SUN RGB-D demonstrate remarkable mAP25 improvements of 9.7% and 9.1% over Source-Only baselines.
arXiv Detail & Related papers (2024-06-17T08:18:41Z)
M&M3D: Multi-Dataset Training and Efficient Network for Multi-view 3D Object Detection [2.5158048364984564]
I proposed a network structure for multi-view 3D object detection using camera-only data and a Bird's-Eye-View map. My work is based on a current key challenge domain adaptation and visual data transfer. My study utilizes 3D information as available semantic information and 2D multi-view image features blending into the visual-language transfer design.
arXiv Detail & Related papers (2023-11-02T04:28:51Z)
Uni3DETR: Unified 3D Detection Transformer [75.35012428550135]
We propose a unified 3D detector that addresses indoor and outdoor detection within the same framework. Specifically, we employ the detection transformer with point-voxel interaction for object prediction. We then propose the mixture of query points, which sufficiently exploits global information for dense small-range indoor scenes and local information for large-range sparse outdoor ones.
arXiv Detail & Related papers (2023-10-09T13:20:20Z)
MDT3D: Multi-Dataset Training for LiDAR 3D Object Detection Generalization [3.8243923744440926]
3D object detection models trained on a source dataset with a specific point distribution have shown difficulties in generalizing to unseen datasets. We leverage the information available from several annotated source datasets with our Multi-Dataset Training for 3D Object Detection (MDT3D) method. We show how we managed the mix of datasets during training and finally introduce a new cross-dataset augmentation method: cross-dataset object injection.
arXiv Detail & Related papers (2023-08-02T08:20:00Z)
SSDA3D: Semi-supervised Domain Adaptation for 3D Object Detection from Point Cloud [125.9472454212909]
We present a novel Semi-Supervised Domain Adaptation method for 3D object detection (SSDA3D) SSDA3D includes an Inter-domain Adaptation stage and an Intra-domain Generalization stage. Experiments show that, with only 10% labeled target data, our SSDA3D can surpass the fully-supervised oracle model with 100% target label.
arXiv Detail & Related papers (2022-12-06T09:32:44Z)
Viewer-Centred Surface Completion for Unsupervised Domain Adaptation in 3D Object Detection [7.489722641968593]
3D detectors tend to overfit datasets they are trained on. This causes a drastic decrease in accuracy when the detectors are trained on one dataset and tested on another. We address this in our approach, SEE-VCN, by designing a novel viewer-centred surface completion network (VCN) With SEE-VCN, we obtain a unified representation of objects across datasets, allowing the network to focus on learning geometry, rather than overfitting on scan patterns.
arXiv Detail & Related papers (2022-09-14T04:22:20Z)
AGO-Net: Association-Guided 3D Point Cloud Object Detection Network [86.10213302724085]
We propose a novel 3D detection framework that associates intact features for objects via domain adaptation. We achieve new state-of-the-art performance on the KITTI 3D detection benchmark in both accuracy and speed.
arXiv Detail & Related papers (2022-08-24T16:54:38Z)
ST3D: Self-training for Unsupervised Domain Adaptation on 3D ObjectDetection [78.71826145162092]
We present a new domain adaptive self-training pipeline, named ST3D, for unsupervised domain adaptation on 3D object detection from point clouds. Our ST3D achieves state-of-the-art performance on all evaluated datasets and even surpasses fully supervised results on KITTI 3D object detection benchmark.
arXiv Detail & Related papers (2021-03-09T10:51:24Z)
1st Place Solution for Waymo Open Dataset Challenge -- 3D Detection and Domain Adaptation [7.807118356899879]
We propose a one-stage, anchor-free and NMS-free 3D point cloud object detector AFDet. AFDet serves as a strong baseline in our winning solution. We design stronger networks and enhance the point cloud data using densification and point painting.
arXiv Detail & Related papers (2020-06-28T04:49:39Z)

This list is automatically generated from the titles and abstracts of the papers in this site.