Toward unlabeled multi-view 3D pedestrian detection by generalizable AI:
techniques and performance analysis
- URL: http://arxiv.org/abs/2308.04515v1
- Date: Tue, 8 Aug 2023 18:24:53 GMT
- Title: Toward unlabeled multi-view 3D pedestrian detection by generalizable AI:
techniques and performance analysis
- Authors: Jo\~ao Paulo Lima, Diego Thomas, Hideaki Uchiyama, Veronica Teichrieb
- Abstract summary: Generalizable AI can be used to improve multi-view 3D pedestrian detection in unlabeled target scenes.
We investigate two approaches for automatically labeling target data: pseudo-labeling using a supervised detector and automatic labeling using an untrained detector.
We show that, by using the automatic labeling approach based on an untrained detector, we can obtain superior results than directly using the untrained detector or a detector trained with an existing labeled source dataset.
- Score: 7.414308466976969
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: We unveil how generalizable AI can be used to improve multi-view 3D
pedestrian detection in unlabeled target scenes. One way to increase
generalization to new scenes is to automatically label target data, which can
then be used for training a detector model. In this context, we investigate two
approaches for automatically labeling target data: pseudo-labeling using a
supervised detector and automatic labeling using an untrained detector (that
can be applied out of the box without any training). We adopt a training
framework for optimizing detector models using automatic labeling procedures.
This framework encompasses different training sets/modes and multi-round
automatic labeling strategies. We conduct our analyses on the
publicly-available WILDTRACK and MultiviewX datasets. We show that, by using
the automatic labeling approach based on an untrained detector, we can obtain
superior results than directly using the untrained detector or a detector
trained with an existing labeled source dataset. It achieved a MODA about 4%
and 1% better than the best existing unlabeled method when using WILDTRACK and
MultiviewX as target datasets, respectively.
Related papers
- TrajSSL: Trajectory-Enhanced Semi-Supervised 3D Object Detection [59.498894868956306]
Pseudo-labeling approaches to semi-supervised learning adopt a teacher-student framework.
We leverage pre-trained motion-forecasting models to generate object trajectories on pseudo-labeled data.
Our approach improves pseudo-label quality in two distinct manners.
arXiv Detail & Related papers (2024-09-17T05:35:00Z) - Vision-Language Guidance for LiDAR-based Unsupervised 3D Object Detection [16.09503890891102]
We propose an unsupervised 3D detection approach that operates exclusively on LiDAR point clouds.
We exploit the inherent CLI-temporal knowledge of LiDAR point clouds for clustering, tracking, as well as boxtext and label refinement.
Our approach outperforms state-of-the-art unsupervised 3D object detectors on the Open dataset.
arXiv Detail & Related papers (2024-08-07T14:14:53Z) - Multi-Class 3D Object Detection with Single-Class Supervision [34.216636233945856]
Training multi-class 3D detectors with fully labeled datasets can be expensive.
An alternative approach is to have targeted single-class labels on disjoint data samples.
In this paper, we are interested in training a multi-class 3D object detection model, while using single-class labeled data.
arXiv Detail & Related papers (2022-05-11T18:00:05Z) - AutoGeoLabel: Automated Label Generation for Geospatial Machine Learning [69.47585818994959]
We evaluate a big data processing pipeline to auto-generate labels for remote sensing data.
We utilize the big geo-data platform IBM PAIRS to dynamically generate such labels in dense urban areas.
arXiv Detail & Related papers (2022-01-31T20:02:22Z) - ST3D++: Denoised Self-training for Unsupervised Domain Adaptation on 3D
Object Detection [78.71826145162092]
We present a self-training method, named ST3D++, with a holistic pseudo label denoising pipeline for unsupervised domain adaptation on 3D object detection.
We equip the pseudo label generation process with a hybrid quality-aware triplet memory to improve the quality and stability of generated pseudo labels.
In the model training stage, we propose a source data assisted training strategy and a curriculum data augmentation policy.
arXiv Detail & Related papers (2021-08-15T07:49:06Z) - Self-Supervised Person Detection in 2D Range Data using a Calibrated
Camera [83.31666463259849]
We propose a method to automatically generate training labels (called pseudo-labels) for 2D LiDAR-based person detectors.
We show that self-supervised detectors, trained or fine-tuned with pseudo-labels, outperform detectors trained using manual annotations.
Our method is an effective way to improve person detectors during deployment without any additional labeling effort.
arXiv Detail & Related papers (2020-12-16T12:10:04Z) - Move to See Better: Self-Improving Embodied Object Detection [35.461141354989714]
We propose a method for improving object detection in testing environments.
Our agent collects multi-view data, generates 2D and 3D pseudo-labels, and fine-tunes its detector in a self-supervised manner.
arXiv Detail & Related papers (2020-11-30T19:16:51Z) - Manual-Label Free 3D Detection via An Open-Source Simulator [50.74299948748722]
We propose a manual-label free 3D detection algorithm that leverages the CARLA simulator to generate a large amount of self-labeled training samples.
Domain Adaptive VoxelNet (DA-VoxelNet) can cross the distribution gap from the synthetic data to the real scenario.
Experimental results show that the proposed unsupervised DA 3D detector can achieve 76.66% and 56.64% mAP on KITTI evaluation set.
arXiv Detail & Related papers (2020-11-16T08:29:01Z) - EHSOD: CAM-Guided End-to-end Hybrid-Supervised Object Detection with
Cascade Refinement [53.69674636044927]
We present EHSOD, an end-to-end hybrid-supervised object detection system.
It can be trained in one shot on both fully and weakly-annotated data.
It achieves comparable results on multiple object detection benchmarks with only 30% fully-annotated data.
arXiv Detail & Related papers (2020-02-18T08:04:58Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.