Resolving Class Imbalance for LiDAR-based Object Detector by Dynamic
Weight Average and Contextual Ground Truth Sampling
- URL: http://arxiv.org/abs/2210.03331v1
- Date: Fri, 7 Oct 2022 05:23:25 GMT
- Title: Resolving Class Imbalance for LiDAR-based Object Detector by Dynamic
Weight Average and Contextual Ground Truth Sampling
- Authors: Daeun Lee, Jongwon Park, Jinkyu Kim
- Abstract summary: Real-world driving datasets often suffer from the problem of data imbalance.
We propose a method to address this data imbalance problem.
Our experiment with KITTI and nuScenes datasets confirms our proposed method's effectiveness.
- Score: 7.096611243139798
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: An autonomous driving system requires a 3D object detector, which must
perceive all present road agents reliably to navigate an environment safely.
However, real-world driving datasets often suffer from the problem of data
imbalance, which causes difficulties in training a model that works well across
all classes, resulting in an undesired imbalanced sub-optimal performance. In
this work, we propose a method to address this data imbalance problem. Our
method consists of two main components: (i) a LiDAR-based 3D object detector
with per-class multiple detection heads where losses from each head are
modified by dynamic weight average to be balanced. (ii) Contextual ground truth
(GT) sampling, where we improve conventional GT sampling techniques by
leveraging semantic information to augment point cloud with sampled ground
truth GT objects. Our experiment with KITTI and nuScenes datasets confirms our
proposed method's effectiveness in dealing with the data imbalance problem,
producing better detection accuracy compared to existing approaches.
Related papers
- TrajSSL: Trajectory-Enhanced Semi-Supervised 3D Object Detection [59.498894868956306]
Pseudo-labeling approaches to semi-supervised learning adopt a teacher-student framework.
We leverage pre-trained motion-forecasting models to generate object trajectories on pseudo-labeled data.
Our approach improves pseudo-label quality in two distinct manners.
arXiv Detail & Related papers (2024-09-17T05:35:00Z) - Approaching Outside: Scaling Unsupervised 3D Object Detection from 2D Scene [22.297964850282177]
We propose LiDAR-2D Self-paced Learning (LiSe) for unsupervised 3D detection.
RGB images serve as a valuable complement to LiDAR data, offering precise 2D localization cues.
Our framework devises a self-paced learning pipeline that incorporates adaptive sampling and weak model aggregation strategies.
arXiv Detail & Related papers (2024-07-11T14:58:49Z) - Gradient-based Sampling for Class Imbalanced Semi-supervised Object Detection [111.0991686509715]
We study the class imbalance problem for semi-supervised object detection (SSOD) under more challenging scenarios.
We propose a simple yet effective gradient-based sampling framework that tackles the class imbalance problem from the perspective of two types of confirmation biases.
Experiments on three proposed sub-tasks, namely MS-COCO, MS-COCO to Object365 and LVIS, suggest that our method outperforms current class imbalanced object detectors by clear margins.
arXiv Detail & Related papers (2024-03-22T11:30:10Z) - Self-supervised Feature Adaptation for 3D Industrial Anomaly Detection [59.41026558455904]
We focus on multi-modal anomaly detection. Specifically, we investigate early multi-modal approaches that attempted to utilize models pre-trained on large-scale visual datasets.
We propose a Local-to-global Self-supervised Feature Adaptation (LSFA) method to finetune the adaptors and learn task-oriented representation toward anomaly detection.
arXiv Detail & Related papers (2024-01-06T07:30:41Z) - Real-Time Object Detection in Occluded Environment with Background
Cluttering Effects Using Deep Learning [0.8192907805418583]
We concentrate on deep learning models for real-time detection of cars and tanks in an occluded environment with a cluttered background.
The developed method makes the custom dataset and employs a preprocessing technique to clean the noisy dataset.
The accuracy and frame per second of the SSD-Mobilenet v2 model are higher than YOLO V3 and YOLO V4.
arXiv Detail & Related papers (2024-01-02T01:30:03Z) - Unsupervised Domain Adaptation for Self-Driving from Past Traversal
Features [69.47588461101925]
We propose a method to adapt 3D object detectors to new driving environments.
Our approach enhances LiDAR-based detection models using spatial quantized historical features.
Experiments on real-world datasets demonstrate significant improvements.
arXiv Detail & Related papers (2023-09-21T15:00:31Z) - DuEqNet: Dual-Equivariance Network in Outdoor 3D Object Detection for
Autonomous Driving [4.489333751818157]
We propose DuEqNet, which first introduces the concept of equivariance into 3D object detection network.
The dual-equivariant of our model can extract the equivariant features at both local and global levels.
Our model presents higher accuracy on orientation and better prediction efficiency.
arXiv Detail & Related papers (2023-02-27T08:30:02Z) - Benchmarking the Robustness of LiDAR-Camera Fusion for 3D Object
Detection [58.81316192862618]
Two critical sensors for 3D perception in autonomous driving are the camera and the LiDAR.
fusing these two modalities can significantly boost the performance of 3D perception models.
We benchmark the state-of-the-art fusion methods for the first time.
arXiv Detail & Related papers (2022-05-30T09:35:37Z) - SelfVoxeLO: Self-supervised LiDAR Odometry with Voxel-based Deep Neural
Networks [81.64530401885476]
We propose a self-supervised LiDAR odometry method, dubbed SelfVoxeLO, to tackle these two difficulties.
Specifically, we propose a 3D convolution network to process the raw LiDAR data directly, which extracts features that better encode the 3D geometric patterns.
We evaluate our method's performances on two large-scale datasets, i.e., KITTI and Apollo-SouthBay.
arXiv Detail & Related papers (2020-10-19T09:23:39Z) - Leveraging Uncertainties for Deep Multi-modal Object Detection in
Autonomous Driving [12.310862288230075]
This work presents a probabilistic deep neural network that combines LiDAR point clouds and RGB camera images for robust, accurate 3D object detection.
We explicitly model uncertainties in the classification and regression tasks, and leverage uncertainties to train the fusion network via a sampling mechanism.
arXiv Detail & Related papers (2020-02-01T14:24:51Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.