COLA: COarse LAbel pre-training for 3D semantic segmentation of sparse
LiDAR datasets
- URL: http://arxiv.org/abs/2202.06884v3
- Date: Tue, 21 Mar 2023 07:12:46 GMT
- Title: COLA: COarse LAbel pre-training for 3D semantic segmentation of sparse
LiDAR datasets
- Authors: Jules Sanchez, Jean-Emmanuel Deschaud and Fran\c{c}ois Goulette
- Abstract summary: Transfer learning is a proven technique in 2D computer vision to leverage the large amount of data available and achieve high performance.
In this work, we tackle the case of real-time 3D semantic segmentation of sparse autonomous driving LiDAR scans.
We introduce a new pre-training task: coarse label pre-training, also called COLA.
- Score: 3.8243923744440926
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Transfer learning is a proven technique in 2D computer vision to leverage the
large amount of data available and achieve high performance with datasets
limited in size due to the cost of acquisition or annotation. In 3D, annotation
is known to be a costly task; nevertheless, pre-training methods have only
recently been investigated. Due to this cost, unsupervised pre-training has
been heavily favored. In this work, we tackle the case of real-time 3D semantic
segmentation of sparse autonomous driving LiDAR scans. Such datasets have been
increasingly released, but each has a unique label set. We propose here an
intermediate-level label set called coarse labels, which can easily be used on
any existing and future autonomous driving datasets, thus allowing all the data
available to be leveraged at once without any additional manual labeling. This
way, we have access to a larger dataset, alongside a simple task of semantic
segmentation. With it, we introduce a new pre-training task: coarse label
pre-training, also called COLA. We thoroughly analyze the impact of COLA on
various datasets and architectures and show that it yields a noticeable
performance improvement, especially when only a small dataset is available for
the finetuning task.
Related papers
- Discwise Active Learning for LiDAR Semantic Segmentation [92.92714645987083]
Active learning (AL) provides a solution that can iteratively and intelligently label a dataset.
We propose a discwise approach (DiAL) where in each iteration, we query the region a single frame covers on global coordinates, labeling all frames simultaneously.
arXiv Detail & Related papers (2023-09-23T06:08:22Z) - SPOT: Scalable 3D Pre-training via Occupancy Prediction for Learning Transferable 3D Representations [76.45009891152178]
Pretraining-finetuning approach can alleviate the labeling burden by fine-tuning a pre-trained backbone across various downstream datasets as well as tasks.
We show, for the first time, that general representations learning can be achieved through the task of occupancy prediction.
Our findings will facilitate the understanding of LiDAR points and pave the way for future advancements in LiDAR pre-training.
arXiv Detail & Related papers (2023-09-19T11:13:01Z) - Label Name is Mantra: Unifying Point Cloud Segmentation across
Heterogeneous Datasets [17.503843467554592]
We propose a principled approach that supports learning from heterogeneous datasets with different label sets.
Our idea is to utilize a pre-trained language model to embed discrete labels to a continuous latent space with the help of their label names.
Our model outperforms the state-of-the-art by a large margin.
arXiv Detail & Related papers (2023-03-19T06:14:22Z) - Navya3DSeg -- Navya 3D Semantic Segmentation Dataset & split generation
for autonomous vehicles [63.20765930558542]
3D semantic data are useful for core perception tasks such as obstacle detection and ego-vehicle localization.
We propose a new dataset, Navya 3D (Navya3DSeg), with a diverse label space corresponding to a large scale production grade operational domain.
It contains 23 labeled sequences and 25 supplementary sequences without labels, designed to explore self-supervised and semi-supervised semantic segmentation benchmarks on point clouds.
arXiv Detail & Related papers (2023-02-16T13:41:19Z) - LESS: Label-Efficient Semantic Segmentation for LiDAR Point Clouds [62.49198183539889]
We propose a label-efficient semantic segmentation pipeline for outdoor scenes with LiDAR point clouds.
Our method co-designs an efficient labeling process with semi/weakly supervised learning.
Our proposed method is even highly competitive compared to the fully supervised counterpart with 100% labels.
arXiv Detail & Related papers (2022-10-14T19:13:36Z) - Primitive3D: 3D Object Dataset Synthesis from Randomly Assembled
Primitives [44.03149443379618]
We propose a cost-effective method for automatically generating a large amount of 3D objects with annotations.
These objects are auto-annotated with part labels originating from primitives.
Considering the large overhead of learning on the generated dataset, we propose a dataset distillation strategy.
arXiv Detail & Related papers (2022-05-25T10:07:07Z) - Open-Set Semi-Supervised Learning for 3D Point Cloud Understanding [62.17020485045456]
It is commonly assumed in semi-supervised learning (SSL) that the unlabeled data are drawn from the same distribution as that of the labeled ones.
We propose to selectively utilize unlabeled data through sample weighting, so that only conducive unlabeled data would be prioritized.
arXiv Detail & Related papers (2022-05-02T16:09:17Z) - PointContrast: Unsupervised Pre-training for 3D Point Cloud
Understanding [107.02479689909164]
In this work, we aim at facilitating research on 3D representation learning.
We measure the effect of unsupervised pre-training on a large source set of 3D scenes.
arXiv Detail & Related papers (2020-07-21T17:59:22Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.