PointSSC: A Cooperative Vehicle-Infrastructure Point Cloud Benchmark for
Semantic Scene Completion
- URL: http://arxiv.org/abs/2309.12708v2
- Date: Thu, 7 Mar 2024 02:50:04 GMT
- Title: PointSSC: A Cooperative Vehicle-Infrastructure Point Cloud Benchmark for
Semantic Scene Completion
- Authors: Yuxiang Yan, Boda Liu, Jianfei Ai, Qinbu Li, Ru Wan, Jian Pu
- Abstract summary: Semantic Scene Completion aims to jointly generate space occupancies and semantic labels for complex 3D scenes.
PointSSC is the first cooperative vehicle-infrastructure point cloud benchmark for semantic scene completion.
- Score: 4.564209472726044
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Semantic Scene Completion (SSC) aims to jointly generate space occupancies
and semantic labels for complex 3D scenes. Most existing SSC models focus on
volumetric representations, which are memory-inefficient for large outdoor
spaces. Point clouds provide a lightweight alternative but existing benchmarks
lack outdoor point cloud scenes with semantic labels. To address this, we
introduce PointSSC, the first cooperative vehicle-infrastructure point cloud
benchmark for semantic scene completion. These scenes exhibit long-range
perception and minimal occlusion. We develop an automated annotation pipeline
leveraging Semantic Segment Anything to efficiently assign semantics. To
benchmark progress, we propose a LiDAR-based model with a Spatial-Aware
Transformer for global and local feature extraction and a Completion and
Segmentation Cooperative Module for joint completion and segmentation. PointSSC
provides a challenging testbed to drive advances in semantic point cloud
completion for real-world navigation. The code and datasets are available at
https://github.com/yyxssm/PointSSC.
Related papers
- 4D-CS: Exploiting Cluster Prior for 4D Spatio-Temporal LiDAR Semantic Segmentation [21.300636683882338]
We propose a new method to generate cluster labels that reflect the complete spatial structure and temporal information of objects.
We achieve state-of-the-art results on the multi-scan semantic and moving object segmentation on Semantic KITTI and nuScenes datasets.
arXiv Detail & Related papers (2025-01-06T11:23:13Z) - 3D Part Segmentation via Geometric Aggregation of 2D Visual Features [57.20161517451834]
Supervised 3D part segmentation models are tailored for a fixed set of objects and parts, limiting their transferability to open-set, real-world scenarios.
Recent works have explored vision-language models (VLMs) as a promising alternative, using multi-view rendering and textual prompting to identify object parts.
To address these limitations, we propose COPS, a COmprehensive model for Parts that blends semantics extracted from visual concepts and 3D geometry to effectively identify object parts.
arXiv Detail & Related papers (2024-12-05T15:27:58Z) - ZeroReg: Zero-Shot Point Cloud Registration with Foundation Models [77.84408427496025]
State-of-the-art 3D point cloud registration methods rely on labeled 3D datasets for training.
We introduce ZeroReg, a zero-shot registration approach that utilizes 2D foundation models to predict 3D correspondences.
arXiv Detail & Related papers (2023-12-05T11:33:16Z) - Collect-and-Distribute Transformer for 3D Point Cloud Analysis [82.03517861433849]
We propose a new transformer network equipped with a collect-and-distribute mechanism to communicate short- and long-range contexts of point clouds.
Results show the effectiveness of the proposed CDFormer, delivering several new state-of-the-art performances on point cloud classification and segmentation tasks.
arXiv Detail & Related papers (2023-06-02T03:48:45Z) - FreePoint: Unsupervised Point Cloud Instance Segmentation [72.64540130803687]
We propose FreePoint, for underexplored unsupervised class-agnostic instance segmentation on point clouds.
We represent point features by combining coordinates, colors, and self-supervised deep features.
Based on the point features, we segment point clouds into coarse instance masks as pseudo labels, which are used to train a point cloud instance segmentation model.
arXiv Detail & Related papers (2023-05-11T16:56:26Z) - CasFusionNet: A Cascaded Network for Point Cloud Semantic Scene
Completion by Dense Feature Fusion [14.34344002500153]
We present CasFusionNet, a novel cascaded network for point cloud semantic scene completion by dense feature fusion.
We organize the above three modules via dense feature fusion in each level, and cascade a total of four levels, where we also employ feature fusion between each level for sufficient information usage.
arXiv Detail & Related papers (2022-11-24T16:36:42Z) - Semantic Segmentation-assisted Scene Completion for LiDAR Point Clouds [9.489733900529204]
We propose an end-to-end semantic segmentation-assisted scene completion network.
The network takes a raw point cloud as input, and merges the features from the segmentation branch into the completion branch hierarchically.
Our method achieves competitive performance on Semantic KITTI dataset with low latency.
arXiv Detail & Related papers (2021-09-23T15:55:45Z) - SCSS-Net: Superpoint Constrained Semi-supervised Segmentation Network
for 3D Indoor Scenes [6.3364439467281315]
We propose a superpoint constrained semi-supervised segmentation network for 3D point clouds, named as SCSS-Net.
Specifically, we use the pseudo labels predicted from unlabeled point clouds for self-training, and the superpoints produced by geometry-based and color-based Region Growing algorithms are combined to modify and delete pseudo labels with low confidence.
arXiv Detail & Related papers (2021-07-08T04:43:21Z) - Sparse Single Sweep LiDAR Point Cloud Segmentation via Learning
Contextual Shape Priors from Scene Completion [43.86692068523167]
LiDAR point cloud analysis is a core task for 3D computer vision, especially for autonomous driving.
We propose a novel sparse LiDAR point cloud semantic segmentation framework assisted by learned contextual shape priors.
Our JS3C-Net achieves superior performance on both SemanticKITTI and SemanticPOSS benchmarks.
arXiv Detail & Related papers (2020-12-07T14:58:25Z) - LiDAR-based Panoptic Segmentation via Dynamic Shifting Network [56.71765153629892]
LiDAR-based panoptic segmentation aims to parse both objects and scenes in a unified manner.
We propose the Dynamic Shifting Network (DS-Net), which serves as an effective panoptic segmentation framework in the point cloud realm.
Our proposed DS-Net achieves superior accuracies over current state-of-the-art methods.
arXiv Detail & Related papers (2020-11-24T08:44:46Z) - SoftPoolNet: Shape Descriptor for Point Cloud Completion and
Classification [93.54286830844134]
We propose a method for 3D object completion and classification based on point clouds.
For the decoder stage, we propose regional convolutions, a novel operator aimed at maximizing the global activation entropy.
We evaluate our approach on different 3D tasks such as object completion and classification, achieving state-of-the-art accuracy.
arXiv Detail & Related papers (2020-08-17T14:32:35Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.