PCPNet: An Efficient and Semantic-Enhanced Transformer Network for Point
Cloud Prediction
- URL: http://arxiv.org/abs/2304.07773v1
- Date: Sun, 16 Apr 2023 13:12:33 GMT
- Title: PCPNet: An Efficient and Semantic-Enhanced Transformer Network for Point
Cloud Prediction
- Authors: Zhen Luo, Junyi Ma, Zijie Zhou and Guangming Xiong
- Abstract summary: We propose a novel efficient Transformer-based network to predict the future LiDAR point clouds.
We also design a semantic auxiliary training strategy to make the predicted LiDAR point cloud sequence semantically similar to the ground truth.
Our method outperforms the state-of-the-art PCP methods on the prediction results and semantic similarity, and has a good real-time performance.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: The ability to predict future structure features of environments based on
past perception information is extremely needed by autonomous vehicles, which
helps to make the following decision-making and path planning more reasonable.
Recently, point cloud prediction (PCP) is utilized to predict and describe
future environmental structures by the point cloud form. In this letter, we
propose a novel efficient Transformer-based network to predict the future LiDAR
point clouds exploiting the past point cloud sequences. We also design a
semantic auxiliary training strategy to make the predicted LiDAR point cloud
sequence semantically similar to the ground truth and thus improves the
significance of the deployment for more tasks in real-vehicle applications. Our
approach is completely self-supervised, which means it does not require any
manual labeling and has a solid generalization ability toward different
environments. The experimental results show that our method outperforms the
state-of-the-art PCP methods on the prediction results and semantic similarity,
and has a good real-time performance. Our open-source code and pre-trained
models are available at https://github.com/Blurryface0814/PCPNet.
Related papers
- Explainable time-series forecasting with sampling-free SHAP for Transformers [3.515829683606796]
We introduce SHAPformer, an accurate, fast and sampling-free explainable time-series forecasting model based on the Transformer architecture.<n>It generates explanations in under one second, several orders of magnitude faster than the SHAP Permutation Explainer.<n>On synthetic data with ground truth explanations, SHAPformer provides explanations that are true to the data.
arXiv Detail & Related papers (2025-12-23T17:02:35Z) - VFMF: World Modeling by Forecasting Vision Foundation Model Features [67.09340259579761]
We introduce a generative forecaster that performs autoregressive flow matching in vision foundation models feature space.<n>We show that this latent information more effectively than previously used PCA-based alternatives, both for forecasting and other applications.<n>With matched architecture and compute, our method produces sharper and more accurate predictions than regression across all modalities.
arXiv Detail & Related papers (2025-12-12T02:10:05Z) - OPUS: Occupancy Prediction Using a Sparse Set [64.60854562502523]
We present a framework to simultaneously predict occupied locations and classes using a set of learnable queries.
OPUS incorporates a suite of non-trivial strategies to enhance model performance.
Our lightest model achieves superior RayIoU on the Occ3D-nuScenes dataset at near 2x FPS, while our heaviest model surpasses previous best results by 6.1 RayIoU.
arXiv Detail & Related papers (2024-09-14T07:44:22Z) - Robust Load Prediction of Power Network Clusters Based on Cloud-Model-Improved Transformer [3.8801337536879505]
The Transformer model, a leading method for load prediction, faces challenges modeling historical data due to variables like weather, events, festivals, and data volatility.
To tackle this, the cloud model's fuzzy feature is utilized to manage uncertainties effectively.
The Cloud Model Improved Transformer (CMIT) method integrates the Transformer model with the cloud model utilizing the particle swarm optimization algorithm.
arXiv Detail & Related papers (2024-07-30T13:32:26Z) - GVFs in the Real World: Making Predictions Online for Water Treatment [23.651798878534635]
We investigate the use of reinforcement-learning based prediction approaches for a real drinking-water treatment plant.
We first describe this dataset, and highlight challenges with seasonality, nonstationarity, partial observability.
We show the importance of learning in deployment, by comparing a TD agent trained purely offline with no online updating to a TD agent that learns online.
arXiv Detail & Related papers (2023-12-04T04:49:10Z) - Pre-training on Synthetic Driving Data for Trajectory Prediction [61.520225216107306]
We propose a pipeline-level solution to mitigate the issue of data scarcity in trajectory forecasting.
We adopt HD map augmentation and trajectory synthesis for generating driving data, and then we learn representations by pre-training on them.
We conduct extensive experiments to demonstrate the effectiveness of our data expansion and pre-training strategies.
arXiv Detail & Related papers (2023-09-18T19:49:22Z) - GeoMAE: Masked Geometric Target Prediction for Self-supervised Point
Cloud Pre-Training [16.825524577372473]
We introduce a point cloud representation learning framework, based on geometric feature reconstruction.
We identify three self-supervised learning objectives to peculiar point clouds, namely centroid prediction, normal estimation, and curvature prediction.
Our pipeline is conceptually simple and it consists of two major steps: first, it randomly masks out groups of points, followed by a Transformer-based point cloud encoder.
arXiv Detail & Related papers (2023-05-15T17:14:55Z) - Interpretable Self-Aware Neural Networks for Robust Trajectory
Prediction [50.79827516897913]
We introduce an interpretable paradigm for trajectory prediction that distributes the uncertainty among semantic concepts.
We validate our approach on real-world autonomous driving data, demonstrating superior performance over state-of-the-art baselines.
arXiv Detail & Related papers (2022-11-16T06:28:20Z) - LOPR: Latent Occupancy PRediction using Generative Models [49.15687400958916]
LiDAR generated occupancy grid maps (L-OGMs) offer a robust bird's eye-view scene representation.
We propose a framework that decouples occupancy prediction into: representation learning and prediction within the learned latent space.
arXiv Detail & Related papers (2022-10-03T22:04:00Z) - Conditioned Human Trajectory Prediction using Iterative Attention Blocks [70.36888514074022]
We present a simple yet effective pedestrian trajectory prediction model aimed at pedestrians positions prediction in urban-like environments.
Our model is a neural-based architecture that can run several layers of attention blocks and transformers in an iterative sequential fashion.
We show that without explicit introduction of social masks, dynamical models, social pooling layers, or complicated graph-like structures, it is possible to produce on par results with SoTA models.
arXiv Detail & Related papers (2022-06-29T07:49:48Z) - SLPC: a VRNN-based approach for stochastic lidar prediction and
completion in autonomous driving [63.87272273293804]
We propose a new LiDAR prediction framework that is based on generative models namely Variational Recurrent Neural Networks (VRNNs)
Our algorithm is able to address the limitations of previous video prediction frameworks when dealing with sparse data by spatially inpainting the depth maps in the upcoming frames.
We present a sparse version of VRNNs and an effective self-supervised training method that does not require any labels.
arXiv Detail & Related papers (2021-02-19T11:56:44Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.