PivotNet: Vectorized Pivot Learning for End-to-end HD Map Construction
- URL: http://arxiv.org/abs/2308.16477v2
- Date: Fri, 1 Sep 2023 03:14:03 GMT
- Title: PivotNet: Vectorized Pivot Learning for End-to-end HD Map Construction
- Authors: Wenjie Ding, Limeng Qiao, Xi Qiu, Chi Zhang
- Abstract summary: We propose a simple yet effective architecture named PivotNet, which adopts unified pivot-based map representations.
PivotNet is remarkably superior to other SOTAs by 5.9 mAP at least.
- Score: 10.936405710245625
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Vectorized high-definition map online construction has garnered considerable
attention in the field of autonomous driving research. Most existing approaches
model changeable map elements using a fixed number of points, or predict local
maps in a two-stage autoregressive manner, which may miss essential details and
lead to error accumulation. Towards precise map element learning, we propose a
simple yet effective architecture named PivotNet, which adopts unified
pivot-based map representations and is formulated as a direct set prediction
paradigm. Concretely, we first propose a novel point-to-line mask module to
encode both the subordinate and geometrical point-line priors in the network.
Then, a well-designed pivot dynamic matching module is proposed to model the
topology in dynamic point sequences by introducing the concept of sequence
matching. Furthermore, to supervise the position and topology of the vectorized
point predictions, we propose a dynamic vectorized sequence loss. Extensive
experiments and ablations show that PivotNet is remarkably superior to other
SOTAs by 5.9 mAP at least. The code will be available soon.
Related papers
- TopoSD: Topology-Enhanced Lane Segment Perception with SDMap Prior [70.84644266024571]
We propose to train a perception model to "see" standard definition maps (SDMaps)
We encode SDMap elements into neural spatial map representations and instance tokens, and then incorporate such complementary features as prior information.
Based on the lane segment representation framework, the model simultaneously predicts lanes, centrelines and their topology.
arXiv Detail & Related papers (2024-11-22T06:13:42Z) - PriorMapNet: Enhancing Online Vectorized HD Map Construction with Priors [15.475364300374403]
We introduce PriorMapNet to enhance online vectorized HD map construction with priors.
Our proposed PriorMapNet achieves state-of-the-art performance in the online vectorized HD map construction task on nuScenes and Argoverse2 datasets.
arXiv Detail & Related papers (2024-08-16T15:26:23Z) - PrevPredMap: Exploring Temporal Modeling with Previous Predictions for Online Vectorized HD Map Construction [9.32290307534907]
PrevPredMap is a pioneering temporal modeling framework that leverages previous predictions for constructing online vectorized HD maps.
The framework achieves state-of-the-art performance on the nuScenes and Argoverse2 datasets.
arXiv Detail & Related papers (2024-07-24T15:58:24Z) - ADMap: Anti-disturbance framework for reconstructing online vectorized
HD map [9.218463154577616]
This paper proposes the Anti-disturbance Map reconstruction framework (ADMap)
To mitigate point-order jitter, the framework consists of three modules: Multi-Scale Perception Neck, Instance Interactive Attention (IIA), and Vector Direction Difference Loss (VDDL)
arXiv Detail & Related papers (2024-01-24T01:37:27Z) - Minimally Supervised Learning using Topological Projections in
Self-Organizing Maps [55.31182147885694]
We introduce a semi-supervised learning approach based on topological projections in self-organizing maps (SOMs)
Our proposed method first trains SOMs on unlabeled data and then a minimal number of available labeled data points are assigned to key best matching units (BMU)
Our results indicate that the proposed minimally supervised model significantly outperforms traditional regression techniques.
arXiv Detail & Related papers (2024-01-12T22:51:48Z) - InsMapper: Exploring Inner-instance Information for Vectorized HD
Mapping [41.59891369655983]
InsMapper harnesses inner-instance information for vectorized high-definition mapping through transformers.
InsMapper surpasses the previous state-of-the-art method, demonstrating its effectiveness and generality.
arXiv Detail & Related papers (2023-08-16T17:58:28Z) - Online Map Vectorization for Autonomous Driving: A Rasterization
Perspective [58.71769343511168]
We introduce a newization-based evaluation metric, which has superior sensitivity and is better suited to real-world autonomous driving scenarios.
We also propose MapVR (Map Vectorization via Rasterization), a novel framework that applies differentiableization to preciseized outputs and then performs geometry-aware supervision on HD maps.
arXiv Detail & Related papers (2023-06-18T08:51:14Z) - End-to-End Vectorized HD-map Construction with Piecewise Bezier Curve [9.129634919566026]
HD-map construction has attracted significant research interest in the autonomous driving community.
We introduce a simple yet effective architecture, named Piecewise Bezier HD-map Network (BeMapNet), which is formulated as a direct set prediction paradigm and postprocessing-free.
In addition, based on the progressively restoration of Bezier curve, we also present an efficient Point-Curve-Region Loss for supervising more robust and precise HD-map modeling.
arXiv Detail & Related papers (2023-06-16T09:05:52Z) - Graph-PCNN: Two Stage Human Pose Estimation with Graph Pose Refinement [54.29252286561449]
We propose a two-stage graph-based and model-agnostic framework, called Graph-PCNN.
In the first stage, heatmap regression network is applied to obtain a rough localization result, and a set of proposal keypoints, called guided points, are sampled.
In the second stage, for each guided point, different visual feature is extracted by the localization.
The relationship between guided points is explored by the graph pose refinement module to get more accurate localization results.
arXiv Detail & Related papers (2020-07-21T04:59:15Z) - Pre-Trained Models for Heterogeneous Information Networks [57.78194356302626]
We propose a self-supervised pre-training and fine-tuning framework, PF-HIN, to capture the features of a heterogeneous information network.
PF-HIN consistently and significantly outperforms state-of-the-art alternatives on each of these tasks, on four datasets.
arXiv Detail & Related papers (2020-07-07T03:36:28Z) - Gated Path Selection Network for Semantic Segmentation [72.44994579325822]
We develop a novel network named Gated Path Selection Network (GPSNet), which aims to learn adaptive receptive fields.
In GPSNet, we first design a two-dimensional multi-scale network - SuperNet, which densely incorporates features from growing receptive fields.
To dynamically select desirable semantic context, a gate prediction module is further introduced.
arXiv Detail & Related papers (2020-01-19T12:32:17Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.