MachMap: End-to-End Vectorized Solution for Compact HD-Map Construction
- URL: http://arxiv.org/abs/2306.10301v1
- Date: Sat, 17 Jun 2023 09:06:48 GMT
- Title: MachMap: End-to-End Vectorized Solution for Compact HD-Map Construction
- Authors: Limeng Qiao, Yongchao Zheng, Peng Zhang, Wenjie Ding, Xi Qiu, Xing
Wei, Chi Zhang
- Abstract summary: This report introduces the 1st place winning solution for the Autonomous Driving Challenge 2023 - Online HD-map Construction.
We elaborate an effective architecture, termed as MachMap, which formulates the task of HD-map construction as the point detection paradigm.
- Score: 24.517848530666907
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: This report introduces the 1st place winning solution for the Autonomous
Driving Challenge 2023 - Online HD-map Construction. By delving into the
vectorization pipeline, we elaborate an effective architecture, termed as
MachMap, which formulates the task of HD-map construction as the point
detection paradigm in the bird-eye-view space with an end-to-end manner.
Firstly, we introduce a novel map-compaction scheme into our framework, leading
to reducing the number of vectorized points by 93% without any expression
performance degradation. Build upon the above process, we then follow the
general query-based paradigm and propose a strong baseline with integrating a
powerful CNN-based backbone like InternImage, a temporal-based instance decoder
and a well-designed point-mask coupling head. Additionally, an extra optional
ensemble stage is utilized to refine model predictions for better performance.
Our MachMap-tiny with IN-1K initialization achieves a mAP of 79.1 on the
Argoverse2 benchmark and the further improved MachMap-huge reaches the best mAP
of 83.5, outperforming all the other online HD-map construction approaches on
the final leaderboard with a distinct performance margin (> 9.8 mAP at least).
Related papers
- TopoSD: Topology-Enhanced Lane Segment Perception with SDMap Prior [70.84644266024571]
We propose to train a perception model to "see" standard definition maps (SDMaps)
We encode SDMap elements into neural spatial map representations and instance tokens, and then incorporate such complementary features as prior information.
Based on the lane segment representation framework, the model simultaneously predicts lanes, centrelines and their topology.
arXiv Detail & Related papers (2024-11-22T06:13:42Z) - PriorMapNet: Enhancing Online Vectorized HD Map Construction with Priors [15.475364300374403]
We introduce PriorMapNet to enhance online vectorized HD map construction with priors.
Our proposed PriorMapNet achieves state-of-the-art performance in the online vectorized HD map construction task on nuScenes and Argoverse2 datasets.
arXiv Detail & Related papers (2024-08-16T15:26:23Z) - ADMap: Anti-disturbance framework for reconstructing online vectorized
HD map [9.218463154577616]
This paper proposes the Anti-disturbance Map reconstruction framework (ADMap)
To mitigate point-order jitter, the framework consists of three modules: Multi-Scale Perception Neck, Instance Interactive Attention (IIA), and Vector Direction Difference Loss (VDDL)
arXiv Detail & Related papers (2024-01-24T01:37:27Z) - MapNeXt: Revisiting Training and Scaling Practices for Online Vectorized
HD Map Construction [0.0]
We present a full-scale upgrade of MapTR and propose MapNeXt, the next generation of HD map learning architecture.
MapNeXt-Huge achieves state-of-the-art performance on the challenging nuScenes benchmark.
arXiv Detail & Related papers (2024-01-14T16:14:36Z) - Online Vectorized HD Map Construction using Geometry [17.33973935325903]
We propose GeMap, which learns Euclidean shapes and relations of map instances beyond basic perception.
Our method achieves new state-of-the-art performance on the NuScenes and Argoverse 2 datasets.
arXiv Detail & Related papers (2023-12-06T08:26:26Z) - ScalableMap: Scalable Map Learning for Online Long-Range Vectorized HD
Map Construction [42.874195888422584]
We propose a novel end-to-end pipeline for online long-range vectorized high-definition (HD) map construction using on-board camera sensors.
We exploit the properties of map elements to improve the performance of map construction.
arXiv Detail & Related papers (2023-10-20T09:46:24Z) - Vision-based Large-scale 3D Semantic Mapping for Autonomous Driving
Applications [53.553924052102126]
We present a complete pipeline for 3D semantic mapping solely based on a stereo camera system.
The pipeline comprises a direct visual odometry front-end as well as a back-end for global temporal integration.
We propose a simple but effective voting scheme which improves the quality and consistency of the 3D point labels.
arXiv Detail & Related papers (2022-03-02T13:18:38Z) - HDMapGen: A Hierarchical Graph Generative Model of High Definition Maps [81.86923212296863]
HD maps are maps with precise definitions of road lanes with rich semantics of the traffic rules.
There are only a small amount of real-world road topologies and geometries, which significantly limits our ability to test out the self-driving stack.
We propose HDMapGen, a hierarchical graph generation model capable of producing high-quality and diverse HD maps.
arXiv Detail & Related papers (2021-06-28T17:59:30Z) - When Liebig's Barrel Meets Facial Landmark Detection: A Practical Model [87.25037167380522]
We propose a model that is accurate, robust, efficient, generalizable, and end-to-end trainable.
In order to achieve a better accuracy, we propose two lightweight modules.
DQInit dynamically initializes the queries of decoder from the inputs, enabling the model to achieve as good accuracy as the ones with multiple decoder layers.
QAMem is designed to enhance the discriminative ability of queries on low-resolution feature maps by assigning separate memory values to each query rather than a shared one.
arXiv Detail & Related papers (2021-05-27T13:51:42Z) - HDNET: Exploiting HD Maps for 3D Object Detection [99.49035895393934]
We show that High-Definition (HD) maps provide strong priors that can boost the performance and robustness of modern 3D object detectors.
We design a single stage detector that extracts geometric and semantic features from the HD maps.
As maps might not be available everywhere, we also propose a map prediction module that estimates the map on the fly from raw LiDAR data.
arXiv Detail & Related papers (2020-12-21T21:59:54Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.