Related papers: MachMap: End-to-End Vectorized Solution for Compact HD-Map Construction

MachMap: End-to-End Vectorized Solution for Compact HD-Map Construction

URL: http://arxiv.org/abs/2306.10301v1
Date: Sat, 17 Jun 2023 09:06:48 GMT
Title: MachMap: End-to-End Vectorized Solution for Compact HD-Map Construction
Authors: Limeng Qiao, Yongchao Zheng, Peng Zhang, Wenjie Ding, Xi Qiu, Xing Wei, Chi Zhang
Abstract summary: This report introduces the 1st place winning solution for the Autonomous Driving Challenge 2023 - Online HD-map Construction. We elaborate an effective architecture, termed as MachMap, which formulates the task of HD-map construction as the point detection paradigm.
Score: 24.517848530666907
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: This report introduces the 1st place winning solution for the Autonomous Driving Challenge 2023 - Online HD-map Construction. By delving into the vectorization pipeline, we elaborate an effective architecture, termed as MachMap, which formulates the task of HD-map construction as the point detection paradigm in the bird-eye-view space with an end-to-end manner. Firstly, we introduce a novel map-compaction scheme into our framework, leading to reducing the number of vectorized points by 93% without any expression performance degradation. Build upon the above process, we then follow the general query-based paradigm and propose a strong baseline with integrating a powerful CNN-based backbone like InternImage, a temporal-based instance decoder and a well-designed point-mask coupling head. Additionally, an extra optional ensemble stage is utilized to refine model predictions for better performance. Our MachMap-tiny with IN-1K initialization achieves a mAP of 79.1 on the Argoverse2 benchmark and the further improved MachMap-huge reaches the best mAP of 83.5, outperforming all the other online HD-map construction approaches on the final leaderboard with a distinct performance margin (> 9.8 mAP at least).

Related papers

TopoSD: Topology-Enhanced Lane Segment Perception with SDMap Prior [70.84644266024571]
We propose to train a perception model to "see" standard definition maps (SDMaps) We encode SDMap elements into neural spatial map representations and instance tokens, and then incorporate such complementary features as prior information. Based on the lane segment representation framework, the model simultaneously predicts lanes, centrelines and their topology.
arXiv Detail & Related papers (2024-11-22T06:13:42Z)
PriorMapNet: Enhancing Online Vectorized HD Map Construction with Priors [15.475364300374403]
We introduce PriorMapNet to enhance online vectorized HD map construction with priors. Our proposed PriorMapNet achieves state-of-the-art performance in the online vectorized HD map construction task on nuScenes and Argoverse2 datasets.
arXiv Detail & Related papers (2024-08-16T15:26:23Z)
ADMap: Anti-disturbance framework for reconstructing online vectorized HD map [9.218463154577616]
This paper proposes the Anti-disturbance Map reconstruction framework (ADMap) To mitigate point-order jitter, the framework consists of three modules: Multi-Scale Perception Neck, Instance Interactive Attention (IIA), and Vector Direction Difference Loss (VDDL)
arXiv Detail & Related papers (2024-01-24T01:37:27Z)
MapNeXt: Revisiting Training and Scaling Practices for Online Vectorized HD Map Construction [0.0]
We present a full-scale upgrade of MapTR and propose MapNeXt, the next generation of HD map learning architecture. MapNeXt-Huge achieves state-of-the-art performance on the challenging nuScenes benchmark.
arXiv Detail & Related papers (2024-01-14T16:14:36Z)
Online Vectorized HD Map Construction using Geometry [17.33973935325903]
We propose GeMap, which learns Euclidean shapes and relations of map instances beyond basic perception. Our method achieves new state-of-the-art performance on the NuScenes and Argoverse 2 datasets.
arXiv Detail & Related papers (2023-12-06T08:26:26Z)
ScalableMap: Scalable Map Learning for Online Long-Range Vectorized HD Map Construction [42.874195888422584]
We propose a novel end-to-end pipeline for online long-range vectorized high-definition (HD) map construction using on-board camera sensors. We exploit the properties of map elements to improve the performance of map construction.
arXiv Detail & Related papers (2023-10-20T09:46:24Z)
Vision-based Large-scale 3D Semantic Mapping for Autonomous Driving Applications [53.553924052102126]
We present a complete pipeline for 3D semantic mapping solely based on a stereo camera system. The pipeline comprises a direct visual odometry front-end as well as a back-end for global temporal integration. We propose a simple but effective voting scheme which improves the quality and consistency of the 3D point labels.
arXiv Detail & Related papers (2022-03-02T13:18:38Z)
HDMapGen: A Hierarchical Graph Generative Model of High Definition Maps [81.86923212296863]
HD maps are maps with precise definitions of road lanes with rich semantics of the traffic rules. There are only a small amount of real-world road topologies and geometries, which significantly limits our ability to test out the self-driving stack. We propose HDMapGen, a hierarchical graph generation model capable of producing high-quality and diverse HD maps.
arXiv Detail & Related papers (2021-06-28T17:59:30Z)
When Liebig's Barrel Meets Facial Landmark Detection: A Practical Model [87.25037167380522]
We propose a model that is accurate, robust, efficient, generalizable, and end-to-end trainable. In order to achieve a better accuracy, we propose two lightweight modules. DQInit dynamically initializes the queries of decoder from the inputs, enabling the model to achieve as good accuracy as the ones with multiple decoder layers. QAMem is designed to enhance the discriminative ability of queries on low-resolution feature maps by assigning separate memory values to each query rather than a shared one.
arXiv Detail & Related papers (2021-05-27T13:51:42Z)
HDNET: Exploiting HD Maps for 3D Object Detection [99.49035895393934]
We show that High-Definition (HD) maps provide strong priors that can boost the performance and robustness of modern 3D object detectors. We design a single stage detector that extracts geometric and semantic features from the HD maps. As maps might not be available everywhere, we also propose a map prediction module that estimates the map on the fly from raw LiDAR data.
arXiv Detail & Related papers (2020-12-21T21:59:54Z)

This list is automatically generated from the titles and abstracts of the papers in this site.