M3TR: Generalist HD Map Construction with Variable Map Priors
- URL: http://arxiv.org/abs/2411.10316v1
- Date: Fri, 15 Nov 2024 16:14:48 GMT
- Title: M3TR: Generalist HD Map Construction with Variable Map Priors
- Authors: Fabian Immel, Richard Fehler, Frank Bieder, Jan-Hendrik Pauls, Christoph Stiller,
- Abstract summary: We introduce M3TR, a generalist approach for HD map construction both with and without map priors.
We propose the first realistic scenarios with semantically diverse map priors.
We show that training across all prior scenarios yields a single Generalist model, whose performance is on par with previous Expert models.
- Score: 6.314412580044879
- License:
- Abstract: Autonomous vehicles require road information for their operation, usually in form of HD maps. Since offline maps eventually become outdated or may only be partially available, online HD map construction methods have been proposed to infer map information from live sensor data. A key issue remains how to exploit such partial or outdated map information as a prior. We introduce M3TR (Multi-Masking Map Transformer), a generalist approach for HD map construction both with and without map priors. We address shortcomings in ground truth generation for Argoverse 2 and nuScenes and propose the first realistic scenarios with semantically diverse map priors. Examining various query designs, we use an improved method for integrating prior map elements into a HD map construction model, increasing performance by +4.3 mAP. Finally, we show that training across all prior scenarios yields a single Generalist model, whose performance is on par with previous Expert models that can handle only one specific type of map prior. M3TR thus is the first model capable of leveraging variable map priors, making it suitable for real-world deployment. Code is available at https://github.com/immel-f/m3tr
Related papers
- TopoSD: Topology-Enhanced Lane Segment Perception with SDMap Prior [70.84644266024571]
We propose to train a perception model to "see" standard definition maps (SDMaps)
We encode SDMap elements into neural spatial map representations and instance tokens, and then incorporate such complementary features as prior information.
Based on the lane segment representation framework, the model simultaneously predicts lanes, centrelines and their topology.
arXiv Detail & Related papers (2024-11-22T06:13:42Z) - ExelMap: Explainable Element-based HD-Map Change Detection and Update [2.79552147676281]
We propose a novel task of explainable element-based HD map change detection and update.
We present ExelMap, an explainable element-based map updating strategy that specifically identifies changed map elements.
This is the first comprehensive problem investigation of real-world end-to-end element-based HD map change detection and update.
arXiv Detail & Related papers (2024-09-16T11:17:33Z) - Enhancing Vectorized Map Perception with Historical Rasterized Maps [37.48510990922406]
We propose HRMapNet, leveraging a low-cost Historical Rasterized Map to enhance online vectorized map perception.
The historicalized map can be easily constructed from past predicted vectorized results and provides valuable complementary information.
HRMapNet can be integrated with most online vectorized map perception methods.
arXiv Detail & Related papers (2024-09-01T05:22:33Z) - P-MapNet: Far-seeing Map Generator Enhanced by both SDMap and HDMap Priors [33.73570824028748]
We present P-MapNet, in which the letter P highlights the fact that we focus on incorporating map priors to improve model performance.
Our attention-based architecture adaptively attends to relevant SDMap skeletons and significantly improves performance.
P-MapNet is a far-seeing solution that brings larger improvements on longer ranges.
arXiv Detail & Related papers (2024-03-15T17:59:53Z) - Augmenting Lane Perception and Topology Understanding with Standard
Definition Navigation Maps [51.24861159115138]
Standard Definition (SD) maps are more affordable and have worldwide coverage, offering a scalable alternative.
We propose a novel framework to integrate SD maps into online map prediction and propose a Transformer-based encoder, SD Map Representations from transFormers.
This enhancement consistently and significantly boosts (by up to 60%) lane detection and topology prediction on current state-of-the-art online map prediction methods.
arXiv Detail & Related papers (2023-11-07T15:42:22Z) - HDMapNet: An Online HD Map Construction and Evaluation Framework [23.19001503634617]
HD map construction is a crucial problem for autonomous driving.
Traditional HD maps are coupled with centimeter-level accurate localization which is unreliable in many scenarios.
Online map learning is a more scalable way to provide semantic and geometry priors to self-driving vehicles.
arXiv Detail & Related papers (2021-07-13T18:06:46Z) - HDMapGen: A Hierarchical Graph Generative Model of High Definition Maps [81.86923212296863]
HD maps are maps with precise definitions of road lanes with rich semantics of the traffic rules.
There are only a small amount of real-world road topologies and geometries, which significantly limits our ability to test out the self-driving stack.
We propose HDMapGen, a hierarchical graph generation model capable of producing high-quality and diverse HD maps.
arXiv Detail & Related papers (2021-06-28T17:59:30Z) - MP3: A Unified Model to Map, Perceive, Predict and Plan [84.07678019017644]
MP3 is an end-to-end approach to mapless driving where the input is raw sensor data and a high-level command.
We show that our approach is significantly safer, more comfortable, and can follow commands better than the baselines in challenging long-term closed-loop simulations.
arXiv Detail & Related papers (2021-01-18T00:09:30Z) - HDNET: Exploiting HD Maps for 3D Object Detection [99.49035895393934]
We show that High-Definition (HD) maps provide strong priors that can boost the performance and robustness of modern 3D object detectors.
We design a single stage detector that extracts geometric and semantic features from the HD maps.
As maps might not be available everywhere, we also propose a map prediction module that estimates the map on the fly from raw LiDAR data.
arXiv Detail & Related papers (2020-12-21T21:59:54Z) - OpenREALM: Real-time Mapping for Unmanned Aerial Vehicles [62.997667081978825]
OpenREALM is a real-time mapping framework for Unmanned Aerial Vehicles (UAVs)
Different modes of operation allow OpenREALM to perform simple stitching assuming an approximate plane ground.
In all modes incremental progress of the resulting map can be viewed live by an operator on the ground.
arXiv Detail & Related papers (2020-09-22T12:28:14Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.