Related papers: RelMap: Enhancing Online Map Construction with Class-Aware Spatial Relation and Semantic Priors

RelMap: Enhancing Online Map Construction with Class-Aware Spatial Relation and Semantic Priors

URL: http://arxiv.org/abs/2507.21567v1
Date: Tue, 29 Jul 2025 07:58:52 GMT
Title: RelMap: Enhancing Online Map Construction with Class-Aware Spatial Relation and Semantic Priors
Authors: Tianhui Cai, Yun Zhang, Zewei Zhou, Zhiyu Huang, Jiaqi Ma,
Abstract summary: We propose an end-to-end framework that enhances online map construction by incorporating spatial relations and semantic priors.<n>Our method is compatible with both single-frame and temporal perception backbones, achieving state-of-the-art performance on both the nuScenes and Argoverse 2 datasets.
Score: 13.26774106477855
License: http://creativecommons.org/licenses/by-nc-sa/4.0/
Abstract: Online high-definition (HD) map construction plays an increasingly important role in scaling autonomous driving systems. Transformer-based methods have become prevalent in online HD map construction; however, existing approaches often neglect the inherent spatial and semantic relationships among map elements, which limits their accuracy and generalization. To address this, we propose RelMap, an end-to-end framework that enhances online map construction by incorporating spatial relations and semantic priors. We introduce a Class-aware Spatial Relation Prior, which explicitly encodes relative positional dependencies between map elements using a learnable class-aware relation encoder. Additionally, we propose a Mixture-of-Experts (MoE)-based Semantic Prior, which routes features to class-specific experts based on predicted class probabilities, refining instance feature decoding. Our method is compatible with both single-frame and temporal perception backbones, achieving state-of-the-art performance on both the nuScenes and Argoverse 2 datasets.

Related papers

Coherent Online Road Topology Estimation and Reasoning with Standard-Definition Maps [26.036008442130587]
Most autonomous cars rely on the availability of high-definition (HD) maps.<n>Current research aims to address this constraint by directly predicting HD map elements from onboard sensors.<n>We propose a coherent approach to predict lane segments and their corresponding topology, as well as road boundaries.
arXiv Detail & Related papers (2025-07-02T06:26:17Z)
InteractionMap: Improving Online Vectorized HDMap Construction with Interaction [0.4551615447454768]
State-of-the-art map vectorization methods are mainly based on DETR-like framework to generate HD maps in an end-to-end manner.<n>In this paper, we propose InteractionMap, which improves previous map vectorization methods by fully leveraging local-to-global information interaction.
arXiv Detail & Related papers (2025-03-27T16:23:15Z)
IC-Mapper: Instance-Centric Spatio-Temporal Modeling for Online Vectorized Map Construction [18.975185033472968]
IC-Mapper is an instance-centric online mapping framework, which comprises two primary components.<n>We perform point sampling on the historical global map from a spatial dimension and integrate it with the detection results of instances corresponding to the current frame to achieve real-time expansion and update of the map.
arXiv Detail & Related papers (2025-03-05T20:28:34Z)
TopoSD: Topology-Enhanced Lane Segment Perception with SDMap Prior [70.84644266024571]
We propose to train a perception model to "see" standard definition maps (SDMaps) We encode SDMap elements into neural spatial map representations and instance tokens, and then incorporate such complementary features as prior information. Based on the lane segment representation framework, the model simultaneously predicts lanes, centrelines and their topology.
arXiv Detail & Related papers (2024-11-22T06:13:42Z)
GenMapping: Unleashing the Potential of Inverse Perspective Mapping for Robust Online HD Map Construction [20.1127163541618]
We have designed a universal map generation framework, GenMapping. The framework is established with a triadic synergy architecture, including principal and dual auxiliary branches. A thorough array of experimental results shows that the proposed model surpasses current state-of-the-art methods in both semantic mapping and vectorized mapping, while also maintaining a rapid inference speed.
arXiv Detail & Related papers (2024-09-13T10:15:28Z)
MapTRv2: An End-to-End Framework for Online Vectorized HD Map Construction [40.07726377230152]
High-definition (HD) map provides abundant and precise static environmental information of the driving scene. We present textbfMap textbfTRansformer, an end-to-end framework for online vectorized HD map construction.
arXiv Detail & Related papers (2023-08-10T17:56:53Z)
How To Not Train Your Dragon: Training-free Embodied Object Goal Navigation with Semantic Frontiers [94.46825166907831]
We present a training-free solution to tackle the object goal navigation problem in Embodied AI. Our method builds a structured scene representation based on the classic visual simultaneous localization and mapping (V-SLAM) framework. Our method propagates semantics on the scene graphs based on language priors and scene statistics to introduce semantic knowledge to the geometric frontiers.
arXiv Detail & Related papers (2023-05-26T13:38:33Z)
BEVBert: Multimodal Map Pre-training for Language-guided Navigation [75.23388288113817]
We propose a new map-based pre-training paradigm that is spatial-aware for use in vision-and-language navigation (VLN) We build a local metric map to explicitly aggregate incomplete observations and remove duplicates, while modeling navigation dependency in a global topological map. Based on the hybrid map, we devise a pre-training framework to learn a multimodal map representation, which enhances spatial-aware cross-modal reasoning thereby facilitating the language-guided navigation goal.
arXiv Detail & Related papers (2022-12-08T16:27:54Z)
Learning Implicit Feature Alignment Function for Semantic Segmentation [51.36809814890326]
Implicit Feature Alignment function (IFA) is inspired by the rapidly expanding topic of implicit neural representations. We show that IFA implicitly aligns the feature maps at different levels and is capable of producing segmentation maps in arbitrary resolutions. Our method can be combined with improvement on various architectures, and it achieves state-of-the-art accuracy trade-off on common benchmarks.
arXiv Detail & Related papers (2022-06-17T09:40:14Z)
Temporally-Consistent Surface Reconstruction using Metrically-Consistent Atlases [131.50372468579067]
We propose a method for unsupervised reconstruction of a temporally-consistent sequence of surfaces from a sequence of time-evolving point clouds. We represent the reconstructed surfaces as atlases computed by a neural network, which enables us to establish correspondences between frames. Our approach outperforms state-of-the-art ones on several challenging datasets.
arXiv Detail & Related papers (2021-11-12T17:48:25Z)
Learning Lane Graph Representations for Motion Forecasting [92.88572392790623]
We construct a lane graph from raw map data to preserve the map structure. We exploit a fusion network consisting of four types of interactions, actor-to-lane, lane-to-lane, lane-to-actor and actor-to-actor. Our approach significantly outperforms the state-of-the-art on the large scale Argoverse motion forecasting benchmark.
arXiv Detail & Related papers (2020-07-27T17:59:49Z)

This list is automatically generated from the titles and abstracts of the papers in this site.