EAN-MapNet: Efficient Vectorized HD Map Construction with Anchor
Neighborhoods
- URL: http://arxiv.org/abs/2402.18278v2
- Date: Fri, 8 Mar 2024 04:33:06 GMT
- Title: EAN-MapNet: Efficient Vectorized HD Map Construction with Anchor
Neighborhoods
- Authors: Huiyuan Xiong, Jun Shen, Taohong Zhu, Yuelong Pan
- Abstract summary: We propose EAN-MapNet for Efficiently constructing HD map using Anchor Neighborhoods.
On the nuScenes dataset, EAN-MapNet achieves a state-of-the-art performance with 63.0 mAP after training for 24 epochs, surpassing MapTR by 12.7 mAP.
- Score: 3.699463628959233
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: High-definition (HD) map is crucial for autonomous driving systems. Most
existing works design map elements detection heads based on the DETR decoder.
However, the initial queries lack explicit incorporation of physical positional
information, and vanilla self-attention entails high computational complexity.
Therefore, we propose EAN-MapNet for Efficiently constructing HD map using
Anchor Neighborhoods. Firstly, we design query units based on the anchor
neighborhoods, allowing non-neighborhood central anchors to effectively assist
in fitting the neighborhood central anchors to the target points representing
map elements. Then, we propose grouped local self-attention (GL-SA) by
leveraging the relative instance relationship among the queries. This
facilitates direct feature interaction among queries of the same instances,
while innovatively employing local queries as intermediaries for interaction
among queries from different instances. Consequently, GL-SA significantly
reduces the computational complexity of self-attention while ensuring ample
feature interaction among queries. On the nuScenes dataset, EAN-MapNet achieves
a state-of-the-art performance with 63.0 mAP after training for 24 epochs,
surpassing MapTR by 12.7 mAP. Furthermore, it considerably reduces memory
consumption by 8198M compared to MapTRv2.
Related papers
- TopoSD: Topology-Enhanced Lane Segment Perception with SDMap Prior [70.84644266024571]
We propose to train a perception model to "see" standard definition maps (SDMaps)
We encode SDMap elements into neural spatial map representations and instance tokens, and then incorporate such complementary features as prior information.
Based on the lane segment representation framework, the model simultaneously predicts lanes, centrelines and their topology.
arXiv Detail & Related papers (2024-11-22T06:13:42Z) - MGMapNet: Multi-Granularity Representation Learning for End-to-End Vectorized HD Map Construction [75.93907511203317]
We propose MGMapNet (Multi-Granularity Map Network) to model map element with a multi-granularity representation.
The proposed MGMapNet achieves state-of-the-art performance, surpassing MapTRv2 by 5.3 mAP on nuScenes and 4.4 mAP on Argoverse2 respectively.
arXiv Detail & Related papers (2024-10-10T09:05:23Z) - Leveraging Enhanced Queries of Point Sets for Vectorized Map Construction [15.324464723174533]
This paper introduces MapQR, an end-to-end method with an emphasis on enhancing query capabilities for constructing online vectorized maps.
MapQR utilizes a novel query design, called scatter-and-gather query, which is modelled by separate content and position parts explicitly.
The proposed MapQR achieves the best mean average precision (mAP) and maintains good efficiency on both nuScenes and Argoverse 2.
arXiv Detail & Related papers (2024-02-27T11:43:09Z) - ADMap: Anti-disturbance framework for reconstructing online vectorized
HD map [9.218463154577616]
This paper proposes the Anti-disturbance Map reconstruction framework (ADMap)
To mitigate point-order jitter, the framework consists of three modules: Multi-Scale Perception Neck, Instance Interactive Attention (IIA), and Vector Direction Difference Loss (VDDL)
arXiv Detail & Related papers (2024-01-24T01:37:27Z) - Decoupled DETR: Spatially Disentangling Localization and Classification
for Improved End-to-End Object Detection [48.429555904690595]
We introduce spatially decoupled DETR, which includes a task-aware query generation module and a disentangled feature learning process.
We demonstrate that our approach achieves a significant improvement in MSCOCO datasets compared to previous work.
arXiv Detail & Related papers (2023-10-24T15:54:11Z) - InsMapper: Exploring Inner-instance Information for Vectorized HD
Mapping [41.59891369655983]
InsMapper harnesses inner-instance information for vectorized high-definition mapping through transformers.
InsMapper surpasses the previous state-of-the-art method, demonstrating its effectiveness and generality.
arXiv Detail & Related papers (2023-08-16T17:58:28Z) - DETR Doesn't Need Multi-Scale or Locality Design [69.56292005230185]
This paper presents an improved DETR detector that maintains a "plain" nature.
It uses a single-scale feature map and global cross-attention calculations without specific locality constraints.
We show that two simple technologies are surprisingly effective within a plain design to compensate for the lack of multi-scale feature maps and locality constraints.
arXiv Detail & Related papers (2023-08-03T17:59:04Z) - MachMap: End-to-End Vectorized Solution for Compact HD-Map Construction [24.517848530666907]
This report introduces the 1st place winning solution for the Autonomous Driving Challenge 2023 - Online HD-map Construction.
We elaborate an effective architecture, termed as MachMap, which formulates the task of HD-map construction as the point detection paradigm.
arXiv Detail & Related papers (2023-06-17T09:06:48Z) - DQnet: Cross-Model Detail Querying for Camouflaged Object Detection [54.82390534024954]
A convolutional neural network (CNN) for camouflaged object detection tends to activate local discriminative regions while ignoring complete object extent.
In this paper, we argue that partial activation is caused by the intrinsic characteristics of CNN.
In order to obtain feature maps that could activate full object extent, a novel framework termed Cross-Model Detail Querying network (DQnet) is proposed.
arXiv Detail & Related papers (2022-12-16T06:23:58Z) - MapTR: Structured Modeling and Learning for Online Vectorized HD Map
Construction [33.30177029735497]
MapTR is a structured end-to-end framework for efficient online vectorized HD map construction.
MapTR achieves the best performance and efficiency among existing vectorized map construction approaches.
arXiv Detail & Related papers (2022-08-30T17:55:59Z) - Rethinking Localization Map: Towards Accurate Object Perception with
Self-Enhancement Maps [78.2581910688094]
This work introduces a novel self-enhancement method to harvest accurate object localization maps and object boundaries with only category labels as supervision.
In particular, the proposed Self-Enhancement Maps achieve the state-of-the-art localization accuracy of 54.88% on ILSVRC.
arXiv Detail & Related papers (2020-06-09T12:35:55Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.