MapTRv2: An End-to-End Framework for Online Vectorized HD Map
Construction
- URL: http://arxiv.org/abs/2308.05736v1
- Date: Thu, 10 Aug 2023 17:56:53 GMT
- Title: MapTRv2: An End-to-End Framework for Online Vectorized HD Map
Construction
- Authors: Bencheng Liao, Shaoyu Chen, Yunchi Zhang, Bo Jiang, Qian Zhang, Wenyu
Liu, Chang Huang, Xinggang Wang
- Abstract summary: High-definition (HD) map provides abundant and precise static environmental information of the driving scene.
We present textbfMap textbfTRansformer, an end-to-end framework for online vectorized HD map construction.
- Score: 32.74879918300096
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: High-definition (HD) map provides abundant and precise static environmental
information of the driving scene, serving as a fundamental and indispensable
component for planning in autonomous driving system. In this paper, we present
\textbf{Map} \textbf{TR}ansformer, an end-to-end framework for online
vectorized HD map construction. We propose a unified permutation-equivalent
modeling approach, \ie, modeling map element as a point set with a group of
equivalent permutations, which accurately describes the shape of map element
and stabilizes the learning process. We design a hierarchical query embedding
scheme to flexibly encode structured map information and perform hierarchical
bipartite matching for map element learning. To speed up convergence, we
further introduce auxiliary one-to-many matching and dense supervision. The
proposed method well copes with various map elements with arbitrary shapes. It
runs at real-time inference speed and achieves state-of-the-art performance on
both nuScenes and Argoverse2 datasets. Abundant qualitative results show stable
and robust map construction quality in complex and various driving scenes. Code
and more demos are available at \url{https://github.com/hustvl/MapTR} for
facilitating further studies and applications.
Related papers
- Leveraging Enhanced Queries of Point Sets for Vectorized Map Construction [15.324464723174533]
This paper introduces MapQR, an end-to-end method with an emphasis on enhancing query capabilities for constructing online vectorized maps.
MapQR utilizes a novel query design, called scatter-and-gather query, which is modelled by separate content and position parts explicitly.
The proposed MapQR achieves the best mean average precision (mAP) and maintains good efficiency on both nuScenes and Argoverse 2.
arXiv Detail & Related papers (2024-02-27T11:43:09Z) - ADMap: Anti-disturbance framework for reconstructing online vectorized
HD map [9.218463154577616]
This paper proposes the Anti-disturbance Map reconstruction framework (ADMap)
To mitigate point-order jitter, the framework consists of three modules: Multi-Scale Perception Neck, Instance Interactive Attention (IIA), and Vector Direction Difference Loss (VDDL)
arXiv Detail & Related papers (2024-01-24T01:37:27Z) - ScalableMap: Scalable Map Learning for Online Long-Range Vectorized HD
Map Construction [42.874195888422584]
We propose a novel end-to-end pipeline for online long-range vectorized high-definition (HD) map construction using on-board camera sensors.
We exploit the properties of map elements to improve the performance of map construction.
arXiv Detail & Related papers (2023-10-20T09:46:24Z) - TAPIR: Tracking Any Point with per-frame Initialization and temporal
Refinement [64.11385310305612]
We present a novel model for Tracking Any Point (TAP) that effectively tracks any queried point on any physical surface throughout a video sequence.
Our approach employs two stages: (1) a matching stage, which independently locates a suitable candidate point match for the query point on every other frame, and (2) a refinement stage, which updates both the trajectory and query features based on local correlations.
The resulting model surpasses all baseline methods by a significant margin on the TAP-Vid benchmark, as demonstrated by an approximate 20% absolute average Jaccard (AJ) improvement on DAVIS.
arXiv Detail & Related papers (2023-06-14T17:07:51Z) - G-MSM: Unsupervised Multi-Shape Matching with Graph-based Affinity
Priors [52.646396621449]
G-MSM is a novel unsupervised learning approach for non-rigid shape correspondence.
We construct an affinity graph on a given set of training shapes in a self-supervised manner.
We demonstrate state-of-the-art performance on several recent shape correspondence benchmarks.
arXiv Detail & Related papers (2022-12-06T12:09:24Z) - MapTR: Structured Modeling and Learning for Online Vectorized HD Map
Construction [33.30177029735497]
MapTR is a structured end-to-end framework for efficient online vectorized HD map construction.
MapTR achieves the best performance and efficiency among existing vectorized map construction approaches.
arXiv Detail & Related papers (2022-08-30T17:55:59Z) - Learning Implicit Feature Alignment Function for Semantic Segmentation [51.36809814890326]
Implicit Feature Alignment function (IFA) is inspired by the rapidly expanding topic of implicit neural representations.
We show that IFA implicitly aligns the feature maps at different levels and is capable of producing segmentation maps in arbitrary resolutions.
Our method can be combined with improvement on various architectures, and it achieves state-of-the-art accuracy trade-off on common benchmarks.
arXiv Detail & Related papers (2022-06-17T09:40:14Z) - Bending Graphs: Hierarchical Shape Matching using Gated Optimal
Transport [80.64516377977183]
Shape matching has been a long-studied problem for the computer graphics and vision community.
We investigate a hierarchical learning design, to which we incorporate local patch-level information and global shape-level structures.
We propose a novel optimal transport solver by recurrently updating features on non-confident nodes to learn globally consistent correspondences between the shapes.
arXiv Detail & Related papers (2022-02-03T11:41:46Z) - HDMapGen: A Hierarchical Graph Generative Model of High Definition Maps [81.86923212296863]
HD maps are maps with precise definitions of road lanes with rich semantics of the traffic rules.
There are only a small amount of real-world road topologies and geometries, which significantly limits our ability to test out the self-driving stack.
We propose HDMapGen, a hierarchical graph generation model capable of producing high-quality and diverse HD maps.
arXiv Detail & Related papers (2021-06-28T17:59:30Z) - Dual Geometric Graph Network (DG2N) -- Iterative network for deformable
shape alignment [8.325327265120283]
We provide a novel new approach for aligning geometric models using a dual graph structure where local features are mapping probabilities.
We report state of the art results on stretchable domains alignment in a rapid and stable solution for meshes and cloud of points.
arXiv Detail & Related papers (2020-11-30T12:03:28Z) - Learning Lane Graph Representations for Motion Forecasting [92.88572392790623]
We construct a lane graph from raw map data to preserve the map structure.
We exploit a fusion network consisting of four types of interactions, actor-to-lane, lane-to-lane, lane-to-actor and actor-to-actor.
Our approach significantly outperforms the state-of-the-art on the large scale Argoverse motion forecasting benchmark.
arXiv Detail & Related papers (2020-07-27T17:59:49Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.