CurveFormer: 3D Lane Detection by Curve Propagation with Curve Queries
and Attention
- URL: http://arxiv.org/abs/2209.07989v1
- Date: Fri, 16 Sep 2022 14:54:57 GMT
- Title: CurveFormer: 3D Lane Detection by Curve Propagation with Curve Queries
and Attention
- Authors: Yifeng Bai, Zhirong Chen, Zhangjie Fu, Lang Peng, Pengpeng Liang,
Erkang Cheng
- Abstract summary: 3D lane detection is an integral part of autonomous driving systems.
Previous CNN and Transformer-based methods usually first generate a bird's-eye-view (BEV) feature map from the front view image.
We propose CurveFormer, a single-stage Transformer-based method that directly calculates 3D lane parameters.
- Score: 3.330270927081078
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: 3D lane detection is an integral part of autonomous driving systems. Previous
CNN and Transformer-based methods usually first generate a bird's-eye-view
(BEV) feature map from the front view image, and then use a sub-network with
BEV feature map as input to predict 3D lanes. Such approaches require an
explicit view transformation between BEV and front view, which itself is still
a challenging problem. In this paper, we propose CurveFormer, a single-stage
Transformer-based method that directly calculates 3D lane parameters and can
circumvent the difficult view transformation step. Specifically, we formulate
3D lane detection as a curve propagation problem by using curve queries. A 3D
lane query is represented by a dynamic and ordered anchor point set. In this
way, queries with curve representation in Transformer decoder iteratively
refine the 3D lane detection results. Moreover, a curve cross-attention module
is introduced to compute the similarities between curve queries and image
features. Additionally, a context sampling module that can capture more
relative image features of a curve query is provided to further boost the 3D
lane detection performance. We evaluate our method for 3D lane detection on
both synthetic and real-world datasets, and the experimental results show that
our method achieves promising performance compared with the state-of-the-art
approaches. The effectiveness of each component is validated via ablation
studies as well.
Related papers
- NeRFDeformer: NeRF Transformation from a Single View via 3D Scene Flows [60.291277312569285]
We present a method for automatically modifying a NeRF representation based on a single observation.
Our method defines the transformation as a 3D flow, specifically as a weighted linear blending of rigid transformations.
We also introduce a new dataset for exploring the problem of modifying a NeRF scene through a single observation.
arXiv Detail & Related papers (2024-06-15T07:58:08Z) - CurveFormer++: 3D Lane Detection by Curve Propagation with Temporal
Curve Queries and Attention [6.337799395191661]
We present CurveFormer++, a single-stage Transformer-based method that does not require the image feature view transform module.
By employing a Transformer decoder, the model can iteratively refine the 3D lane detection results.
We evaluate our approach for the 3D lane detection task on two publicly available real-world datasets.
arXiv Detail & Related papers (2024-02-09T14:13:40Z) - 3D Lane Detection from Front or Surround-View using Joint-Modeling & Matching [27.588395086563978]
We propose a joint modeling approach that combines Bezier curves and methods.
We also introduce a novel 3D Spatial, representing an exploration of 3D surround-view lane detection research.
This innovative method establishes a new benchmark in front-view 3D lane detection on the Openlane dataset.
arXiv Detail & Related papers (2024-01-16T01:12:24Z) - Decoupling the Curve Modeling and Pavement Regression for Lane Detection [67.22629246312283]
curve-based lane representation is a popular approach in many lane detection methods.
We propose a new approach to the lane detection task by decomposing it into two parts: curve modeling and ground height regression.
arXiv Detail & Related papers (2023-09-19T11:24:14Z) - An Efficient Transformer for Simultaneous Learning of BEV and Lane
Representations in 3D Lane Detection [55.281369497158515]
We propose an efficient transformer for 3D lane detection.
Different from the vanilla transformer, our model contains a cross-attention mechanism to simultaneously learn lane and BEV representations.
Our method obtains 2D and 3D lane predictions by applying the lane features to the image-view and BEV features, respectively.
arXiv Detail & Related papers (2023-06-08T04:18:31Z) - Online Lane Graph Extraction from Onboard Video [133.68032636906133]
We use the video stream from an onboard camera for online extraction of the surrounding's lane graph.
Using video, instead of a single image, as input poses both benefits and challenges in terms of combining the information from different timesteps.
A single model of this proposed simple, yet effective, method can process any number of images, including one, to produce accurate lane graphs.
arXiv Detail & Related papers (2023-04-03T12:36:39Z) - Rethinking Efficient Lane Detection via Curve Modeling [37.45243848960598]
The proposed method achieves a new state-of-the-art performance on the popular LLAMAS benchmark.
It also achieves favorable accuracy on the TuSimple and CU datasets, while retaining both low latency (> 150 FPS) and small model size ( 10M)
arXiv Detail & Related papers (2022-03-04T17:00:33Z) - DOPS: Learning to Detect 3D Objects and Predict their 3D Shapes [54.239416488865565]
We propose a fast single-stage 3D object detection method for LIDAR data.
The core novelty of our method is a fast, single-pass architecture that both detects objects in 3D and estimates their shapes.
We find that our proposed method achieves state-of-the-art results by 5% on object detection in ScanNet scenes, and it gets top results by 3.4% in the Open dataset.
arXiv Detail & Related papers (2020-04-02T17:48:50Z) - Road Curb Detection and Localization with Monocular Forward-view Vehicle
Camera [74.45649274085447]
We propose a robust method for estimating road curb 3D parameters using a calibrated monocular camera equipped with a fisheye lens.
Our approach is able to estimate the vehicle to curb distance in real time with mean accuracy of more than 90%.
arXiv Detail & Related papers (2020-02-28T00:24:18Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.