RNGDet: Road Network Graph Detection by Transformer in Aerial Images
- URL: http://arxiv.org/abs/2202.07824v1
- Date: Wed, 16 Feb 2022 01:59:41 GMT
- Title: RNGDet: Road Network Graph Detection by Transformer in Aerial Images
- Authors: Zhenhua Xu, Yuxuan Liu, Lu Gan, Yuxiang Sun, Ming Liu and Lujia Wang
- Abstract summary: Road network graphs provide critical information for autonomous vehicle applications.
manually annotating road network graphs is inefficient and labor-intensive.
We propose a novel approach based on transformer and imitation learning named RNGDet.
- Score: 19.141279413414082
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Road network graphs provide critical information for autonomous vehicle
applications, such as motion planning on drivable areas. However, manually
annotating road network graphs is inefficient and labor-intensive.
Automatically detecting road network graphs could alleviate this issue, but
existing works are either segmentation-based approaches that could not ensure
satisfactory topology correctness, or graph-based approaches that could not
present precise enough detection results. To provide a solution to these
problems, we propose a novel approach based on transformer and imitation
learning named RNGDet (\underline{R}oad \underline{N}etwork \underline{G}raph
\underline{Det}ection by Transformer) in this paper. In view of that
high-resolution aerial images could be easily accessed all over the world
nowadays, we make use of aerial images in our approach. Taken as input an
aerial image, our approach iteratively generates road network graphs
vertex-by-vertex. Our approach can handle complicated intersection points of
various numbers of road segments. We evaluate our approach on a publicly
available dataset. The superiority of our approach is demonstrated through the
comparative experiments.
Related papers
- Learning Lane Graphs from Aerial Imagery Using Transformers [7.718401895021425]
This work introduces a novel approach to generating successor lane graphs from aerial imagery.
We frame successor lane graphs as a collection of maximal length paths and predict them using a Detection Transformer (DETR) architecture.
We demonstrate the efficacy of our method through extensive experiments on the diverse and large-scale UrbanLaneGraph dataset.
arXiv Detail & Related papers (2024-07-08T07:42:32Z) - Homography Guided Temporal Fusion for Road Line and Marking Segmentation [73.47092021519245]
Road lines and markings are frequently occluded in the presence of moving vehicles, shadow, and glare.
We propose a Homography Guided Fusion (HomoFusion) module to exploit temporally-adjacent video frames for complementary cues.
We show that exploiting available camera intrinsic data and ground plane assumption for cross-frame correspondence can lead to a light-weight network with significantly improved performances in speed and accuracy.
arXiv Detail & Related papers (2024-04-11T10:26:40Z) - Online Lane Graph Extraction from Onboard Video [133.68032636906133]
We use the video stream from an onboard camera for online extraction of the surrounding's lane graph.
Using video, instead of a single image, as input poses both benefits and challenges in terms of combining the information from different timesteps.
A single model of this proposed simple, yet effective, method can process any number of images, including one, to produce accurate lane graphs.
arXiv Detail & Related papers (2023-04-03T12:36:39Z) - RNGDet++: Road Network Graph Detection by Transformer with Instance
Segmentation and Multi-scale Features Enhancement [19.263691277963368]
The graph structure of road networks is critical for downstream tasks of autonomous driving systems, such as global planning, motion prediction and control.
In the past, the road network graph is usually manually annotated by human experts, which is time-consuming and labor-intensive.
Previous works either post-process semantic segmentation maps or propose graph-based algorithms to directly predict the road network graph.
Previous works suffer from hard-coded processing algorithms and inferior final performance.
Since the new proposed approach is improved from RNGDet, it is named RNGDet++.
arXiv Detail & Related papers (2022-09-21T07:06:46Z) - Self-Supervised Road Layout Parsing with Graph Auto-Encoding [5.45914480139453]
We present a neural network approach that takes a road- map in bird's eye view as input, and predicts a human-interpretable graph that represents the road's topological layout.
Our approach elevates the understanding of road layouts from pixel level to the level of graphs.
arXiv Detail & Related papers (2022-03-21T14:14:26Z) - Road Extraction from Overhead Images with Graph Neural Networks [18.649284163019516]
We propose a method that directly infers the final road graph in a single pass.
The key idea consists in combining a Fully Convolutional Network in charge of locating points of interest and a Graph Neural Network which predicts links between these points.
We evaluate our method against existing works on the popular RoadTracer dataset and achieve competitive results.
arXiv Detail & Related papers (2021-12-09T21:10:27Z) - Aerial Images Meet Crowdsourced Trajectories: A New Approach to Robust
Road Extraction [110.61383502442598]
We introduce a novel neural network framework termed Cross-Modal Message Propagation Network (CMMPNet)
CMMPNet is composed of two deep Auto-Encoders for modality-specific representation learning and a tailor-designed Dual Enhancement Module for cross-modal representation refinement.
Experiments on three real-world benchmarks demonstrate the effectiveness of our CMMPNet for robust road extraction.
arXiv Detail & Related papers (2021-11-30T04:30:10Z) - Road Network Guided Fine-Grained Urban Traffic Flow Inference [108.64631590347352]
Accurate inference of fine-grained traffic flow from coarse-grained one is an emerging yet crucial problem.
We propose a novel Road-Aware Traffic Flow Magnifier (RATFM) that exploits the prior knowledge of road networks.
Our method can generate high-quality fine-grained traffic flow maps.
arXiv Detail & Related papers (2021-09-29T07:51:49Z) - SPIN Road Mapper: Extracting Roads from Aerial Images via Spatial and
Interaction Space Graph Reasoning for Autonomous Driving [64.10636296274168]
Road extraction is an essential step in building autonomous navigation systems.
Using just convolution neural networks (ConvNets) for this problem is not effective as it is inefficient at capturing distant dependencies between road segments in the image.
We propose a Spatial and Interaction Space Graph Reasoning (SPIN) module which when plugged into a ConvNet performs reasoning over graphs constructed on spatial and interaction spaces projected from the feature maps.
arXiv Detail & Related papers (2021-09-16T03:52:17Z) - Graph Representation Learning for Road Type Classification [13.227651826285014]
We present a learning-based approach to graph representations of road networks employing state-of-the-art graph convolutional neural networks.
Our approach is applied to realistic road networks of 17 cities from Open Street Map.
arXiv Detail & Related papers (2021-07-16T09:32:58Z) - Convolutional Recurrent Network for Road Boundary Extraction [99.55522995570063]
We tackle the problem of drivable road boundary extraction from LiDAR and camera imagery.
We design a structured model where a fully convolutional network obtains deep features encoding the location and direction of road boundaries.
We showcase the effectiveness of our method on a large North American city where we obtain perfect topology of road boundaries 99.3% of the time.
arXiv Detail & Related papers (2020-12-21T18:59:12Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.