Patched Line Segment Learning for Vector Road Mapping
- URL: http://arxiv.org/abs/2309.02923v1
- Date: Wed, 6 Sep 2023 11:33:25 GMT
- Title: Patched Line Segment Learning for Vector Road Mapping
- Authors: Jiakun Xu, Bowen Xu, Gui-Song Xia, Liang Dong, Nan Xue
- Abstract summary: We build upon a well-defined Patched Line Segment representation for road graphs that holds geometric significance.
Our method achieves state-of-the-art performance with just 6 GPU hours of training, leading to a substantial 32-fold reduction in training costs.
- Score: 34.16241268436923
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: This paper presents a novel approach to computing vector road maps from
satellite remotely sensed images, building upon a well-defined Patched Line
Segment (PaLiS) representation for road graphs that holds geometric
significance. Unlike prevailing methods that derive road vector representations
from satellite images using binary masks or keypoints, our method employs line
segments. These segments not only convey road locations but also capture their
orientations, making them a robust choice for representation. More precisely,
given an input image, we divide it into non-overlapping patches and predict a
suitable line segment within each patch. This strategy enables us to capture
spatial and structural cues from these patch-based line segments, simplifying
the process of constructing the road network graph without the necessity of
additional neural networks for connectivity. In our experiments, we demonstrate
how an effective representation of a road graph significantly enhances the
performance of vector road mapping on established benchmarks, without requiring
extensive modifications to the neural network architecture. Furthermore, our
method achieves state-of-the-art performance with just 6 GPU hours of training,
leading to a substantial 32-fold reduction in training costs in terms of GPU
hours.
Related papers
- PaRK-Detect: Towards Efficient Multi-Task Satellite Imagery Road
Extraction via Patch-Wise Keypoints Detection [12.145321599949236]
We propose a new scheme for multi-task satellite imagery road extraction, Patch-wise Road Keypoints Detection (PaRK-Detect)
Our framework predicts the position of patch-wise road keypoints and the adjacent relationships between them to construct road graphs in a single pass.
We evaluate our approach against the existing state-of-the-art methods on DeepGlobe, Massachusetts Roads, and RoadTracer datasets and achieve competitive or better results.
arXiv Detail & Related papers (2023-02-26T08:26:26Z) - RNGDet: Road Network Graph Detection by Transformer in Aerial Images [19.141279413414082]
Road network graphs provide critical information for autonomous vehicle applications.
manually annotating road network graphs is inefficient and labor-intensive.
We propose a novel approach based on transformer and imitation learning named RNGDet.
arXiv Detail & Related papers (2022-02-16T01:59:41Z) - Road Extraction from Overhead Images with Graph Neural Networks [18.649284163019516]
We propose a method that directly infers the final road graph in a single pass.
The key idea consists in combining a Fully Convolutional Network in charge of locating points of interest and a Graph Neural Network which predicts links between these points.
We evaluate our method against existing works on the popular RoadTracer dataset and achieve competitive results.
arXiv Detail & Related papers (2021-12-09T21:10:27Z) - SPIN Road Mapper: Extracting Roads from Aerial Images via Spatial and
Interaction Space Graph Reasoning for Autonomous Driving [64.10636296274168]
Road extraction is an essential step in building autonomous navigation systems.
Using just convolution neural networks (ConvNets) for this problem is not effective as it is inefficient at capturing distant dependencies between road segments in the image.
We propose a Spatial and Interaction Space Graph Reasoning (SPIN) module which when plugged into a ConvNet performs reasoning over graphs constructed on spatial and interaction spaces projected from the feature maps.
arXiv Detail & Related papers (2021-09-16T03:52:17Z) - Graph Representation Learning for Road Type Classification [13.227651826285014]
We present a learning-based approach to graph representations of road networks employing state-of-the-art graph convolutional neural networks.
Our approach is applied to realistic road networks of 17 cities from Open Street Map.
arXiv Detail & Related papers (2021-07-16T09:32:58Z) - Hierarchical Recurrent Attention Networks for Structured Online Maps [91.28820076955128]
We tackle the problem of online road network extraction from sparse 3D point clouds.
Our method is inspired by how annotator builds a lane graph, by first identifying how many lanes there are and then drawing each one in turn.
We develop a hierarchical recurrent network that attends to initial regions of a lane boundary and traces them out completely by outputting a structured polyline.
arXiv Detail & Related papers (2020-12-22T19:35:53Z) - Towards Efficient Scene Understanding via Squeeze Reasoning [71.1139549949694]
We propose a novel framework called Squeeze Reasoning.
Instead of propagating information on the spatial map, we first learn to squeeze the input feature into a channel-wise global vector.
We show that our approach can be modularized as an end-to-end trained block and can be easily plugged into existing networks.
arXiv Detail & Related papers (2020-11-06T12:17:01Z) - Scribble-based Weakly Supervised Deep Learning for Road Surface
Extraction from Remote Sensing Images [7.1577508803778045]
We propose a scribble-based weakly supervised road surface extraction method named ScRoadExtractor.
To propagate semantic information from sparse scribbles to unlabeled pixels, we introduce a road label propagation algorithm.
The proposal masks generated from the road label propagation algorithm are utilized to train a dual-branch encoder-decoder network.
arXiv Detail & Related papers (2020-10-25T12:40:30Z) - Learning Lane Graph Representations for Motion Forecasting [92.88572392790623]
We construct a lane graph from raw map data to preserve the map structure.
We exploit a fusion network consisting of four types of interactions, actor-to-lane, lane-to-lane, lane-to-actor and actor-to-actor.
Our approach significantly outperforms the state-of-the-art on the large scale Argoverse motion forecasting benchmark.
arXiv Detail & Related papers (2020-07-27T17:59:49Z) - VectorNet: Encoding HD Maps and Agent Dynamics from Vectorized
Representation [74.56282712099274]
This paper introduces VectorNet, a hierarchical graph neural network that exploits the spatial locality of individual road components represented by vectors.
By operating on the vectorized high definition (HD) maps and agent trajectories, we avoid lossy rendering and computationally intensive ConvNet encoding steps.
We evaluate VectorNet on our in-house behavior prediction benchmark and the recently released Argoverse forecasting dataset.
arXiv Detail & Related papers (2020-05-08T19:07:03Z) - Weakly-Supervised Semantic Segmentation by Iterative Affinity Learning [86.45526827323954]
Weakly-supervised semantic segmentation is a challenging task as no pixel-wise label information is provided for training.
We propose an iterative algorithm to learn such pairwise relations.
We show that the proposed algorithm performs favorably against the state-of-the-art methods.
arXiv Detail & Related papers (2020-02-19T10:32:03Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.