PaRK-Detect: Towards Efficient Multi-Task Satellite Imagery Road
Extraction via Patch-Wise Keypoints Detection
- URL: http://arxiv.org/abs/2302.13263v1
- Date: Sun, 26 Feb 2023 08:26:26 GMT
- Title: PaRK-Detect: Towards Efficient Multi-Task Satellite Imagery Road
Extraction via Patch-Wise Keypoints Detection
- Authors: Shenwei Xie, Wanfeng Zheng, Zhenglin Xian, Junli Yang, Chuang Zhang,
Ming Wu
- Abstract summary: We propose a new scheme for multi-task satellite imagery road extraction, Patch-wise Road Keypoints Detection (PaRK-Detect)
Our framework predicts the position of patch-wise road keypoints and the adjacent relationships between them to construct road graphs in a single pass.
We evaluate our approach against the existing state-of-the-art methods on DeepGlobe, Massachusetts Roads, and RoadTracer datasets and achieve competitive or better results.
- Score: 12.145321599949236
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Automatically extracting roads from satellite imagery is a fundamental yet
challenging computer vision task in the field of remote sensing. Pixel-wise
semantic segmentation-based approaches and graph-based approaches are two
prevailing schemes. However, prior works show the imperfections that semantic
segmentation-based approaches yield road graphs with low connectivity, while
graph-based methods with iterative exploring paradigms and smaller receptive
fields focus more on local information and are also time-consuming. In this
paper, we propose a new scheme for multi-task satellite imagery road
extraction, Patch-wise Road Keypoints Detection (PaRK-Detect). Building on top
of D-LinkNet architecture and adopting the structure of keypoint detection, our
framework predicts the position of patch-wise road keypoints and the adjacent
relationships between them to construct road graphs in a single pass.
Meanwhile, the multi-task framework also performs pixel-wise semantic
segmentation and generates road segmentation masks. We evaluate our approach
against the existing state-of-the-art methods on DeepGlobe, Massachusetts
Roads, and RoadTracer datasets and achieve competitive or better results. We
also demonstrate a considerable outperformance in terms of inference speed.
Related papers
- Patched Line Segment Learning for Vector Road Mapping [34.16241268436923]
We build upon a well-defined Patched Line Segment representation for road graphs that holds geometric significance.
Our method achieves state-of-the-art performance with just 6 GPU hours of training, leading to a substantial 32-fold reduction in training costs.
arXiv Detail & Related papers (2023-09-06T11:33:25Z) - SparseTrack: Multi-Object Tracking by Performing Scene Decomposition
based on Pseudo-Depth [84.64121608109087]
We propose a pseudo-depth estimation method for obtaining the relative depth of targets from 2D images.
Secondly, we design a depth cascading matching (DCM) algorithm, which can use the obtained depth information to convert a dense target set into multiple sparse target subsets.
By integrating the pseudo-depth method and the DCM strategy into the data association process, we propose a new tracker, called SparseTrack.
arXiv Detail & Related papers (2023-06-08T14:36:10Z) - RNGDet: Road Network Graph Detection by Transformer in Aerial Images [19.141279413414082]
Road network graphs provide critical information for autonomous vehicle applications.
manually annotating road network graphs is inefficient and labor-intensive.
We propose a novel approach based on transformer and imitation learning named RNGDet.
arXiv Detail & Related papers (2022-02-16T01:59:41Z) - Road Extraction from Overhead Images with Graph Neural Networks [18.649284163019516]
We propose a method that directly infers the final road graph in a single pass.
The key idea consists in combining a Fully Convolutional Network in charge of locating points of interest and a Graph Neural Network which predicts links between these points.
We evaluate our method against existing works on the popular RoadTracer dataset and achieve competitive results.
arXiv Detail & Related papers (2021-12-09T21:10:27Z) - Modelling Neighbor Relation in Joint Space-Time Graph for Video
Correspondence Learning [53.74240452117145]
This paper presents a self-supervised method for learning reliable visual correspondence from unlabeled videos.
We formulate the correspondence as finding paths in a joint space-time graph, where nodes are grid patches sampled from frames, and are linked by two types of edges.
Our learned representation outperforms the state-of-the-art self-supervised methods on a variety of visual tasks.
arXiv Detail & Related papers (2021-09-28T05:40:01Z) - SPIN Road Mapper: Extracting Roads from Aerial Images via Spatial and
Interaction Space Graph Reasoning for Autonomous Driving [64.10636296274168]
Road extraction is an essential step in building autonomous navigation systems.
Using just convolution neural networks (ConvNets) for this problem is not effective as it is inefficient at capturing distant dependencies between road segments in the image.
We propose a Spatial and Interaction Space Graph Reasoning (SPIN) module which when plugged into a ConvNet performs reasoning over graphs constructed on spatial and interaction spaces projected from the feature maps.
arXiv Detail & Related papers (2021-09-16T03:52:17Z) - OpenPifPaf: Composite Fields for Semantic Keypoint Detection and
Spatio-Temporal Association [90.39247595214998]
Image-based perception tasks can be formulated as detecting, associating and semantic keypoints, e.g. human body pose estimation and tracking.
We present a general framework that jointly detects semantic andtemporal keypoint associations in a single stage.
We also show that our method generalizes to any class of keypoints such as car and animal parts to provide a holistic perception framework.
arXiv Detail & Related papers (2021-03-03T14:44:14Z) - Spatiotemporal Graph Neural Network based Mask Reconstruction for Video
Object Segmentation [70.97625552643493]
This paper addresses the task of segmenting class-agnostic objects in semi-supervised setting.
We propose a novel graph neuralS network (TG-Net) which captures the local contexts by utilizing all proposals.
arXiv Detail & Related papers (2020-12-10T07:57:44Z) - Scribble-based Weakly Supervised Deep Learning for Road Surface
Extraction from Remote Sensing Images [7.1577508803778045]
We propose a scribble-based weakly supervised road surface extraction method named ScRoadExtractor.
To propagate semantic information from sparse scribbles to unlabeled pixels, we introduce a road label propagation algorithm.
The proposal masks generated from the road label propagation algorithm are utilized to train a dual-branch encoder-decoder network.
arXiv Detail & Related papers (2020-10-25T12:40:30Z) - PP-LinkNet: Improving Semantic Segmentation of High Resolution Satellite
Imagery with Multi-stage Training [4.694536172504848]
Road network and building footprint extraction is essential for many applications such as updating maps, traffic regulations, city planning, ride-hailing, disaster response textitetc.
arXiv Detail & Related papers (2020-10-14T10:23:48Z) - Tracking Road Users using Constraint Programming [79.32806233778511]
We present a constraint programming (CP) approach for the data association phase found in the tracking-by-detection paradigm of the multiple object tracking (MOT) problem.
Our proposed method was tested on a motorized vehicles tracking dataset and produces results that outperform the top methods of the UA-DETRAC benchmark.
arXiv Detail & Related papers (2020-03-10T00:04:32Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.