Detecting Lane and Road Markings at A Distance with Perspective
Transformer Layers
- URL: http://arxiv.org/abs/2003.08550v2
- Date: Sun, 25 Oct 2020 06:38:46 GMT
- Title: Detecting Lane and Road Markings at A Distance with Perspective
Transformer Layers
- Authors: Zhuoping Yu, Xiaozhou Ren, Yuyao Huang, Wei Tian, Junqiao Zhao
- Abstract summary: In existing approaches, the detection accuracy often degrades with the increasing distance.
This is due to the fact that distant lane and road markings occupy a small number of pixels in the image.
Inverse Perspective Mapping can be used to eliminate the perspective distortion, but the inherent can lead to artifacts.
- Score: 5.033948921121557
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Accurate detection of lane and road markings is a task of great importance
for intelligent vehicles. In existing approaches, the detection accuracy often
degrades with the increasing distance. This is due to the fact that distant
lane and road markings occupy a small number of pixels in the image, and scales
of lane and road markings are inconsistent at various distances and
perspectives. The Inverse Perspective Mapping (IPM) can be used to eliminate
the perspective distortion, but the inherent interpolation can lead to
artifacts especially around distant lane and road markings and thus has a
negative impact on the accuracy of lane marking detection and segmentation. To
solve this problem, we adopt the Encoder-Decoder architecture in Fully
Convolutional Networks and leverage the idea of Spatial Transformer Networks to
introduce a novel semantic segmentation neural network. This approach
decomposes the IPM process into multiple consecutive differentiable homographic
transform layers, which are called "Perspective Transformer Layers".
Furthermore, the interpolated feature map is refined by subsequent
convolutional layers thus reducing the artifacts and improving the accuracy.
The effectiveness of the proposed method in lane marking detection is validated
on two public datasets: TuSimple and ApolloScape
Related papers
- Homography Guided Temporal Fusion for Road Line and Marking Segmentation [73.47092021519245]
Road lines and markings are frequently occluded in the presence of moving vehicles, shadow, and glare.
We propose a Homography Guided Fusion (HomoFusion) module to exploit temporally-adjacent video frames for complementary cues.
We show that exploiting available camera intrinsic data and ground plane assumption for cross-frame correspondence can lead to a light-weight network with significantly improved performances in speed and accuracy.
arXiv Detail & Related papers (2024-04-11T10:26:40Z) - Perspective Aware Road Obstacle Detection [104.57322421897769]
We show that road obstacle detection techniques ignore the fact that, in practice, the apparent size of the obstacles decreases as their distance to the vehicle increases.
We leverage this by computing a scale map encoding the apparent size of a hypothetical object at every image location.
We then leverage this perspective map to generate training data by injecting onto the road synthetic objects whose size corresponds to the perspective foreshortening.
arXiv Detail & Related papers (2022-10-04T17:48:42Z) - Pyramid Transformer for Traffic Sign Detection [1.933681537640272]
A novel Pyramid Transformer with locality mechanisms is proposed in this paper.
Specifically, Pyramid Transformer has several spatial pyramid reduction layers to shrink and embed the input image into tokens with rich multi-scale context.
The experiments are conducted on the German Traffic Sign Detection Benchmark (GTSDB)
arXiv Detail & Related papers (2022-07-13T09:21:19Z) - Network Traffic Anomaly Detection Method Based on Multi scale Residual
Feature [4.894147848840537]
An anomaly detection method based on mul-ti-scale residual features of network traffic is proposed.
The experimental results show that the detection performance of the proposed method for anomalous network traffic is sig-nificantly improved compared with traditional methods.
arXiv Detail & Related papers (2022-05-08T16:18:24Z) - Laneformer: Object-aware Row-Column Transformers for Lane Detection [96.62919884511287]
Laneformer is a transformer-based architecture tailored for lane detection in autonomous driving.
Inspired by recent advances of the transformer encoder-decoder architecture in various vision tasks, we move forwards to design a new end-to-end Laneformer architecture.
arXiv Detail & Related papers (2022-03-18T10:14:35Z) - LocalTrans: A Multiscale Local Transformer Network for Cross-Resolution
Homography Estimation [52.63874513999119]
Cross-resolution image alignment is a key problem in multiscale giga photography.
Existing deep homography methods neglecting the explicit formulation of correspondences between them, which leads to degraded accuracy in cross-resolution challenges.
We propose a local transformer network embedded within a multiscale structure to explicitly learn correspondences between the multimodal inputs.
arXiv Detail & Related papers (2021-06-08T02:51:45Z) - LDNet: End-to-End Lane Marking Detection Approach Using a Dynamic Vision
Sensor [0.0]
This paper explores the novel application of lane marking detection using an event camera.
The spatial resolution of the encoded features is retained by a dense atrous spatial pyramid pooling block.
The efficacy of the proposed work is evaluated using the DVS dataset for lane extraction.
arXiv Detail & Related papers (2020-09-17T02:15:41Z) - Anchor-free Small-scale Multispectral Pedestrian Detection [88.7497134369344]
We propose a method for effective and efficient multispectral fusion of the two modalities in an adapted single-stage anchor-free base architecture.
We aim at learning pedestrian representations based on object center and scale rather than direct bounding box predictions.
Results show our method's effectiveness in detecting small-scaled pedestrians.
arXiv Detail & Related papers (2020-08-19T13:13:01Z) - LR-CNN: Local-aware Region CNN for Vehicle Detection in Aerial Imagery [43.91170581467171]
State-of-the-art object detection approaches have difficulties detecting dense, small targets with arbitrary orientation in large aerial images.
We present the Local-aware Region Convolutional Neural Network (LR-CNN), a novel two-stage approach for vehicle detection in aerial imagery.
arXiv Detail & Related papers (2020-05-28T19:57:34Z) - End-to-End Lane Marker Detection via Row-wise Classification [13.82948181492772]
In autonomous driving, detecting reliable and accurate lane marker positions is a crucial yet challenging task.
Conventional approaches for the lane marker detection problem perform a pixel-level dense prediction task followed by sophisticated post-processing.
We propose a method performing direct lane marker prediction in an end-to-end manner, without any post-processing step.
arXiv Detail & Related papers (2020-05-06T12:48:46Z) - Saliency Enhancement using Gradient Domain Edges Merging [65.90255950853674]
We develop a method to merge the edges with the saliency maps to improve the performance of the saliency.
This leads to our proposed saliency enhancement using edges (SEE) with an average improvement of at least 3.4 times higher on the DUT-OMRON dataset.
The SEE algorithm is split into 2 parts, SEE-Pre for preprocessing and SEE-Post pour postprocessing.
arXiv Detail & Related papers (2020-02-11T14:04:56Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.