Unsupervised Monocular Road Segmentation for Autonomous Driving via Scene Geometry
- URL: http://arxiv.org/abs/2510.16790v1
- Date: Sun, 19 Oct 2025 10:59:43 GMT
- Title: Unsupervised Monocular Road Segmentation for Autonomous Driving via Scene Geometry
- Authors: Sara Hatami Rostami, Behrooz Nasihatkon,
- Abstract summary: This paper presents a fully unsupervised approach for binary road segmentation (road vs. non-road)<n>The method leverages scene geometry and temporal cues to distinguish road from non-road regions.<n>On the Cityscapes dataset, the model achieves an Intersection-over-Union (IoU) of 0.82, demonstrating high accuracy with a simple design.
- Score: 2.8647133890966994
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: This paper presents a fully unsupervised approach for binary road segmentation (road vs. non-road), eliminating the reliance on costly manually labeled datasets. The method leverages scene geometry and temporal cues to distinguish road from non-road regions. Weak labels are first generated from geometric priors, marking pixels above the horizon as non-road and a predefined quadrilateral in front of the vehicle as road. In a refinement stage, temporal consistency is enforced by tracking local feature points across frames and penalizing inconsistent label assignments using mutual information maximization. This enhances both precision and temporal stability. On the Cityscapes dataset, the model achieves an Intersection-over-Union (IoU) of 0.82, demonstrating high accuracy with a simple design. These findings demonstrate the potential of combining geometric constraints and temporal consistency for scalable unsupervised road segmentation in autonomous driving.
Related papers
- Road Obstacle Video Segmentation [71.92123495914892]
We demonstrate that the road-obstacle segmentation task is inherently temporal, since the segmentation maps for consecutive frames are strongly correlated.<n>Our approach establishes a new state-of-the-art in road-obstacle video segmentation for long-range video sequences.
arXiv Detail & Related papers (2025-09-16T15:34:43Z) - InterLoc: LiDAR-based Intersection Localization using Road Segmentation with Automated Evaluation Method [10.561470037080177]
We present a novel LiDAR-based method for online vehicle-centric intersection localization.<n>We detect intersection candidates in a bird's eye view (BEV) representation formed by concatenating semantic road scans.<n>Experiments on the Semantic KITTITI dataset show that our method outperforms the latest learning-based baseline in accuracy and reliability.
arXiv Detail & Related papers (2025-05-01T13:30:28Z) - Neural Semantic Map-Learning for Autonomous Vehicles [85.8425492858912]
We present a mapping system that fuses local submaps gathered from a fleet of vehicles at a central instance to produce a coherent map of the road environment.
Our method jointly aligns and merges the noisy and incomplete local submaps using a scene-specific Neural Signed Distance Field.
We leverage memory-efficient sparse feature-grids to scale to large areas and introduce a confidence score to model uncertainty in scene reconstruction.
arXiv Detail & Related papers (2024-10-10T10:10:03Z) - Homography Guided Temporal Fusion for Road Line and Marking Segmentation [73.47092021519245]
Road lines and markings are frequently occluded in the presence of moving vehicles, shadow, and glare.
We propose a Homography Guided Fusion (HomoFusion) module to exploit temporally-adjacent video frames for complementary cues.
We show that exploiting available camera intrinsic data and ground plane assumption for cross-frame correspondence can lead to a light-weight network with significantly improved performances in speed and accuracy.
arXiv Detail & Related papers (2024-04-11T10:26:40Z) - Leveraging Topology for Domain Adaptive Road Segmentation in Satellite
and Aerial Imagery [9.23555285827483]
Road segmentation algorithms fail to generalize to new geographical locations.
Road skeleton is an auxiliary task to impose the topological constraints.
For self-training, we filter out the noisy pseudo-labels by using a connectivity-based pseudo-labels refinement strategy.
arXiv Detail & Related papers (2023-09-27T12:50:51Z) - Leveraging Road Area Semantic Segmentation with Auxiliary Steering Task [0.0]
We propose a CNN-based method that can leverage the steering wheel angle information to improve the road area semantic segmentation.
We demonstrate the effectiveness of the proposed approach on two challenging data sets for autonomous driving.
arXiv Detail & Related papers (2022-12-19T13:25:09Z) - Unsupervised Foggy Scene Understanding via Self Spatial-Temporal Label
Diffusion [51.11295961195151]
We exploit the characteristics of the foggy image sequence of driving scenes to densify the confident pseudo labels.
Based on the two discoveries of local spatial similarity and adjacent temporal correspondence of the sequential image data, we propose a novel Target-Domain driven pseudo label Diffusion scheme.
Our scheme helps the adaptive model achieve 51.92% and 53.84% mean intersection-over-union (mIoU) on two publicly available natural foggy datasets.
arXiv Detail & Related papers (2022-06-10T05:16:50Z) - HDGT: Heterogeneous Driving Graph Transformer for Multi-Agent Trajectory
Prediction via Scene Encoding [76.9165845362574]
We propose a backbone modelling the driving scene as a heterogeneous graph with different types of nodes and edges.
For spatial relation encoding, the coordinates of the node as well as its in-edges are in the local node-centric coordinate system.
Experimental results show that HDGT achieves state-of-the-art performance for the task of trajectory prediction.
arXiv Detail & Related papers (2022-04-30T07:08:30Z) - OffRoadTranSeg: Semi-Supervised Segmentation using Transformers on
OffRoad environments [0.0]
We present OffRoadTranSeg, the first end-to-end framework for semi-supervised segmentation in unstructured outdoor environment.
The proposed method is validated on RELLIS-3D and RUGD offroad datasets.
arXiv Detail & Related papers (2021-06-26T08:05:09Z) - Detecting 32 Pedestrian Attributes for Autonomous Vehicles [103.87351701138554]
In this paper, we address the problem of jointly detecting pedestrians and recognizing 32 pedestrian attributes.
We introduce a Multi-Task Learning (MTL) model relying on a composite field framework, which achieves both goals in an efficient way.
We show competitive detection and attribute recognition results, as well as a more stable MTL training.
arXiv Detail & Related papers (2020-12-04T15:10:12Z) - Road Curb Detection and Localization with Monocular Forward-view Vehicle
Camera [74.45649274085447]
We propose a robust method for estimating road curb 3D parameters using a calibrated monocular camera equipped with a fisheye lens.
Our approach is able to estimate the vehicle to curb distance in real time with mean accuracy of more than 90%.
arXiv Detail & Related papers (2020-02-28T00:24:18Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.