Related papers: OffRoadTranSeg: Semi-Supervised Segmentation using Transformers on OffRoad environments

OffRoadTranSeg: Semi-Supervised Segmentation using Transformers on OffRoad environments

URL: http://arxiv.org/abs/2106.13963v1
Date: Sat, 26 Jun 2021 08:05:09 GMT
Title: OffRoadTranSeg: Semi-Supervised Segmentation using Transformers on OffRoad environments
Authors: Anukriti Singh, Kartikeya Singh, and P.B. Sujit
Abstract summary: We present OffRoadTranSeg, the first end-to-end framework for semi-supervised segmentation in unstructured outdoor environment. The proposed method is validated on RELLIS-3D and RUGD offroad datasets.
Score: 0.0
License: http://creativecommons.org/publicdomain/zero/1.0/
Abstract: We present OffRoadTranSeg, the first end-to-end framework for semi-supervised segmentation in unstructured outdoor environment using transformers and automatic data selection for labelling. The offroad segmentation is a scene understanding approach that is widely used in autonomous driving. The popular offroad segmentation method is to use fully connected convolution layers and large labelled data, however, due to class imbalance, there will be several mismatches and also some classes may not be detected. Our approach is to do the task of offroad segmentation in a semi-supervised manner. The aim is to provide a model where self supervised vision transformer is used to fine-tune offroad datasets with self-supervised data collection for labelling using depth estimation. The proposed method is validated on RELLIS-3D and RUGD offroad datasets. The experiments show that OffRoadTranSeg outperformed other state of the art models, and also solves the RELLIS-3D class imbalance problem.

Related papers

Beyond Endpoints: Path-Centric Reasoning for Vectorized Off-Road Network Extraction [9.833728353188132]
We release WildRoad, a global off-road road network dataset constructed efficiently with a dedicated interactive annotation tool.<n>We introduce MaGRoad, a path-centric framework that aggregates multi-scale visual evidence along candidate paths to infer connectivity robustly.<n>MaGRoad achieves state-of-the-art performance on our challenging WildRoad benchmark while generalizing well to urban datasets.
arXiv Detail & Related papers (2025-12-11T08:29:27Z)
Unsupervised Monocular Road Segmentation for Autonomous Driving via Scene Geometry [2.8647133890966994]
This paper presents a fully unsupervised approach for binary road segmentation (road vs. non-road)<n>The method leverages scene geometry and temporal cues to distinguish road from non-road regions.<n>On the Cityscapes dataset, the model achieves an Intersection-over-Union (IoU) of 0.82, demonstrating high accuracy with a simple design.
arXiv Detail & Related papers (2025-10-19T10:59:43Z)
COARSE: Collaborative Pseudo-Labeling with Coarse Real Labels for Off-Road Semantic Segmentation [49.267650162344765]
COARSE is a semi-supervised domain adaptation framework for off-road semantic segmentation. We bridge domain gaps with complementary pixel-level and patch-level decoders, enhanced by a collaborative pseudo-labeling strategy.
arXiv Detail & Related papers (2025-03-05T22:25:54Z)
Efficient Data Representation for Motion Forecasting: A Scene-Specific Trajectory Set Approach [12.335528093380631]
This study introduces a novel approach for generating scene-specific trajectory sets tailored to different contexts. A deterministic goal sampling algorithm identifies relevant map regions, while our Recursive In-Distribution Subsampling (RIDS) method enhances trajectory plausibility. Experiments on the Argoverse 2 dataset demonstrate that our method achieves up to a 10% improvement in Driving Area Compliance.
arXiv Detail & Related papers (2024-07-30T11:06:39Z)
Homography Guided Temporal Fusion for Road Line and Marking Segmentation [73.47092021519245]
Road lines and markings are frequently occluded in the presence of moving vehicles, shadow, and glare. We propose a Homography Guided Fusion (HomoFusion) module to exploit temporally-adjacent video frames for complementary cues. We show that exploiting available camera intrinsic data and ground plane assumption for cross-frame correspondence can lead to a light-weight network with significantly improved performances in speed and accuracy.
arXiv Detail & Related papers (2024-04-11T10:26:40Z)
Leveraging Topology for Domain Adaptive Road Segmentation in Satellite and Aerial Imagery [9.23555285827483]
Road segmentation algorithms fail to generalize to new geographical locations. Road skeleton is an auxiliary task to impose the topological constraints. For self-training, we filter out the noisy pseudo-labels by using a connectivity-based pseudo-labels refinement strategy.
arXiv Detail & Related papers (2023-09-27T12:50:51Z)
OpenLane-V2: A Topology Reasoning Benchmark for Unified 3D HD Mapping [84.65114565766596]
We present OpenLane-V2, the first dataset on topology reasoning for traffic scene structure. OpenLane-V2 consists of 2,000 annotated road scenes that describe traffic elements and their correlation to the lanes. We evaluate various state-of-the-art methods, and present their quantitative and qualitative results on OpenLane-V2 to indicate future avenues for investigating topology reasoning in traffic scenes.
arXiv Detail & Related papers (2023-04-20T16:31:22Z)
Unsupervised Adaptation from Repeated Traversals for Autonomous Driving [54.59577283226982]
Self-driving cars must generalize to the end-user's environment to operate reliably. One potential solution is to leverage unlabeled data collected from the end-users' environments. There is no reliable signal in the target domain to supervise the adaptation process. We show that this simple additional assumption is sufficient to obtain a potent signal that allows us to perform iterative self-training of 3D object detectors on the target domain.
arXiv Detail & Related papers (2023-03-27T15:07:55Z)
Leveraging Road Area Semantic Segmentation with Auxiliary Steering Task [0.0]
We propose a CNN-based method that can leverage the steering wheel angle information to improve the road area semantic segmentation. We demonstrate the effectiveness of the proposed approach on two challenging data sets for autonomous driving.
arXiv Detail & Related papers (2022-12-19T13:25:09Z)
Weakly Supervised Training of Monocular 3D Object Detectors Using Wide Baseline Multi-view Traffic Camera Data [19.63193201107591]
7DoF prediction of vehicles at an intersection is an important task for assessing potential conflicts between road users. We develop an approach using a weakly supervised method of fine tuning 3D object detectors for traffic observation cameras. Our method achieves vehicle 7DoF pose prediction accuracy on our dataset comparable to the top performing monocular 3D object detectors on autonomous vehicle datasets.
arXiv Detail & Related papers (2021-10-21T08:26:48Z)
One Million Scenes for Autonomous Driving: ONCE Dataset [91.94189514073354]
We introduce the ONCE dataset for 3D object detection in the autonomous driving scenario. The data is selected from 144 driving hours, which is 20x longer than the largest 3D autonomous driving dataset available. We reproduce and evaluate a variety of self-supervised and semi-supervised methods on the ONCE dataset.
arXiv Detail & Related papers (2021-06-21T12:28:08Z)
GANav: Group-wise Attention Network for Classifying Navigable Regions in Unstructured Outdoor Environments [54.21959527308051]
We present a new learning-based method for identifying safe and navigable regions in off-road terrains and unstructured environments from RGB images. Our approach consists of classifying groups of terrain classes based on their navigability levels using coarse-grained semantic segmentation. We show through extensive evaluations on the RUGD and RELLIS-3D datasets that our learning algorithm improves the accuracy of visual perception in off-road terrains for navigation.
arXiv Detail & Related papers (2021-03-07T02:16:24Z)
Fusion of neural networks, for LIDAR-based evidential road mapping [3.065376455397363]
We introduce RoadSeg, a new convolutional architecture that is optimized for road detection in LIDAR scans. RoadSeg is used to classify individual LIDAR points as either belonging to the road, or not. We thus secondly present an evidential road mapping algorithm, that fuses consecutive road detection results.
arXiv Detail & Related papers (2021-02-05T18:14:36Z)
Detecting 32 Pedestrian Attributes for Autonomous Vehicles [103.87351701138554]
In this paper, we address the problem of jointly detecting pedestrians and recognizing 32 pedestrian attributes. We introduce a Multi-Task Learning (MTL) model relying on a composite field framework, which achieves both goals in an efficient way. We show competitive detection and attribute recognition results, as well as a more stable MTL training.
arXiv Detail & Related papers (2020-12-04T15:10:12Z)
Self-Supervised Drivable Area and Road Anomaly Segmentation using RGB-D Data for Robotic Wheelchairs [26.110522390201094]
We develop a pipeline that can automatically generate segmentation labels for drivable areas and road anomalies. Our proposed automatic labeling pipeline achieves an impressive speed-up compared to manual labeling. Our proposed self-supervised approach exhibits more robust and accurate results than the state-of-the-art traditional algorithms.
arXiv Detail & Related papers (2020-07-12T10:12:46Z)

This list is automatically generated from the titles and abstracts of the papers in this site.