Plants Don't Walk on the Street: Common-Sense Reasoning for Reliable
Semantic Segmentation
- URL: http://arxiv.org/abs/2104.09254v1
- Date: Mon, 19 Apr 2021 12:51:06 GMT
- Title: Plants Don't Walk on the Street: Common-Sense Reasoning for Reliable
Semantic Segmentation
- Authors: Linara Adilova, Elena Schulz, Maram Akila, Sebastian Houben, Jan David
Schneider, Fabian Hueger, Tim Wirtz
- Abstract summary: We propose to use a partly human-designed, partly learned set of rules to describe relations between objects of a traffic scene on a high level of abstraction.
In doing so, we improve and robustify existing deep neural networks consuming low-level sensor information.
- Score: 0.7696728525672148
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Data-driven sensor interpretation in autonomous driving can lead to highly
implausible predictions as can most of the time be verified with common-sense
knowledge. However, learning common knowledge only from data is hard and
approaches for knowledge integration are an active research area. We propose to
use a partly human-designed, partly learned set of rules to describe relations
between objects of a traffic scene on a high level of abstraction. In doing so,
we improve and robustify existing deep neural networks consuming low-level
sensor information. We present an initial study adapting the well-established
Probabilistic Soft Logic (PSL) framework to validate and improve on the problem
of semantic segmentation. We describe in detail how we integrate common
knowledge into the segmentation pipeline using PSL and verify our approach in a
set of experiments demonstrating the increase in robustness against several
severe image distortions applied to the A2D2 autonomous driving data set.
Related papers
- Federated Adversarial Learning for Robust Autonomous Landing Runway Detection [6.029462194041386]
In this paper, we propose a federated adversarial learning-based framework to detect landing runways.
To the best of our knowledge, this marks the first instance of federated learning work that address the adversarial sample problem in landing runway detection.
arXiv Detail & Related papers (2024-06-22T19:31:52Z) - LLMSense: Harnessing LLMs for High-level Reasoning Over Spatiotemporal Sensor Traces [1.1137304094345333]
We design an effective prompting framework for Large Language Models (LLMs) on high-level reasoning tasks.
We also design two strategies to enhance performance with long sensor traces, including summarization before reasoning and selective inclusion of historical traces.
Our framework can be implemented in an edge-cloud setup, running small LLMs on the edge for data summarization and performing high-level reasoning on the cloud for privacy preservation.
arXiv Detail & Related papers (2024-03-28T22:06:04Z) - Cognitive TransFuser: Semantics-guided Transformer-based Sensor Fusion
for Improved Waypoint Prediction [38.971222477695214]
RGB-LIDAR-based multi-task feature fusion network, coined Cognitive TransFuser, augments and exceeds the baseline network by a significant margin for safer and more complete road navigation.
We validate the proposed network on the Town05 Short and Town05 Long Benchmark through extensive experiments, achieving up to 44.2 FPS real-time inference time.
arXiv Detail & Related papers (2023-08-04T03:59:10Z) - ALSO: Automotive Lidar Self-supervision by Occupancy estimation [70.70557577874155]
We propose a new self-supervised method for pre-training the backbone of deep perception models operating on point clouds.
The core idea is to train the model on a pretext task which is the reconstruction of the surface on which the 3D points are sampled.
The intuition is that if the network is able to reconstruct the scene surface, given only sparse input points, then it probably also captures some fragments of semantic information.
arXiv Detail & Related papers (2022-12-12T13:10:19Z) - Federated Deep Learning Meets Autonomous Vehicle Perception: Design and
Verification [168.67190934250868]
Federated learning empowered connected autonomous vehicle (FLCAV) has been proposed.
FLCAV preserves privacy while reducing communication and annotation costs.
It is challenging to determine the network resources and road sensor poses for multi-stage training.
arXiv Detail & Related papers (2022-06-03T23:55:45Z) - Neurosymbolic hybrid approach to driver collision warning [64.02492460600905]
There are two main algorithmic approaches to autonomous driving systems.
Deep learning alone has achieved state-of-the-art results in many areas.
But sometimes it can be very difficult to debug if the deep learning model doesn't work.
arXiv Detail & Related papers (2022-03-28T20:29:50Z) - Triggering Failures: Out-Of-Distribution detection by learning from
local adversarial attacks in Semantic Segmentation [76.2621758731288]
We tackle the detection of out-of-distribution (OOD) objects in semantic segmentation.
Our main contribution is a new OOD detection architecture called ObsNet associated with a dedicated training scheme based on Local Adversarial Attacks (LAA)
We show it obtains top performances both in speed and accuracy when compared to ten recent methods of the literature on three different datasets.
arXiv Detail & Related papers (2021-08-03T17:09:56Z) - End-to-End Intersection Handling using Multi-Agent Deep Reinforcement
Learning [63.56464608571663]
Navigating through intersections is one of the main challenging tasks for an autonomous vehicle.
In this work, we focus on the implementation of a system able to navigate through intersections where only traffic signs are provided.
We propose a multi-agent system using a continuous, model-free Deep Reinforcement Learning algorithm used to train a neural network for predicting both the acceleration and the steering angle at each time step.
arXiv Detail & Related papers (2021-04-28T07:54:40Z) - DeepWORD: A GCN-based Approach for Owner-Member Relationship Detection
in Autonomous Driving [2.895229237964064]
We propose an innovative relationship prediction method, namely DeepWORD, by designing a graph convolution network (GCN)
Specifically, we utilize the feature maps with local correlation as the input of nodes to improve the information richness.
We establish an annotated owner-member relationship dataset called WORD as a large-scale benchmark, which will be available soon.
arXiv Detail & Related papers (2021-03-30T06:12:29Z) - Learning a Domain-Agnostic Visual Representation for Autonomous Driving
via Contrastive Loss [25.798361683744684]
Domain-Agnostic Contrastive Learning (DACL) is a two-stage unsupervised domain adaptation framework with cyclic adversarial training and contrastive loss.
Our proposed approach achieves better performance in the monocular depth estimation task compared to previous state-of-the-art methods.
arXiv Detail & Related papers (2021-03-10T07:06:03Z) - Semantics-aware Adaptive Knowledge Distillation for Sensor-to-Vision
Action Recognition [131.6328804788164]
We propose a framework, named Semantics-aware Adaptive Knowledge Distillation Networks (SAKDN), to enhance action recognition in vision-sensor modality (videos)
The SAKDN uses multiple wearable-sensors as teacher modalities and uses RGB videos as student modality.
arXiv Detail & Related papers (2020-09-01T03:38:31Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.