Penalty-Based Imitation Learning With Cross Semantics Generation Sensor
Fusion for Autonomous Driving
- URL: http://arxiv.org/abs/2303.11888v4
- Date: Thu, 26 Oct 2023 13:09:24 GMT
- Title: Penalty-Based Imitation Learning With Cross Semantics Generation Sensor
Fusion for Autonomous Driving
- Authors: Hongkuan Zhou, Aifen Sui, Letian Shi, and Yinxian Li
- Abstract summary: In this paper, we provide a penalty-based imitation learning approach to integrate multiple modalities of information.
We observe a remarkable increase in the driving score by more than 12% when compared to the state-of-the-art (SOTA) model, InterFuser.
Our model achieves this performance enhancement while achieving a 7-fold increase in inference speed and reducing the model size by approximately 30%.
- Score: 1.2749527861829049
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: In recent times, there has been a growing focus on end-to-end autonomous
driving technologies. This technology involves the replacement of the entire
driving pipeline with a single neural network, which has a simpler structure
and faster inference time. However, while this approach reduces the number of
components in the driving pipeline, it also presents challenges related to
interpretability and safety. For instance, the trained policy may not always
comply with traffic rules, and it is difficult to determine the reason for such
misbehavior due to the lack of intermediate outputs. Additionally, the
successful implementation of autonomous driving technology heavily depends on
the reliable and expedient processing of sensory data to accurately perceive
the surrounding environment. In this paper, we provide penalty-based imitation
learning approach combined with cross semantics generation sensor fusion
technologies (P-CSG) to efficiently integrate multiple modalities of
information and enable the autonomous agent to effectively adhere to traffic
regulations. Our model undergoes evaluation within the Town 05 Long benchmark,
where we observe a remarkable increase in the driving score by more than 12%
when compared to the state-of-the-art (SOTA) model, InterFuser. Notably, our
model achieves this performance enhancement while achieving a 7-fold increase
in inference speed and reducing the model size by approximately 30%. For more
detailed information, including code-based resources, they can be found at
https://hk-zh.github.io/p-csg/
Related papers
- RainSD: Rain Style Diversification Module for Image Synthesis
Enhancement using Feature-Level Style Distribution [5.500457283114346]
This paper presents a synthetic road dataset with sensor blockage generated from real road dataset BDD100K.
Using this dataset, the degradation of diverse multi-task networks for autonomous driving has been thoroughly evaluated and analyzed.
The tendency of the performance degradation of deep neural network-based perception systems for autonomous vehicle has been analyzed in depth.
arXiv Detail & Related papers (2023-12-31T11:30:42Z) - Unsupervised Domain Adaptation for Self-Driving from Past Traversal
Features [69.47588461101925]
We propose a method to adapt 3D object detectors to new driving environments.
Our approach enhances LiDAR-based detection models using spatial quantized historical features.
Experiments on real-world datasets demonstrate significant improvements.
arXiv Detail & Related papers (2023-09-21T15:00:31Z) - What Matters to Enhance Traffic Rule Compliance of Imitation Learning for End-to-End Autonomous Driving [10.191916541924813]
We proposed P-CSG, a penalty-based imitation learning approach with contrastive-based cross semantics generation sensor fusion technologies.
In this paper, we introduce three penalties - red light, stop sign, and curvature speed penalty to make the agent more sensitive to traffic rules.
We conducted robustness evaluations against adversarial attacks like FGSM and Dot attacks, revealing a substantial increase in robustness compared to other baseline models.
arXiv Detail & Related papers (2023-09-14T15:54:56Z) - Integrated Decision and Control for High-Level Automated Vehicles by
Mixed Policy Gradient and Its Experiment Verification [10.393343763237452]
This paper presents a self-evolving decision-making system based on the Integrated Decision and Control (IDC)
An RL algorithm called constrained mixed policy gradient (CMPG) is proposed to consistently upgrade the driving policy of the IDC.
Experiment results show that boosting by data, the system can achieve better driving ability over model-based methods.
arXiv Detail & Related papers (2022-10-19T14:58:41Z) - Exploring Contextual Representation and Multi-Modality for End-to-End
Autonomous Driving [58.879758550901364]
Recent perception systems enhance spatial understanding with sensor fusion but often lack full environmental context.
We introduce a framework that integrates three cameras to emulate the human field of view, coupled with top-down bird-eye-view semantic data to enhance contextual representation.
Our method achieves displacement error by 0.67m in open-loop settings, surpassing current methods by 6.9% on the nuScenes dataset.
arXiv Detail & Related papers (2022-10-13T05:56:20Z) - YOLOPv2: Better, Faster, Stronger for Panoptic Driving Perception [1.6683976936678229]
Multi-tasking learning approaches have achieved promising results in solving panoptic driving perception problems.
This paper proposed an effective and efficient multi-task learning network to simultaneously perform the task of traffic object detection, drivable road area segmentation and lane detection.
Our model achieved the new state-of-the-art (SOTA) performance in terms of accuracy and speed on the challenging BDD100K dataset.
arXiv Detail & Related papers (2022-08-24T11:00:27Z) - Learning energy-efficient driving behaviors by imitating experts [75.12960180185105]
This paper examines the role of imitation learning in bridging the gap between control strategies and realistic limitations in communication and sensing.
We show that imitation learning can succeed in deriving policies that, if adopted by 5% of vehicles, may boost the energy-efficiency of networks with varying traffic conditions by 15% using only local observations.
arXiv Detail & Related papers (2022-06-28T17:08:31Z) - COOPERNAUT: End-to-End Driving with Cooperative Perception for Networked
Vehicles [54.61668577827041]
We introduce COOPERNAUT, an end-to-end learning model that uses cross-vehicle perception for vision-based cooperative driving.
Our experiments on AutoCastSim suggest that our cooperative perception driving models lead to a 40% improvement in average success rate.
arXiv Detail & Related papers (2022-05-04T17:55:12Z) - End-to-End Intersection Handling using Multi-Agent Deep Reinforcement
Learning [63.56464608571663]
Navigating through intersections is one of the main challenging tasks for an autonomous vehicle.
In this work, we focus on the implementation of a system able to navigate through intersections where only traffic signs are provided.
We propose a multi-agent system using a continuous, model-free Deep Reinforcement Learning algorithm used to train a neural network for predicting both the acceleration and the steering angle at each time step.
arXiv Detail & Related papers (2021-04-28T07:54:40Z) - Multi-Modal Fusion Transformer for End-to-End Autonomous Driving [59.60483620730437]
We propose TransFuser, a novel Multi-Modal Fusion Transformer, to integrate image and LiDAR representations using attention.
Our approach achieves state-of-the-art driving performance while reducing collisions by 76% compared to geometry-based fusion.
arXiv Detail & Related papers (2021-04-19T11:48:13Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.