Multi-Agent Deep Reinforcement Learning for Dynamic Avatar Migration in
AIoT-enabled Vehicular Metaverses with Trajectory Prediction
- URL: http://arxiv.org/abs/2306.14683v1
- Date: Mon, 26 Jun 2023 13:27:11 GMT
- Title: Multi-Agent Deep Reinforcement Learning for Dynamic Avatar Migration in
AIoT-enabled Vehicular Metaverses with Trajectory Prediction
- Authors: Junlong Chen, Jiawen Kang, Minrui Xu, Zehui Xiong, Dusit Niyato, Chuan
Chen, Abbas Jamalipour, and Shengli Xie
- Abstract summary: We propose a model to predict the future trajectories of intelligent vehicles based on their historical data.
We show that our proposed algorithm can effectively reduce the latency of executing avatar tasks by around 25% without prediction.
- Score: 70.9337170201739
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: Avatars, as promising digital assistants in Vehicular Metaverses, can enable
drivers and passengers to immerse in 3D virtual spaces, serving as a practical
emerging example of Artificial Intelligence of Things (AIoT) in intelligent
vehicular environments. The immersive experience is achieved through seamless
human-avatar interaction, e.g., augmented reality navigation, which requires
intensive resources that are inefficient and impractical to process on
intelligent vehicles locally. Fortunately, offloading avatar tasks to RoadSide
Units (RSUs) or cloud servers for remote execution can effectively reduce
resource consumption. However, the high mobility of vehicles, the dynamic
workload of RSUs, and the heterogeneity of RSUs pose novel challenges to making
avatar migration decisions. To address these challenges, in this paper, we
propose a dynamic migration framework for avatar tasks based on real-time
trajectory prediction and Multi-Agent Deep Reinforcement Learning (MADRL).
Specifically, we propose a model to predict the future trajectories of
intelligent vehicles based on their historical data, indicating the future
workloads of RSUs.Based on the expected workloads of RSUs, we formulate the
avatar task migration problem as a long-term mixed integer programming problem.
To tackle this problem efficiently, the problem is transformed into a Partially
Observable Markov Decision Process (POMDP) and solved by multiple DRL agents
with hybrid continuous and discrete actions in decentralized. Numerical results
demonstrate that our proposed algorithm can effectively reduce the latency of
executing avatar tasks by around 25% without prediction and 30% with prediction
and enhance user immersive experiences in the AIoT-enabled Vehicular Metaverse
(AeVeM).
Related papers
- DNN Task Assignment in UAV Networks: A Generative AI Enhanced Multi-Agent Reinforcement Learning Approach [16.139481340656552]
This paper presents a joint approach that combines multiple-agent reinforcement learning (MARL) and generative diffusion models (GDM)
In the second stage, we introduce a novel DNN task assignment algorithm, termed GDM-MADDPG, which utilizes the reverse denoising process of GDM to replace the actor network in multi-agent deep deterministic policy gradient (MADDPG)
Simulation results indicate that our algorithm performs favorably compared to benchmarks in terms of path planning, Age of Information (AoI), energy consumption, and task load balancing.
arXiv Detail & Related papers (2024-11-13T02:41:02Z) - ImDy: Human Inverse Dynamics from Imitated Observations [47.994797555884325]
Inverse dynamics (ID) aims at reproducing the driven torques from human kinematic observations.
Conventional optimization-based ID requires expensive laboratory setups, restricting its availability.
We propose to exploit the recently progressive human motion imitation algorithms to learn human inverse dynamics in a data-driven manner.
arXiv Detail & Related papers (2024-10-23T07:06:08Z) - Mitigating Covariate Shift in Imitation Learning for Autonomous Vehicles Using Latent Space Generative World Models [60.87795376541144]
A world model is a neural network capable of predicting an agent's next state given past states and actions.
During end-to-end training, our policy learns how to recover from errors by aligning with states observed in human demonstrations.
We present qualitative and quantitative results, demonstrating significant improvements upon prior state of the art in closed-loop testing.
arXiv Detail & Related papers (2024-09-25T06:48:25Z) - Bridging Language, Vision and Action: Multimodal VAEs in Robotic Manipulation Tasks [0.0]
In this work, we focus on unsupervised vision-language--action mapping in the area of robotic manipulation.
We propose a model-invariant training alternative that improves the models' performance in a simulator by up to 55%.
Our work thus also sheds light on the potential benefits and limitations of using the current multimodal VAEs for unsupervised learning of robotic motion trajectories.
arXiv Detail & Related papers (2024-04-02T13:25:16Z) - HAIM-DRL: Enhanced Human-in-the-loop Reinforcement Learning for Safe and Efficient Autonomous Driving [2.807187711407621]
We propose an enhanced human-in-the-loop reinforcement learning method, termed the Human as AI mentor-based deep reinforcement learning (HAIM-DRL) framework.
We first introduce an innovative learning paradigm that effectively injects human intelligence into AI, termed Human as AI mentor (HAIM)
In this paradigm, the human expert serves as a mentor to the AI agent, while the agent could be guided to minimize traffic flow disturbance.
arXiv Detail & Related papers (2024-01-06T08:30:14Z) - Policy Pre-training for End-to-end Autonomous Driving via
Self-supervised Geometric Modeling [96.31941517446859]
We propose PPGeo (Policy Pre-training via Geometric modeling), an intuitive and straightforward fully self-supervised framework curated for the policy pretraining in visuomotor driving.
We aim at learning policy representations as a powerful abstraction by modeling 3D geometric scenes on large-scale unlabeled and uncalibrated YouTube driving videos.
In the first stage, the geometric modeling framework generates pose and depth predictions simultaneously, with two consecutive frames as input.
In the second stage, the visual encoder learns driving policy representation by predicting the future ego-motion and optimizing with the photometric error based on current visual observation only.
arXiv Detail & Related papers (2023-01-03T08:52:49Z) - Multitask Adaptation by Retrospective Exploration with Learned World
Models [77.34726150561087]
We propose a meta-learned addressing model called RAMa that provides training samples for the MBRL agent taken from task-agnostic storage.
The model is trained to maximize the expected agent's performance by selecting promising trajectories solving prior tasks from the storage.
arXiv Detail & Related papers (2021-10-25T20:02:57Z) - Hierarchical Few-Shot Imitation with Skill Transition Models [66.81252581083199]
Few-shot Imitation with Skill Transition Models (FIST) is an algorithm that extracts skills from offline data and utilizes them to generalize to unseen tasks.
We show that FIST is capable of generalizing to new tasks and substantially outperforms prior baselines in navigation experiments.
arXiv Detail & Related papers (2021-07-19T15:56:01Z) - Fine-Grained Vehicle Perception via 3D Part-Guided Visual Data
Augmentation [77.60050239225086]
We propose an effective training data generation process by fitting a 3D car model with dynamic parts to vehicles in real images.
Our approach is fully automatic without any human interaction.
We present a multi-task network for VUS parsing and a multi-stream network for VHI parsing.
arXiv Detail & Related papers (2020-12-15T03:03:38Z) - Deep Surrogate Q-Learning for Autonomous Driving [17.30342128504405]
We propose Surrogate Q-learning for learning lane-change behavior for autonomous driving.
We show that the architecture leads to a novel replay sampling technique we call Scene-centric Experience Replay.
We also show that our methods enhance real-world applicability of RL systems by learning policies on the real highD dataset.
arXiv Detail & Related papers (2020-10-21T19:49:06Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.