Continual Meta-Reinforcement Learning for UAV-Aided Vehicular Wireless
Networks
- URL: http://arxiv.org/abs/2207.06131v1
- Date: Wed, 13 Jul 2022 11:28:02 GMT
- Title: Continual Meta-Reinforcement Learning for UAV-Aided Vehicular Wireless
Networks
- Authors: Riccardo Marini, Sangwoo Park, Osvaldo Simeone, Chiara Buratti
- Abstract summary: Unmanned aerial base stations (UABSs) can be deployed in vehicular wireless networks to support applications such as extended sensing via vehicle-to-everything (V2X) services.
A key problem in such systems is designing algorithms that can efficiently optimize the trajectory of the UABS in order to maximize coverage.
We propose the use of continual meta-RL as a means to transfer information from previously experienced traffic configurations to new conditions.
- Score: 29.89196067653312
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: Unmanned aerial base stations (UABSs) can be deployed in vehicular wireless
networks to support applications such as extended sensing via
vehicle-to-everything (V2X) services. A key problem in such systems is
designing algorithms that can efficiently optimize the trajectory of the UABS
in order to maximize coverage. In existing solutions, such optimization is
carried out from scratch for any new traffic configuration, often by means of
conventional reinforcement learning (RL). In this paper, we propose the use of
continual meta-RL as a means to transfer information from previously
experienced traffic configurations to new conditions, with the goal of reducing
the time needed to optimize the UABS's policy. Adopting the Continual Meta
Policy Search (CoMPS) strategy, we demonstrate significant efficiency gains as
compared to conventional RL, as well as to naive transfer learning methods.
Related papers
- Enhancing Spectrum Efficiency in 6G Satellite Networks: A GAIL-Powered Policy Learning via Asynchronous Federated Inverse Reinforcement Learning [67.95280175998792]
A novel adversarial imitation learning (GAIL)-powered policy learning approach is proposed for optimizing beamforming, spectrum allocation, and remote user equipment (RUE) association ins.
We employ inverse RL (IRL) to automatically learn reward functions without manual tuning.
We show that the proposed MA-AL method outperforms traditional RL approaches, achieving a $14.6%$ improvement in convergence and reward value.
arXiv Detail & Related papers (2024-09-27T13:05:02Z) - UAV-enabled Collaborative Beamforming via Multi-Agent Deep Reinforcement Learning [79.16150966434299]
We formulate a UAV-enabled collaborative beamforming multi-objective optimization problem (UCBMOP) to maximize the transmission rate of the UVAA and minimize the energy consumption of all UAVs.
We use the heterogeneous-agent trust region policy optimization (HATRPO) as the basic framework, and then propose an improved HATRPO algorithm, namely HATRPO-UCB.
arXiv Detail & Related papers (2024-04-11T03:19:22Z) - Traffic Learning and Proactive UAV Trajectory Planning for Data Uplink
in Markovian IoT Models [6.49537221266081]
In IoT networks, the traditional resource management schemes rely on a message exchange between the devices and the base station.
We present a novel learning-based framework that estimates the traffic arrival of IoT devices based on Markovian events.
We propose a deep reinforcement learning approach to optimize the optimal policy of each UAV.
arXiv Detail & Related papers (2024-01-24T21:57:55Z) - A Hybrid Framework of Reinforcement Learning and Convex Optimization for
UAV-Based Autonomous Metaverse Data Collection [16.731929552692524]
This paper considers a UAV-assisted Metaverse network, in which UAVs extend the coverage of the base station (BS) to collect the Metaverse data generated at roadside units (RSUs)
To improve the data collection efficiency, resource allocation and trajectory control are integrated into the system model.
Based on the proposed UAV-assisted Metaverse network system model, we design a hybrid framework with reinforcement learning and convex optimization to cooperatively solve the time-sequential optimization problem.
arXiv Detail & Related papers (2023-05-29T11:49:20Z) - Muti-Agent Proximal Policy Optimization For Data Freshness in
UAV-assisted Networks [4.042622147977782]
We focus on the case where the collected data is time-sensitive, and it is critical to maintain its timeliness.
Our objective is to optimally design the UAVs' trajectories and the subsets of visited IoT devices such as the global Age-of-Updates (AoU) is minimized.
arXiv Detail & Related papers (2023-03-15T15:03:09Z) - Traffic Management of Autonomous Vehicles using Policy Based Deep
Reinforcement Learning and Intelligent Routing [0.26249027950824505]
We propose a DRL-based signal control system that adjusts traffic signals according to the current congestion situation on intersections.
To deal with the congestion on roads behind the intersection, we used re-routing technique to load balance the vehicles on road networks.
arXiv Detail & Related papers (2022-06-28T02:46:20Z) - Offline Contextual Bandits for Wireless Network Optimization [107.24086150482843]
In this paper, we investigate how to learn policies that can automatically adjust the configuration parameters of every cell in the network in response to the changes in the user demand.
Our solution combines existent methods for offline learning and adapts them in a principled way to overcome crucial challenges arising in this context.
arXiv Detail & Related papers (2021-11-11T11:31:20Z) - Transferable Deep Reinforcement Learning Framework for Autonomous
Vehicles with Joint Radar-Data Communications [69.24726496448713]
We propose an intelligent optimization framework based on the Markov Decision Process (MDP) to help the AV make optimal decisions.
We then develop an effective learning algorithm leveraging recent advances of deep reinforcement learning techniques to find the optimal policy for the AV.
We show that the proposed transferable deep reinforcement learning framework reduces the obstacle miss detection probability by the AV up to 67% compared to other conventional deep reinforcement learning approaches.
arXiv Detail & Related papers (2021-05-28T08:45:37Z) - Traffic Flow Estimation using LTE Radio Frequency Counters and Machine
Learning [0.0]
We present a novel method for traffic flow estimation using standardized LTE/4G radio frequency performance measurement counters.
We show that our approach benefits from applying transfer learning to generalize the solution not only in time but also in space.
arXiv Detail & Related papers (2021-01-22T15:05:10Z) - Meta-Reinforcement Learning for Trajectory Design in Wireless UAV
Networks [151.65541208130995]
A drone base station (DBS) is dispatched to provide uplink connectivity to ground users whose demand is dynamic and unpredictable.
In this case, the DBS's trajectory must be adaptively adjusted to satisfy the dynamic user access requests.
A meta-learning algorithm is proposed in order to adapt the DBS's trajectory when it encounters novel environments.
arXiv Detail & Related papers (2020-05-25T20:43:59Z) - Data Freshness and Energy-Efficient UAV Navigation Optimization: A Deep
Reinforcement Learning Approach [88.45509934702913]
We design a navigation policy for multiple unmanned aerial vehicles (UAVs) where mobile base stations (BSs) are deployed.
We incorporate different contextual information such as energy and age of information (AoI) constraints to ensure the data freshness at the ground BS.
By applying the proposed trained model, an effective real-time trajectory policy for the UAV-BSs captures the observable network states over time.
arXiv Detail & Related papers (2020-02-21T07:29:15Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.