Distributed Multi-agent Meta Learning for Trajectory Design in Wireless
Drone Networks
- URL: http://arxiv.org/abs/2012.03158v1
- Date: Sun, 6 Dec 2020 01:30:12 GMT
- Title: Distributed Multi-agent Meta Learning for Trajectory Design in Wireless
Drone Networks
- Authors: Ye Hu, Mingzhe Chen, Walid Saad, H. Vincent Poor, and Shuguang Cui
- Abstract summary: This paper studies the problem of the trajectory design for a group of energyconstrained drones operating in dynamic wireless network environments.
A value based reinforcement learning (VDRL) solution and a metatraining mechanism is proposed.
- Score: 151.27147513363502
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: In this paper, the problem of the trajectory design for a group of
energy-constrained drones operating in dynamic wireless network environments is
studied. In the considered model, a team of drone base stations (DBSs) is
dispatched to cooperatively serve clusters of ground users that have dynamic
and unpredictable uplink access demands. In this scenario, the DBSs must
cooperatively navigate in the considered area to maximize coverage of the
dynamic requests of the ground users. This trajectory design problem is posed
as an optimization framework whose goal is to find optimal trajectories that
maximize the fraction of users served by all DBSs. To find an optimal solution
for this non-convex optimization problem under unpredictable environments, a
value decomposition based reinforcement learning (VDRL) solution coupled with a
meta-training mechanism is proposed. This algorithm allows the DBSs to
dynamically learn their trajectories while generalizing their learning to
unseen environments. Analytical results show that, the proposed VD-RL algorithm
is guaranteed to converge to a local optimal solution of the non-convex
optimization problem. Simulation results show that, even without meta-training,
the proposed VD-RL algorithm can achieve a 53.2% improvement of the service
coverage and a 30.6% improvement in terms of the convergence speed, compared to
baseline multi-agent algorithms. Meanwhile, the use of meta-learning improves
the convergence speed of the VD-RL algorithm by up to 53.8% when the DBSs must
deal with a previously unseen task.
Related papers
- Meta-Learning Based Optimization for Large Scale Wireless Systems [45.025621137165025]
It is known that the limitation of conventional optimization algorithms in the literature often increases with the number of transmit antennas and communication users in wireless system.
This paper proposes an unsupervised meta-learning based approach to perform non-diaconfigurable optimization at significantly reduced complexity.
arXiv Detail & Related papers (2024-07-01T21:45:27Z) - Personalized Federated Deep Reinforcement Learning-based Trajectory
Optimization for Multi-UAV Assisted Edge Computing [22.09756306579992]
UAVs can serve as intelligent servers in edge computing environments, optimizing their flight trajectories to maximize communication system throughput.
Deep reinforcement learning (DRL)-based trajectory optimization algorithms may suffer from poor training performance due to intricate terrain features and inadequate training data.
This work proposes a novel solution, namely personalized federated deep reinforcement learning (PF-DRL), for multi-UAV trajectory optimization.
arXiv Detail & Related papers (2023-09-05T12:54:40Z) - A Hybrid Framework of Reinforcement Learning and Convex Optimization for
UAV-Based Autonomous Metaverse Data Collection [16.731929552692524]
This paper considers a UAV-assisted Metaverse network, in which UAVs extend the coverage of the base station (BS) to collect the Metaverse data generated at roadside units (RSUs)
To improve the data collection efficiency, resource allocation and trajectory control are integrated into the system model.
Based on the proposed UAV-assisted Metaverse network system model, we design a hybrid framework with reinforcement learning and convex optimization to cooperatively solve the time-sequential optimization problem.
arXiv Detail & Related papers (2023-05-29T11:49:20Z) - Fast and computationally efficient generative adversarial network
algorithm for unmanned aerial vehicle-based network coverage optimization [1.2853186701496802]
The challenge of dynamic traffic demand in mobile networks is tackled by moving cells based on unmanned aerial vehicles.
Considering the tremendous potential of unmanned aerial vehicles in the future, we propose a new algorithm for coverage optimization.
The proposed algorithm is implemented based on a conditional generative adversarial neural network, with a unique multilayer sum-pooling loss function.
arXiv Detail & Related papers (2022-03-25T12:13:21Z) - Reinforcement Learning for Datacenter Congestion Control [50.225885814524304]
Successful congestion control algorithms can dramatically improve latency and overall network throughput.
Until today, no such learning-based algorithms have shown practical potential in this domain.
We devise an RL-based algorithm with the aim of generalizing to different configurations of real-world datacenter networks.
We show that this scheme outperforms alternative popular RL approaches, and generalizes to scenarios that were not seen during training.
arXiv Detail & Related papers (2021-02-18T13:49:28Z) - Multi-Agent Reinforcement Learning in NOMA-aided UAV Networks for
Cellular Offloading [59.32570888309133]
A novel framework is proposed for cellular offloading with the aid of multiple unmanned aerial vehicles (UAVs)
Non-orthogonal multiple access (NOMA) technique is employed at each UAV to further improve the spectrum efficiency of the wireless network.
A mutual deep Q-network (MDQN) algorithm is proposed to jointly determine the optimal 3D trajectory and power allocation of UAVs.
arXiv Detail & Related papers (2020-10-18T20:22:05Z) - Meta-Reinforcement Learning for Trajectory Design in Wireless UAV
Networks [151.65541208130995]
A drone base station (DBS) is dispatched to provide uplink connectivity to ground users whose demand is dynamic and unpredictable.
In this case, the DBS's trajectory must be adaptively adjusted to satisfy the dynamic user access requests.
A meta-learning algorithm is proposed in order to adapt the DBS's trajectory when it encounters novel environments.
arXiv Detail & Related papers (2020-05-25T20:43:59Z) - Optimization-driven Deep Reinforcement Learning for Robust Beamforming
in IRS-assisted Wireless Communications [54.610318402371185]
Intelligent reflecting surface (IRS) is a promising technology to assist downlink information transmissions from a multi-antenna access point (AP) to a receiver.
We minimize the AP's transmit power by a joint optimization of the AP's active beamforming and the IRS's passive beamforming.
We propose a deep reinforcement learning (DRL) approach that can adapt the beamforming strategies from past experiences.
arXiv Detail & Related papers (2020-05-25T01:42:55Z) - Reinforcement Learning Based Vehicle-cell Association Algorithm for
Highly Mobile Millimeter Wave Communication [53.47785498477648]
This paper investigates the problem of vehicle-cell association in millimeter wave (mmWave) communication networks.
We first formulate the user state (VU) problem as a discrete non-vehicle association optimization problem.
The proposed solution achieves up to 15% gains in terms sum of user complexity and 20% reduction in VUE compared to several baseline designs.
arXiv Detail & Related papers (2020-01-22T08:51:05Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.