Multi-Agent Reinforcement Learning in NOMA-aided UAV Networks for
Cellular Offloading
- URL: http://arxiv.org/abs/2010.09094v1
- Date: Sun, 18 Oct 2020 20:22:05 GMT
- Title: Multi-Agent Reinforcement Learning in NOMA-aided UAV Networks for
Cellular Offloading
- Authors: Ruikang Zhong, Xiao Liu, Yuanwei Liu and Yue Chen
- Abstract summary: A novel framework is proposed for cellular offloading with the aid of multiple unmanned aerial vehicles (UAVs)
Non-orthogonal multiple access (NOMA) technique is employed at each UAV to further improve the spectrum efficiency of the wireless network.
A mutual deep Q-network (MDQN) algorithm is proposed to jointly determine the optimal 3D trajectory and power allocation of UAVs.
- Score: 59.32570888309133
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: A novel framework is proposed for cellular offloading with the aid of
multiple unmanned aerial vehicles (UAVs), while the non-orthogonal multiple
access (NOMA) technique is employed at each UAV to further improve the spectrum
efficiency of the wireless network. The optimization problem of joint
three-dimensional (3D) trajectory design and power allocation is formulated for
maximizing the throughput. Since ground mobile users are considered as roaming
continuously, the UAVs need to be re-deployed timely based on the movement of
users. In an effort to solve this pertinent dynamic problem, a K-means based
clustering algorithm is first adopted for periodically partitioning users.
Afterward, a mutual deep Q-network (MDQN) algorithm is proposed to jointly
determine the optimal 3D trajectory and power allocation of UAVs. In contrast
to the conventional DQN algorithm, the MDQN algorithm enables the experience of
multi-agent to be input into a shared neural network to shorten the training
time with the assistance of state abstraction. Numerical results demonstrate
that: 1) the proposed MDQN algorithm is capable of converging under minor
constraints and has a faster convergence rate than the conventional DQN
algorithm in the multi-agent case; 2) The achievable sum rate of the NOMA
enhanced UAV network is 23% superior to the case of orthogonal multiple access
(OMA); 3) By designing the optimal 3D trajectory of UAVs with the aid of the
MDON algorithm, the sum rate of the network enjoys 142% and 56% gains than that
of invoking the circular trajectory and the 2D trajectory, respectively.
Related papers
- Joint User Association, Interference Cancellation and Power Control for
Multi-IRS Assisted UAV Communications [80.35959154762381]
Intelligent reflecting surface (IRS)-assisted unmanned aerial vehicle (UAV) communications are expected to alleviate the load of ground base stations in a cost-effective way.
Existing studies mainly focus on the deployment and resource allocation of a single IRS instead of multiple IRSs.
We propose a new optimization algorithm for joint IRS-user association, trajectory optimization of UAVs, successive interference cancellation (SIC) decoding order scheduling and power allocation.
arXiv Detail & Related papers (2023-12-08T01:57:10Z) - Multi-Agent Reinforcement Learning with Action Masking for UAV-enabled
Mobile Communications [1.3053649021965603]
Unmanned Aerial Vehicles (UAVs) are increasingly used as aerial base stations to provide ad hoc communications infrastructure.
This paper focuses on the use of multiple UAVs for providing wireless communication to mobile users in the absence of terrestrial communications infrastructure.
We jointly optimize UAV 3D trajectory and NOMA power allocation to maximize system throughput.
arXiv Detail & Related papers (2023-03-29T14:41:03Z) - Optimization for Master-UAV-powered Auxiliary-Aerial-IRS-assisted IoT
Networks: An Option-based Multi-agent Hierarchical Deep Reinforcement
Learning Approach [56.84948632954274]
This paper investigates a master unmanned aerial vehicle (MUAV)-powered Internet of Things (IoT) network.
We propose using a rechargeable auxiliary UAV (AUAV) equipped with an intelligent reflecting surface (IRS) to enhance the communication signals from the MUAV.
Under the proposed model, we investigate the optimal collaboration strategy of these energy-limited UAVs to maximize the accumulated throughput of the IoT network.
arXiv Detail & Related papers (2021-12-20T15:45:28Z) - Trajectory Design for UAV-Based Internet-of-Things Data Collection: A
Deep Reinforcement Learning Approach [93.67588414950656]
In this paper, we investigate an unmanned aerial vehicle (UAV)-assisted Internet-of-Things (IoT) system in a 3D environment.
We present a TD3-based trajectory design for completion time minimization (TD3-TDCTM) algorithm.
Our simulation results show the superiority of the proposed TD3-TDCTM algorithm over three conventional non-learning based baseline methods.
arXiv Detail & Related papers (2021-07-23T03:33:29Z) - Jamming-Resilient Path Planning for Multiple UAVs via Deep Reinforcement
Learning [1.2330326247154968]
Unmanned aerial vehicles (UAVs) are expected to be an integral part of wireless networks.
In this paper, we aim to find collision-free paths for multiple cellular-connected UAVs.
We propose an offline temporal difference (TD) learning algorithm with online signal-to-interference-plus-noise ratio mapping to solve the problem.
arXiv Detail & Related papers (2021-04-09T16:52:33Z) - NOMA in UAV-aided cellular offloading: A machine learning approach [59.32570888309133]
A novel framework is proposed for cellular offloading with the aid of multiple unmanned aerial vehicles (UAVs)
Non-orthogonal multiple access (NOMA) technique is employed at each UAV to further improve the spectrum efficiency of the wireless network.
A mutual deep Q-network (MDQN) algorithm is proposed to jointly determine the optimal 3D trajectory and power allocation of UAVs.
arXiv Detail & Related papers (2020-10-18T17:38:48Z) - Communication-Efficient Distributed Stochastic AUC Maximization with
Deep Neural Networks [50.42141893913188]
We study a distributed variable for large-scale AUC for a neural network as with a deep neural network.
Our model requires a much less number of communication rounds and still a number of communication rounds in theory.
Our experiments on several datasets show the effectiveness of our theory and also confirm our theory.
arXiv Detail & Related papers (2020-05-05T18:08:23Z) - Machine Learning for Predictive Deployment of UAVs with Multiple Access [37.49465317156625]
In this paper, a machine learning deployment framework of unmanned aerial vehicles (UAVs) is studied.
Due to time-varying traffic distribution, a long short-term memory (LSTM) based prediction is introduced to predict the future cellular traffic.
The proposed method can reduce up to 24% of the total power consumption compared to the conventional method.
arXiv Detail & Related papers (2020-03-02T00:15:09Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.