Collaborative Intelligent Reflecting Surface Networks with Multi-Agent
Reinforcement Learning
- URL: http://arxiv.org/abs/2203.14152v1
- Date: Sat, 26 Mar 2022 20:37:14 GMT
- Title: Collaborative Intelligent Reflecting Surface Networks with Multi-Agent
Reinforcement Learning
- Authors: Jie Zhang, Jun Li, Yijin Zhang, Qingqing Wu, Xiongwei Wu, Feng Shu,
Shi Jin, Wen Chen
- Abstract summary: Intelligent reflecting surface (IRS) is envisioned to be widely applied in future wireless networks.
In this paper, we investigate a multi-user communication system assisted by cooperative IRS devices with the capability of energy harvesting.
- Score: 63.83425382922157
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Intelligent reflecting surface (IRS) is envisioned to be widely applied in
future wireless networks. In this paper, we investigate a multi-user
communication system assisted by cooperative IRS devices with the capability of
energy harvesting. Aiming to maximize the long-term average achievable system
rate, an optimization problem is formulated by jointly designing the transmit
beamforming at the base station (BS) and discrete phase shift beamforming at
the IRSs, with the constraints on transmit power, user data rate requirement
and IRS energy buffer size. Considering time-varying channels and stochastic
arrivals of energy harvested by the IRSs, we first formulate the problem as a
Markov decision process (MDP) and then develop a novel multi-agent Q-mix (MAQ)
framework with two layers to decouple the optimization parameters. The higher
layer is for optimizing phase shift resolutions, and the lower one is for phase
shift beamforming and power allocation. Since the phase shift optimization is
an integer programming problem with a large-scale action space, we improve MAQ
by incorporating the Wolpertinger method, namely, MAQ-WP algorithm to achieve a
sub-optimality with reduced dimensions of action space. In addition, as MAQ-WP
is still of high complexity to achieve good performance, we propose a policy
gradient-based MAQ algorithm, namely, MAQ-PG, by mapping the discrete phase
shift actions into a continuous space at the cost of a slight performance loss.
Simulation results demonstrate that the proposed MAQ-WP and MAQ-PG algorithms
can converge faster and achieve data rate improvements of 10.7% and 8.8% over
the conventional multi-agent DDPG, respectively.
Related papers
- Smart energy management: process structure-based hybrid neural networks for optimal scheduling and economic predictive control in integrated systems [2.723192806018494]
Integrated energy systems (IESs) are complex systems consisting of diverse operating units spanning multiple domains.
We propose a physics-informed hybrid time-series neural network (NN) surrogate to predict the dynamic performance of IESs across multiple time scales.
arXiv Detail & Related papers (2024-10-07T04:24:39Z) - Quantum Multi-Agent Reinforcement Learning for Autonomous Mobility
Cooperation [14.606459096293088]
We propose a quantum MARL (QMARL) algorithm based on the concept of actor-critic network.
Our QMARL is beneficial in terms of efficient parameter utilization and fast convergence due to quantum supremacy.
An additional technique for scalability is proposed, which is called projection value measure (PVM)
arXiv Detail & Related papers (2023-08-03T03:29:25Z) - Active RIS-aided EH-NOMA Networks: A Deep Reinforcement Learning
Approach [66.53364438507208]
An active reconfigurable intelligent surface (RIS)-aided multi-user downlink communication system is investigated.
Non-orthogonal multiple access (NOMA) is employed to improve spectral efficiency, and the active RIS is powered by energy harvesting (EH)
An advanced LSTM based algorithm is developed to predict users' dynamic communication state.
A DDPG based algorithm is proposed to joint control the amplification matrix and phase shift matrix RIS.
arXiv Detail & Related papers (2023-04-11T13:16:28Z) - Energy Efficiency Maximization in IRS-Aided Cell-Free Massive MIMO
System [2.9081408997650375]
In this paper, we consider an intelligent reflecting surface (IRS)-aided cell-free massive multiple-input multiple-output system, where the beamforming at access points and the phase shifts at IRSs are jointly optimized to maximize energy efficiency (EE)
To solve EE problem, we propose an iterative optimization algorithm by using quadratic transform and Lagrangian dual transform to find the optimum beamforming and phase shifts.
We further propose a deep learning based approach for joint beamforming and phase shifts design. Specifically, a two-stage deep neural network is trained offline using the unsupervised learning manner, which is then deployed online for
arXiv Detail & Related papers (2022-12-24T14:58:15Z) - Intelligent Trajectory Design for RIS-NOMA aided Multi-robot
Communications [59.34642007625687]
The goal is to maximize the sum-rate of whole trajectories for multi-robot system by jointly optimizing trajectories and NOMA decoding orders of robots.
An integrated machine learning (ML) scheme is proposed, which combines long short-term memory (LSTM)-autoregressive integrated moving average (ARIMA) model and dueling double deep Q-network (D$3$QN) algorithm.
arXiv Detail & Related papers (2022-05-03T17:14:47Z) - Federated Learning for Energy-limited Wireless Networks: A Partial Model
Aggregation Approach [79.59560136273917]
limited communication resources, bandwidth and energy, and data heterogeneity across devices are main bottlenecks for federated learning (FL)
We first devise a novel FL framework with partial model aggregation (PMA)
The proposed PMA-FL improves 2.72% and 11.6% accuracy on two typical heterogeneous datasets.
arXiv Detail & Related papers (2022-04-20T19:09:52Z) - A Robust Deep Learning-Based Beamforming Design for RIS-assisted
Multiuser MISO Communications with Practical Constraints [4.727307803726522]
We consider a RIS-aided multiuser multiple-input single-output downlink communication system.
We develop a deep quantization neural network (DQNN) to simultaneously design the active and passive beamforming.
We extend the two proposed DQNN-based algorithms to the case that the discrete phase shifts and imperfect CSI are considered simultaneously.
arXiv Detail & Related papers (2021-11-12T03:53:20Z) - Reconfigurable Intelligent Surface Assisted Mobile Edge Computing with
Heterogeneous Learning Tasks [53.1636151439562]
Mobile edge computing (MEC) provides a natural platform for AI applications.
We present an infrastructure to perform machine learning tasks at an MEC with the assistance of a reconfigurable intelligent surface (RIS)
Specifically, we minimize the learning error of all participating users by jointly optimizing transmit power of mobile users, beamforming vectors of the base station, and the phase-shift matrix of the RIS.
arXiv Detail & Related papers (2020-12-25T07:08:50Z) - Federated Learning via Intelligent Reflecting Surface [30.935389187215474]
Over-the-air computation algorithm (AirComp) based learning (FL) is capable of achieving fast model aggregation by exploiting the waveform superposition property of multiple access channels.
In this paper, we propose a two-step optimization framework to achieve fast yet reliable model aggregation for AirComp-based FL.
Simulation results will demonstrate that our proposed framework and the deployment of an IRS can achieve a lower training loss and higher FL prediction accuracy than the baseline algorithms.
arXiv Detail & Related papers (2020-11-10T11:29:57Z) - Iterative Algorithm Induced Deep-Unfolding Neural Networks: Precoding
Design for Multiuser MIMO Systems [59.804810122136345]
We propose a framework for deep-unfolding, where a general form of iterative algorithm induced deep-unfolding neural network (IAIDNN) is developed.
An efficient IAIDNN based on the structure of the classic weighted minimum mean-square error (WMMSE) iterative algorithm is developed.
We show that the proposed IAIDNN efficiently achieves the performance of the iterative WMMSE algorithm with reduced computational complexity.
arXiv Detail & Related papers (2020-06-15T02:57:57Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.