Adapting to Dynamic LEO-B5G Systems: Meta-Critic Learning Based
Efficient Resource Scheduling
- URL: http://arxiv.org/abs/2110.06787v1
- Date: Wed, 13 Oct 2021 15:21:38 GMT
- Title: Adapting to Dynamic LEO-B5G Systems: Meta-Critic Learning Based
Efficient Resource Scheduling
- Authors: Yaxiong Yuan, Lei lei, Thang X. Vu, Zheng Chang, Symeon Chatzinotas,
Sumei Sun
- Abstract summary: We address two practical issues for an over-loaded LEO-terrestrial system.
The first challenge is how to efficiently schedule resources to serve the massive number of connected users.
The second challenge is how to make the algorithmic solution more resilient in adapting to dynamic wireless environments.
- Score: 38.733584547351796
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Low earth orbit (LEO) satellite-assisted communications have been considered
as one of key elements in beyond 5G systems to provide wide coverage and
cost-efficient data services. Such dynamic space-terrestrial topologies impose
exponential increase in the degrees of freedom in network management. In this
paper, we address two practical issues for an over-loaded LEO-terrestrial
system. The first challenge is how to efficiently schedule resources to serve
the massive number of connected users, such that more data and users can be
delivered/served. The second challenge is how to make the algorithmic solution
more resilient in adapting to dynamic wireless environments.To address them, we
first propose an iterative suboptimal algorithm to provide an offline
benchmark. To adapt to unforeseen variations, we propose an enhanced
meta-critic learning algorithm (EMCL), where a hybrid neural network for
parameterization and the Wolpertinger policy for action mapping are designed in
EMCL. The results demonstrate EMCL's effectiveness and fast-response
capabilities in over-loaded systems and in adapting to dynamic environments
compare to previous actor-critic and meta-learning methods.
Related papers
- Doubly-Dynamic ISAC Precoding for Vehicular Networks: A Constrained Deep Reinforcement Learning (CDRL) Approach [11.770137653756697]
Integrated sensing and communication (ISAC) technology is essential for supporting vehicular networks.
The communication channel in this scenario exhibits time variations, and the potential targets may move rapidly, resulting in double dynamics.
We propose using constrained deep reinforcement learning to facilitate dynamic updates to the ISAC precoder.
arXiv Detail & Related papers (2024-05-23T09:19:14Z) - Context-Aware Orchestration of Energy-Efficient Gossip Learning Schemes [8.382766344930157]
We present a distributed training approach based on the combination of Gossip Learning with adaptive optimization of the learning process.
We propose a data-driven approach to OGL management that relies on optimizing in real-time for each node.
Results suggest that our approach is highly efficient and effective in a broad spectrum of network scenarios.
arXiv Detail & Related papers (2024-04-18T09:17:46Z) - Hybrid-Task Meta-Learning: A Graph Neural Network Approach for Scalable and Transferable Bandwidth Allocation [46.342827102556896]
We develop a deep learning-based bandwidth allocation policy that is scalable with the number of users and transferable to different communication scenarios.
To support scalability, the bandwidth allocation policy is represented by a graph neural network (GNN)
We develop a hybrid-task meta-learning algorithm that trains the initial parameters of the GNN with different communication scenarios.
arXiv Detail & Related papers (2023-12-23T04:25:12Z) - Adaptive Resource Allocation for Virtualized Base Stations in O-RAN with
Online Learning [60.17407932691429]
Open Radio Access Network systems, with their base stations (vBSs), offer operators the benefits of increased flexibility, reduced costs, vendor diversity, and interoperability.
We propose an online learning algorithm that balances the effective throughput and vBS energy consumption, even under unforeseeable and "challenging'' environments.
We prove the proposed solutions achieve sub-linear regret, providing zero average optimality gap even in challenging environments.
arXiv Detail & Related papers (2023-09-04T17:30:21Z) - Multi-agent Reinforcement Learning with Graph Q-Networks for Antenna
Tuning [60.94661435297309]
The scale of mobile networks makes it challenging to optimize antenna parameters using manual intervention or hand-engineered strategies.
We propose a new multi-agent reinforcement learning algorithm to optimize mobile network configurations globally.
We empirically demonstrate the performance of the algorithm on an antenna tilt tuning problem and a joint tilt and power control problem in a simulated environment.
arXiv Detail & Related papers (2023-01-20T17:06:34Z) - Learning to Continuously Optimize Wireless Resource in a Dynamic
Environment: A Bilevel Optimization Perspective [52.497514255040514]
This work develops a new approach that enables data-driven methods to continuously learn and optimize resource allocation strategies in a dynamic environment.
We propose to build the notion of continual learning into wireless system design, so that the learning model can incrementally adapt to the new episodes.
Our design is based on a novel bilevel optimization formulation which ensures certain fairness" across different data samples.
arXiv Detail & Related papers (2021-05-03T07:23:39Z) - Deep Learning-based Resource Allocation For Device-to-Device
Communication [66.74874646973593]
We propose a framework for the optimization of the resource allocation in multi-channel cellular systems with device-to-device (D2D) communication.
A deep learning (DL) framework is proposed, where the optimal resource allocation strategy for arbitrary channel conditions is approximated by deep neural network (DNN) models.
Our simulation results confirm that near-optimal performance can be attained with low time, which underlines the real-time capability of the proposed scheme.
arXiv Detail & Related papers (2020-11-25T14:19:23Z) - Meta-Reinforcement Learning for Trajectory Design in Wireless UAV
Networks [151.65541208130995]
A drone base station (DBS) is dispatched to provide uplink connectivity to ground users whose demand is dynamic and unpredictable.
In this case, the DBS's trajectory must be adaptively adjusted to satisfy the dynamic user access requests.
A meta-learning algorithm is proposed in order to adapt the DBS's trajectory when it encounters novel environments.
arXiv Detail & Related papers (2020-05-25T20:43:59Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.