Dynamic Resource Allocation for Metaverse Applications with Deep
Reinforcement Learning
- URL: http://arxiv.org/abs/2302.13445v1
- Date: Mon, 27 Feb 2023 00:30:01 GMT
- Title: Dynamic Resource Allocation for Metaverse Applications with Deep
Reinforcement Learning
- Authors: Nam H. Chu, Diep N. Nguyen, Dinh Thai Hoang, Khoa T. Phan, Eryk
Dutkiewicz, Dusit Niyato, and Tao Shu
- Abstract summary: This work proposes a novel framework to dynamically manage and allocate different types of resources for Metaverse applications.
We first propose an effective solution to divide applications into groups, namely MetaInstances, where common functions can be shared among applications.
Then, to capture the real-time, dynamic, and uncertain characteristics of request arrival and application departure processes, we develop a semi-Markov decision process-based framework.
- Score: 64.75603723249837
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: This work proposes a novel framework to dynamically and effectively manage
and allocate different types of resources for Metaverse applications, which are
forecasted to demand massive resources of various types that have never been
seen before. Specifically, by studying functions of Metaverse applications, we
first propose an effective solution to divide applications into groups, namely
MetaInstances, where common functions can be shared among applications to
enhance resource usage efficiency. Then, to capture the real-time, dynamic, and
uncertain characteristics of request arrival and application departure
processes, we develop a semi-Markov decision process-based framework and
propose an intelligent algorithm that can gradually learn the optimal admission
policy to maximize the revenue and resource usage efficiency for the Metaverse
service provider and at the same time enhance the Quality-of-Service for
Metaverse users. Extensive simulation results show that our proposed approach
can achieve up to 120% greater revenue for the Metaverse service providers and
up to 178.9% higher acceptance probability for Metaverse application requests
than those of other baselines.
Related papers
- From Novice to Expert: LLM Agent Policy Optimization via Step-wise Reinforcement Learning [62.54484062185869]
We introduce StepAgent, which utilizes step-wise reward to optimize the agent's reinforcement learning process.
We propose implicit-reward and inverse reinforcement learning techniques to facilitate agent reflection and policy adjustment.
arXiv Detail & Related papers (2024-11-06T10:35:11Z) - Learning to Rank for Multiple Retrieval-Augmented Models through Iterative Utility Maximization [21.115495457454365]
This paper investigates the design of a unified search engine to serve multiple retrieval-augmented generation (RAG) agents.
We introduce an iterative approach where the search engine generates retrieval results for these RAG agents and gathers feedback on the quality of the retrieved documents during an offline phase.
We adapt this approach to an online setting, allowing the search engine to refine its behavior based on real-time individual agents feedback.
arXiv Detail & Related papers (2024-10-13T17:53:50Z) - Meta-Reinforcement Learning with Universal Policy Adaptation: Provable Near-Optimality under All-task Optimum Comparator [9.900800253949512]
We develop a bilevel optimization framework for meta-RL (BO-MRL) to learn the meta-prior for task-specific policy adaptation.
We empirically validate the correctness of the derived upper bounds and demonstrate the superior effectiveness of the proposed algorithm over benchmarks.
arXiv Detail & Related papers (2024-10-13T05:17:58Z) - Meta-RTL: Reinforcement-Based Meta-Transfer Learning for Low-Resource Commonsense Reasoning [61.8360232713375]
We propose a reinforcement-based multi-source meta-transfer learning framework (Meta-RTL) for low-resource commonsense reasoning.
We present a reinforcement-based approach to dynamically estimating source task weights that measure the contribution of the corresponding tasks to the target task in the meta-transfer learning.
Experimental results demonstrate that Meta-RTL substantially outperforms strong baselines and previous task selection strategies.
arXiv Detail & Related papers (2024-09-27T18:22:22Z) - Tackling Decision Processes with Non-Cumulative Objectives using Reinforcement Learning [0.0]
We introduce a general mapping of non-cumulative Markov decision processes to standard MDPs.
This allows all techniques developed to find optimal policies for MDPs to be directly applied to the larger class of NCMDPs.
We show applications in a diverse set of tasks, including classical control, portfolio optimization in finance, and discrete optimization problems.
arXiv Detail & Related papers (2024-05-22T13:01:37Z) - RLEMMO: Evolutionary Multimodal Optimization Assisted By Deep Reinforcement Learning [8.389454219309837]
multimodal optimization problems (MMOP) requires finding all optimal solutions, which is challenging in limited function evaluations.
We propose RLEMMO, a Meta-Black-Box Optimization framework, which maintains a population of solutions and incorporates a reinforcement learning agent.
With a novel reward mechanism that encourages both quality and diversity, RLEMMO can be effectively trained using a policy gradient algorithm.
arXiv Detail & Related papers (2024-04-12T05:02:49Z) - Let's reward step by step: Step-Level reward model as the Navigators for
Reasoning [64.27898739929734]
Process-Supervised Reward Model (PRM) furnishes LLMs with step-by-step feedback during the training phase.
We propose a greedy search algorithm that employs the step-level feedback from PRM to optimize the reasoning pathways explored by LLMs.
To explore the versatility of our approach, we develop a novel method to automatically generate step-level reward dataset for coding tasks and observed similar improved performance in the code generation tasks.
arXiv Detail & Related papers (2023-10-16T05:21:50Z) - Attention-aware Resource Allocation and QoE Analysis for Metaverse
xURLLC Services [78.17423912423999]
We study the interaction between service provider (MSP) and network infrastructure provider (InP)
We propose a novel metric named Meta-DuImmersion that incorporates both objective and subjective feelings of Metaverse users.
We develop an attention-aware rendering capacity allocation scheme to improve QoE in xURLLC.
arXiv Detail & Related papers (2022-08-10T16:51:27Z) - Deep Reinforcement Learning for Resource Allocation in Business
Processes [3.0938904602244355]
We propose a novel representation that allows modeling of a multi-process environment with different process-based rewards.
We then use double deep reinforcement learning to look for optimal resource allocation policy.
Deep reinforcement learning based resource allocation achieved significantly better results than two commonly used techniques.
arXiv Detail & Related papers (2021-03-29T11:20:25Z) - Information Directed Reward Learning for Reinforcement Learning [64.33774245655401]
We learn a model of the reward function that allows standard RL algorithms to achieve high expected return with as few expert queries as possible.
In contrast to prior active reward learning methods designed for specific types of queries, IDRL naturally accommodates different query types.
We support our findings with extensive evaluations in multiple environments and with different types of queries.
arXiv Detail & Related papers (2021-02-24T18:46:42Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.