A Bandit Approach to Online Pricing for Heterogeneous Edge Resource
Allocation
- URL: http://arxiv.org/abs/2302.06953v1
- Date: Tue, 14 Feb 2023 10:21:14 GMT
- Title: A Bandit Approach to Online Pricing for Heterogeneous Edge Resource
Allocation
- Authors: Jiaming Cheng, Duong Thuy Anh Nguyen, Lele Wang, Duong Tung Nguyen,
Vijay K. Bhargava
- Abstract summary: Two novel online pricing mechanisms are proposed for heterogeneous edge resource allocation.
The mechanisms operate in real-time and do not require prior knowledge of demand distribution.
The proposed posted pricing schemes allow users to select and pay for their preferred resources, with the platform dynamically adjusting resource prices based on observed historical data.
- Score: 8.089950414444115
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: Edge Computing (EC) offers a superior user experience by positioning cloud
resources in close proximity to end users. The challenge of allocating edge
resources efficiently while maximizing profit for the EC platform remains a
sophisticated problem, especially with the added complexity of the online
arrival of resource requests. To address this challenge, we propose to cast the
problem as a multi-armed bandit problem and develop two novel online pricing
mechanisms, the Kullback-Leibler Upper Confidence Bound (KL-UCB) algorithm and
the Min-Max Optimal algorithm, for heterogeneous edge resource allocation.
These mechanisms operate in real-time and do not require prior knowledge of
demand distribution, which can be difficult to obtain in practice. The proposed
posted pricing schemes allow users to select and pay for their preferred
resources, with the platform dynamically adjusting resource prices based on
observed historical data. Numerical results show the advantages of the proposed
mechanisms compared to several benchmark schemes derived from traditional
bandit algorithms, including the Epsilon-Greedy, basic UCB, and Thompson
Sampling algorithms.
Related papers
- Improving Portfolio Optimization Results with Bandit Networks [0.0]
We introduce and evaluate novel Bandit algorithms designed for non-stationary environments.
First, we present the Adaptive Discounted Thompson Sampling (ADTS) algorithm.
We then extend this approach to the Portfolio Optimization problem by introducing the Combinatorial Adaptive Discounted Thompson Sampling (CADTS) algorithm.
arXiv Detail & Related papers (2024-10-05T16:17:31Z) - A Federated Online Restless Bandit Framework for Cooperative Resource Allocation [23.698976872351576]
We study the cooperative resource allocation problem with unknown system dynamics of MRPs.
We put forth a Federated Thompson-enabled Whittle Index (FedTSWI) algorithm to solve this multi-agent online RMAB problem.
Numerical results show that the proposed algorithm achieves a fast convergence rate of $mathcalO(sqrtTlog(T))$ and better performance compared with baselines.
arXiv Detail & Related papers (2024-06-12T08:34:53Z) - Online Learning under Budget and ROI Constraints via Weak Adaptivity [57.097119428915796]
Existing primal-dual algorithms for constrained online learning problems rely on two fundamental assumptions.
We show how such assumptions can be circumvented by endowing standard primal-dual templates with weakly adaptive regret minimizers.
We prove the first best-of-both-worlds no-regret guarantees which hold in absence of the two aforementioned assumptions.
arXiv Detail & Related papers (2023-02-02T16:30:33Z) - Incentive-Aware Recommender Systems in Two-Sided Markets [49.692453629365204]
We propose a novel recommender system that aligns with agents' incentives while achieving myopically optimal performance.
Our framework models this incentive-aware system as a multi-agent bandit problem in two-sided markets.
Both algorithms satisfy an ex-post fairness criterion, which protects agents from over-exploitation.
arXiv Detail & Related papers (2022-11-23T22:20:12Z) - Interactive Recommendations for Optimal Allocations in Markets with
Constraints [12.580391999838128]
We propose an interactive framework where the system provider can enhance the quality of recommendations to the users.
We employ an integrated approach using techniques from collaborative filtering, bandits, and optimal resource allocation.
Empirical studies on synthetic matrix and real-world data also demonstrate the effectiveness and performance of our approach.
arXiv Detail & Related papers (2022-07-08T22:16:51Z) - Online Allocation with Two-sided Resource Constraints [44.5635910908944]
We consider an online allocation problem subject to lower and upper resource constraints, where the requests arrive sequentially.
We propose a new algorithm that obtains $1-O(fracepsilonalpha-epsilon)$ -competitive ratio for the offline problems that know the entire requests ahead of time.
arXiv Detail & Related papers (2021-12-28T02:21:06Z) - The Best of Many Worlds: Dual Mirror Descent for Online Allocation
Problems [7.433931244705934]
We consider a data-driven setting in which the reward and resource consumption of each request are generated using an input model unknown to the decision maker.
We design general class of algorithms that attain good performance in various input models without knowing which type of input they are facing.
Our algorithms operate in the Lagrangian dual space: they maintain a dual multiplier for each resource that is updated using online mirror descent.
arXiv Detail & Related papers (2020-11-18T18:39:17Z) - Coordinated Online Learning for Multi-Agent Systems with Coupled
Constraints and Perturbed Utility Observations [91.02019381927236]
We introduce a novel method to steer the agents toward a stable population state, fulfilling the given resource constraints.
The proposed method is a decentralized resource pricing method based on the resource loads resulting from the augmentation of the game's Lagrangian.
arXiv Detail & Related papers (2020-10-21T10:11:17Z) - Resource Allocation via Model-Free Deep Learning in Free Space Optical
Communications [119.81868223344173]
The paper investigates the general problem of resource allocation for mitigating channel fading effects in Free Space Optical (FSO) communications.
Under this framework, we propose two algorithms that solve FSO resource allocation problems.
arXiv Detail & Related papers (2020-07-27T17:38:51Z) - Hierarchical Adaptive Contextual Bandits for Resource Constraint based
Recommendation [49.69139684065241]
Contextual multi-armed bandit (MAB) achieves cutting-edge performance on a variety of problems.
In this paper, we propose a hierarchical adaptive contextual bandit method (HATCH) to conduct the policy learning of contextual bandits with a budget constraint.
arXiv Detail & Related papers (2020-04-02T17:04:52Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.