A Cost-Aware Mechanism for Optimized Resource Provisioning in Cloud
Computing
- URL: http://arxiv.org/abs/2309.11299v1
- Date: Wed, 20 Sep 2023 13:27:30 GMT
- Title: A Cost-Aware Mechanism for Optimized Resource Provisioning in Cloud
Computing
- Authors: Safiye Ghasemi, Mohammad Reza Meybodi, Mehdi Dehghan Takht Fooladi,
and Amir Masoud Rahmani
- Abstract summary: We have proposed a novel learning based resource provisioning approach that achieves cost-reduction guarantees of demands.
Our method adapts most of the requirements efficiently, and furthermore the resulting performance meets our design goals.
- Score: 6.369406986434764
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Due to the recent wide use of computational resources in cloud computing, new
resource provisioning challenges have been emerged. Resource provisioning
techniques must keep total costs to a minimum while meeting the requirements of
the requests. According to widely usage of cloud services, it seems more
challenging to develop effective schemes for provisioning services
cost-effectively; we have proposed a novel learning based resource provisioning
approach that achieves cost-reduction guarantees of demands. The contributions
of our optimized resource provisioning (ORP) approach are as follows. Firstly,
it is designed to provide a cost-effective method to efficiently handle the
provisioning of requested applications; while most of the existing models allow
only workflows in general which cares about the dependencies of the tasks, ORP
performs based on services of which applications comprised and cares about
their efficient provisioning totally. Secondly, it is a learning automata-based
approach which selects the most proper resources for hosting each service of
the demanded application; our approach considers both cost and service
requirements together for deploying applications. Thirdly, a comprehensive
evaluation is performed for three typical workloads: data-intensive,
process-intensive and normal applications. The experimental results show that
our method adapts most of the requirements efficiently, and furthermore the
resulting performance meets our design goals.
Related papers
- Benchmarking Open-Source Language Models for Efficient Question Answering in Industrial Applications [0.0]
Large Language Models (LLMs) have demonstrated remarkable capabilities in tasks such as question answering (QA)
This paper presents a comprehensive benchmarking study comparing open-source LLMs with their non-open-source counterparts on the task of question answering.
arXiv Detail & Related papers (2024-06-19T17:11:51Z) - EASRec: Elastic Architecture Search for Efficient Long-term Sequential
Recommender Systems [82.76483989905961]
Current Sequential Recommender Systems (SRSs) suffer from computational and resource inefficiencies.
We develop the Elastic Architecture Search for Efficient Long-term Sequential Recommender Systems (EASRec)
EASRec introduces data-aware gates that leverage historical information from input data batch to improve the performance of the recommendation network.
arXiv Detail & Related papers (2024-02-01T07:22:52Z) - Machine Learning Insides OptVerse AI Solver: Design Principles and
Applications [74.67495900436728]
We present a comprehensive study on the integration of machine learning (ML) techniques into Huawei Cloud's OptVerse AI solver.
We showcase our methods for generating complex SAT and MILP instances utilizing generative models that mirror multifaceted structures of real-world problem.
We detail the incorporation of state-of-the-art parameter tuning algorithms which markedly elevate solver performance.
arXiv Detail & Related papers (2024-01-11T15:02:15Z) - Elastic Entangled Pair and Qubit Resource Management in Quantum Cloud
Computing [73.7522199491117]
Quantum cloud computing (QCC) offers a promising approach to efficiently provide quantum computing resources.
The fluctuations in user demand and quantum circuit requirements are challenging for efficient resource provisioning.
We propose a resource allocation model to provision quantum computing and networking resources.
arXiv Detail & Related papers (2023-07-25T00:38:46Z) - Dynamic Resource Allocation for Metaverse Applications with Deep
Reinforcement Learning [64.75603723249837]
This work proposes a novel framework to dynamically manage and allocate different types of resources for Metaverse applications.
We first propose an effective solution to divide applications into groups, namely MetaInstances, where common functions can be shared among applications.
Then, to capture the real-time, dynamic, and uncertain characteristics of request arrival and application departure processes, we develop a semi-Markov decision process-based framework.
arXiv Detail & Related papers (2023-02-27T00:30:01Z) - CILP: Co-simulation based Imitation Learner for Dynamic Resource
Provisioning in Cloud Computing Environments [13.864161788250856]
Key challenge for latency-critical tasks is to predict future workload demands to provision proactively.
Existing AI-based solutions tend to not holistically consider all crucial aspects such as provision overheads, heterogeneous VM costs and Quality of Service (QoS) of the cloud system.
We propose a novel method, called CILP, that formulates the VM provisioning problem as two sub-problems of prediction and optimization.
arXiv Detail & Related papers (2023-02-11T09:15:34Z) - Actively Learning Costly Reward Functions for Reinforcement Learning [56.34005280792013]
We show that it is possible to train agents in complex real-world environments orders of magnitudes faster.
By enabling the application of reinforcement learning methods to new domains, we show that we can find interesting and non-trivial solutions.
arXiv Detail & Related papers (2022-11-23T19:17:20Z) - Outsourcing Training without Uploading Data via Efficient Collaborative
Open-Source Sampling [49.87637449243698]
Traditional outsourcing requires uploading device data to the cloud server.
We propose to leverage widely available open-source data, which is a massive dataset collected from public and heterogeneous sources.
We develop a novel strategy called Efficient Collaborative Open-source Sampling (ECOS) to construct a proximal proxy dataset from open-source data for cloud training.
arXiv Detail & Related papers (2022-10-23T00:12:18Z) - ANDREAS: Artificial intelligence traiNing scheDuler foR accElerAted
resource clusterS [1.798617052102518]
We propose ANDREAS, an advanced scheduling solution to maximize performance and minimize Data Centers operational costs.
experiments show that we can achieve a cost reduction between 30 and 62% on average with respect to first-principle methods.
arXiv Detail & Related papers (2021-05-11T14:36:19Z) - Function Design for Improved Competitive Ratio in Online Resource
Allocation with Procurement Costs [16.68130648568593]
We study the problem of online resource allocation, where multiple customers arrive the seller must allocate resources to each incoming customer while also facing a procurement cost for the total allocation.
arXiv Detail & Related papers (2020-12-23T02:32:47Z) - AI-based Resource Allocation: Reinforcement Learning for Adaptive
Auto-scaling in Serverless Environments [0.0]
Serverless computing has emerged as a compelling new paradigm of cloud computing models in recent years.
A common approach among both commercial and open source serverless computing platforms is workload-based auto-scaling.
In this paper we investigate the applicability of a reinforcement learning approach to request-based auto-scaling in a serverless framework.
arXiv Detail & Related papers (2020-05-29T06:18:39Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.