MetaNet: Automated Dynamic Selection of Scheduling Policies in Cloud
Environments
- URL: http://arxiv.org/abs/2205.10642v1
- Date: Sat, 21 May 2022 16:51:51 GMT
- Title: MetaNet: Automated Dynamic Selection of Scheduling Policies in Cloud
Environments
- Authors: Shreshth Tuli and Giuliano Casale and Nicholas R. Jennings
- Abstract summary: This work aims to solve the non-trivial meta problem of online dynamic selection of a scheduling policy using a surrogate model called MetaNet.
Compared to state-of-the-art DNN schedulers, this allows for improvement in execution costs, energy consumption, response time and service level agreement violations by up to 11, 43, 8 and 13 percent, respectively.
- Score: 13.864161788250856
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Task scheduling is a well-studied problem in the context of optimizing the
Quality of Service (QoS) of cloud computing environments. In order to sustain
the rapid growth of computational demands, one of the most important QoS
metrics for cloud schedulers is the execution cost. In this regard, several
data-driven deep neural networks (DNNs) based schedulers have been proposed in
recent years to allow scalable and efficient resource management in dynamic
workload settings. However, optimal scheduling frequently relies on
sophisticated DNNs with high computational needs implying higher execution
costs. Further, even in non-stationary environments, sophisticated schedulers
might not always be required and we could briefly rely on low-cost schedulers
in the interest of cost-efficiency. Therefore, this work aims to solve the
non-trivial meta problem of online dynamic selection of a scheduling policy
using a surrogate model called MetaNet. Unlike traditional solutions with a
fixed scheduling policy, MetaNet on-the-fly chooses a scheduler from a large
set of DNN based methods to optimize task scheduling and execution costs in
tandem. Compared to state-of-the-art DNN schedulers, this allows for
improvement in execution costs, energy consumption, response time and service
level agreement violations by up to 11, 43, 8 and 13 percent, respectively.
Related papers
- Capacity Planning and Scheduling for Jobs with Uncertainty in Resource Usage and Duration [11.232441969983672]
This work is to perform capacity planning, estimate resource requirements, and job scheduling for on-prem grid computing environments.<n>A key contribution of our approach is handling uncertainty in both resource usage and duration of the jobs.<n>We propose approximate approaches using deterministic estimators and pair sampling-based constraint programming.
arXiv Detail & Related papers (2025-07-01T22:56:08Z) - Semantic Scheduling for LLM Inference [48.19648297172146]
We introduce the concept of semantic scheduling in scheduling of requests from large language models (LLM)<n>We present a novel scheduling algorithm with optimal time complexity, designed to minimize the overall waiting time in LLM-based prompt scheduling.
arXiv Detail & Related papers (2025-06-13T20:15:58Z) - SMDP-Based Dynamic Batching for Improving Responsiveness and Energy Efficiency of Batch Services [12.600853777230185]
Parallel computing resources exhibit heightened computational and energy efficiency when operating with larger batch sizes.
In the realm of online services, the adoption of a larger batch size may lead to longer response times.
This paper aims to provide a dynamic scheme that delicately balances latency and efficiency.
arXiv Detail & Related papers (2025-01-04T04:14:09Z) - Optimizing Load Scheduling in Power Grids Using Reinforcement Learning and Markov Decision Processes [0.0]
This paper proposes a reinforcement learning (RL) approach to address the challenges of dynamic load scheduling.
Our results show that the RL-based method provides a robust and scalable solution for real-time load scheduling.
arXiv Detail & Related papers (2024-10-23T09:16:22Z) - A Multi-Head Ensemble Multi-Task Learning Approach for Dynamical
Computation Offloading [62.34538208323411]
We propose a multi-head ensemble multi-task learning (MEMTL) approach with a shared backbone and multiple prediction heads (PHs)
MEMTL outperforms benchmark methods in both the inference accuracy and mean square error without requiring additional training data.
arXiv Detail & Related papers (2023-09-02T11:01:16Z) - Dynamic Scheduling for Federated Edge Learning with Streaming Data [56.91063444859008]
We consider a Federated Edge Learning (FEEL) system where training data are randomly generated over time at a set of distributed edge devices with long-term energy constraints.
Due to limited communication resources and latency requirements, only a subset of devices is scheduled for participating in the local training process in every iteration.
arXiv Detail & Related papers (2023-05-02T07:41:16Z) - Graph Neural Networks for the Offline Nanosatellite Task Scheduling
Problem [7.20853603061523]
This study investigates how to schedule nanosatellite tasks more efficiently using Graph Neural Networks (GNNs)
The goal is to find the optimal schedule for tasks to be carried out in orbit while taking into account Quality-of-Service (QoS) considerations.
arXiv Detail & Related papers (2023-03-24T03:17:28Z) - Scheduling Inference Workloads on Distributed Edge Clusters with
Reinforcement Learning [11.007816552466952]
This paper focuses on the problem of scheduling inference queries on Deep Neural Networks in edge networks at short timescales.
By means of simulations, we analyze several policies in the realistic network settings and workloads of a large ISP.
We design ASET, a Reinforcement Learning based scheduling algorithm able to adapt its decisions according to the system conditions.
arXiv Detail & Related papers (2023-01-31T13:23:34Z) - Towards Optimal VPU Compiler Cost Modeling by using Neural Networks to
Infer Hardware Performances [58.720142291102135]
'VPUNN' is a neural network-based cost model trained on low-level task profiling.
It consistently outperforms the state-of-the-art cost modeling in Intel's line of VPU processors.
arXiv Detail & Related papers (2022-05-09T22:48:39Z) - Innovations in the field of on-board scheduling technologies [64.41511459132334]
This paper proposes an onboard scheduler, that integrates inside an onboard software framework for mission autonomy.
The scheduler is based on linear integer programming and relies on the use of a branch-and-cut solver.
The technology has been tested on an Earth Observation scenario, comparing its performance against the state-of-the-art scheduling technology.
arXiv Detail & Related papers (2022-05-04T12:00:49Z) - MCDS: AI Augmented Workflow Scheduling in Mobile Edge Cloud Computing
Systems [12.215537834860699]
Recently proposed scheduling methods leverage the low response times of edge computing platforms to optimize application Quality of Service (QoS)
We propose MCDS: Monte Carlo Learning using Deep Surrogate Models to efficiently schedule workflow applications in mobile edge-cloud computing systems.
arXiv Detail & Related papers (2021-12-14T10:00:01Z) - Meta-learning with an Adaptive Task Scheduler [93.63502984214918]
Existing meta-learning algorithms randomly sample meta-training tasks with a uniform probability.
It is likely that tasks are detrimental with noise or imbalanced given a limited number of meta-training tasks.
We propose an adaptive task scheduler (ATS) for the meta-training process.
arXiv Detail & Related papers (2021-10-26T22:16:35Z) - Better than the Best: Gradient-based Improper Reinforcement Learning for
Network Scheduling [60.48359567964899]
We consider the problem of scheduling in constrained queueing networks with a view to minimizing packet delay.
We use a policy gradient based reinforcement learning algorithm that produces a scheduler that performs better than the available atomic policies.
arXiv Detail & Related papers (2021-05-01T10:18:34Z) - An Online Algorithm for Computation Offloading in Non-Stationary
Environments [12.843328612860244]
We consider the latency problem in a task-offloading scenario, where multiple servers are available to the user equipment for outsourcing computational tasks.
To account for the temporally dynamic nature of the wireless links and the availability of the computing resources, we model the server selection as a multi-armed bandit (MAB) problem.
We propose a novel online learning algorithm based on the principle of optimism in the face of uncertainty, which outperforms the state-of-the-art algorithms by up to 1s.
arXiv Detail & Related papers (2020-06-22T07:00:47Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.