Predictive Auto-scaling with OpenStack Monasca
- URL: http://arxiv.org/abs/2111.02133v1
- Date: Wed, 3 Nov 2021 11:02:08 GMT
- Title: Predictive Auto-scaling with OpenStack Monasca
- Authors: Giacomo Lanciano, Filippo Galli, Tommaso Cucinotta, Davide Bacciu,
Andrea Passarella
- Abstract summary: We propose an architecture for auto-scaling cloud services based on the status in which the system is expected to evolve in the near future.
We prototyped our approach as an open-source component, which relies on, and extends, the monitoring capabilities offered by Monasca.
- Score: 8.631793985356286
- License: http://creativecommons.org/licenses/by-sa/4.0/
- Abstract: Cloud auto-scaling mechanisms are typically based on reactive automation
rules that scale a cluster whenever some metric, e.g., the average CPU usage
among instances, exceeds a predefined threshold. Tuning these rules becomes
particularly cumbersome when scaling-up a cluster involves non-negligible times
to bootstrap new instances, as it happens frequently in production cloud
services.
To deal with this problem, we propose an architecture for auto-scaling cloud
services based on the status in which the system is expected to evolve in the
near future. Our approach leverages on time-series forecasting techniques, like
those based on machine learning and artificial neural networks, to predict the
future dynamics of key metrics, e.g., resource consumption metrics, and apply a
threshold-based scaling policy on them. The result is a predictive automation
policy that is able, for instance, to automatically anticipate peaks in the
load of a cloud application and trigger ahead of time appropriate scaling
actions to accommodate the expected increase in traffic.
We prototyped our approach as an open-source OpenStack component, which
relies on, and extends, the monitoring capabilities offered by Monasca,
resulting in the addition of predictive metrics that can be leveraged by
orchestration components like Heat or Senlin. We show experimental results
using a recurrent neural network and a multi-layer perceptron as predictor,
which are compared with a simple linear regression and a traditional
non-predictive auto-scaling policy. However, the proposed framework allows for
the easy customization of the prediction policy as needed.
Related papers
- An Adaptive Framework for Generalizing Network Traffic Prediction
towards Uncertain Environments [51.99765487172328]
We have developed a new framework using time-series analysis for dynamically assigning mobile network traffic prediction models.
Our framework employs learned behaviors, outperforming any single model with over a 50% improvement relative to current studies.
arXiv Detail & Related papers (2023-11-30T18:58:38Z) - Cluster-and-Conquer: A Framework For Time-Series Forecasting [94.63501563413725]
We propose a three-stage framework for forecasting high-dimensional time-series data.
Our framework is highly general, allowing for any time-series forecasting and clustering method to be used in each step.
When instantiated with simple linear autoregressive models, we are able to achieve state-of-the-art results on several benchmark datasets.
arXiv Detail & Related papers (2021-10-26T20:41:19Z) - onlineforecast: An R package for adaptive and recursive forecasting [1.2647816797166165]
R package onlineforecast provides a generalized setup of data and models for online forecasting.
It has functionality for time-adaptive fitting of linear regression-based models.
The package comes with comprehensive vignettes and examples of online forecasting applications in energy systems.
arXiv Detail & Related papers (2021-09-27T10:01:35Z) - Click-through Rate Prediction with Auto-Quantized Contrastive Learning [46.585376453464114]
We consider whether the user behaviors are rich enough to capture the interests for prediction, and propose an Auto-Quantized Contrastive Learning (AQCL) loss to regularize the model.
The proposed framework is agnostic to different model architectures and can be trained in an end-to-end fashion.
arXiv Detail & Related papers (2021-09-27T04:39:43Z) - Automated Machine Learning Techniques for Data Streams [91.3755431537592]
This paper surveys the state-of-the-art open-source AutoML tools, applies them to data collected from streams, and measures how their performance changes over time.
The results show that off-the-shelf AutoML tools can provide satisfactory results but in the presence of concept drift, detection or adaptation techniques have to be applied to maintain the predictive accuracy over time.
arXiv Detail & Related papers (2021-06-14T11:42:46Z) - A Predictive Autoscaler for Elastic Batch Jobs [8.354712625979776]
Large batch jobs such as Deep Learning, HPC and Spark require far more computational resources and higher cost than conventional online service.
We propose a predictive autoscaler to provide an elastic interface for the customers and overprovision instances.
arXiv Detail & Related papers (2020-10-10T17:35:55Z) - Superiority of Simplicity: A Lightweight Model for Network Device
Workload Prediction [58.98112070128482]
We propose a lightweight solution for series prediction based on historic observations.
It consists of a heterogeneous ensemble method composed of two models - a neural network and a mean predictor.
It achieves an overall $R2$ score of 0.10 on the available FedCSIS 2020 challenge dataset.
arXiv Detail & Related papers (2020-07-07T15:44:16Z) - AutoCP: Automated Pipelines for Accurate Prediction Intervals [84.16181066107984]
This paper proposes an AutoML framework called Automatic Machine Learning for Conformal Prediction (AutoCP)
Unlike the familiar AutoML frameworks that attempt to select the best prediction model, AutoCP constructs prediction intervals that achieve the user-specified target coverage rate.
We tested AutoCP on a variety of datasets and found that it significantly outperforms benchmark algorithms.
arXiv Detail & Related papers (2020-06-24T23:13:11Z) - AI-based Resource Allocation: Reinforcement Learning for Adaptive
Auto-scaling in Serverless Environments [0.0]
Serverless computing has emerged as a compelling new paradigm of cloud computing models in recent years.
A common approach among both commercial and open source serverless computing platforms is workload-based auto-scaling.
In this paper we investigate the applicability of a reinforcement learning approach to request-based auto-scaling in a serverless framework.
arXiv Detail & Related papers (2020-05-29T06:18:39Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.