Managing Cold-start in The Serverless Cloud with Temporal Convolutional
Networks
- URL: http://arxiv.org/abs/2304.00396v1
- Date: Sat, 1 Apr 2023 21:54:22 GMT
- Title: Managing Cold-start in The Serverless Cloud with Temporal Convolutional
Networks
- Authors: Tam N. Nguyen
- Abstract summary: Serverless cloud is an innovative cloud service model that frees customers from most cloud management duties.
A big threat to the serverless cloud's performance is cold-start, which is when the time of provisioning the needed cloud resource to serve customers' requests incurs unacceptable costs to the service providers and/or the customers.
This paper proposes a novel low-coupling, high-cohesion ensemble policy that addresses the cold-start problem at infrastructure- and function-levels of the serverless cloud stack.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Serverless cloud is an innovative cloud service model that frees customers
from most cloud management duties. It also offers the same advantages as other
cloud models but at much lower costs. As a result, the serverless cloud has
been increasingly employed in high-impact areas such as system security,
banking, and health care. A big threat to the serverless cloud's performance is
cold-start, which is when the time of provisioning the needed cloud resource to
serve customers' requests incurs unacceptable costs to the service providers
and/or the customers. This paper proposes a novel low-coupling, high-cohesion
ensemble policy that addresses the cold-start problem at infrastructure- and
function-levels of the serverless cloud stack, while the state of the art
policies have a more narrowed focus. This ensemble policy anchors on the
prediction of function instance arrivals, 10 to 15 minutes into the future. It
is achievable by using the temporal convolutional network (TCN) deep-learning
method. Bench-marking results on a real-world dataset from a large-scale
serverless cloud provider show that TCN out-performs other popular machine
learning algorithms for time series. Going beyond cold-start management, the
proposed policy and publicly available codes can be adopted in solving other
cloud problems such as optimizing the provisioning of virtual software-defined
network assets.
Related papers
- CloudCast -- Total Cloud Cover Nowcasting with Machine Learning [0.0]
Cloud cover plays a critical role in weather prediction and impacts several sectors, including agriculture, solar power generation, and aviation.
Despite advancements in numerical weather prediction (NWP) models, forecasting total cloud cover remains challenging due to the small-scale nature of cloud formation processes.
We introduce CloudCast, a convolutional neural network (CNN) based on the U-Net architecture, designed to predict total cloud cover (TCC) up to five hours ahead.
arXiv Detail & Related papers (2024-10-27T14:46:13Z) - Managing Bandwidth: The Key to Cloud-Assisted Autonomous Driving [73.55745551827229]
We argue that we can, and must, rely on the cloud for real-time control systems like self-driving cars.
We identify an opportunity to offload parts of time-sensitive and latency-critical compute to the cloud.
arXiv Detail & Related papers (2024-10-21T17:32:36Z) - Teaching Cloud Infrastructure and Scalable Application Deployment in an Undergraduate Computer Science Program [2.8912542516745168]
Students commonly interact with cloud abstractions early in their technical careers.
Building cloud-native applications without a firm understanding of the fundamentals of cloud engineering can leave students susceptible to cost and security pitfalls common to cloud platforms.
Our course featured a number of hands-on assignments that gave students experience with modern, best-practice concepts and tools such as IaC, containerization, observability, serverless computing, and continuous integration and deployment.
arXiv Detail & Related papers (2024-10-01T19:49:19Z) - IDF-CR: Iterative Diffusion Process for Divide-and-Conquer Cloud Removal in Remote-sensing Images [55.40601468843028]
We present an iterative diffusion process for cloud removal (IDF-CR)
IDF-CR is divided into two-stage models that address pixel space and latent space.
In the latent space stage, the diffusion model transforms low-quality cloud removal into high-quality clean output.
arXiv Detail & Related papers (2024-03-18T15:23:48Z) - Application of Machine Learning Optimization in Cloud Computing Resource
Scheduling and Management [18.462300407761873]
The scale of cloud computing in China has reached 209.1 billion yuan.
This paper proposes an innovative approach to solve complex problems in cloud computing resource scheduling and management.
arXiv Detail & Related papers (2024-02-27T05:14:27Z) - CWD: A Machine Learning based Approach to Detect Unknown Cloud Workloads [3.523208537466129]
We develop a machine learning based technique to characterize, profile and predict workloads running in the cloud environment.
We also develop techniques to analyze the performance of the model in a standalone manner.
arXiv Detail & Related papers (2022-11-28T19:41:56Z) - Device-Cloud Collaborative Recommendation via Meta Controller [65.97416287295152]
We propose a meta controller to dynamically manage the collaboration between the on-device recommender and the cloud-based recommender.
On the basis of the counterfactual samples and the extended training, extensive experiments in the industrial recommendation scenarios show the promise of meta controller.
arXiv Detail & Related papers (2022-07-07T03:23:04Z) - Measuring the Carbon Intensity of AI in Cloud Instances [91.28501520271972]
We provide a framework for measuring software carbon intensity, and propose to measure operational carbon emissions.
We evaluate a suite of approaches for reducing emissions on the Microsoft Azure cloud compute platform.
arXiv Detail & Related papers (2022-06-10T17:04:04Z) - Unsupervised Point Cloud Representation Learning with Deep Neural
Networks: A Survey [104.71816962689296]
Unsupervised point cloud representation learning has attracted increasing attention due to the constraint in large-scale point cloud labelling.
This paper provides a comprehensive review of unsupervised point cloud representation learning using deep neural networks.
arXiv Detail & Related papers (2022-02-28T07:46:05Z) - Auto-Split: A General Framework of Collaborative Edge-Cloud AI [49.750972428032355]
This paper describes the techniques and engineering practice behind Auto-Split, an edge-cloud collaborative prototype of Huawei Cloud.
To the best of our knowledge, there is no existing industry product that provides the capability of Deep Neural Network (DNN) splitting.
arXiv Detail & Related papers (2021-08-30T08:03:29Z) - Cloud Collectives: Towards Cloud-aware Collectives forML Workloads with
Rank Reordering [8.81194405760133]
Cloud Collectives is a prototype that accelerates collectives by reorderingranks of participating frameworks.
Collectives is non-intrusive, requires no code changes nor rebuild of an existing application, and runs without support from cloud providers.
Preliminary application of Cloud Collectives on allreduce operations in public clouds results in a speedup of up to 3.7x in multiple microbenchmarks and 1.3x in real-world workloads.
arXiv Detail & Related papers (2021-05-28T20:14:38Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.