Exploring sustainable alternatives for the deployment of microservices
architectures in the cloud
- URL: http://arxiv.org/abs/2402.11238v1
- Date: Sat, 17 Feb 2024 10:06:26 GMT
- Title: Exploring sustainable alternatives for the deployment of microservices
architectures in the cloud
- Authors: Vittorio Cortellessa, Daniele Di Pompeo, Michele Tucci
- Abstract summary: This paper introduces a novel approach to support cloud deployment of architectures by targeting optimal combinations of application performance, deployment costs, and power consumption.
The results demonstrate the potential of our approach through a comprehensive assessment of the Train Ticket case study.
- Score: 1.3812010983144802
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: As organizations increasingly migrate their applications to the cloud, the
optimization of microservices architectures becomes imperative for achieving
sustainability goals. Nonetheless, sustainable deployments may increase costs
and deteriorate performance, thus the identification of optimal tradeoffs among
these conflicting requirements is a key objective not easy to achieve. This
paper introduces a novel approach to support cloud deployment of microservices
architectures by targeting optimal combinations of application performance,
deployment costs, and power consumption. By leveraging genetic algorithms,
specifically NSGA-II, we automate the generation of alternative architectural
deployments. The results demonstrate the potential of our approach through a
comprehensive assessment of the Train Ticket case study.
Related papers
- Microservice Deployment in Space Computing Power Networks via Robust Reinforcement Learning [43.96374556275842]
It is important to provide reliable real-time remote sensing inference services to meet the low-latency requirements.
This paper presents a remote sensing artificial intelligence applications deployment framework designed for Low Earth Orbit satellite constellations.
arXiv Detail & Related papers (2025-01-08T16:55:04Z) - Real-Time Performance Optimization of Travel Reservation Systems Using AI and Microservices [1.03590082373586]
This study proposes a hybrid framework that folds an Artificial Intelligence and a Microservices approach for the performance optimization of the system.
The AI algorithms forecast demand patterns, optimize the allocation of resources, and enhance decision-making driven by Microservices architecture.
arXiv Detail & Related papers (2024-12-09T16:08:22Z) - AI-Driven Resource Allocation Framework for Microservices in Hybrid Cloud Platforms [1.03590082373586]
This paper presents an AI-driven framework for resource allocation among in hybrid cloud platforms.
The framework employs reinforcement learning (RL)-based resource utilization optimization to reduce costs and improve performance.
arXiv Detail & Related papers (2024-12-03T17:41:08Z) - Transforming the Hybrid Cloud for Emerging AI Workloads [81.15269563290326]
This white paper envisions transforming hybrid cloud systems to meet the growing complexity of AI workloads.
The proposed framework addresses critical challenges in energy efficiency, performance, and cost-effectiveness.
This joint initiative aims to establish hybrid clouds as secure, efficient, and sustainable platforms.
arXiv Detail & Related papers (2024-11-20T11:57:43Z) - An Infrastructure Cost Optimised Algorithm for Partitioning of Microservices [20.638612359627952]
As migrating applications into the cloud is universally adopted by the software industry, have proven to be the most suitable and widely accepted architecture pattern for applications deployed on distributed cloud.
Their efficacy is enabled by both technical benefits like reliability, fault isolation, scalability and productivity benefits like ease of asset maintenance and clear ownership boundaries.
In some cases, the complexity of migrating an existing application into the architecture becomes overwhelmingly complex and expensive.
arXiv Detail & Related papers (2024-08-13T02:08:59Z) - A Learning-based Incentive Mechanism for Mobile AIGC Service in Decentralized Internet of Vehicles [49.86094523878003]
We propose a decentralized incentive mechanism for mobile AIGC service allocation.
We employ multi-agent deep reinforcement learning to find the balance between the supply of AIGC services on RSUs and user demand for services within the IoV context.
arXiv Detail & Related papers (2024-03-29T12:46:07Z) - Dynamic Resource Allocation for Metaverse Applications with Deep
Reinforcement Learning [64.75603723249837]
This work proposes a novel framework to dynamically manage and allocate different types of resources for Metaverse applications.
We first propose an effective solution to divide applications into groups, namely MetaInstances, where common functions can be shared among applications.
Then, to capture the real-time, dynamic, and uncertain characteristics of request arrival and application departure processes, we develop a semi-Markov decision process-based framework.
arXiv Detail & Related papers (2023-02-27T00:30:01Z) - KAIROS: Building Cost-Efficient Machine Learning Inference Systems with
Heterogeneous Cloud Resources [10.462798429064277]
KAIROS is a novel runtime framework that maximizes the query throughput while meeting target and a cost budget.
Our evaluation using industry-grade deep learning (DL) models shows that KAIROS yields up to 2X the throughput of an optimal homogeneous solution.
arXiv Detail & Related papers (2022-10-12T03:06:51Z) - Slimmable Domain Adaptation [112.19652651687402]
We introduce a simple framework, Slimmable Domain Adaptation, to improve cross-domain generalization with a weight-sharing model bank.
Our framework surpasses other competing approaches by a very large margin on multiple benchmarks.
arXiv Detail & Related papers (2022-06-14T06:28:04Z) - Reproducible Performance Optimization of Complex Applications on the
Edge-to-Cloud Continuum [55.6313942302582]
We propose a methodology to support the optimization of real-life applications on the Edge-to-Cloud Continuum.
Our approach relies on a rigorous analysis of possible configurations in a controlled testbed environment to understand their behaviour.
Our methodology can be generalized to other applications in the Edge-to-Cloud Continuum.
arXiv Detail & Related papers (2021-08-04T07:35:14Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.