MOSAIC: A Multi-Objective Optimization Framework for Sustainable
Datacenter Management
- URL: http://arxiv.org/abs/2311.08583v1
- Date: Tue, 14 Nov 2023 23:05:37 GMT
- Title: MOSAIC: A Multi-Objective Optimization Framework for Sustainable
Datacenter Management
- Authors: Sirui Qi, Dejan Milojicic, Cullen Bash, Sudeep Pasricha
- Abstract summary: We propose a novel framework for sustainable datacenter management.
We take into account multiple geography- and time-based factors including renewable energy sources, variable energy costs, power usage efficiency, carbon factors, and water intensity in energy.
Our framework achieves a cumulative improvement across all objectives (carbon, water, cost) of up to 4.61x compared to the state-of-the-arts.
- Score: 2.9699290794642366
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: In recent years, cloud service providers have been building and hosting
datacenters across multiple geographical locations to provide robust services.
However, the geographical distribution of datacenters introduces growing
pressure to both local and global environments, particularly when it comes to
water usage and carbon emissions. Unfortunately, efforts to reduce the
environmental impact of such datacenters often lead to an increase in the cost
of datacenter operations. To co-optimize the energy cost, carbon emissions, and
water footprint of datacenter operation from a global perspective, we propose a
novel framework for multi-objective sustainable datacenter management (MOSAIC)
that integrates adaptive local search with a collaborative decomposition-based
evolutionary algorithm to intelligently manage geographical workload
distribution and datacenter operations. Our framework sustainably allocates
workloads to datacenters while taking into account multiple geography- and
time-based factors including renewable energy sources, variable energy costs,
power usage efficiency, carbon factors, and water intensity in energy. Our
experimental results show that, compared to the best-known prior work
frameworks, MOSAIC can achieve 27.45x speedup and 1.53x improvement in Pareto
Hypervolume while reducing the carbon footprint by up to 1.33x, water footprint
by up to 3.09x, and energy costs by up to 1.40x. In the simultaneous
three-objective co-optimization scenario, MOSAIC achieves a cumulative
improvement across all objectives (carbon, water, cost) of up to 4.61x compared
to the state-of-the-arts.
Related papers
- SustainDC: Benchmarking for Sustainable Data Center Control [4.159959816797259]
We introduce SustainDC, a set of Python environments for benchmarking multi-agent reinforcement learning (MARL) algorithms for data centers (DC)
SustainDC supports custom DC configurations and tasks such as workload scheduling, cooling optimization, and auxiliary battery management.
We evaluate various MARL algorithms on SustainDC, showing their performance across diverse DC designs, locations, weather conditions, grid carbon intensity, and workload requirements.
arXiv Detail & Related papers (2024-08-14T22:43:52Z) - Efficient Federated Learning Using Dynamic Update and Adaptive Pruning with Momentum on Shared Server Data [59.6985168241067]
Federated Learning (FL) encounters two important problems, i.e., low training efficiency and limited computational resources.
We propose a new FL framework, FedDUMAP, to leverage the shared insensitive data on the server and the distributed data in edge devices.
Our proposed FL model, FedDUMAP, combines the three original techniques and has a significantly better performance compared with baseline approaches.
arXiv Detail & Related papers (2024-08-11T02:59:11Z) - I've Got 99 Problems But FLOPS Ain't One [70.3084616806354]
We take an unconventional approach to find relevant research directions, starting from public plans to build a $100 billion datacenter for machine learning applications.
We discover what workloads such a datacenter might carry and explore the challenges one may encounter in doing so, with a focus on networking research.
We conclude that building the datacenter and training such models is technically possible, but this requires novel wide-area transports for inter-DC communication, a multipath transport and novel datacenter topologies.
arXiv Detail & Related papers (2024-07-01T10:33:46Z) - Game-Theoretic Deep Reinforcement Learning to Minimize Carbon Emissions and Energy Costs for AI Inference Workloads in Geo-Distributed Data Centers [3.3379026542599934]
This work introduces a unique approach combining Game Theory (GT) and Deep Reinforcement Learning (DRL) for optimizing the distribution of AI inference workloads in geo-distributed data centers.
The proposed technique integrates the principles of non-cooperative Game Theory into a DRL framework, enabling data centers to make intelligent decisions regarding workload allocation.
arXiv Detail & Related papers (2024-04-01T20:13:28Z) - Spatio-temporal load shifting for truly clean computing [0.5857582826810999]
We study the impact of shifting computing jobs and associated power loads both in time and between locations.
We isolate three signals relevant for informed use of loadblity.
The costs of 24/7 CFE are reduced by 1.29$pm$0.07 EUR/MWh for every additional percentage of flexible load.
arXiv Detail & Related papers (2024-03-26T13:36:42Z) - Carbon Footprint Reduction for Sustainable Data Centers in Real-Time [2.794742330785396]
We propose a Data Center Carbon Footprint Reduction (DC-CFR) multi-agent Reinforcement Learning (MARL) framework to optimize data centers for the objectives of carbon footprint reduction, energy consumption, and energy cost.
The results show that the DC-CFR MARL agents effectively resolved the complex interdependencies in optimizing cooling, load shifting, and energy storage in real-time for various locations under real-world dynamic weather and grid carbon intensity conditions.
arXiv Detail & Related papers (2024-03-21T02:59:56Z) - SHIELD: Sustainable Hybrid Evolutionary Learning Framework for Carbon,
Wastewater, and Energy-Aware Data Center Management [2.9699290794642366]
Geo-distributed data centers (GDDCs) have a significant associated environmental impact.
This paper proposes a novel framework to co-optimize carbon emissions, water footprint, and energy costs of GDDCs.
arXiv Detail & Related papers (2023-08-24T21:11:55Z) - Sustainable AIGC Workload Scheduling of Geo-Distributed Data Centers: A
Multi-Agent Reinforcement Learning Approach [48.18355658448509]
Recent breakthroughs in generative artificial intelligence have triggered a surge in demand for machine learning training, which poses significant cost burdens and environmental challenges due to its substantial energy consumption.
Scheduling training jobs among geographically distributed cloud data centers unveils the opportunity to optimize the usage of computing capacity powered by inexpensive and low-carbon energy.
We propose an algorithm based on multi-agent reinforcement learning and actor-critic methods to learn the optimal collaborative scheduling strategy through interacting with a cloud system built with real-life workload patterns, energy prices, and carbon intensities.
arXiv Detail & Related papers (2023-04-17T02:12:30Z) - Measuring the Carbon Intensity of AI in Cloud Instances [91.28501520271972]
We provide a framework for measuring software carbon intensity, and propose to measure operational carbon emissions.
We evaluate a suite of approaches for reducing emissions on the Microsoft Azure cloud compute platform.
arXiv Detail & Related papers (2022-06-10T17:04:04Z) - Power Modeling for Effective Datacenter Planning and Compute Management [53.41102502425513]
We discuss two classes of statistical power models designed and validated to be accurate, simple, interpretable and applicable to all hardware configurations and workloads.
We demonstrate that the proposed statistical modeling techniques, while simple and scalable, predict power with less than 5% Mean Absolute Percent Error (MAPE) for more than 95% diverse Power Distribution Units (more than 2000) using only 4 features.
arXiv Detail & Related papers (2021-03-22T21:22:51Z) - A Framework for Energy and Carbon Footprint Analysis of Distributed and
Federated Edge Learning [48.63610479916003]
This article breaks down and analyzes the main factors that influence the environmental footprint of distributed learning policies.
It models both vanilla and decentralized FL policies driven by consensus.
Results show that FL allows remarkable end-to-end energy savings (30%-40%) for wireless systems characterized by low bit/Joule efficiency.
arXiv Detail & Related papers (2021-03-18T16:04:42Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.