PyDCM: Custom Data Center Models with Reinforcement Learning for Sustainability
- URL: http://arxiv.org/abs/2310.03906v8
- Date: Tue, 26 Mar 2024 21:48:44 GMT
- Title: PyDCM: Custom Data Center Models with Reinforcement Learning for Sustainability
- Authors: Avisek Naug, Antonio Guillen, Ricardo Luna GutiƩrrez, Vineet Gundecha, Dejan Markovikj, Lekhapriya Dheeraj Kashyap, Lorenz Krause, Sahand Ghorbanpour, Sajad Mousavi, Ashwin Ramesh Babu, Soumyendu Sarkar,
- Abstract summary: PyDCM is a customizable Data Center Model implemented in Python.
The use of vectorized thermal calculations makes PyDCM orders of magnitude faster (30 times) than current Energy Plus modeling implementations.
- Score: 2.6429542504022314
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The increasing global emphasis on sustainability and reducing carbon emissions is pushing governments and corporations to rethink their approach to data center design and operation. Given their high energy consumption and exponentially large computational workloads, data centers are prime candidates for optimizing power consumption, especially in areas such as cooling and IT energy usage. A significant challenge in this pursuit is the lack of a configurable and scalable thermal data center model that offers an end-to-end pipeline. Data centers consist of multiple IT components whose geometric configuration and heat dissipation make thermal modeling difficult. This paper presents PyDCM, a customizable Data Center Model implemented in Python, that allows users to create unique configurations of IT equipment with custom server specifications and geometric arrangements of IT cabinets. The use of vectorized thermal calculations makes PyDCM orders of magnitude faster (30 times) than current Energy Plus modeling implementations and scales sublinearly with the number of CPUs. Also, PyDCM enables the use of Deep Reinforcement Learning via the Gymnasium wrapper to optimize data center cooling and offers a user-friendly platform for testing various data center design prototypes.
Related papers
- DCP: Learning Accelerator Dataflow for Neural Network via Propagation [52.06154296196845]
This work proposes an efficient data-centric approach, named Dataflow Code Propagation (DCP), to automatically find the optimal dataflow for DNN layers in seconds without human effort.
DCP learns a neural predictor to efficiently update the dataflow codes towards the desired gradient directions to minimize various optimization objectives.
For example, without using additional training data, DCP surpasses the GAMMA method that performs a full search using thousands of samples.
arXiv Detail & Related papers (2024-10-09T05:16:44Z) - A Configurable Pythonic Data Center Model for Sustainable Cooling and ML Integration [4.0196072781228285]
We showcase PyDCM, a Python library that enables extremely fast prototyping of data center design.
We demonstrate capabilities of PyDCM and compare them to existing works in EnergyPlus for modeling data centers.
PyDCM can also be used as a standalone Gymnasium environment for demonstrating sustainability-focused data center control.
arXiv Detail & Related papers (2024-04-18T20:25:33Z) - Sustainability of Data Center Digital Twins with Reinforcement Learning [2.4971633082970377]
Machine learning (ML) has led to an increased demand for computational power, resulting in larger data centers (DCs) and higher energy consumption.
To address this issue and reduce carbon emissions, intelligent design and control of DC components such as IT servers, cabinets, HVAC cooling, flexible load shifting, and battery energy storage are essential.
DCRL-Green is a multi-agent RL environment that empowers the ML community to design data centers and research, develop, and refine RL controllers for carbon footprint reduction in DCs.
arXiv Detail & Related papers (2024-04-16T18:22:30Z) - Multi-Edge Server-Assisted Dynamic Federated Learning with an Optimized
Floating Aggregation Point [51.47520726446029]
cooperative edge learning (CE-FL) is a distributed machine learning architecture.
We model the processes taken during CE-FL, and conduct analytical training.
We show the effectiveness of our framework with the data collected from a real-world testbed.
arXiv Detail & Related papers (2022-03-26T00:41:57Z) - Treehouse: A Case For Carbon-Aware Datacenter Software [4.7521372297013365]
The end of Dennard scaling and the slowing of Moore's Law has put the energy use of datacenters on an unsustainable path.
We argue that substantial reductions in the carbon intensity of datacenter computing are possible with a software-centric approach.
arXiv Detail & Related papers (2022-01-06T16:00:53Z) - Power Modeling for Effective Datacenter Planning and Compute Management [53.41102502425513]
We discuss two classes of statistical power models designed and validated to be accurate, simple, interpretable and applicable to all hardware configurations and workloads.
We demonstrate that the proposed statistical modeling techniques, while simple and scalable, predict power with less than 5% Mean Absolute Percent Error (MAPE) for more than 95% diverse Power Distribution Units (more than 2000) using only 4 features.
arXiv Detail & Related papers (2021-03-22T21:22:51Z) - A Framework for Energy and Carbon Footprint Analysis of Distributed and
Federated Edge Learning [48.63610479916003]
This article breaks down and analyzes the main factors that influence the environmental footprint of distributed learning policies.
It models both vanilla and decentralized FL policies driven by consensus.
Results show that FL allows remarkable end-to-end energy savings (30%-40%) for wireless systems characterized by low bit/Joule efficiency.
arXiv Detail & Related papers (2021-03-18T16:04:42Z) - AI Chiller: An Open IoT Cloud Based Machine Learning Framework for the
Energy Saving of Building HVAC System via Big Data Analytics on the Fusion of
BMS and Environmental Data [12.681421165031576]
Energy saving and carbon emission reduction in buildings is one of the key measures in combating climate change.
The optimization of chiller system power consumption had been extensively studied in the mechanical engineering and building service domains.
With the advance of big data and AI, the adoption of machine learning into the optimization problems becomes popular.
arXiv Detail & Related papers (2020-10-09T09:51:03Z) - Ensemble Distillation for Robust Model Fusion in Federated Learning [72.61259487233214]
Federated Learning (FL) is a machine learning setting where many devices collaboratively train a machine learning model.
In most of the current training schemes the central model is refined by averaging the parameters of the server model and the updated parameters from the client side.
We propose ensemble distillation for model fusion, i.e. training the central classifier through unlabeled data on the outputs of the models from the clients.
arXiv Detail & Related papers (2020-06-12T14:49:47Z) - NeurOpt: Neural network based optimization for building energy
management and climate control [58.06411999767069]
We propose a data-driven control algorithm based on neural networks to reduce this cost of model identification.
We validate our learning and control algorithms on a two-story building with ten independently controlled zones, located in Italy.
arXiv Detail & Related papers (2020-01-22T00:51:03Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.