WattScope: Non-intrusive Application-level Power Disaggregation in
Datacenters
- URL: http://arxiv.org/abs/2309.12612v1
- Date: Fri, 22 Sep 2023 04:13:46 GMT
- Title: WattScope: Non-intrusive Application-level Power Disaggregation in
Datacenters
- Authors: Xiaoding Guan, Noman Bashir, David Irwin, Prashant Shenoy
- Abstract summary: WattScope is a system for non-intrusive estimating the power consumption of individual applications.
WattScope adapts and extends a machine learning-based technique for disaggregating building power.
- Score: 0.6086160084025234
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Datacenter capacity is growing exponentially to satisfy the increasing demand
for emerging computationally-intensive applications, such as deep learning.
This trend has led to concerns over datacenters' increasing energy consumption
and carbon footprint. The basic prerequisite for optimizing a datacenter's
energy- and carbon-efficiency is accurately monitoring and attributing energy
consumption to specific users and applications. Since datacenter servers tend
to be multi-tenant, i.e., they host many applications, server- and rack-level
power monitoring alone does not provide insight into their resident
applications' energy usage and carbon emissions. At the same time, current
application-level energy monitoring and attribution techniques are intrusive:
they require privileged access to servers and require coordinated support in
hardware and software, which is not always possible in cloud. To address the
problem, we design WattScope, a system for non-intrusively estimating the power
consumption of individual applications using external measurements of a
server's aggregate power usage without requiring direct access to the server's
operating system or applications. Our key insight is that, based on an analysis
of production traces, the power characteristics of datacenter workloads, e.g.,
low variability, low magnitude, and high periodicity, are highly amenable to
disaggregation of a server's total power consumption into application-specific
values. WattScope adapts and extends a machine learning-based technique for
disaggregating building power and applies it to server- and rack-level power
meter measurements in data centers. We evaluate WattScope's accuracy on a
production workload and show that it yields high accuracy, e.g., often <10%
normalized mean absolute error, and is thus a potentially useful tool for
datacenters in externally monitoring application-level power usage.
Related papers
- A Robust Power Model Training Framework for Cloud Native Runtime Energy Metric Exporter [2.661555989363678]
Estimating power consumption in modern Cloud environments is essential for carbon quantification toward green computing.
This paper examines multiple challenges associated with this goal.
The first challenge is that multiple customers are sharing the same hardware platform.
The second challenge is the overhead in power consumption that the Cloud platform control plane induces.
arXiv Detail & Related papers (2024-04-10T02:20:29Z) - A Reinforcement Learning Approach for Performance-aware Reduction in
Power Consumption of Data Center Compute Nodes [0.46040036610482665]
We use Reinforcement Learning to design a power capping policy on cloud compute nodes.
We show how a trained agent running on actual hardware can take actions by balancing power consumption and application performance.
arXiv Detail & Related papers (2023-08-15T23:25:52Z) - Non-Intrusive Electric Load Monitoring Approach Based on Current Feature
Visualization for Smart Energy Management [51.89904044860731]
We employ computer vision techniques of AI to design a non-invasive load monitoring method for smart electric energy management.
We propose to recognize all electric loads from color feature images using a U-shape deep neural network with multi-scale feature extraction and attention mechanism.
arXiv Detail & Related papers (2023-08-08T04:52:19Z) - Outsourcing Training without Uploading Data via Efficient Collaborative
Open-Source Sampling [49.87637449243698]
Traditional outsourcing requires uploading device data to the cloud server.
We propose to leverage widely available open-source data, which is a massive dataset collected from public and heterogeneous sources.
We develop a novel strategy called Efficient Collaborative Open-source Sampling (ECOS) to construct a proximal proxy dataset from open-source data for cloud training.
arXiv Detail & Related papers (2022-10-23T00:12:18Z) - How Much More Data Do I Need? Estimating Requirements for Downstream
Tasks [99.44608160188905]
Given a small training data set and a learning algorithm, how much more data is necessary to reach a target validation or test performance?
Overestimating or underestimating data requirements incurs substantial costs that could be avoided with an adequate budget.
Using our guidelines, practitioners can accurately estimate data requirements of machine learning systems to gain savings in both development time and data acquisition costs.
arXiv Detail & Related papers (2022-07-04T21:16:05Z) - Measuring the Carbon Intensity of AI in Cloud Instances [91.28501520271972]
We provide a framework for measuring software carbon intensity, and propose to measure operational carbon emissions.
We evaluate a suite of approaches for reducing emissions on the Microsoft Azure cloud compute platform.
arXiv Detail & Related papers (2022-06-10T17:04:04Z) - Treehouse: A Case For Carbon-Aware Datacenter Software [4.7521372297013365]
The end of Dennard scaling and the slowing of Moore's Law has put the energy use of datacenters on an unsustainable path.
We argue that substantial reductions in the carbon intensity of datacenter computing are possible with a software-centric approach.
arXiv Detail & Related papers (2022-01-06T16:00:53Z) - Power Modeling for Effective Datacenter Planning and Compute Management [53.41102502425513]
We discuss two classes of statistical power models designed and validated to be accurate, simple, interpretable and applicable to all hardware configurations and workloads.
We demonstrate that the proposed statistical modeling techniques, while simple and scalable, predict power with less than 5% Mean Absolute Percent Error (MAPE) for more than 95% diverse Power Distribution Units (more than 2000) using only 4 features.
arXiv Detail & Related papers (2021-03-22T21:22:51Z) - Towards AIOps in Edge Computing Environments [60.27785717687999]
This paper describes the system design of an AIOps platform which is applicable in heterogeneous, distributed environments.
It is feasible to collect metrics with a high frequency and simultaneously run specific anomaly detection algorithms directly on edge devices.
arXiv Detail & Related papers (2021-02-12T09:33:00Z) - Catch Me If You Can: Using Power Analysis to Identify HPC Activity [0.35534933448684125]
We show how electrical power consumption data from an HPC platform can be used to identify what programs are executed.
We test our approach on an HPC rack at Lawrence Berkeley National Laboratory using a variety of scientific benchmarks.
arXiv Detail & Related papers (2020-05-06T20:57:41Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.