EPAM: A Predictive Energy Model for Mobile AI
- URL: http://arxiv.org/abs/2303.01509v1
- Date: Thu, 2 Mar 2023 09:11:23 GMT
- Title: EPAM: A Predictive Energy Model for Mobile AI
- Authors: Anik Mallik, Haoxin Wang, Jiang Xie, Dawei Chen, and Kyungtae Han
- Abstract summary: We introduce a comprehensive study of mobile AI applications considering different deep neural network (DNN) models and processing sources.
We measure the latency, energy consumption, and memory usage of all the models using four processing sources.
Our study highlights important insights, such as how mobile AI behaves in different applications (vision and non-vision) using CPU, GPU, and NNAPI.
- Score: 6.451060076703027
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Artificial intelligence (AI) has enabled a new paradigm of smart applications
-- changing our way of living entirely. Many of these AI-enabled applications
have very stringent latency requirements, especially for applications on mobile
devices (e.g., smartphones, wearable devices, and vehicles). Hence, smaller and
quantized deep neural network (DNN) models are developed for mobile devices,
which provide faster and more energy-efficient computation for mobile AI
applications. However, how AI models consume energy in a mobile device is still
unexplored. Predicting the energy consumption of these models, along with their
different applications, such as vision and non-vision, requires a thorough
investigation of their behavior using various processing sources. In this
paper, we introduce a comprehensive study of mobile AI applications considering
different DNN models and processing sources, focusing on computational resource
utilization, delay, and energy consumption. We measure the latency, energy
consumption, and memory usage of all the models using four processing sources
through extensive experiments. We explain the challenges in such investigations
and how we propose to overcome them. Our study highlights important insights,
such as how mobile AI behaves in different applications (vision and non-vision)
using CPU, GPU, and NNAPI. Finally, we propose a novel Gaussian process
regression-based general predictive energy model based on DNN structures,
computation resources, and processors, which can predict the energy for each
complete application cycle irrespective of device configuration and
application. This study provides crucial facts and an energy prediction
mechanism to the AI research community to help bring energy efficiency to
mobile AI applications.
Related papers
- Benchmarking Deep Learning Models for Object Detection on Edge Computing Devices [0.0]
We evaluate state-of-the-art object detection models, including YOLOv8 (Nano, Small, Medium), EfficientDet Lite (Lite0, Lite1, Lite2), and SSD (SSD MobileNet V1, SSDLite MobileDet)
We deployed these models on popular edge devices like the Raspberry Pi 3, 4, and 5 with/without TPU accelerators, and Jetson Orin Nano, collecting key performance metrics such as energy consumption, inference time, and Mean Average Precision (mAP)
Our findings highlight that lower mAP models such as SSD MobileNet V1 are more energy-efficient and faster in
arXiv Detail & Related papers (2024-09-25T10:56:49Z) - Power Hungry Processing: Watts Driving the Cost of AI Deployment? [74.19749699665216]
generative, multi-purpose AI systems promise a unified approach to building machine learning (ML) models into technology.
This ambition of generality'' comes at a steep cost to the environment, given the amount of energy these systems require and the amount of carbon that they emit.
We measure deployment cost as the amount of energy and carbon required to perform 1,000 inferences on representative benchmark dataset using these models.
We conclude with a discussion around the current trend of deploying multi-purpose generative ML systems, and caution that their utility should be more intentionally weighed against increased costs in terms of energy and emissions
arXiv Detail & Related papers (2023-11-28T15:09:36Z) - On the Opportunities of Green Computing: A Survey [80.21955522431168]
Artificial Intelligence (AI) has achieved significant advancements in technology and research with the development over several decades.
The needs for high computing power brings higher carbon emission and undermines research fairness.
To tackle the challenges of computing resources and environmental impact of AI, Green Computing has become a hot research topic.
arXiv Detail & Related papers (2023-11-01T11:16:41Z) - The Two Faces of AI in Green Mobile Computing: A Literature Review [2.6763498831034034]
We present a review of the literature of the past decade on the usage of artificial intelligence within the realm of green mobile computing.
From the analysis of 34 papers, we highlight the emerging patterns and map the field into 13 main topics that are summarized in details.
arXiv Detail & Related papers (2023-07-21T15:18:10Z) - Towards Implementing Energy-aware Data-driven Intelligence for Smart
Health Applications on Mobile Platforms [4.648824029505978]
On-device deep learning frameworks are proficient in utilizing computing resources in mobile platforms seamlessly.
However, energy resources in a mobile device are typically limited.
We introduce a new framework through an energy-aware, adaptive model comprehension and realization.
arXiv Detail & Related papers (2023-02-01T15:34:24Z) - Precise Energy Consumption Measurements of Heterogeneous Artificial
Intelligence Workloads [0.534434568021034]
We present measurements of the energy consumption of two typical applications of deep learning models on different types of compute nodes.
One advantage of our approach is that the information on energy consumption is available to all users of the supercomputer.
arXiv Detail & Related papers (2022-12-03T21:40:55Z) - Fluid Batching: Exit-Aware Preemptive Serving of Early-Exit Neural
Networks on Edge NPUs [74.83613252825754]
"smart ecosystems" are being formed where sensing happens concurrently rather than standalone.
This is shifting the on-device inference paradigm towards deploying neural processing units (NPUs) at the edge.
We propose a novel early-exit scheduling that allows preemption at run time to account for the dynamicity introduced by the arrival and exiting processes.
arXiv Detail & Related papers (2022-09-27T15:04:01Z) - Enable Deep Learning on Mobile Devices: Methods, Systems, and
Applications [46.97774949613859]
Deep neural networks (DNNs) have achieved unprecedented success in the field of artificial intelligence (AI)
However, their superior performance comes at the considerable cost of computational complexity.
This paper provides an overview of efficient deep learning methods, systems and applications.
arXiv Detail & Related papers (2022-04-25T16:52:48Z) - Learning, Computing, and Trustworthiness in Intelligent IoT
Environments: Performance-Energy Tradeoffs [62.91362897985057]
An Intelligent IoT Environment (iIoTe) is comprised of heterogeneous devices that can collaboratively execute semi-autonomous IoT applications.
This paper provides a state-of-the-art overview of these technologies and illustrates their functionality and performance, with special attention to the tradeoff among resources, latency, privacy and energy consumption.
arXiv Detail & Related papers (2021-10-04T19:41:42Z) - Smart at what cost? Characterising Mobile Deep Neural Networks in the
wild [16.684419342012674]
This paper is the first holistic study of Deep Neural Network (DNN) usage in the wild.
We analyse over 16k of the most popular apps in the Google Play Store.
We measure the models' energy footprint, as a core cost dimension of any mobile deployment.
arXiv Detail & Related papers (2021-09-28T18:09:29Z) - Energy Drain of the Object Detection Processing Pipeline for Mobile
Devices: Analysis and Implications [77.00418462388525]
This paper presents the first detailed experimental study of a mobile augmented reality (AR) client's energy consumption and the detection latency of executing Convolutional Neural Networks (CNN) based object detection.
Our detailed measurements refine the energy analysis of mobile AR clients and reveal several interesting perspectives regarding the energy consumption of executing CNN-based object detection.
arXiv Detail & Related papers (2020-11-26T00:32:07Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.