An Energy-Aware Approach to Design Self-Adaptive AI-based Applications
on the Edge
- URL: http://arxiv.org/abs/2309.00022v1
- Date: Thu, 31 Aug 2023 09:33:44 GMT
- Title: An Energy-Aware Approach to Design Self-Adaptive AI-based Applications
on the Edge
- Authors: Alessandro Tundo, Marco Mobilio, Shashikant Ilager, Ivona Brandi\'c,
Ezio Bartocci, Leonardo Mariani
- Abstract summary: We present an energy-aware approach for the design and deployment of self-adaptive AI-based applications.
We address the problem of determining the set of configurations that can be used to self-adapt the system with a meta-heuristic search procedure.
Results show that our self-adaptive application can outperform non-adaptive baseline configurations by saving up to 81% of energy while loosing only between 2% and 6% in accuracy.
- Score: 42.462246527457594
- License: http://creativecommons.org/licenses/by-sa/4.0/
- Abstract: The advent of edge devices dedicated to machine learning tasks enabled the
execution of AI-based applications that efficiently process and classify the
data acquired by the resource-constrained devices populating the Internet of
Things. The proliferation of such applications (e.g., critical monitoring in
smart cities) demands new strategies to make these systems also sustainable
from an energetic point of view.
In this paper, we present an energy-aware approach for the design and
deployment of self-adaptive AI-based applications that can balance application
objectives (e.g., accuracy in object detection and frames processing rate) with
energy consumption. We address the problem of determining the set of
configurations that can be used to self-adapt the system with a meta-heuristic
search procedure that only needs a small number of empirical samples. The final
set of configurations are selected using weighted gray relational analysis, and
mapped to the operation modes of the self-adaptive application.
We validate our approach on an AI-based application for pedestrian detection.
Results show that our self-adaptive application can outperform non-adaptive
baseline configurations by saving up to 81\% of energy while loosing only
between 2% and 6% in accuracy.
Related papers
- Neural Horizon Model Predictive Control -- Increasing Computational Efficiency with Neural Networks [0.0]
We propose a proposed machine-learning supported approach to model predictive control.
We propose approximating part of the problem horizon, while maintaining safety guarantees.
The proposed MPC scheme can be applied to a wide range of applications, including those requiring a rapid control response.
arXiv Detail & Related papers (2024-08-19T08:13:37Z) - Automatic AI Model Selection for Wireless Systems: Online Learning via Digital Twinning [50.332027356848094]
AI-based applications are deployed at intelligent controllers to carry out functionalities like scheduling or power control.
The mapping between context and AI model parameters is ideally done in a zero-shot fashion.
This paper introduces a general methodology for the online optimization of AMS mappings.
arXiv Detail & Related papers (2024-06-22T11:17:50Z) - Machine Learning Insides OptVerse AI Solver: Design Principles and
Applications [74.67495900436728]
We present a comprehensive study on the integration of machine learning (ML) techniques into Huawei Cloud's OptVerse AI solver.
We showcase our methods for generating complex SAT and MILP instances utilizing generative models that mirror multifaceted structures of real-world problem.
We detail the incorporation of state-of-the-art parameter tuning algorithms which markedly elevate solver performance.
arXiv Detail & Related papers (2024-01-11T15:02:15Z) - Energy-efficient Task Adaptation for NLP Edge Inference Leveraging
Heterogeneous Memory Architectures [68.91874045918112]
adapter-ALBERT is an efficient model optimization for maximal data reuse across different tasks.
We demonstrate the advantage of mapping the model to a heterogeneous on-chip memory architecture by performing simulations on a validated NLP edge accelerator.
arXiv Detail & Related papers (2023-03-25T14:40:59Z) - FELARE: Fair Scheduling of Machine Learning Applications on
Heterogeneous Edge Systems [5.165692107696155]
Edge computing enables smart IoT-based systems via concurrent and continuous execution of latency-sensitive machine learning (ML) applications.
We study and analyze resource allocation solutions that can increase the on-time task completion rate while considering the energy constraint.
We observed 8.9% improvement in on-time task completion rate and 12.6% in energy-saving without imposing any significant overhead on the edge system.
arXiv Detail & Related papers (2022-05-31T19:19:40Z) - Which Design Decisions in AI-enabled Mobile Applications Contribute to
Greener AI? [7.194465440864905]
This report consists of a plan to conduct an empirical study to quantify the implications of the design decisions on AI-enabled applications performance.
We will implement both image-based and language-based neural networks in mobile applications to solve multiple image classification and text classification problems.
arXiv Detail & Related papers (2021-09-28T07:30:28Z) - Towards AIOps in Edge Computing Environments [60.27785717687999]
This paper describes the system design of an AIOps platform which is applicable in heterogeneous, distributed environments.
It is feasible to collect metrics with a high frequency and simultaneously run specific anomaly detection algorithms directly on edge devices.
arXiv Detail & Related papers (2021-02-12T09:33:00Z) - Modular approach to data preprocessing in ALOHA and application to a
smart industry use case [0.0]
The paper addresses a modular approach, integrated into the ALOHA tool flow, to support the data preprocessing and transformation pipeline.
To demonstrate the effectiveness of the approach, we present some experimental results related to a keyword spotting use case.
arXiv Detail & Related papers (2021-02-02T06:48:51Z) - Reconfigurable Intelligent Surface Assisted Mobile Edge Computing with
Heterogeneous Learning Tasks [53.1636151439562]
Mobile edge computing (MEC) provides a natural platform for AI applications.
We present an infrastructure to perform machine learning tasks at an MEC with the assistance of a reconfigurable intelligent surface (RIS)
Specifically, we minimize the learning error of all participating users by jointly optimizing transmit power of mobile users, beamforming vectors of the base station, and the phase-shift matrix of the RIS.
arXiv Detail & Related papers (2020-12-25T07:08:50Z) - Context-Aware Wireless Connectivity and Processing Unit Optimization for
IoT Networks [10.248295944860963]
The proposed approach simultaneously selects the best connectivity and processing unit along with the percentage of data to be offloaded by jointly optimizing energy consumption, response-time, security, and monetary cost.
The requirements of IoT devices in terms of response-time and security are taken as inputs along with the remaining battery level of the devices, and the developed algorithm returns an optimized policy.
arXiv Detail & Related papers (2020-04-30T02:18:35Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.