SamurAI: A Versatile IoT Node With Event-Driven Wake-Up and Embedded ML
Acceleration
- URL: http://arxiv.org/abs/2304.13726v1
- Date: Tue, 11 Apr 2023 08:52:48 GMT
- Title: SamurAI: A Versatile IoT Node With Event-Driven Wake-Up and Embedded ML
Acceleration
- Authors: Ivan Miro-Panades (LSTA), Benoit Tain (LECA), Jean-Frederic Christmann
(LFIM), David Coriat (LIIM), Romain Lemaire (LIIM), Clement Jany, Baudouin
Martineau (DSYS), Fabrice Chaix (DSYS), Guillaume Waltener (DSYS), Emmanuel
Pluchart (LSTA), Jean-Philippe Noel (LFIM), Adam Makosiej, Maxime Montoya,
Simone Bacles-Min (LIIM), David Briand (LIAE), Jean-Marc Philippe, Yvain
Thonnart (LFIM), Alexandre Valentian (LSTA), Frederic Heitzmann (DSYS),
Fabien Clermidy (DSCIN)
- Abstract summary: This paper presents SamurAI, a versatile IoT node bridging this gap in processing and in energy by leveraging two on-chip sub-systems.
AR contains a 1.7MOPS event-driven, asynchronous Wake-up Controller (WuC) with a 207ns wake-up time optimized for sporadic computing.
OD combines a deep-sleep RISC-V CPU and 1.3TOPS/W Machine Learning (ML) for more complex tasks up to 36GOPS.
- Score: 37.89976990030855
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Increased capabilities such as recognition and self-adaptability are now
required from IoT applications. While IoT node power consumption is a major
concern for these applications, cloud-based processing is becoming
unsustainable due to continuous sensor or image data transmission over the
wireless network. Thus optimized ML capabilities and data transfers should be
integrated in the IoT node. Moreover, IoT applications are torn between
sporadic data-logging and energy-hungry data processing (e.g. image
classification). Thus, the versatility of the node is key in addressing this
wide diversity of energy and processing needs. This paper presents SamurAI, a
versatile IoT node bridging this gap in processing and in energy by leveraging
two on-chip sub-systems: a low power, clock-less, event-driven
Always-Responsive (AR) part and an energy-efficient On-Demand (OD) part. AR
contains a 1.7MOPS event-driven, asynchronous Wake-up Controller (WuC) with a
207ns wake-up time optimized for sporadic computing, while OD combines a
deep-sleep RISC-V CPU and 1.3TOPS/W Machine Learning (ML) for more complex
tasks up to 36GOPS. This architecture partitioning achieves best in class
versatility metrics such as peak performance to idle power ratio. On an
applicative classification scenario, it demonstrates system power gains, up to
3.5x compared to cloud-based processing, and thus extended battery lifetime.
Related papers
- Resource Efficient Asynchronous Federated Learning for Digital Twin Empowered IoT Network [29.895766751146155]
Digital twin (DT) can provide real-time status and dynamic topology mapping for Internet of Things (IoT) devices.
We develop a dynamic resource scheduling algorithm tailored for the asynchronous federated learning (FL)-based lightweight DT empowered IoT network.
Specifically, our approach aims to minimize a multi-objective function that encompasses both energy consumption and latency.
arXiv Detail & Related papers (2024-08-26T14:28:51Z) - Energy-Efficient Edge Learning via Joint Data Deepening-and-Prefetching [9.468399367975984]
We propose a novel offloading architecture called joint data deepening-and-prefetching (JD2P)
JD2P is feature-by-feature offloading comprising two key techniques.
We evaluate the effectiveness of JD2P through experiments using the MNIST dataset.
arXiv Detail & Related papers (2024-02-19T08:12:47Z) - Application and Energy-Aware Data Aggregation using Vector
Synchronization in Distributed Battery-less IoT Networks [2.94944680995069]
The battery-less Internet of Things (IoT) devices are a key element in the sustainable green initiative for the next-generation wireless networks.
These battery-free devices use the ambient energy, harvested from the environment.
The main goal is to provide a mechanism to aggregate the sensor data and provide a sustainable application support in the distributed battery-less IoT network.
arXiv Detail & Related papers (2023-11-02T07:51:23Z) - Multi-Objective Optimization for UAV Swarm-Assisted IoT with Virtual
Antenna Arrays [55.736718475856726]
Unmanned aerial vehicle (UAV) network is a promising technology for assisting Internet-of-Things (IoT)
Existing UAV-assisted data harvesting and dissemination schemes require UAVs to frequently fly between the IoTs and access points.
We introduce collaborative beamforming into IoTs and UAVs simultaneously to achieve energy and time-efficient data harvesting and dissemination.
arXiv Detail & Related papers (2023-08-03T02:49:50Z) - Computational Intelligence and Deep Learning for Next-Generation
Edge-Enabled Industrial IoT [51.68933585002123]
We investigate how to deploy computational intelligence and deep learning (DL) in edge-enabled industrial IoT networks.
In this paper, we propose a novel multi-exit-based federated edge learning (ME-FEEL) framework.
In particular, the proposed ME-FEEL can achieve an accuracy gain up to 32.7% in the industrial IoT networks with the severely limited resources.
arXiv Detail & Related papers (2021-10-28T08:14:57Z) - Dynamic Distribution of Edge Intelligence at the Node Level for Internet
of Things [1.4026258162876617]
Dynamic deployment of Convolutional Neural Network (CNN) architecture is proposed utilizing only IoT-level devices.
By partitioning and pipelining the CNN, it horizontally distributes the computation load among resource-constrained devices.
Results show that throughput can be increased by 1.55x to 1.75x for sharing the CNN into two and three resource-constrained devices.
arXiv Detail & Related papers (2021-07-13T03:26:36Z) - Energy-Efficient Model Compression and Splitting for Collaborative
Inference Over Time-Varying Channels [52.60092598312894]
We propose a technique to reduce the total energy bill at the edge device by utilizing model compression and time-varying model split between the edge and remote nodes.
Our proposed solution results in minimal energy consumption and $CO$ emission compared to the considered baselines.
arXiv Detail & Related papers (2021-06-02T07:36:27Z) - EdgeBERT: Sentence-Level Energy Optimizations for Latency-Aware
Multi-Task NLP Inference [82.1584439276834]
Transformer-based language models such as BERT provide significant accuracy improvement for a multitude of natural language processing (NLP) tasks.
We present EdgeBERT, an in-depth algorithm- hardware co-design for latency-aware energy optimization for multi-task NLP.
arXiv Detail & Related papers (2020-11-28T19:21:47Z) - Optimizing Resource-Efficiency for Federated Edge Intelligence in IoT
Networks [96.24723959137218]
We study an edge intelligence-based IoT network in which a set of edge servers learn a shared model using federated learning (FL)
We propose a novel framework, called federated edge intelligence (FEI), that allows edge servers to evaluate the required number of data samples according to the energy cost of the IoT network.
We prove that our proposed algorithm does not cause any data leakage nor disclose any topological information of the IoT network.
arXiv Detail & Related papers (2020-11-25T12:51:59Z) - Dynamic Scheduling for Stochastic Edge-Cloud Computing Environments
using A3C learning and Residual Recurrent Neural Networks [30.61220416710614]
A-Advantage-Actor-Critic (A3C) learning is known to quickly adapt to dynamic scenarios with less data and Residual Recurrent Neural Network (R2N2) to quickly update model parameters.
We use the R2N2 architecture to capture a large number of host and task parameters together with temporal patterns to provide efficient scheduling decisions.
Experiments conducted on real-world data set show a significant improvement in terms of energy consumption, response time, ServiceLevelAgreement and running cost by 14.4%, 7.74%, 31.9%, and 4.64%, respectively.
arXiv Detail & Related papers (2020-09-01T13:36:34Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.