Automated Customization of On-Thing Inference for Quality-of-Experience
Enhancement
- URL: http://arxiv.org/abs/2112.06918v1
- Date: Sat, 11 Dec 2021 07:37:13 GMT
- Title: Automated Customization of On-Thing Inference for Quality-of-Experience
Enhancement
- Authors: Yang Bai, Lixing Chen, Shaolei Ren, Jie Xu
- Abstract summary: This paper studies automated customization for DL inference on IoT devices (termed as on-thing inference)
We use a novel online learning algorithm, NeuralUCB, that has excellent generalization ability for handling various user QoE patterns.
We design feedback solicitation schemes to reduce the number of QoE solicitations while maintaining the learning efficiency of NeuralUCB.
- Score: 27.16877467047541
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: The rapid uptake of intelligent applications is pushing deep learning (DL)
capabilities to Internet-of-Things (IoT). Despite the emergence of new tools
for embedding deep neural networks (DNNs) into IoT devices, providing
satisfactory Quality of Experience (QoE) to users is still challenging due to
the heterogeneity in DNN architectures, IoT devices, and user preferences. This
paper studies automated customization for DL inference on IoT devices (termed
as on-thing inference), and our goal is to enhance user QoE by configuring the
on-thing inference with an appropriate DNN for users under different usage
scenarios. The core of our method is a DNN selection module that learns user
QoE patterns on-the-fly and identifies the best-fit DNN for on-thing inference
with the learned knowledge. It leverages a novel online learning algorithm,
NeuralUCB, that has excellent generalization ability for handling various user
QoE patterns. We also embed the knowledge transfer technique in NeuralUCB to
expedite the learning process. However, NeuralUCB frequently solicits QoE
ratings from users, which incurs non-negligible inconvenience. To address this
problem, we design feedback solicitation schemes to reduce the number of QoE
solicitations while maintaining the learning efficiency of NeuralUCB. A
pragmatic problem, aggregated QoE, is further investigated to improve the
practicality of our framework. We conduct experiments on both synthetic and
real-world data. The results indicate that our method efficiently learns the
user QoE pattern with few solicitations and provides drastic QoE enhancement
for IoT devices.
Related papers
- A Cutting-Edge Deep Learning Method For Enhancing IoT Security [0.0]
This paper proposes an innovative design of the Internet of Things (IoT) Environment Intrusion Detection System (or IDS) using Deep Learning-integrated Convolutional Neural Networks (CNN) and Long Short-Term Memory (LSTM) networks.
Our model, based on the CICIDS 2017 dataset, achieved an accuracy of 99.52% in classifying network traffic as either benign or malicious.
arXiv Detail & Related papers (2024-06-18T08:42:51Z) - The Internet of Senses: Building on Semantic Communications and Edge
Intelligence [67.75406096878321]
The Internet of Senses (IoS) holds the promise of flawless telepresence-style communication for all human receptors'
We elaborate on how the emerging semantic communications and Artificial Intelligence (AI)/Machine Learning (ML) paradigms may satisfy the requirements of IoS use cases.
arXiv Detail & Related papers (2022-12-21T03:37:38Z) - Fluid Batching: Exit-Aware Preemptive Serving of Early-Exit Neural
Networks on Edge NPUs [74.83613252825754]
"smart ecosystems" are being formed where sensing happens concurrently rather than standalone.
This is shifting the on-device inference paradigm towards deploying neural processing units (NPUs) at the edge.
We propose a novel early-exit scheduling that allows preemption at run time to account for the dynamicity introduced by the arrival and exiting processes.
arXiv Detail & Related papers (2022-09-27T15:04:01Z) - Adaptive Anomaly Detection for Internet of Things in Hierarchical Edge
Computing: A Contextual-Bandit Approach [81.5261621619557]
We propose an adaptive anomaly detection scheme with hierarchical edge computing (HEC)
We first construct multiple anomaly detection DNN models with increasing complexity, and associate each of them to a corresponding HEC layer.
Then, we design an adaptive model selection scheme that is formulated as a contextual-bandit problem and solved by using a reinforcement learning policy network.
arXiv Detail & Related papers (2021-08-09T08:45:47Z) - Self-Supervised Learning of Event-Based Optical Flow with Spiking Neural
Networks [3.7384509727711923]
A major challenge for neuromorphic computing is that learning algorithms for traditional artificial neural networks (ANNs) do not transfer directly to spiking neural networks (SNNs)
In this article, we focus on the self-supervised learning problem of optical flow estimation from event-based camera inputs.
We show that the performance of the proposed ANNs and SNNs are on par with that of the current state-of-the-art ANNs trained in a self-supervised manner.
arXiv Detail & Related papers (2021-06-03T14:03:41Z) - Fast On-Device Adaptation for Spiking Neural Networks via
Online-Within-Online Meta-Learning [31.78005607111787]
Spiking Neural Networks (SNNs) have recently gained popularity as machine learning models for on-device edge intelligence.
In this paper, we propose an online-within-online meta-learning rule for SNNs termed OWOML-SNN, that enables lifelong learning on a stream of tasks.
arXiv Detail & Related papers (2021-02-21T04:28:49Z) - Adversarially Robust and Explainable Model Compression with On-Device
Personalization for Text Classification [4.805959718658541]
On-device Deep Neural Networks (DNNs) have recently gained more attention due to the increasing computing power of mobile devices and the number of applications in Computer Vision (CV) and Natural Language Processing (NLP)
In NLP applications, although model compression has seen initial success, there are at least three major challenges yet to be addressed: adversarial robustness, explainability, and personalization.
Here we attempt to tackle these challenges by designing a new training scheme for model compression and adversarial robustness, including the optimization of an explainable feature mapping objective.
The resulting compressed model is personalized using on-device private training data via fine-
arXiv Detail & Related papers (2021-01-10T15:06:55Z) - Adaptive Contention Window Design using Deep Q-learning [46.49295424938727]
We study the problem of adaptive contention window (CW) design for random-access wireless networks.
Our goal is to design an intelligent node that can dynamically adapt its minimum CW (MCW) parameter to maximize a network-level utility.
arXiv Detail & Related papers (2020-11-18T17:31:18Z) - Deep Multi-Task Learning for Cooperative NOMA: System Design and
Principles [52.79089414630366]
We develop a novel deep cooperative NOMA scheme, drawing upon the recent advances in deep learning (DL)
We develop a novel hybrid-cascaded deep neural network (DNN) architecture such that the entire system can be optimized in a holistic manner.
arXiv Detail & Related papers (2020-07-27T12:38:37Z) - Progressive Tandem Learning for Pattern Recognition with Deep Spiking
Neural Networks [80.15411508088522]
Spiking neural networks (SNNs) have shown advantages over traditional artificial neural networks (ANNs) for low latency and high computational efficiency.
We propose a novel ANN-to-SNN conversion and layer-wise learning framework for rapid and efficient pattern recognition.
arXiv Detail & Related papers (2020-07-02T15:38:44Z) - Boosting Deep Neural Networks with Geometrical Prior Knowledge: A Survey [77.99182201815763]
Deep Neural Networks (DNNs) achieve state-of-the-art results in many different problem settings.
DNNs are often treated as black box systems, which complicates their evaluation and validation.
One promising field, inspired by the success of convolutional neural networks (CNNs) in computer vision tasks, is to incorporate knowledge about symmetric geometrical transformations.
arXiv Detail & Related papers (2020-06-30T14:56:05Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.