Graph Reinforcement Learning-based CNN Inference Offloading in Dynamic
  Edge Computing
        - URL: http://arxiv.org/abs/2210.13464v1
- Date: Mon, 24 Oct 2022 07:17:20 GMT
- Title: Graph Reinforcement Learning-based CNN Inference Offloading in Dynamic
  Edge Computing
- Authors: Nan Li, Alexandros Iosifidis, Qi Zhang
- Abstract summary: This paper addresses the computational offloading of CNN inference in dynamic multi-access edge computing (MEC) networks.
We propose a graph reinforcement learning-based early-exit mechanism (GRLE) which outperforms the state-of-the-art work.
The experimental results show that GRLE achieves the average accuracy up to 3.41x over graph reinforcement learning (GRL) and 1.45x over DROOE.
- Score: 93.67044879636093
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract:   This paper studies the computational offloading of CNN inference in dynamic
multi-access edge computing (MEC) networks. To address the uncertainties in
communication time and Edge servers' available capacity, we use early-exit
mechanism to terminate the computation earlier to meet the deadline of
inference tasks. We design a reward function to trade off the communication,
computation and inference accuracy, and formulate the offloading problem of CNN
inference as a maximization problem with the goal of maximizing the average
inference accuracy and throughput in long term. To solve the maximization
problem, we propose a graph reinforcement learning-based early-exit mechanism
(GRLE), which outperforms the state-of-the-art work, deep reinforcement
learning-based online offloading (DROO) and its enhanced method, DROO with
early-exit mechanism (DROOE), under different dynamic scenarios. The
experimental results show that GRLE achieves the average accuracy up to 3.41x
over graph reinforcement learning (GRL) and 1.45x over DROOE, which shows the
advantages of GRLE for offloading decision-making in dynamic MEC.
 
      
        Related papers
        - Statistical physics analysis of graph neural networks: Approaching   optimality in the contextual stochastic block model [0.0]
 Graph neural networks (GNNs) are designed to process data associated with graphs.
GNNs can encounter difficulties in gathering information from nodes far apart by iterated aggregation steps.
We show how the architecture of the GCN has to scale with the depth to avoid oversmoothing.
 arXiv  Detail & Related papers  (2025-03-03T09:55:10Z)
- Gradient Transformation: Towards Efficient and Model-Agnostic Unlearning   for Dynamic Graph Neural Networks [66.70786325911124]
 Graph unlearning has emerged as an essential tool for safeguarding user privacy and mitigating the negative impacts of undesirable data.
With the increasing prevalence of DGNNs, it becomes imperative to investigate the implementation of dynamic graph unlearning.
We propose an effective, efficient, model-agnostic, and post-processing method to implement DGNN unlearning.
 arXiv  Detail & Related papers  (2024-05-23T10:26:18Z)
- Dynamic Semantic Compression for CNN Inference in Multi-access Edge
  Computing: A Graph Reinforcement Learning-based Autoencoder [82.8833476520429]
 We propose a novel semantic compression method, autoencoder-based CNN architecture (AECNN) for effective semantic extraction and compression in partial offloading.
In the semantic encoder, we introduce a feature compression module based on the channel attention mechanism in CNNs, to compress intermediate data by selecting the most informative features.
In the semantic decoder, we design a lightweight decoder to reconstruct the intermediate data through learning from the received compressed data to improve accuracy.
 arXiv  Detail & Related papers  (2024-01-19T15:19:47Z)
- A Multi-Head Ensemble Multi-Task Learning Approach for Dynamical
  Computation Offloading [62.34538208323411]
 We propose a multi-head ensemble multi-task learning (MEMTL) approach with a shared backbone and multiple prediction heads (PHs)
MEMTL outperforms benchmark methods in both the inference accuracy and mean square error without requiring additional training data.
 arXiv  Detail & Related papers  (2023-09-02T11:01:16Z)
- GIF: A General Graph Unlearning Strategy via Influence Function [63.52038638220563]
 Graph Influence Function (GIF) is a model-agnostic unlearning method that can efficiently and accurately estimate parameter changes in response to a $epsilon$-mass perturbation in deleted data.
We conduct extensive experiments on four representative GNN models and three benchmark datasets to justify GIF's superiority in terms of unlearning efficacy, model utility, and unlearning efficiency.
 arXiv  Detail & Related papers  (2023-04-06T03:02:54Z)
- Unlearning Graph Classifiers with Limited Data Resources [39.29148804411811]
 Controlled data removal is becoming an important feature of machine learning models for data-sensitive Web applications.
It is still largely unknown how to perform efficient machine unlearning of graph neural networks (GNNs)
Our main contribution is the first known nonlinear approximate graph unlearning method based on GSTs.
Our second contribution is a theoretical analysis of the computational complexity of the proposed unlearning mechanism.
Our third contribution are extensive simulation results which show that, compared to complete retraining of GNNs after each removal request, the new GST-based approach offers, on average, a 10.38x speed-up
 arXiv  Detail & Related papers  (2022-11-06T20:46:50Z)
- Efficient Graph Neural Network Inference at Large Scale [54.89457550773165]
 Graph neural networks (GNNs) have demonstrated excellent performance in a wide range of applications.
Existing scalable GNNs leverage linear propagation to preprocess the features and accelerate the training and inference procedure.
We propose a novel adaptive propagation order approach that generates the personalized propagation order for each node based on its topological information.
 arXiv  Detail & Related papers  (2022-11-01T14:38:18Z)
- GNN at the Edge: Cost-Efficient Graph Neural Network Processing over
  Distributed Edge Servers [24.109721494781592]
 Graph Neural Networks (GNNs) are still under exploration, presenting a stark disparity to its broad edge adoptions.
This paper studies the cost optimization for distributed GNN processing over a multi-tier heterogeneous edge network.
We show that our approach achieves superior performance over de facto baselines with more than 95.8% cost eduction in a fast convergence speed.
 arXiv  Detail & Related papers  (2022-10-31T13:03:16Z)
- Deep Learning for Ultra-Reliable and Low-Latency Communications in 6G
  Networks [84.2155885234293]
 We first summarize how to apply data-driven supervised deep learning and deep reinforcement learning in URLLC.
To address these open problems, we develop a multi-level architecture that enables device intelligence, edge intelligence, and cloud intelligence for URLLC.
 arXiv  Detail & Related papers  (2020-02-22T14:38:11Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
       
     
           This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.