High Efficiency Inference Accelerating Algorithm for NOMA-based Mobile
Edge Computing
- URL: http://arxiv.org/abs/2312.15850v1
- Date: Tue, 26 Dec 2023 02:05:52 GMT
- Title: High Efficiency Inference Accelerating Algorithm for NOMA-based Mobile
Edge Computing
- Authors: Xin Yuan, Ning Li, Tuo Zhang, Muqing Li, Yuwen Chen, Jose Fernan
Martinez Ortega, Song Guo
- Abstract summary: Splitting the inference model between device, edge server, and cloud can improve the performance of EI greatly.
NOMA, which is the key supporting technologies of B5G/6G, can achieve massive connections and high spectrum efficiency.
We propose the effective communication and computing resource allocation algorithm to accelerate the model inference at edge.
- Score: 23.88527790721402
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Splitting the inference model between device, edge server, and cloud can
improve the performance of EI greatly. Additionally, the non-orthogonal
multiple access (NOMA), which is the key supporting technologies of B5G/6G, can
achieve massive connections and high spectrum efficiency. Motivated by the
benefits of NOMA, integrating NOMA with model split in MEC to reduce the
inference latency further becomes attractive. However, the NOMA based
communication during split inference has not been properly considered in
previous works. Therefore, in this paper, we integrate the NOMA into split
inference in MEC, and propose the effective communication and computing
resource allocation algorithm to accelerate the model inference at edge.
Specifically, when the mobile user has a large model inference task needed to
be calculated in the NOMA-based MEC, it will take the energy consumption of
both device and edge server and the inference latency into account to find the
optimal model split strategy, subchannel allocation strategy (uplink and
downlink), and transmission power allocation strategy (uplink and downlink).
Since the minimum inference delay and energy consumption cannot be satisfied
simultaneously, and the variables of subchannel allocation and model split are
discrete, the gradient descent (GD) algorithm is adopted to find the optimal
tradeoff between them. Moreover, the loop iteration GD approach (Li-GD) is
proposed to reduce the complexity of GD algorithm that caused by the parameter
discrete. Additionally, the properties of the proposed algorithm are also
investigated, which demonstrate the effectiveness of the proposed algorithms.
Related papers
- A QoE-Aware Split Inference Accelerating Algorithm for NOMA-based Edge Intelligence [20.67035066213381]
An effective resource allocation algorithm is proposed in this paper, for accelerating split inference in edge intelligence.
The ERA takes the resource consumption, QoE, and inference latency into account to find the optimal model split strategy and resource allocation strategy.
The experimental results demonstrate that the performance of ERA is much better than that of the previous studies.
arXiv Detail & Related papers (2024-09-25T01:09:45Z) - Mobility and Cost Aware Inference Accelerating Algorithm for Edge
Intelligence [24.512525338942158]
The edge intelligence (EI) has been widely applied recently. Spliting the model between device, edge server, and cloud can improve the performance of EI greatly.
The model segmentation without user mobility has been investigated deeply by previous works.
We propose mobility and cost aware model segmentation and resource allocation algorithm for accelerating the inference at edge.
arXiv Detail & Related papers (2023-12-27T10:04:02Z) - A Multi-Head Ensemble Multi-Task Learning Approach for Dynamical
Computation Offloading [62.34538208323411]
We propose a multi-head ensemble multi-task learning (MEMTL) approach with a shared backbone and multiple prediction heads (PHs)
MEMTL outperforms benchmark methods in both the inference accuracy and mean square error without requiring additional training data.
arXiv Detail & Related papers (2023-09-02T11:01:16Z) - Active RIS-aided EH-NOMA Networks: A Deep Reinforcement Learning
Approach [66.53364438507208]
An active reconfigurable intelligent surface (RIS)-aided multi-user downlink communication system is investigated.
Non-orthogonal multiple access (NOMA) is employed to improve spectral efficiency, and the active RIS is powered by energy harvesting (EH)
An advanced LSTM based algorithm is developed to predict users' dynamic communication state.
A DDPG based algorithm is proposed to joint control the amplification matrix and phase shift matrix RIS.
arXiv Detail & Related papers (2023-04-11T13:16:28Z) - High-Speed Resource Allocation Algorithm Using a Coherent Ising Machine
for NOMA Systems [3.6406488220483326]
A key challenge to fully utilizing the effectiveness of the NOMA technique is the optimization of the resource allocation.
We propose the coherent Ising machine (CIM) based optimization method for channel allocation in NOMA systems.
We show that our proposed method is superior in terms of speed and the attained optimal solutions.
arXiv Detail & Related papers (2022-12-03T09:22:54Z) - Federated Learning for Energy-limited Wireless Networks: A Partial Model
Aggregation Approach [79.59560136273917]
limited communication resources, bandwidth and energy, and data heterogeneity across devices are main bottlenecks for federated learning (FL)
We first devise a novel FL framework with partial model aggregation (PMA)
The proposed PMA-FL improves 2.72% and 11.6% accuracy on two typical heterogeneous datasets.
arXiv Detail & Related papers (2022-04-20T19:09:52Z) - Data-Driven Random Access Optimization in Multi-Cell IoT Networks with
NOMA [78.60275748518589]
Non-orthogonal multiple access (NOMA) is a key technology to enable massive machine type communications (mMTC) in 5G networks and beyond.
In this paper, NOMA is applied to improve the random access efficiency in high-density spatially-distributed multi-cell wireless IoT networks.
A novel formulation of random channel access management is proposed, in which the transmission probability of each IoT device is tuned to maximize the geometric mean of users' expected capacity.
arXiv Detail & Related papers (2021-01-02T15:21:08Z) - Multi-Agent Reinforcement Learning in NOMA-aided UAV Networks for
Cellular Offloading [59.32570888309133]
A novel framework is proposed for cellular offloading with the aid of multiple unmanned aerial vehicles (UAVs)
Non-orthogonal multiple access (NOMA) technique is employed at each UAV to further improve the spectrum efficiency of the wireless network.
A mutual deep Q-network (MDQN) algorithm is proposed to jointly determine the optimal 3D trajectory and power allocation of UAVs.
arXiv Detail & Related papers (2020-10-18T20:22:05Z) - Adaptive Subcarrier, Parameter, and Power Allocation for Partitioned
Edge Learning Over Broadband Channels [69.18343801164741]
partitioned edge learning (PARTEL) implements parameter-server training, a well known distributed learning method, in wireless network.
We consider the case of deep neural network (DNN) models which can be trained using PARTEL by introducing some auxiliary variables.
arXiv Detail & Related papers (2020-10-08T15:27:50Z) - Distributed Optimization, Averaging via ADMM, and Network Topology [0.0]
We study the connection between network topology and convergence rates for different algorithms on a real world problem of sensor localization.
We also show interesting connections between ADMM and lifted Markov chains besides providing an explicitly characterization of its convergence.
arXiv Detail & Related papers (2020-09-05T21:44:39Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.