Enabling AI Quality Control via Feature Hierarchical Edge Inference
- URL: http://arxiv.org/abs/2211.07860v1
- Date: Tue, 15 Nov 2022 02:54:23 GMT
- Title: Enabling AI Quality Control via Feature Hierarchical Edge Inference
- Authors: Jinhyuk Choi, Seong-Lyun Kim, Seung-Woo Ko
- Abstract summary: This work proposes a feature hierarchical EI (FHEI) comprising feature network and inference network deployed at an edge server and corresponding mobile.
A higher scale feature requires more computation and communication loads while it provides a better AI quality.
It is verified by extensive simulations that the proposed joint communication-and-computation control on FHEI architecture always outperforms several benchmarks.
- Score: 6.490724361345847
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: With the rise of edge computing, various AI services are expected to be
available at a mobile side through the inference based on deep neural network
(DNN) operated at the network edge, called edge inference (EI). On the other
hand, the resulting AI quality (e.g., mean average precision in objective
detection) has been regarded as a given factor, and AI quality control has yet
to be explored despite its importance in addressing the diverse demands of
different users. This work aims at tackling the issue by proposing a feature
hierarchical EI (FHEI), comprising feature network and inference network
deployed at an edge server and corresponding mobile, respectively.
Specifically, feature network is designed based on feature hierarchy, a
one-directional feature dependency with a different scale. A higher scale
feature requires more computation and communication loads while it provides a
better AI quality. The tradeoff enables FHEI to control AI quality gradually
w.r.t. communication and computation loads, leading to deriving a
near-to-optimal solution to maximize multi-user AI quality under the
constraints of uplink \& downlink transmissions and edge server and mobile
computation capabilities. It is verified by extensive simulations that the
proposed joint communication-and-computation control on FHEI architecture
always outperforms several benchmarks by differentiating each user's AI quality
depending on the communication and computation conditions.
Related papers
- AI Flow at the Network Edge [58.31090055138711]
AI Flow is a framework that streamlines the inference process by jointly leveraging the heterogeneous resources available across devices, edge nodes, and cloud servers.
This article serves as a position paper for identifying the motivation, challenges, and principles of AI Flow.
arXiv Detail & Related papers (2024-11-19T12:51:17Z) - Resource-Efficient Sensor Fusion via System-Wide Dynamic Gated Neural Networks [16.0018681576301]
We propose a novel algorithmic strategy called Quantile-constrained Inference (QIC)
QIC makes joint, high-quality, swift decisions on all the above aspects of the system.
Our results confirm that QIC matches the optimum and outperforms its alternatives by over 80%.
arXiv Detail & Related papers (2024-10-22T06:12:04Z) - Offloading and Quality Control for AI Generated Content Services in 6G Mobile Edge Computing Networks [18.723955271182007]
This paper proposes a joint optimization algorithm for offloading decisions, computation time, and diffusion steps of the diffusion models in the reverse diffusion stage.
Experimental results conclusively demonstrate that the proposed algorithm achieves superior joint optimization performance compared to the baselines.
arXiv Detail & Related papers (2023-12-11T08:36:27Z) - Federated Learning-Empowered AI-Generated Content in Wireless Networks [58.48381827268331]
Federated learning (FL) can be leveraged to improve learning efficiency and achieve privacy protection for AIGC.
We present FL-based techniques for empowering AIGC, and aim to enable users to generate diverse, personalized, and high-quality content.
arXiv Detail & Related papers (2023-07-14T04:13:11Z) - Large Language Models Empowered Autonomous Edge AI for Connected
Intelligence [51.269276328087855]
Edge artificial intelligence (Edge AI) is a promising solution to achieve connected intelligence.
This article presents a vision of autonomous edge AI systems that automatically organize, adapt, and optimize themselves to meet users' diverse requirements.
arXiv Detail & Related papers (2023-07-06T05:16:55Z) - Integrated Sensing-Communication-Computation for Edge Artificial Intelligence [41.611639821262415]
Integrated sensing-communication-computation (I SCC) is of paramount significance for improving resource utilization.
This article presents various kinds of I SCC schemes for federated edge learning tasks and edge AI inference tasks in both application and physical layers.
arXiv Detail & Related papers (2023-06-01T21:35:20Z) - Enabling AI-Generated Content (AIGC) Services in Wireless Edge Networks [68.00382171900975]
In wireless edge networks, the transmission of incorrectly generated content may unnecessarily consume network resources.
We present the AIGC-as-a-service concept and discuss the challenges in deploying A at the edge networks.
We propose a deep reinforcement learning-enabled algorithm for optimal ASP selection.
arXiv Detail & Related papers (2023-01-09T09:30:23Z) - Task-Oriented Sensing, Computation, and Communication Integration for
Multi-Device Edge AI [108.08079323459822]
This paper studies a new multi-intelligent edge artificial-latency (AI) system, which jointly exploits the AI model split inference and integrated sensing and communication (ISAC)
We measure the inference accuracy by adopting an approximate but tractable metric, namely discriminant gain.
arXiv Detail & Related papers (2022-07-03T06:57:07Z) - Reconfigurable Intelligent Surface Assisted Mobile Edge Computing with
Heterogeneous Learning Tasks [53.1636151439562]
Mobile edge computing (MEC) provides a natural platform for AI applications.
We present an infrastructure to perform machine learning tasks at an MEC with the assistance of a reconfigurable intelligent surface (RIS)
Specifically, we minimize the learning error of all participating users by jointly optimizing transmit power of mobile users, beamforming vectors of the base station, and the phase-shift matrix of the RIS.
arXiv Detail & Related papers (2020-12-25T07:08:50Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.