A Survey on Collaborative DNN Inference for Edge Intelligence
- URL: http://arxiv.org/abs/2207.07812v1
- Date: Sat, 16 Jul 2022 02:32:35 GMT
- Title: A Survey on Collaborative DNN Inference for Edge Intelligence
- Authors: Weiqing Ren, Yuben Qu, Chao Dong, Yuqian Jing, Hao Sun, Qihui Wu, Song
Guo
- Abstract summary: Edge intelligence (EI) becomes a cutting-edge direction in the field of AI.
In this paper, we classify four typical collaborative DNN inference paradigms for EI, and analyze the characteristics and key technologies of them.
- Score: 22.691247982285432
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: With the vigorous development of artificial intelligence (AI), the
intelligent applications based on deep neural network (DNN) change people's
lifestyles and the production efficiency. However, the huge amount of
computation and data generated from the network edge becomes the major
bottleneck, and traditional cloud-based computing mode has been unable to meet
the requirements of real-time processing tasks. To solve the above problems, by
embedding AI model training and inference capabilities into the network edge,
edge intelligence (EI) becomes a cutting-edge direction in the field of AI.
Furthermore, collaborative DNN inference among the cloud, edge, and end device
provides a promising way to boost the EI. Nevertheless, at present, EI oriented
collaborative DNN inference is still in its early stage, lacking a systematic
classification and discussion of existing research efforts. Thus motivated, we
have made a comprehensive investigation on the recent studies about EI oriented
collaborative DNN inference. In this paper, we firstly review the background
and motivation of EI. Then, we classify four typical collaborative DNN
inference paradigms for EI, and analyze the characteristics and key
technologies of them. Finally, we summarize the current challenges of
collaborative DNN inference, discuss the future development trend and provide
the future research direction.
Related papers
- DNN Partitioning, Task Offloading, and Resource Allocation in Dynamic Vehicular Networks: A Lyapunov-Guided Diffusion-Based Reinforcement Learning Approach [49.56404236394601]
We formulate the problem of joint DNN partitioning, task offloading, and resource allocation in Vehicular Edge Computing.
Our objective is to minimize the DNN-based task completion time while guaranteeing the system stability over time.
We propose a Multi-Agent Diffusion-based Deep Reinforcement Learning (MAD2RL) algorithm, incorporating the innovative use of diffusion models.
arXiv Detail & Related papers (2024-06-11T06:31:03Z) - Networking Systems for Video Anomaly Detection: A Tutorial and Survey [55.28514053969056]
Video Anomaly Detection (VAD) is a fundamental research task within the Artificial Intelligence (AI) community.
In this article, we delineate the foundational assumptions, learning frameworks, and applicable scenarios of various deep learning-driven VAD routes.
We showcase our latest NSVAD research in industrial IoT and smart cities, along with an end-cloud collaborative architecture for deployable NSVAD.
arXiv Detail & Related papers (2024-05-16T02:00:44Z) - Towards Integrated Fine-tuning and Inference when Generative AI meets
Edge Intelligence [5.078859563367533]
High-performance generative artificial intelligence (GAI) represents latest evolution of computational intelligence.
The inevitable encounter between GAI and edge intelligence (EI) can unleash new opportunities.
We propose the GAI-oriented synthetical network (GaisNet) that buffers contradiction leveraging data-free knowledge relay.
arXiv Detail & Related papers (2024-01-05T06:52:55Z) - Green Edge AI: A Contemporary Survey [46.11332733210337]
The transformative power of AI is derived from the utilization of deep neural networks (DNNs)
Deep learning (DL) is increasingly being transitioned to wireless edge networks in proximity to end-user devices (EUDs)
Despite its potential, edge AI faces substantial challenges, mostly due to the dichotomy between the resource limitations of wireless edge networks and the resource-intensive nature of DL.
arXiv Detail & Related papers (2023-12-01T04:04:37Z) - Brain-Inspired Spiking Neural Networks for Industrial Fault Diagnosis: A Survey, Challenges, and Opportunities [10.371337760495521]
Spiking Neural Network (SNN) is founded on principles of Brain-inspired computing.
This paper systematically reviews the theoretical progress of SNN-based models to answer the question of what SNN is.
arXiv Detail & Related papers (2023-11-13T11:25:34Z) - Hardware Approximate Techniques for Deep Neural Network Accelerators: A
Survey [4.856755747052137]
Deep Neural Networks (DNNs) are very popular because of their high performance in various cognitive tasks in Machine Learning (ML)
Recent advancements in DNNs have brought beyond human accuracy in many tasks, but at the cost of high computational complexity.
This article provides a comprehensive survey and analysis of hardware approximation techniques for DNN accelerators.
arXiv Detail & Related papers (2022-03-16T16:33:13Z) - Edge-Cloud Polarization and Collaboration: A Comprehensive Survey [61.05059817550049]
We conduct a systematic review for both cloud and edge AI.
We are the first to set up the collaborative learning mechanism for cloud and edge modeling.
We discuss potentials and practical experiences of some on-going advanced edge AI topics.
arXiv Detail & Related papers (2021-11-11T05:58:23Z) - Artificial Intelligence for IT Operations (AIOPS) Workshop White Paper [50.25428141435537]
Artificial Intelligence for IT Operations (AIOps) is an emerging interdisciplinary field arising in the intersection between machine learning, big data, streaming analytics, and the management of IT operations.
Main aim of the AIOPS workshop is to bring together researchers from both academia and industry to present their experiences, results, and work in progress in this field.
arXiv Detail & Related papers (2021-01-15T10:43:10Z) - Boosting Deep Neural Networks with Geometrical Prior Knowledge: A Survey [77.99182201815763]
Deep Neural Networks (DNNs) achieve state-of-the-art results in many different problem settings.
DNNs are often treated as black box systems, which complicates their evaluation and validation.
One promising field, inspired by the success of convolutional neural networks (CNNs) in computer vision tasks, is to incorporate knowledge about symmetric geometrical transformations.
arXiv Detail & Related papers (2020-06-30T14:56:05Z) - Neuroevolution in Deep Neural Networks: Current Trends and Future
Challenges [0.0]
A variety of methods have been applied to the architectural configuration and learning or training of artificial deep neural networks (DNN)
Evolutionary Algorithms (EAs) are gaining momentum as a computationally feasible method for the automated optimisation and training of DNNs.
This paper presents a comprehensive survey, discussion and evaluation of the state-of-the-art works on using EAs for architectural configuration and training of DNNs.
arXiv Detail & Related papers (2020-06-09T17:28:25Z) - Deep Learning for Ultra-Reliable and Low-Latency Communications in 6G
Networks [84.2155885234293]
We first summarize how to apply data-driven supervised deep learning and deep reinforcement learning in URLLC.
To address these open problems, we develop a multi-level architecture that enables device intelligence, edge intelligence, and cloud intelligence for URLLC.
arXiv Detail & Related papers (2020-02-22T14:38:11Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.