An Overview on Generative AI at Scale with Edge-Cloud Computing
- URL: http://arxiv.org/abs/2306.17170v2
- Date: Sun, 9 Jul 2023 08:56:08 GMT
- Title: An Overview on Generative AI at Scale with Edge-Cloud Computing
- Authors: Yun-Cheng Wang, Jintang Xue, Chengwei Wei, C.-C. Jay Kuo
- Abstract summary: generative artificial intelligence (GenAI) generates new content that resembles what is created by humans.
The rapid development of GenAI systems has created a huge amount of new data on the Internet.
It is attractive to build GenAI systems at scale by leveraging the edge-cloud computing paradigm.
- Score: 28.98486923400986
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: As a specific category of artificial intelligence (AI), generative artificial
intelligence (GenAI) generates new content that resembles what is created by
humans. The rapid development of GenAI systems has created a huge amount of new
data on the Internet, posing new challenges to current computing and
communication frameworks. Currently, GenAI services rely on the traditional
cloud computing framework due to the need for large computation resources.
However, such services will encounter high latency because of data transmission
and a high volume of requests. On the other hand, edge-cloud computing can
provide adequate computation power and low latency at the same time through the
collaboration between edges and the cloud. Thus, it is attractive to build
GenAI systems at scale by leveraging the edge-cloud computing paradigm. In this
overview paper, we review recent developments in GenAI and edge-cloud
computing, respectively. Then, we use two exemplary GenAI applications to
discuss technical challenges in scaling up their solutions using edge-cloud
collaborative systems. Finally, we list design considerations for training and
deploying GenAI systems at scale and point out future research directions.
Related papers
- Two-Timescale Model Caching and Resource Allocation for Edge-Enabled AI-Generated Content Services [55.0337199834612]
Generative AI (GenAI) has emerged as a transformative technology, enabling customized and personalized AI-generated content (AIGC) services.
These services require executing GenAI models with billions of parameters, posing significant obstacles to resource-limited wireless edge.
We introduce the formulation of joint model caching and resource allocation for AIGC services to balance a trade-off between AIGC quality and latency metrics.
arXiv Detail & Related papers (2024-11-03T07:01:13Z) - Strategic Demand-Planning in Wireless Networks: Can Generative-AI Save Spectrum and Energy? [17.59973153669422]
This article introduces the concept of strategic demand-planning through demand-labeling, demand-shaping, and demand-rescheduling.
GenAI is proposed as a powerful tool to facilitate demand-shaping in wireless networks.
GenAI can serve a function in saving energy and spectrum in wireless networks.
arXiv Detail & Related papers (2024-07-02T14:27:06Z) - Explainable Generative AI (GenXAI): A Survey, Conceptualization, and Research Agenda [1.8592384822257952]
We elaborate on why XAI has gained importance with the rise of GenAI and its challenges for explainability research.
We also unveil novel and emerging desiderata that explanations should fulfill, covering aspects such as verifiability, interactivity, security, and cost.
arXiv Detail & Related papers (2024-04-15T08:18:16Z) - Computing in the Era of Large Generative Models: From Cloud-Native to
AI-Native [46.7766555589807]
We describe an AI-native computing paradigm that harnesses the power of both cloudnative technologies and advanced machine learning inference.
These joint efforts aim to optimize costs-of-goods-sold (COGS) and improve resource accessibility.
arXiv Detail & Related papers (2024-01-17T20:34:11Z) - AI-Generated Images as Data Source: The Dawn of Synthetic Era [61.879821573066216]
generative AI has unlocked the potential to create synthetic images that closely resemble real-world photographs.
This paper explores the innovative concept of harnessing these AI-generated images as new data sources.
In contrast to real data, AI-generated data exhibit remarkable advantages, including unmatched abundance and scalability.
arXiv Detail & Related papers (2023-10-03T06:55:19Z) - A Comprehensive Survey of AI-Generated Content (AIGC): A History of
Generative AI from GAN to ChatGPT [63.58711128819828]
ChatGPT and other Generative AI (GAI) techniques belong to the category of Artificial Intelligence Generated Content (AIGC)
The goal of AIGC is to make the content creation process more efficient and accessible, allowing for the production of high-quality content at a faster pace.
arXiv Detail & Related papers (2023-03-07T20:36:13Z) - Edge-Cloud Polarization and Collaboration: A Comprehensive Survey [61.05059817550049]
We conduct a systematic review for both cloud and edge AI.
We are the first to set up the collaborative learning mechanism for cloud and edge modeling.
We discuss potentials and practical experiences of some on-going advanced edge AI topics.
arXiv Detail & Related papers (2021-11-11T05:58:23Z) - Auto-Split: A General Framework of Collaborative Edge-Cloud AI [49.750972428032355]
This paper describes the techniques and engineering practice behind Auto-Split, an edge-cloud collaborative prototype of Huawei Cloud.
To the best of our knowledge, there is no existing industry product that provides the capability of Deep Neural Network (DNN) splitting.
arXiv Detail & Related papers (2021-08-30T08:03:29Z) - KubeEdge.AI: AI Platform for Edge Devices [4.337396433660794]
KubeEdge.AI is an edge AI framework on top of KubeEdge.
It provides a set of key modules and interfaces: a data handling and processing engine, a concise AI runtime, a decision engine, and a distributed data query interface.
arXiv Detail & Related papers (2020-07-07T23:36:23Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.