Building AI Service Repositories for On-Demand Service Orchestration in 6G AI-RAN
- URL: http://arxiv.org/abs/2504.09647v1
- Date: Sun, 13 Apr 2025 16:40:58 GMT
- Title: Building AI Service Repositories for On-Demand Service Orchestration in 6G AI-RAN
- Authors: Yun Tang, Mengbang Zou, Udhaya Chandhar Srinivasan, Obumneme Umealor, Dennis Kevogo, Benjamin James Scott, Weisi Guo,
- Abstract summary: This paper systematically identifies and categorizes critical attributes influencing AI service orchestration in 6G networks.<n>We introduce an open-source, LLM-assisted toolchain that automates service packaging, deployment, and runtime profiling.
- Score: 7.375775031391254
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Efficient orchestration of AI services in 6G AI-RAN requires well-structured, ready-to-deploy AI service repositories combined with orchestration methods adaptive to diverse runtime contexts across radio access, edge, and cloud layers. Current literature lacks comprehensive frameworks for constructing such repositories and generally overlooks key practical orchestration factors. This paper systematically identifies and categorizes critical attributes influencing AI service orchestration in 6G networks and introduces an open-source, LLM-assisted toolchain that automates service packaging, deployment, and runtime profiling. We validate the proposed toolchain through the Cranfield AI Service repository case study, demonstrating significant automation benefits, reduced manual coding efforts, and the necessity of infrastructure-specific profiling, paving the way for more practical orchestration frameworks.
Related papers
- End-to-End Edge AI Service Provisioning Framework in 6G ORAN [7.6934511825411045]
This paper proposes a novel Edge AI and Network Service Orchestration framework that leverages Large Language Model (LLM) agents deployed as O-RAN rApps.<n>The proposed LLM-agent-powered system enables interactive and intuitive orchestration by translating the user's use case description into deployable AI services and corresponding network configurations.
arXiv Detail & Related papers (2025-03-15T00:48:50Z) - Intelligent Mobile AI-Generated Content Services via Interactive Prompt Engineering and Dynamic Service Provisioning [55.641299901038316]
AI-generated content can organize collaborative Mobile AIGC Service Providers (MASPs) at network edges to provide ubiquitous and customized content for resource-constrained users.<n>Such a paradigm faces two significant challenges: 1) raw prompts often lead to poor generation quality due to users' lack of experience with specific AIGC models, and 2) static service provisioning fails to efficiently utilize computational and communication resources.<n>We develop an interactive prompt engineering mechanism that leverages a Large Language Model (LLM) to generate customized prompt corpora and employs Inverse Reinforcement Learning (IRL) for policy imitation.
arXiv Detail & Related papers (2025-02-17T03:05:20Z) - Towards Cognitive Service Delivery on B5G through AIaaS Architecture [0.16070833439280313]
The transition from 4G to 5G has substantial implications for AI in consolidating a network geared towards business verticals.<n>This paper proposes a framework for evolving NWDAF that presents the interfaces necessary to further empower the core network with AI capabilities B5G and 6G.
arXiv Detail & Related papers (2024-12-23T20:30:29Z) - Cluster-Based Multi-Agent Task Scheduling for Space-Air-Ground Integrated Networks [60.085771314013044]
Low-altitude economy holds significant potential for development in areas such as communication and sensing.
We propose a Clustering-based Multi-agent Deep Deterministic Policy Gradient (CMADDPG) algorithm to address the multi-UAV cooperative task scheduling challenges in SAGIN.
arXiv Detail & Related papers (2024-12-14T06:17:33Z) - A Learning-based Incentive Mechanism for Mobile AIGC Service in Decentralized Internet of Vehicles [49.86094523878003]
We propose a decentralized incentive mechanism for mobile AIGC service allocation.
We employ multi-agent deep reinforcement learning to find the balance between the supply of AIGC services on RSUs and user demand for services within the IoV context.
arXiv Detail & Related papers (2024-03-29T12:46:07Z) - Intelligent Data-Driven Architectural Features Orchestration for Network
Slicing [0.0]
Orchestration and machine learning are key elements with a crucial role in the network-slicing processes.
This paper discusses machine learning-based orchestration of features and capabilities in network slicing architectures.
An attack prevention security mechanism is developed for the SFI2 architecture using distributed embedded and cooperating ML agents.
arXiv Detail & Related papers (2024-01-12T12:32:36Z) - OTOv3: Automatic Architecture-Agnostic Neural Network Training and
Compression from Structured Pruning to Erasing Operators [57.145175475579315]
This topic spans various techniques, from structured pruning to neural architecture search, encompassing both pruning and erasing operators perspectives.
We introduce the third-generation Only-Train-Once (OTOv3), which first automatically trains and compresses a general DNN through pruning and erasing operations.
Our empirical results demonstrate the efficacy of OTOv3 across various benchmarks in structured pruning and neural architecture search.
arXiv Detail & Related papers (2023-12-15T00:22:55Z) - Toward 6G Native-AI Network: Foundation Model based Cloud-Edge-End Collaboration Framework [55.73948386625618]
We analyze the challenges of achieving 6G native AI from perspectives of data, AI models, and operational paradigm.<n>We propose a 6G native AI framework based on foundation models, provide an integration method for the expert knowledge, present the customization for two kinds of PFM, and outline a novel operational paradigm for the native AI framework.
arXiv Detail & Related papers (2023-10-26T15:19:40Z) - Zero-touch realization of Pervasive Artificial Intelligence-as-a-service
in 6G networks [8.500820283596774]
We introduce a novel platform architecture to deploy a zero-touch PAI-as-a-Service (PAI) in 6G networks supported by a blockchain-based smart system.
We present a proof of concept where we evaluate the ability of our proposed system to self-optimize and self-adapt to the dynamics of 6G networks.
arXiv Detail & Related papers (2023-07-21T10:02:24Z) - In-situ Model Downloading to Realize Versatile Edge AI in 6G Mobile
Networks [61.416494781759326]
In-situ model downloading aims to achieve transparent and real-time replacement of on-device AI models by downloading from an AI library in the network.
A key component of the presented framework is a set of techniques that dynamically compress a downloaded model at the depth-level, parameter-level, or bit-level.
We propose a 6G network architecture customized for deploying in-situ model downloading with the key feature of a three-tier (edge, local, and central) AI library.
arXiv Detail & Related papers (2022-10-07T13:41:15Z) - Predictive Closed-Loop Service Automation in O-RAN based Network Slicing [35.23159175375706]
Open radio access network (O-RAN) is perhaps the most promising RAN architecture that inherits all the aforementioned features.
This article provides a closed-loop and intelligent resource provisioning scheme for O-RAN slicing to prevent SLA violations.
arXiv Detail & Related papers (2022-02-04T04:34:00Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.