AI-Paging: Lease-Based Execution Anchoring for Network-Exposed AI-as-a-Service
- URL: http://arxiv.org/abs/2602.15286v1
- Date: Tue, 17 Feb 2026 01:11:26 GMT
- Title: AI-Paging: Lease-Based Execution Anchoring for Network-Exposed AI-as-a-Service
- Authors: Merve Saimler, Mohaned Chraiti,
- Abstract summary: 6G service providers are envisioned to play a crucial role in exposing AI in a setting where users submit only an intent.<n>We prototype AI-Paging using existing control- and user-plane mechanisms.
- Score: 0.13750624267664155
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: With AI-as-a-Service (AIaaS) now deployed across multiple providers and model tiers, selecting the appropriate model instance at run time is increasingly outside the end user's knowledge and operational control. Accordingly, the 6G service providers are envisioned to play a crucial role in exposing AIaaS in a setting where users submit only an intent while the network helps in the intent-to-model matching (resolution) and execution placement under policy, trust, and Quality of Service (QoS) constraints. The network role becomes to discover candidate execution endpoints and selects a suitable model/anchor under policy and QoS constraints in a process referred here to as AI-paging (by analogy to cellular call paging). In the proposed architecture, AI-paging is a control-plane transaction that resolves an intent into an AI service identity (AISI), a scoped session token (AIST), and an expiring admission lease (COMMIT) that authorizes user-plane steering to a selected AI execution anchor (AEXF) under a QoS binding. AI-Paging enforces two invariants: (i) lease-gated steering (without COMMIT, no steering state is installed) and (ii) make-before-break anchoring to support continuity and reliability of AIaaS services under dynamic network conditions. We prototype AI-Paging using existing control- and user-plane mechanisms (service-based control, QoS flows, and policy-based steering) with no new packet headers, ensuring compatibility with existing 3GPP-based exposure and management architectures, and evaluate transaction latency, relocation interruption, enforcement correctness under lease expiry, and audit-evidence overhead under mobility and failures.
Related papers
- High-Fidelity Network Management for Federated AI-as-a-Service: Cross-Domain Orchestration [0.12234742322758417]
This paper introduces an assurance-oriented AI management plane based on Tail-Risk Envelopes (TREs)<n>TREs are signed, composable per-domain descriptors that combine deterministic guardrails with rate-latency-impairment models.<n>We show that tenant-level reservations prevent bursty traffic from inflating tail latency under TRE contracts.
arXiv Detail & Related papers (2026-02-17T00:40:04Z) - An Agentic AI Control Plane for 6G Network Slice Orchestration, Monitoring, and Trading [4.494648341572566]
6G networks are expected to be AI-native, intent-driven, and economically programmable.<n>Existing slicing frameworks, largely designed for 5G, rely on static policies and manual controls.<n>We propose an agentic AI control plane architecture for 6G network slice orchestration.
arXiv Detail & Related papers (2026-01-27T09:50:57Z) - Meta-Learning-Based Handover Management in NextG O-RAN [47.23997854845925]
We propose CONTRA, a framework that jointly optimize THOs and CHOs within the O-RAN architecture.<n>ConTRA is specifically designed for near-real-time deployment as an O-RAN xApp runtime.<n>We show that CONTRA improves user throughput and reduces both THO and CHO switching costs.
arXiv Detail & Related papers (2025-12-26T13:01:46Z) - QoS-Aware Hierarchical Reinforcement Learning for Joint Link Selection and Trajectory Optimization in SAGIN-Supported UAV Mobility Management [52.15690855486153]
A space-air-ground integrated network (SAGIN) has emerged as an essential architecture for enabling ubiquitous UAV connectivity.<n>This paper formulates UAV mobility management in SAGIN as a constrained multiobjective joint optimization problem.
arXiv Detail & Related papers (2025-12-17T06:22:46Z) - AAGATE: A NIST AI RMF-Aligned Governance Platform for Agentic AI [2.430812125419517]
AAGATE addresses the unique security and governance challenges posed by autonomous, language-model-driven agents in production.<n>It incorporates a zero-trust service mesh, an explainable policy engine, behavioral analytics, and decentralized accountability hooks.
arXiv Detail & Related papers (2025-10-29T18:06:28Z) - A Novel Zero-Trust Identity Framework for Agentic AI: Decentralized Authentication and Fine-Grained Access Control [7.228060525494563]
This paper posits the imperative for a novel Agentic AI IAM framework.<n>We propose a comprehensive framework built upon rich, verifiable Agent Identities (IDs)<n>We also explore how Zero-Knowledge Proofs (ZKPs) enable privacy-preserving attribute disclosure and verifiable policy compliance.
arXiv Detail & Related papers (2025-05-25T20:21:55Z) - Intelligent Mobile AI-Generated Content Services via Interactive Prompt Engineering and Dynamic Service Provisioning [55.641299901038316]
AI-generated content can organize collaborative Mobile AIGC Service Providers (MASPs) at network edges to provide ubiquitous and customized content for resource-constrained users.<n>Such a paradigm faces two significant challenges: 1) raw prompts often lead to poor generation quality due to users' lack of experience with specific AIGC models, and 2) static service provisioning fails to efficiently utilize computational and communication resources.<n>We develop an interactive prompt engineering mechanism that leverages a Large Language Model (LLM) to generate customized prompt corpora and employs Inverse Reinforcement Learning (IRL) for policy imitation.
arXiv Detail & Related papers (2025-02-17T03:05:20Z) - A Learning-based Incentive Mechanism for Mobile AIGC Service in Decentralized Internet of Vehicles [49.86094523878003]
We propose a decentralized incentive mechanism for mobile AIGC service allocation.
We employ multi-agent deep reinforcement learning to find the balance between the supply of AIGC services on RSUs and user demand for services within the IoV context.
arXiv Detail & Related papers (2024-03-29T12:46:07Z) - Enabling AI-Generated Content (AIGC) Services in Wireless Edge Networks [68.00382171900975]
In wireless edge networks, the transmission of incorrectly generated content may unnecessarily consume network resources.
We present the AIGC-as-a-service concept and discuss the challenges in deploying A at the edge networks.
We propose a deep reinforcement learning-enabled algorithm for optimal ASP selection.
arXiv Detail & Related papers (2023-01-09T09:30:23Z) - Artificial Intelligence Empowered Multiple Access for Ultra Reliable and
Low Latency THz Wireless Networks [76.89730672544216]
Terahertz (THz) wireless networks are expected to catalyze the beyond fifth generation (B5G) era.
To satisfy the ultra-reliability and low-latency demands of several B5G applications, novel mobility management approaches are required.
This article presents a holistic MAC layer approach that enables intelligent user association and resource allocation, as well as flexible and adaptive mobility management.
arXiv Detail & Related papers (2022-08-17T03:00:24Z) - Predictive Closed-Loop Service Automation in O-RAN based Network Slicing [35.23159175375706]
Open radio access network (O-RAN) is perhaps the most promising RAN architecture that inherits all the aforementioned features.
This article provides a closed-loop and intelligent resource provisioning scheme for O-RAN slicing to prevent SLA violations.
arXiv Detail & Related papers (2022-02-04T04:34:00Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.