Fugu-MT 論文翻訳(概要): Hybrid Orchestration of Edge AI and Microservices via Graph-based Self-Imitation Learning

論文の概要: Hybrid Orchestration of Edge AI and Microservices via Graph-based Self-Imitation Learning

arxiv url: http://arxiv.org/abs/2603.06669v1
Date: Tue, 03 Mar 2026 03:54:04 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-10 15:13:12.698102
Title: Hybrid Orchestration of Edge AI and Microservices via Graph-based Self-Imitation Learning
Title（参考訳）: グラフベースのセルフイミテーション学習によるエッジAIとマイクロサービスのハイブリッドオーケストレーション
Authors: Chen Yang, Jin Zheng, Yang Zhuolin, Lai Pan, Zhang Xiao, Hu Menglan, Yin Haiyan,
Abstract要約: SIL-GPOは、エッジAIマイクロサービスシステムのハイブリッドオーケストレーションを最適化する強化学習フレームワークである。我々は、SIL-GPOが、最先端、メタヒューリスティック、深層RLベースラインと比較して、エンドツーエンドのサービスレイテンシを著しく低減し、リソース利用率を高めることを実証した。
参考スコア（独自算出の注目度）: 10.889659056437546
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Modern edge AI applications increasingly rely on microservice architectures that integrate both AI services and conventional microservices into complex request chains with stringent latency requirements. Effectively orchestrating these heterogeneous services is crucial for ensuring low-latency performance, yet remains challenging due to their diverse resource demands and strong operational interdependencies under resource-constrained edge environments. In particular, frequent interactions between services tightly couple deployment and routing decisions, yet existing approaches optimize them in isolation, leading to fundamentally inadequate system performance.In this paper, we propose SIL-GPO, a reinforcement learning framework that optimizes hybrid orchestration for edge AI microservice systems. SIL-GPO formulates the orchestration problem as a sequential decision-making task and leverages graph attention networks to encode service topologies and routing dependencies within the agent state representation. Moreover, SIL-GPO integrates a self-imitation learning strategy into proximal policy optimization, enabling the agent to prioritize and reuse high-reward trajectories. This guides policy updates towards globally promising solutions that standard RL often fails to discover under sparse rewards and large combinatorial action spaces. We conduct extensive experiments on trace-driven edge AI workloads, demonstrating that SIL-GPO significantly reduces end-to-end service latency and enhances resource utilization compared to state-of-the-art heuristic, metaheuristic, and deep RL baselines. Our framework offers a unified and scalable solution for efficient orchestration of AI services and microservices in the edge, paving the way for low-latency, high-performance edge AI deployments.
Abstract（参考訳）: 現代のエッジAIアプリケーションは、AIサービスと従来のマイクロサービスの両方を、厳格なレイテンシ要件を持った複雑な要求チェーンに統合するマイクロサービスアーキテクチャにますます依存しています。これらの異種サービスを効果的にオーケストレーションすることは、低レイテンシのパフォーマンスを確保する上で非常に重要ですが、リソース要求の多様さとリソース制約されたエッジ環境下での運用上の相互依存が強いため、依然として困難です。本稿では,エッジAIマイクロサービスシステムのハイブリッドオーケストレーションを最適化する強化学習フレームワークであるSIL-GPOを提案する。 SIL-GPOは、オーケストレーション問題をシーケンシャルな意思決定タスクとして定式化し、グラフアテンションネットワークを利用して、エージェント状態表現内のサービストポロジとルーティング依存性をエンコードする。さらに、SIL-GPOは自己シミュレーション学習戦略を近似ポリシー最適化に統合し、エージェントが高次軌道を優先順位付けし再利用できるようにする。これは、標準RLがスパース報酬や大きな組合せ行動空間の下でしばしば発見できない、グローバルに有望なソリューションへのポリシー更新を導く。我々は、トレース駆動のエッジAIワークロードに関する広範な実験を行い、SIL-GPOは、最先端のヒューリスティック、メタヒューリスティック、ディープRLベースラインと比較して、エンドツーエンドのサービスレイテンシを著しく削減し、リソース使用率を向上させることを実証した。私たちのフレームワークは、エッジにおけるAIサービスとマイクロサービスの効率的なオーケストレーションのための、統一的でスケーラブルなソリューションを提供しています。

論文の概要: Hybrid Orchestration of Edge AI and Microservices via Graph-based Self-Imitation Learning

関連論文リスト