Fugu-MT 論文翻訳(概要): AgileOS: A GPU Operating System Layer for Protected CUDA Services

論文の概要: AgileOS: A GPU Operating System Layer for Protected CUDA Services

arxiv url: http://arxiv.org/abs/2606.06697v1
Date: Thu, 04 Jun 2026 20:34:56 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-08 14:33:29.436535
Title: AgileOS: A GPU Operating System Layer for Protected CUDA Services
Title（参考訳）: AgileOS: 保護されたCUDAサービスのためのGPUオペレーティングシステムレイヤ
Authors: Zhuoping Yang, Yiyu Shi, Alex Jones, Peipei Zhou,
Abstract要約: 本稿では,保護サービスのためのGPUオペレーティングシステム層であるAgileOSの初期設計とプロトタイプのスコープについて述べる。サービス状態とモジュールインターフェースを保護するため、AgileOSは、ユーザー割り当てを保護されたモジュール/MMIO範囲から分離するGPUメモリ管理モデルを定義している。 AgileOSはモジュール化され、柔軟性があり、様々な保護されたサービスとcuFFTやPyTorchといった既存のライブラリをサポートする。
参考スコア（独自算出の注目度）: 5.074019267683835
License: http://creativecommons.org/licenses/by-nc-sa/4.0/
Abstract: Modern GPU applications increasingly interact with storage systems, network devices, vendor libraries, and GPU-resident services rather than executing only isolated compute kernels. This shift creates a need for operating-system-like protection around GPU services, where service metadata, device queues, memory-mapped I/O regions, and library-internal state should not be directly exposed to untrusted application kernels. However, today's CUDA programming model, by default, still gives each application direct ownership of its CUDA context, device pointers, runtime handles, module loading path, and kernel launches, leaving protected GPU services to build their own ad hoc interfaces and isolation mechanisms. This paper presents the initial design and prototype scope of AgileOS, a GPU operating-system layer for protected CUDA services. AgileOS virtualizes CUDA at the library boundary: applications link against client-side CUDA Runtime, Driver, and selected library shims, while a trusted runtime worker owns the real CUDA context and mediates supported operations. To protect service state and module interfaces, AgileOS also defines a GPU memory-management model that separates user allocations from protected module/MMIO ranges, using pointer validation and memory access guards via PTX injection. AgileOS is modularized and flexible, supporting a range of protected services and existing libraries such as cuFFT and PyTorch. The prototype includes client-side interceptors, worker-side CUDA handlers, virtualized CUDA object tables, protected AgileOS modules, a GPU memory manager that separates user allocations from protected module/MMIO ranges, selected trusted library adapters, and the PTX-level kernel memory guard.
Abstract（参考訳）: 現代のGPUアプリケーションは、独立した計算カーネルのみを実行するのではなく、ストレージシステム、ネットワークデバイス、ベンダーライブラリ、GPU常駐サービスと対話する傾向にある。このシフトは、サービスメタデータ、デバイスキュー、メモリマップされたI/Oリージョン、ライブラリ内部の状態を信頼できないアプリケーションカーネルに直接公開するべきではない、GPUサービスを中心としたOSライクな保護の必要性を生み出している。しかし、今日のCUDAプログラミングモデルは、デフォルトでは、各アプリケーションがCUDAコンテキスト、デバイスポインタ、ランタイムハンドル、モジュールローディングパス、カーネルローンチを直接所有し、独自のアドホックインターフェースと分離メカニズムを構築するために保護されたGPUサービスを残している。本稿では,保護されたCUDAサービスのためのGPUオペレーティングシステム層であるAgileOSの初期設計とプロトタイプのスコープについて述べる。 AgileOSはCUDAをライブラリ境界で仮想化する:アプリケーションはクライアントサイドのCUDAランタイム、ドライバ、選択されたライブラリシムとリンクし、信頼できるランタイムワーカーは実際のCUDAコンテキストを所有し、サポート対象の操作を仲介する。サービス状態とモジュールインターフェースを保護するため、AgileOSは、PTXインジェクションを介してポインタバリデーションとメモリアクセスガードを使用して、保護されたモジュール/MMIO範囲からユーザアロケーションを分離するGPUメモリ管理モデルも定義している。 AgileOSはモジュール化され、柔軟性があり、様々な保護されたサービスとcuFFTやPyTorchといった既存のライブラリをサポートする。プロトタイプには、クライアント側のインターセプタ、ワーカー側のCUDAハンドラ、仮想化されたCUDAオブジェクトテーブル、保護されたAgileOSモジュール、保護されたモジュール/MMIO範囲からユーザ割り当てを分離するGPUメモリマネージャ、選択された信頼できるライブラリアダプタ、PTXレベルのカーネルメモリガードが含まれている。

論文の概要: AgileOS: A GPU Operating System Layer for Protected CUDA Services

関連論文リスト