Fugu-MT 論文翻訳(概要): Almost-Free Queue Jumping for Prior Inputs in Private Neural Inference

論文の概要: Almost-Free Queue Jumping for Prior Inputs in Private Neural Inference

arxiv url: http://arxiv.org/abs/2603.12946v1
Date: Fri, 13 Mar 2026 12:41:36 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-16 17:38:12.085389
Title: Almost-Free Queue Jumping for Prior Inputs in Private Neural Inference
Title（参考訳）: プライベート・ニューラル推論における事前入力のためのほぼ自由なキュージャンプ
Authors: Qiao Zhang, Minghui Xu, Tingchuang Zhang, Xiuzhen Cheng,
Abstract要約: PP-MLは暗号プリミティブを統合することで、セキュアなニューラルネットワーク推論を可能にする。 PrivQJは、インプロセッシングスロットのリサイクルを通じて、入力間の共有計算を利用する。その結果,最先端のPP-MLシステムと比較して,オーバヘッドのオーダー・オブ・マグニチュード低減効果が示された。
参考スコア（独自算出の注目度）: 14.074510817815538
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Privacy-Preserving Machine Learning as a Service (PP-MLaaS) enables secure neural network inference by integrating cryptographic primitives such as homomorphic encryption (HE) and multi-party computation (MPC), protecting both client data and server models. Recent mixed-primitive frameworks have significantly improved inference efficiency, yet they process batched inputs sequentially, offering little flexibility for prioritizing urgent requests. Naïve queue jumping introduces considerable computational and communication overhead, increasing non-negligible latency for in-queue inputs. We initiate the study of privacy-preserving queue jumping in batched inference and propose PrivQJ, a novel framework that enables efficient priority handling without degrading overall system performance. PrivQJ exploits shared computation across inputs via in-processing slot recycling, allowing prior inputs to be piggybacked onto ongoing batch computation with almost no additional cryptographic cost. Both theoretical analysis and experimental results demonstrate over an order-of-magnitude reduction in overhead compared to state-of-the-art PP-MLaaS systems.
Abstract（参考訳）: プライバシ保存機械学習・アズ・ア・サービス(PP-MLaaS)は、ホモモルフィック暗号化(HE)やマルチパーティ計算(MPC)といった暗号プリミティブを統合し、クライアントデータとサーバモデルの両方を保護することによって、セキュアなニューラルネットワーク推論を可能にする。最近の混合プリミティブフレームワークは推論効率を大幅に改善しているが、バッチ入力を順次処理し、緊急要求を優先順位付けするための柔軟性をほとんど提供していない。ネイブキュージャンプは計算と通信のオーバーヘッドを大幅に増加させ、キュー内入力の非無視レイテンシを増大させる。バッチ推論におけるプライバシ保護キュージャンプの研究を開始し,システム全体の性能を低下させることなく,効率的な優先度処理を可能にする新しいフレームワークであるPrivQJを提案する。 PrivQJは、インプロセッシングスロットのリサイクルを通じて、入力間の共有計算を利用する。理論解析と実験結果の両方が、最先端のPP-MLaaSシステムと比較して、オーバーヘッドのオーダー・オブ・マグニチュード低減を実証している。

論文の概要: Almost-Free Queue Jumping for Prior Inputs in Private Neural Inference

関連論文リスト