Fugu-MT 論文翻訳(概要): CANS: Accelerating Multiuser Collaborative Edge Inference via Cooperative Autodidactic NeuroSurgeon

論文の概要: CANS: Accelerating Multiuser Collaborative Edge Inference via Cooperative Autodidactic NeuroSurgeon

arxiv url: http://arxiv.org/abs/2606.09175v1
Date: Mon, 08 Jun 2026 08:14:22 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-09 14:42:06.824036
Title: CANS: Accelerating Multiuser Collaborative Edge Inference via Cooperative Autodidactic NeuroSurgeon
Title（参考訳）: CANS:Cooperative Autodidactic NeuroSurgeonによる複数ユーザ協調エッジ推論の高速化
Authors: Zheshun Wu, Ziyang Zhang, Changyao Lin, Zenglin Xu, Jie Liu,
Abstract要約: モバイルエッジコンピューティング(MEC)による協調型ディープニューラルネットワーク(DNN)推論は、リソース制約のあるモバイルデバイスにインテリジェントなサービスを提供するための有望なアプローチとして登場した。我々は、協調的エッジ推論フレームワークであるCooperative Autodidactic NeuroSurgeon (CANS)を提案し、オンライン推論中に情報的フィードバックを共有することで、デバイスが最適なDNNパーティションを適応的に学習できるようにする。特に2つのエッジデバイスにおける試作実験において,提案したCANSは,非協調的ベースラインと比較して,平均推論遅延を最大50%削減した。
参考スコア（独自算出の注目度）: 46.896701137872604
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Recently, mobile edge computing (MEC)-enabled collaborative deep neural network (DNN) inference has emerged as a promising approach for delivering intelligent services to resource-constrained mobile devices. A representative scenario is multi-user collaborative edge inference, where distinct devices independently partition their DNN models and offload backend computation to a common edge server over wireless networks. However, determining the optimal DNN partition for each device is challenging due to unknown and time-varying system conditions, including fluctuating wireless links and diverse device capabilities. To address this problem, we propose Cooperative Autodidactic NeuroSurgeon (CANS), a collaborative edge inference framework that enables devices to adaptively learn optimal DNN partitions by sharing informative feedback during online inference. To handle the challenge of device heterogeneity and better leverage offline inference experience, we integrate a novel FedLinUCB-DW algorithm that groups devices of the same type and warm-starts online exploration using local offline early-exit inference experience. Furthermore, we provide theoretical guarantees for FedLinUCB-DW by deriving the regret upper bound. We also validate our method on both a simulated environment and a hardware prototype system. Empirical evaluations demonstrate that CANS achieves lower inference latency compared to state-of-the-art baselines. Especially, in prototype experiments on two edge devices, the proposed CANS reduced average inference latency by up to 50% compared to the non-cooperative baseline.
Abstract（参考訳）: 近年,リソース制約のあるモバイルデバイスにインテリジェントなサービスを提供するための,モバイルエッジコンピューティング(MEC)による協調型ディープニューラルネットワーク(DNN)推論が,有望なアプローチとして浮上している。代表的なシナリオは、DNNモデルを独立して分割し、バックエンドの計算を無線ネットワーク上の共通のエッジサーバにオフロードする、マルチユーザ共同エッジ推論である。しかし、無線リンクのゆらぎや多様なデバイス機能など、未知かつ時間的に変化するシステム条件のため、各デバイスに対して最適なDNNパーティションを決定することは困難である。この問題を解決するために,協調型自律神経外科医(CANS)を提案する。これは協調型エッジ推論フレームワークで,オンライン推論中に情報的フィードバックを共有することで,デバイスが最適なDNNパーティションを適応的に学習することを可能にする。デバイスの不均一性の課題に対処し、オフラインの推論エクスペリエンスをより活用するために、同じタイプのデバイスをグループ化し、ローカルのオフライン早期実行推論エクスペリエンスを使用して、オンライン探索をウォームスタートする新しいFedLinUCB-DWアルゴリズムを統合する。さらに,FedLinUCB-DWに対する理論的保証として,後悔の上限を導出する。また,本手法をシミュレーション環境とハードウェアプロトタイプシステムの両方で検証する。実証的な評価では、CANSは最先端のベースラインに比べて推論レイテンシが低いことが示されている。特に,2つのエッジデバイスにおける試作実験において,提案したCANSは,非協調的ベースラインと比較して平均推論遅延を最大50%削減した。

論文の概要: CANS: Accelerating Multiuser Collaborative Edge Inference via Cooperative Autodidactic NeuroSurgeon

関連論文リスト