Fugu-MT 論文翻訳(概要): Optimal Model Placement and Online Model Splitting for Device-Edge Co-Inference

論文の概要: Optimal Model Placement and Online Model Splitting for Device-Edge Co-Inference

arxiv url: http://arxiv.org/abs/2105.13618v1
Date: Fri, 28 May 2021 06:55:04 GMT
ステータス: 翻訳完了
システム内更新日: 2021-05-31 13:27:04.261154
Title: Optimal Model Placement and Online Model Splitting for Device-Edge Co-Inference
Title（参考訳）: デバイスエッジ会議における最適モデル配置とオンラインモデル分割
Authors: Jia Yan, Suzhi Bi, Ying-Jun Angela Zhang
Abstract要約: デバイスのエッジコ推論は、リソース制約のある無線デバイスがディープニューラルネットワーク(DNN)ベースのアプリケーションを実行する新たな可能性を開く。デバイスエッジ共振器のエネルギー・時間コストを最小限に抑えるため,モデル配置とオンラインモデル分割決定の協調最適化について検討した。
参考スコア（独自算出の注目度）: 22.785214118527872
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Device-edge co-inference opens up new possibilities for resource-constrained wireless devices (WDs) to execute deep neural network (DNN)-based applications with heavy computation workloads. In particular, the WD executes the first few layers of the DNN and sends the intermediate features to the edge server that processes the remaining layers of the DNN. By adapting the model splitting decision, there exists a tradeoff between local computation cost and communication overhead. In practice, the DNN model is re-trained and updated periodically at the edge server. Once the DNN parameters are regenerated, part of the updated model must be placed at the WD to facilitate on-device inference. In this paper, we study the joint optimization of the model placement and online model splitting decisions to minimize the energy-and-time cost of device-edge co-inference in presence of wireless channel fading. The problem is challenging because the model placement and model splitting decisions are strongly coupled, while involving two different time scales. We first tackle online model splitting by formulating an optimal stopping problem, where the finite horizon of the problem is determined by the model placement decision. In addition to deriving the optimal model splitting rule based on backward induction, we further investigate a simple one-stage look-ahead rule, for which we are able to obtain analytical expressions of the model splitting decision. The analysis is useful for us to efficiently optimize the model placement decision in a larger time scale. In particular, we obtain a closed-form model placement solution for the fully-connected multilayer perceptron with equal neurons. Simulation results validate the superior performance of the joint optimal model placement and splitting with various DNN structures.
Abstract（参考訳）: device-edge co-inferenceは、大量の計算負荷を伴うディープニューラルネットワーク(dnn)ベースのアプリケーションを実行するための、リソース制約付きワイヤレスデバイス(wds)の新たな可能性を開く。特に、WDはDNNの最初の数層を実行し、DNNの残りの層を処理するエッジサーバに中間機能を送信する。モデル分割決定を適用することで、局所計算コストと通信オーバーヘッドの間にトレードオフが存在する。実際には、DNNモデルはエッジサーバで定期的にトレーニングされ、更新される。 dnnパラメータが再生成されると、更新されたモデルの一部はオンデバイス推論を容易にするためにwdに置かれなければならない。本稿では,無線チャネルの消滅に伴うデバイスエッジ共振のエネルギー・時間コストを最小化するために,モデル配置とオンラインモデル分割決定の協調最適化について検討する。モデル配置とモデル分割の決定は2つの異なるタイムスケールを伴いながら強く結合しているため、問題は難しい。まず,モデル配置決定によって問題の有限方向が決定される最適停止問題を定式化し,オンラインモデル分割に取り組む。逆方向誘導に基づく最適モデル分割規則の導出に加えて、モデル分割決定の解析式を得ることができる単純な1段階のルックアヘッド規則についても検討する。この分析は,モデル配置決定をより大きな時間スケールで効率的に最適化する上で有用である。特に、ニューロンが等しい完全連結多層パーセプトロンに対する閉形式モデル配置解を得る。シミュレーションの結果, 各種DNN構造を用いた最適モデル配置と分割の優れた性能が検証された。

論文の概要: Optimal Model Placement and Online Model Splitting for Device-Edge Co-Inference

関連論文リスト