Fugu-MT 論文翻訳(概要): Channel-Adaptive Edge AI: Maximizing Inference Throughput by Adapting Computational Complexity to Channel States

論文の概要: Channel-Adaptive Edge AI: Maximizing Inference Throughput by Adapting Computational Complexity to Channel States

arxiv url: http://arxiv.org/abs/2603.03146v1
Date: Tue, 03 Mar 2026 16:33:29 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-04 21:38:10.880102
Title: Channel-Adaptive Edge AI: Maximizing Inference Throughput by Adapting Computational Complexity to Channel States
Title（参考訳）: Channel-Adaptive Edge AI: 計算複雑性をチャネル状態に適応させることによる推論出力の最大化
Authors: Jierui Zhang, Jianhao Huang, Kaibin Huang,
Abstract要約: emph通信と計算(IC$2$)は、6Gネットワークにおける効率的なエッジ推論を実現するための新しいパラダイムとして登場した。この計量は、チャネル歪みと人工知能(AI)モデルアーキテクチャと計算複雑性の両方を考慮する必要があるため、非常に複雑である。我々は、E2E推論精度の抽出可能な解析モデルを開発し、それを利用して、推論スループットを最大化するEmph Channel-Adaptive AIアルゴリズムを設計する。
参考スコア（独自算出の注目度）: 31.472509140661796
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: \emph{Integrated communication and computation} (IC$^2$) has emerged as a new paradigm for enabling efficient edge inference in sixth-generation (6G) networks. However, the design of IC$^2$ technologies is hindered by the lack of a tractable theoretical framework for characterizing \emph{end-to-end} (E2E) inference performance. The metric is highly complicated as it needs to account for both channel distortion and artificial intelligence (AI) model architecture and computational complexity. In this work, we address this challenge by developing a tractable analytical model for E2E inference accuracy and leveraging it to design a \emph{channel-adaptive AI} algorithm that maximizes inference throughput, referred to as the edge processing rate (EPR), under latency and accuracy constraints. Specifically, we consider an edge inference system in which a server deploys a backbone model with early exit, which enables flexible computational complexity, to perform inference on data features transmitted by a mobile device. The proposed accuracy model characterizes high-dimensional feature distributions in the angular domain using a Mixture of von Mises (MvM) distribution. This leads to a desired closed-form expression for inference accuracy as a function of quantization bit-width and model traversal depth, which represents channel distortion and computational complexity, respectively. Building upon this accuracy model, we formulate and solve the EPR maximization problem under joint latency and accuracy constraints, leading to a channel-adaptive AI algorithm that achieves full IC$^2$ integration. The proposed algorithm jointly adapts transmit-side feature compression and receive-side model complexity according to channel conditions to maximize overall efficiency and inference throughput. Experimental results demonstrate its superior performance as compared with fixed-complexity counterparts.
Abstract（参考訳）: 第6世代(6G)ネットワークにおいて、効率的なエッジ推論を実現するための新しいパラダイムとして、 \emph{Integrated communication and computation} (IC$^2$)が登場した。しかし、IC$^2$ 技術の設計は、emph{end-to-end} (E2E) 推論性能を特徴付けるための難解な理論フレームワークが欠如していることから妨げられている。この計量は、チャネル歪みと人工知能(AI)モデルアーキテクチャと計算複雑性の両方を考慮する必要があるため、非常に複雑である。本研究では,エッジ処理率(EPR)と呼ばれる推論スループットを,レイテンシと精度の制約の下で最大化する,E2E推論精度の抽出可能な解析モデルを開発し,それを活用して,推論スループットを最大化する「emph{ channel-adaptive AI}」アルゴリズムを設計する。具体的には、サーバが早期終了時にバックボーンモデルをデプロイし、柔軟な計算複雑性を実現するエッジ推論システムを検討し、モバイルデバイスから送信されるデータの特徴を推論する。提案した精度モデルは,MvM分布を用いた角領域の高次元特徴分布を特徴付ける。このことは、それぞれチャネル歪みと計算複雑性を表す量子化ビット幅とモデルトラバース深さの関数として、推論精度のための所望のクローズドフォーム表現をもたらす。この精度モデルに基づいて、結合レイテンシと精度制約の下でEPRの最大化問題を定式化し、解決し、完全なIC$^2$積分を実現するチャネル適応型AIアルゴリズムを実現する。提案アルゴリズムは,送信側特徴圧縮と受信側モデル複雑性をチャネル条件に応じて併用することにより,全体の効率と推論スループットを最大化する。実験結果から, 固定複雑度に比べて優れた性能を示した。

論文の概要: Channel-Adaptive Edge AI: Maximizing Inference Throughput by Adapting Computational Complexity to Channel States

関連論文リスト