Fugu-MT 論文翻訳(概要): Explainable Task-Oriented Token Communication for AI-Native 6G Networks

論文の概要: Explainable Task-Oriented Token Communication for AI-Native 6G Networks

arxiv url: http://arxiv.org/abs/2606.14808v1
Date: Fri, 12 Jun 2026 05:52:08 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-16 16:21:32.246758
Title: Explainable Task-Oriented Token Communication for AI-Native 6G Networks
Title（参考訳）: AIネイティブ6Gネットワークのための説明可能なタスク指向トークン通信
Authors: Feibo Jiang, Lei Mao, Li Dong, Kezhi Wang, Cunhua Pan, Jiangzhou Wang,
Abstract要約: ファンデーションモデル(FM)と無線通信の統合は、ビット精度の伝送からタスク指向の伝送への画像通信の進化を促している。既存のタスク指向の画像通信手法は、タスク指向のトークン表現が不十分なこと、ビジュアルトークンとタスクトークンのコラボレーションが不十分であること、タスク決定の解釈可能性の制限という3つの大きな課題に直面している。本稿では,これらの課題に対処するための説明可能なタスク指向トークンコミュニケーション(ET-TokenCom)フレームワークを提案する。
参考スコア（独自算出の注目度）: 66.32351608034986
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: The integration of Foundation Models (FMs) and wireless communications is driving the evolution of image communication from bit-accurate transmission toward task-oriented transmission. However, existing task-oriented image communication methods still face three major challenges: insufficient task-oriented Token representation, inadequate collaboration between Visual Tokens and Task Tokens, and limited interpretability of task decisions. To address these challenges, we propose an Explainable Task-Oriented Token Communication (ET-TokenCom) framework. By treating Tokens as unified units for information representation and transmission, the proposed framework constructs an end-to-end communication link that spans visual perception, wireless transmission, and task reasoning. At the transmitter, the ET-TokenCom framework extracts Visual Tokens from images to preserve low-level visual information. Meanwhile, Task Tokens generated by the FM are introduced to represent the target information and decision intent required by the current task. A Cross-Modal Attention (CMA) fusion mechanism is further designed, enabling Task Tokens to explicitly guide the selection, weighting, and transmission of Visual Tokens. At the receiver, the framework integrates Token decoding with an explainable output mechanism, where attention heatmaps are generated to highlight critical perceptual regions under different task objectives and reveal the influence of Task Tokens on the outputs. Finally, simulation results validate the effectiveness and robustness of the proposed ET-TokenCom framework.
Abstract（参考訳）: ファンデーションモデル(FM)と無線通信の統合は、ビット精度の伝送からタスク指向の伝送への画像通信の進化を促している。しかし、既存のタスク指向の画像通信手法は、タスク指向のトークン表現の不足、ビジュアルトークンとタスクトークンのコラボレーションの不十分、タスク決定の限定的な解釈可能性の3つの大きな課題に直面している。これらの課題に対処するために、説明可能なタスク指向トークンコミュニケーション(ET-TokenCom)フレームワークを提案する。 Tokensを情報表現と送信の統一単位として扱うことにより、視覚知覚、無線送信、タスク推論にまたがるエンドツーエンドの通信リンクを構築する。送信機では、ET-TokenComフレームワークが画像からビジュアルトークンを抽出し、低レベルな視覚情報を保存する。一方、FMで生成されたタスクトークンは、現在のタスクに必要な目標情報および決定意図を表すために導入される。クロスモーダル・アテンション(CMA)融合機構はさらに設計されており、タスクトークンは視覚トークンの選択、重み付け、送信を明示的にガイドすることができる。受信機では、Tokenデコーディングと説明可能な出力機構を統合し、注目ヒートマップを生成して、異なるタスク目標の下で重要な知覚領域をハイライトし、タスクトークンが出力に与える影響を明らかにする。最後に,ET-TokenComフレームワークの有効性とロバスト性をシミュレーションにより検証した。

論文の概要: Explainable Task-Oriented Token Communication for AI-Native 6G Networks

関連論文リスト