Fugu-MT 論文翻訳(概要): Towards Large Model Feature Coding

論文の概要: Towards Large Model Feature Coding

arxiv url: http://arxiv.org/abs/2605.24025v1
Date: Wed, 20 May 2026 13:39:48 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-26 19:50:17.543162
Title: Towards Large Model Feature Coding
Title（参考訳）: 大規模モデル特徴符号化に向けて
Authors: Youwei Pang, Changsheng Gao, Dong Liu, Huchuan Lu, Weisi Lin,
Abstract要約: 大規模モデル特徴符号化(LaMoFC)のためのベンチマークおよび評価フレームワークを提案する。まず、機能データセットLaMoFCBenchを構築し、4つのカテゴリと16のシナリオで多様なタスク要件をカバーしました。次に、実際の応用シナリオに従って代表分割点を指定し、中間的特徴を抽出する。
参考スコア（独自算出の注目度）: 90.89390892560094
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Large models have delivered remarkable performance across a wide range of perception and generation tasks, yet practical deployment is increasingly constrained by computational and memory budgets, as well as privacy requirements. Split execution alleviates these constraints by partitioning computation across devices, but it inevitably introduces intensive transmission and storage of intermediate features. Unlike conventional feature coding for CNNs that typically targets homogeneous spatial activation maps, modern large models generate heterogeneous features with varying statistical distributions and compression tolerances, e.g., multi-level/multi-modal representations and autoregressive context caches. These characteristics necessitate treating large model feature coding (LaMoFC) as a fundamental system component and call for a systematic evaluation framework. In this paper, we present a comprehensive benchmark and evaluation framework for LaMoFC. We first build the feature dataset LaMoFCBench, covering diverse task requirements across 4 categories and 16 scenarios while integrating widelyadopted architectures and various split-computing settings. We then specify representative split points according to practical application scenarios to extract intermediate features, establishing a unified pipeline for fair and reproducible comparisons. Finally, we benchmark mainstream universal feature codecs, exposing the profound misalignment between existing coding paradigms and the heterogeneous nature of large model features. These findings reveal that LaMoFC demands a fundamental departure from existing paradigms, and LaMoFCBench provides the shared empirical foundation to drive this transition. The data and code will be available at https://github.com/lartpang/LaMoFCBench.
Abstract（参考訳）: 大規模なモデルでは、幅広い知覚と生成タスクで顕著なパフォーマンスを実現していますが、現実的なデプロイメントは、計算やメモリの予算やプライバシ要件によってますます制限されています。分割実行は、デバイス間で計算を分割することでこれらの制約を緩和するが、必然的に中間機能の集中的な伝達と保存を導入する。通常、同種空間活性化マップを対象とするCNNの従来の特徴符号化とは異なり、現代の大規模モデルは様々な統計分布と圧縮耐性を持つ異種特徴を生成する。これらの特徴は,大規模モデル特徴符号化(LaMoFC)を基本システムコンポーネントとして扱う必要があり,体系的な評価フレームワークが求められている。本稿では,LaMoFCの総合ベンチマークと評価フレームワークを提案する。まず、機能データセットLaMoFCBenchを構築し、4つのカテゴリと16のシナリオにまたがる多様なタスク要件をカバーし、広く登録されたアーキテクチャとさまざまな分割計算設定を統合します。次に、実際のアプリケーションシナリオに従って代表分割点を指定し、中間的特徴を抽出し、公正かつ再現可能な比較のための統一パイプラインを確立する。最後に、主要な普遍的特徴コーデックをベンチマークし、既存のコーディングパラダイムと大きなモデル特徴の不均一性との間の深い相違を明らかにする。これらの結果は、LaMoFCが既存のパラダイムから根本的な離脱を要求することを示し、LaMoFCBenchは、この移行を促進するための共有経験的基盤を提供する。データとコードはhttps://github.com/lartpang/LaMoFCBench.comから入手できる。

論文の概要: Towards Large Model Feature Coding

関連論文リスト