Fugu-MT 論文翻訳(概要): AMAQ: Adaptive Mixed-bit Activation Quantization for Collaborative Parameter Efficient Fine-tuning

論文の概要: AMAQ: Adaptive Mixed-bit Activation Quantization for Collaborative Parameter Efficient Fine-tuning

arxiv url: http://arxiv.org/abs/2510.05468v1
Date: Tue, 07 Oct 2025 00:05:16 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-08 17:57:08.034122
Title: AMAQ: Adaptive Mixed-bit Activation Quantization for Collaborative Parameter Efficient Fine-tuning
Title（参考訳）: AMAQ: 協調パラメータ効率的なファインチューニングのための適応混合ビット活性化量子化
Authors: Yurun Song, Zhuoyi Yang, Ian G. Harris, Sangeetha Abdu Jyothi,
Abstract要約: 協調学習における通信オーバーヘッドを低減するために,適応混合ビット活性化量子化(AMAQ)を導入する。 AMAQは、高い精度(6ビットから8ビット)から低い精度(3ビットから4ビット)までのアクティベーションと勾配を圧縮する同じビット予算の下で、AMAQは固定精度のアプローチより優れ、約2.5%高い生成精度と約1.3%より良い分類精度を提供する。
参考スコア（独自算出の注目度）: 7.46409091526834
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Large Language Models (LLMs) are scaling rapidly, creating significant challenges for collaborative server client distributed training, particularly in terms of communication efficiency and computational overheads. To address these challenges, we implement Parameter-efficient Split Learning, which effectively balances efficiency and performance for collaborative training on low-resource devices. To reduce communication overhead in collaborative training, we introduce Adaptive Mixed bit Activation Quantization (AMAQ), a strategy that progressively compresses activations and gradients from high precision (6 to 8 bits) to low precision (3 to 4 bits). AMAQ achieves this by effectively allocating bit budgets across channels based on feature wise and layer wise importance using bit regularization. Under the same bit budgets, AMAQ outperforms fixed-precision approaches, delivering about 2.5% higher generation accuracy and about 1.3% better classification accuracy for models like LLaMA3 8B and Qwen2.5 7B. In addition, it significantly enhances training stability and reducing ultra-low bit representation collapse during the training. Experiments demonstrate that AMAQ integrates effectively into practical multi-machine collaborative training setups, offering superior inference accuracy with only a modest communication overhead for bits adaptation during training. This trade off makes AMAQ a practical and effective solution for collaborative training with minimal communication cost.
Abstract（参考訳）: 大規模言語モデル(LLM)は急速にスケールしており、特に通信効率と計算オーバーヘッドの観点から、協調的なサーバクライアント分散トレーニングにおいて大きな課題を生み出しています。これらの課題に対処するため,低リソースデバイス上での協調学習において,効率と性能を効果的にバランスさせるパラメータ効率のよいスプリットラーニングを実装した。 AMAQ(Adaptive Mixed Bit Activation Quantization)は,高精度(6ビットから8ビット)から低精度(3ビットから4ビット)までのアクティベーションと勾配を段階的に圧縮する手法である。 AMAQは、ビット正規化を用いて、機能ワイドと層ワイドの重要度に基づいて、チャンネル間のビット予算を効果的に割り当てることにより、これを実現する。同じビット予算の下で、AMAQは固定精度のアプローチより優れており、LLaMA3 8BやQwen2.5 7Bのようなモデルの約2.5%高い生成精度と約1.3%の分類精度を提供する。さらに、トレーニングの安定性を著しく向上し、トレーニング中の超低ビット表現の崩壊を低減する。実験により、AMAQは実運用のマルチマシン協調訓練装置に効果的に統合され、トレーニング中のビット適応のためのモデム通信オーバーヘッドのみに優れた推論精度を提供することが示された。このトレードオフにより、AMAQは最小の通信コストで協調トレーニングを行うための実用的で効果的なソリューションとなる。

論文の概要: AMAQ: Adaptive Mixed-bit Activation Quantization for Collaborative Parameter Efficient Fine-tuning

関連論文リスト