Fugu-MT 論文翻訳(概要): Learning Grouped Lattice Vector Quantizers for Low-Bit LLM Compression

論文の概要: Learning Grouped Lattice Vector Quantizers for Low-Bit LLM Compression

arxiv url: http://arxiv.org/abs/2510.20984v1
Date: Thu, 23 Oct 2025 20:19:48 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-28 09:00:15.313865
Title: Learning Grouped Lattice Vector Quantizers for Low-Bit LLM Compression
Title（参考訳）: 低ビットLDM圧縮のためのグループ格子ベクトル量子化器の学習
Authors: Xi Zhang, Xiaolin Wu, Jiamang Wang, Weisi Lin,
Abstract要約: 本稿では,各重みの群に独自の格子コードブックを割り当てるGLVQ(Grouped Lattice Vector Quantization)フレームワークを紹介する。提案手法は,既存のトレーニング後の量子化ベースラインと比較して,モデルサイズと精度のトレードオフが良好である。
参考スコア（独自算出の注目度）: 57.54335545892155
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Large Language Models (LLMs) have demonstrated remarkable capabilities but typically require extensive computational resources and memory for inference. Post-training quantization (PTQ) can effectively reduce these demands by storing weights in lower bit-width formats. However, standard uniform quantization often leads to notable performance degradation, particularly in low-bit scenarios. In this work, we introduce a Grouped Lattice Vector Quantization (GLVQ) framework that assigns each group of weights a customized lattice codebook, defined by a learnable generation matrix. To address the non-differentiability of the quantization process, we adopt Babai rounding to approximate nearest-lattice-point search during training, which enables stable optimization of the generation matrices. Once trained, decoding reduces to a simple matrix-vector multiplication, yielding an efficient and practical quantization pipeline. Experiments on multiple benchmarks show that our approach achieves a better trade-off between model size and accuracy compared to existing post-training quantization baselines, highlighting its effectiveness in deploying large models under stringent resource constraints. Our source code is available on GitHub repository: https://github.com/xzhang9308/GLVQ.
Abstract（参考訳）: 大規模言語モデル(LLM)は目覚ましい能力を示してきたが、一般的に推論には広範な計算資源とメモリを必要とする。後トレーニング量子化(PTQ)は、低ビット幅のフォーマットに重みを格納することで、これらの要求を効果的に低減することができる。しかし、標準的な均一量子化は、特に低ビットシナリオにおいて、顕著な性能劣化をもたらすことが多い。本稿では,学習可能な生成行列によって定義された格子コードブックを各重みのグループに割り当てる,GLVQ(Grouped Lattice Vector Quantization)フレームワークを提案する。量子化過程の非微分性に対処するため, 学習中の近接格子点探索にババイラウンドを採用することにより, 生成行列の安定な最適化を実現する。一度訓練すると、復号化は単純な行列ベクトル乗法に還元され、効率的で実用的な量子化パイプラインが得られる。複数のベンチマーク実験から,提案手法は既存のトレーニング後の量子化ベースラインよりもモデルサイズと精度のトレードオフを良好に達成し,厳密なリソース制約下での大規模モデルのデプロイの有効性を浮き彫りにしている。ソースコードはGitHubリポジトリで入手できます。

論文の概要: Learning Grouped Lattice Vector Quantizers for Low-Bit LLM Compression

関連論文リスト