Fugu-MT 論文翻訳(概要): Nonlinear Bipolar Compensation: Handling Outliers in Post-Training Quantization

論文の概要: Nonlinear Bipolar Compensation: Handling Outliers in Post-Training Quantization

arxiv url: http://arxiv.org/abs/2605.16423v1
Date: Thu, 14 May 2026 14:55:46 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-19 17:57:46.392727
Title: Nonlinear Bipolar Compensation: Handling Outliers in Post-Training Quantization
Title（参考訳）: 非線形双極補償:後の量子化における外乱の扱い
Authors: Peilin Sun, Jianxin Wu,
Abstract要約: ネットワーク量子化は、浮動小数点数を低ビット表現にマッピングすることで、モデルのメモリと計算消費を大幅に削減する。既存の量子化法は通常、速度精度のトレードオフと限定的な一般化に悩まされる。本稿では, 非線形補償を導入し, 外れ値の影響を低減させるポストトレーニング量子化手法であるBipolar Compensation (NBC)を提案する。
参考スコア（独自算出の注目度）: 11.76317251442695
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Network quantization has emerged as one of the most practical model compression techniques, which significantly reduces a model's memory and compute consumption by mapping floating-point numbers to low-bit representations. However, existing quantization methods typically suffer from the speed-accuracy tradeoff and limited generalization. To address these issues, recent compensation-based methods offer an efficient yet general solution by introducing additional lightweight linear layers into the quantized network. However, the accuracy of these methods suffers from their limited compensation capability and high sensitivity to outliers. In this paper, we propose Nonlinear Bipolar Compensation (NBC), a post-training quantization approach that introduces nonlinear compensation to reduce the effect of outliers. We further design Bipolar Logarithmic Transformation (BLT), which compresses outliers by mapping both the quantized input and the quantization error into a transformed space. A simple linear layer is then applied for compensation in the transformed space, preserving the efficiency of our method. Extensive experiments across various tasks, models, and quantization methods confirm the effectiveness, efficiency, robustness, and generality of our NBC approach.
Abstract（参考訳）: ネットワーク量子化は、浮動小数点数を低ビット表現にマッピングすることで、モデルのメモリと計算消費を大幅に削減する、最も実用的なモデル圧縮手法の1つである。しかし、既存の量子化法は通常、速度精度のトレードオフと限定的な一般化に悩まされる。これらの問題に対処するため、近年の補償に基づく手法は、量子化されたネットワークに新たな軽量線形層を導入することにより、効率的で汎用的な解決策を提供する。しかし, これらの手法の精度は, 限られた補償能力と, 外れ値に対する高い感度に悩まされている。本稿では, 非線形二極補償(NBC)を提案する。この手法は, 非線形補償を導入し, 外れ値の影響を低減させる。さらに、量子化入力と量子化誤差の両方を変換空間にマッピングすることで、外周を圧縮する双極対数変換(BLT)を設計する。次に、簡単な線形層を変換空間の補償として適用し、本手法の効率性を保つ。様々なタスク、モデル、量子化手法にわたる大規模な実験により、NBCのアプローチの有効性、効率性、堅牢性、一般性が確認された。

論文の概要: Nonlinear Bipolar Compensation: Handling Outliers in Post-Training Quantization

関連論文リスト