Fugu-MT 論文翻訳(概要): On Low-Bit Quantization Errors in Speaker Verification: Diagnostic and Mitigation

論文の概要: On Low-Bit Quantization Errors in Speaker Verification: Diagnostic and Mitigation

arxiv url: http://arxiv.org/abs/2606.08078v1
Date: Sat, 06 Jun 2026 09:55:37 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-09 14:42:05.763677
Title: On Low-Bit Quantization Errors in Speaker Verification: Diagnostic and Mitigation
Title（参考訳）: 話者検証における低ビット量子化誤差について:診断と緩和
Authors: Hugo Leguillier, Driss Matrouf, Guillaume Lechien, Mickael Rouvier,
Abstract要約: ResNet-36とResNet-200の低ビット量子化学習について検討した。我々は,FP32閾値付近に集中するスコアドリフトと有害な決定フリップを有する2ビットの明確な膝点を同定した。 2ビットでほとんどの試行を解消し、曖昧なケースのみをエスカレートする校正多重精度カスケードを提案する。
参考スコア（独自算出の注目度）: 6.8167913328808405
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Although low-bit quantization provides practical means to deploy speaker verification on resource-constrained devices, its effects on speaker verification performance remain poorly understood. In this paper, we study uniform K-means quantization-aware training of ResNet-36 and ResNet-200 through joint layer-wise and score-level analyses. Our layer-wise analysis highlights fragile components and shows that score degradation is not fully explained by weight distortion alone. We identify a clear knee point at 2 bits, with larger score drift and harmful decision flips concentrated near the FP32 threshold. Our score-level analysis reveals where and how score errors emerge under extreme quantization. Building on these findings, we propose a calibrated multi-precision cascade that resolves most trials at 2 bits and escalates only ambiguous cases, achieving performance close to FP32 while preserving the efficiency benefits of low-bit inference with substantially lower compute and memory costs.
Abstract（参考訳）: 低ビット量子化は、リソース制約のあるデバイスに話者検証をデプロイする実用的な手段を提供するが、その話者検証性能への影響はよく分かっていない。本稿では,ResNet-36とResNet-200のK平均量子化学習について,ジョイント層とスコアレベル解析を用いて検討する。筆者らの層別分析では, 脆性成分が強調され, 重量歪みだけでは, スコア劣化が完全に説明できないことが示された。我々は,FP32閾値付近にスコアドリフトと有害な決定フリップが集中した2ビットの明確な膝点を同定した。我々のスコアレベル分析は、極端量子化の下でスコアエラーが出現する場所と方法を明らかにする。これらの結果に基づいて,2ビットでほとんどの試行を解消し,不明瞭なケースのみをエスカレートし,FP32に近い性能を実現し,計算コストとメモリコストを大幅に低減した低ビット推論の効率性を維持した校正マルチ精度カスケードを提案する。

論文の概要: On Low-Bit Quantization Errors in Speaker Verification: Diagnostic and Mitigation

関連論文リスト