Fugu-MT 論文翻訳(概要): EF-BV: A Unified Theory of Error Feedback and Variance Reduction Mechanisms for Biased and Unbiased Compression in Distributed Optimization

論文の概要: EF-BV: A Unified Theory of Error Feedback and Variance Reduction Mechanisms for Biased and Unbiased Compression in Distributed Optimization

arxiv url: http://arxiv.org/abs/2205.04180v1
Date: Mon, 9 May 2022 10:44:23 GMT
ステータス: 翻訳完了
システム内更新日: 2022-05-10 18:08:13.107411
Title: EF-BV: A Unified Theory of Error Feedback and Variance Reduction Mechanisms for Biased and Unbiased Compression in Distributed Optimization
Title（参考訳）: EF-BV:分散最適化におけるバイアス圧縮とアンバイアス圧縮の誤差フィードバックとばらつき低減の統一理論
Authors: Laurent Condat, Kai Yi, Peter Richt\'arik
Abstract要約: 分散最適化と学習では、異なるコンピュータユニット間の通信がボトルネックとなることが多い。圧縮演算子には2つのクラスがあり、それを利用するアルゴリズムは別々である。本稿では,特にDIANAとEF21を復元する新しいアルゴリズムを提案する。
参考スコア（独自算出の注目度）: 7.691755449724637
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: In distributed or federated optimization and learning, communication between the different computing units is often the bottleneck, and gradient compression is a widely used technique for reducing the number of bits sent within each communication round of iterative methods. There are two classes of compression operators and separate algorithms making use of them. In the case of unbiased random compressors with bounded variance (e.g., rand-k), the DIANA algorithm of Mishchenko et al. [2019], which implements a variance reduction technique for handling the variance introduced by compression, is the current state of the art. In the case of biased and contractive compressors (e.g., top-k), the EF21 algorithm of Richt\'arik et al. [2021], which implements an error-feedback mechanism for handling the error introduced by compression, is the current state of the art. These two classes of compression schemes and algorithms are distinct, with different analyses and proof techniques. In this paper, we unify them into a single framework and propose a new algorithm, recovering DIANA and EF21 as particular cases. We prove linear convergence under certain conditions. Our general approach works with a new, larger class of compressors, which includes unbiased and biased compressors as particular cases, and has two parameters, the bias and the variance. These gives a finer control and allows us to inherit the best of the two worlds: biased compressors, whose good performance in practice is recognized, can be used. And independent randomness at the compressors allows to mitigate the effects of compression, with the convergence rate improving when the number of parallel workers is large. This is the first time that an algorithm with all these features is proposed. Our approach takes a step towards better understanding of two so-far distinct worlds of communication-efficient distributed learning.
Abstract（参考訳）: 分散最適化やフェデレーション最適化や学習では、異なる演算ユニット間の通信がボトルネックとなり、勾配圧縮は各通信ラウンド内の反復メソッドで送信されるビット数を減らすために広く使われているテクニックである。圧縮演算子には2つのクラスがあり、それを利用するアルゴリズムは別々である。有界分散を持つ非バイアスランダム圧縮機(例:rund-k)の場合、MishchenkoらによるDIANAアルゴリズムが用いられる。圧縮によって引き起こされる分散を扱う分散低減技術を実装した[2019]が現在の技術である。バイアスのある圧縮機(例えばトップ-k)の場合、EF21アルゴリズムはRicht\'arik et al である。圧縮によって導入されたエラーを処理するエラーフィードバック機構を実装した[2021]が,その最先端技術である。これら2つの圧縮スキームとアルゴリズムのクラスは異なる解析と証明技術を持つ。本稿では,それらをひとつのフレームワークに統合し,新しいアルゴリズムを提案し,特にDIANAとEF21を復元する。我々はある条件下で線形収束を証明する。我々の一般的な手法は、特に非バイアス圧縮機とバイアス圧縮機を含む、より大規模な圧縮機で機能し、バイアスと分散の2つのパラメータを持つ。これにより、より細かい制御が可能になり、バイアスド圧縮機(実際に優れた性能が認識されている)が使用できる2つの世界のベストを継承することができます。そして、圧縮機の独立したランダム性は圧縮の影響を緩和し、並列作業者の数が大きい場合に収束速度が向上する。これら全ての特徴を持つアルゴリズムが提案されたのはこれが初めてである。本稿では,コミュニケーション効率のよい分散学習の2つの世界に対する理解を深める。

論文の概要: EF-BV: A Unified Theory of Error Feedback and Variance Reduction Mechanisms for Biased and Unbiased Compression in Distributed Optimization

関連論文リスト