NeurIPS 2019 Schedule

( events) Timezone:

Poster

Tue Dec 10 05:30 PM -- 07:30 PM (PST) @ East Exhibition Hall B + C #128

Double Quantization for Communication-Efficient Distributed Optimization

In Optimization -- Stochastic Optimization

Yue Yu · Jiaxiang Wu · Longbo Huang

[ Paper] [ Poster]

Modern distributed training of machine learning models often suffers from high communication overhead for synchronizing stochastic gradients and model parameters. In this paper, to reduce the communication complexity, we propose \emph{double quantization}, a general scheme for quantizing both model parameters and gradients. Three communication-efficient algorithms are proposed based on this general scheme. Specifically, (i) we propose a low-precision algorithm AsyLPG with asynchronous parallelism, (ii) we explore integrating gradient sparsification with double quantization and develop Sparse-AsyLPG, (iii) we show that double quantization can be accelerated by the momentum technique and design accelerated AsyLPG. We establish rigorous performance guarantees for the algorithms, and conduct experiments on a multi-server test-bed with real-world datasets to demonstrate that our algorithms can effectively save transmitted bits without performance degradation, and significantly outperform existing methods with either model parameter or gradient quantization.