M22: RATE-DISTORTION INSPIRED GRADIENT COMPRESSION

Yangyi Liu (McMaster University); Sadaf Dr Salehkalaibar (McMaster university); stefano rini (nycu); Jun Chen (McMaster University)

DOI

SPS

Members: Free
IEEE Members: $11.00
Non-members: $15.00

07 Jun 2023

In federated learning (FL), the communication constraint between the remote learners and the Parameter Server (PS) is a crucial bottleneck. This paper proposes M22, a rate-distortion inspired approach to model update compression for distributed training of deep neural networks (DNNs). In particular, (i) we propose a family of distortion measures referred to as M-magnitude weighted L2 norm and (ii) we assume that gradient updates follow an i.i.d. distribution with two degrees of freedom - generalized normal or Weibull. To measure the gradient compression performance under a communication constraint, we define the per-bit accuracy as the optimal improvement in accuracy that a bit of communication brings to the centralized model over the training period. Using this performance measure, we systematically benchmark the choice of gradient distribution and distortion measure. We provide substantial insights on the role of these choices and argue that significant performance improvements can be attained using such a rate-distortion inspired compressor.

Tags:

Machine learning over wireless networks

M22: RATE-DISTORTION INSPIRED GRADIENT COMPRESSION

Yangyi Liu (McMaster University); Sadaf Dr Salehkalaibar (McMaster university); stefano rini (nycu); Jun Chen (McMaster University)

Value-Added Bundle(s) Including this Product

IEEE ICASSP 2023, 4-10 June 2023, Greece. Virtual and In-Person Conference - Presentation Videos Product Bundle

More Like This

Client Selection for Generalization in Accelerated Federated Learning: A Bandit Approach

DEEP REINFORCEMENT LEARNING FOR GREEN UAV-ASSISTED DATA COLLECTION

Fully Distributed Federated Learning with Efficient Local Cooperations

Join the IEEE Signal Processing Society