BYZANTINE-ROBUST AND COMMUNICATION-EFFICIENT DISTRIBUTED NON-CONVEX LEARNING OVER NON-IID DATA
Xuechao He, Heng Zhu, Qing Ling
-
SPS
IEEE Members: $11.00
Non-members: $15.00Length: 00:10:32
Motivated by the emerging federated learning applications, we jointly consider the problems of Byzantine-robustness and communication efficiency in distributed non-convex learning over non-IID data. We propose a compressed robust stochastic model aggregation (C-RSA) method, which applies the idea of robust stochastic model aggregation to achieve Byzantine-robustness over non-IID data, while compresses the transmitted messages so as to achieve communication efficiency. Utilizing the tools of Moreau envelope and proximal point projection, we establish the convergence of C-RSA for distributed non-convex learning problems. Numerical experiments on training a large-scale neural network demonstrate the effectiveness of the proposed C-RSA method.