Scalable and Secure Federated XGBoost

Quang M Nguyen (Massachusetts Institute of Technology); Nhan Khanh Le (TUM); Lam M Nguyen (IBM Research, Thomas J. Watson Research Center)

DOI

SPS

Members: Free
IEEE Members: $11.00
Non-members: $15.00

06 Jun 2023

Federated learning (FL) is the distributed machine learning framework that enables collaborative training across multiple parties while ensuring data privacy. Practical adaptation of XGBoost, the state-of-the-art tree boosting framework, to FL remains nascent due to high cost incurred by conventional privacy-preserving methods. Such limitations can be attributed to the lack of formal analytical model to enable new privacy methods well customized to federated XGBoost. To this end, we propose a novel formulation, termed splitting matrix, in the context of federated XGBoost that mathematically characterizes the role of passive party (PP) having been neglected in the literature. This new formulation facilitates our novel adoption of secure matrix multiplication protocol into federated XGBoost to propose FedXGBoost as a framework for secure XGBoost in federated setting with lossless accuracy and negligible overhead. Extensive experiments on both synthetic and real datasets exhibit our algorithm's empirical outperformance over known methods in the literature.

Tags:

Distributed/Federated learning

Scalable and Secure Federated XGBoost

Quang M Nguyen (Massachusetts Institute of Technology); Nhan Khanh Le (TUM); Lam M Nguyen (IBM Research, Thomas J. Watson Research Center)

Value-Added Bundle(s) Including this Product

IEEE ICASSP 2023, 4-10 June 2023, Greece. Virtual and In-Person Conference - Presentation Videos Product Bundle

More Like This

DPP-based Client Selection for Federated Learning with Non-IID Data

FFedCL: Fair Federated Learning with Contrastive Learning

Multi-Agent Adversarial Training Using Diffusion Learning

Join the IEEE Signal Processing Society