Online Weight Pruning Via Adaptive Sparsity Loss
George Retsinas, Athena Elafrou, Georgios Goumas, Petros Maragos
-
SPS
IEEE Members: $11.00
Non-members: $15.00Length: 00:08:16
Pruning neural networks has regained interest in recent years as a means to compress state-of-the-art deep neural networks and enable their deployment on resource-constrained devices. In this paper, we propose a robust sparsity controlling framework that efficiently prunes network parameters during training with minimal computational overhead. We incorporate fast mechanisms to prune individual layers and build upon these to automatically prune the entire network under a user-defined budget constraint. Key to our end-to-end network pruning approach is the formulation of an intuitive and easy-to-implement adaptive sparsity loss used to explicitly control sparsity during training, enabling efficient budget-aware optimization.