Neural Collapse in Deep Homogeneous Classifiers and the role of Weight Decay
Akshay Rangamani, Andrzej Banburski-Fahey
-
SPS
IEEE Members: $11.00
Non-members: $15.00Length: 00:10:06
Neural Collapse is a phenomenon recently discovered in deep classifiers where the last layer activations collapse onto their class means, while the means and last layer weights take on the structure of dual equiangular tight frames. In this paper we present results showing the role of weight decay in the emergence of Neural Collapse in deep homogeneous networks. We show that certain near-interpolating minima of deep networks satisfy the Neural Collapse condition, and this can be derived from the gradient flow on the regularized square loss. We also show that weight decay is necessary for neural collapse to occur. We support our theoretical analysis with experiments that confirm our results.