49
9

Nonconvex Regularization for Network Slimming:Compressing CNNs Even More

Abstract

In the last decade, convolutional neural networks (CNNs) have evolved to become the dominant models for various computer vision tasks, but they cannot be deployed in low-memory devices due to its high memory requirement and computational cost. One popular, straightforward approach to compressing CNNs is network slimming, which imposes an 1\ell_1 penalty on the channel-associated scaling factors in the batch normalization layers during training. In this way, channels with low scaling factors are identified to be insignificant and are pruned in the models. In this paper, we propose replacing the 1\ell_1 penalty with the p\ell_p and transformed 1\ell_1 (T1\ell_1) penalties since these nonconvex penalties outperformed 1\ell_1 in yielding sparser satisfactory solutions in various compressed sensing problems. In our numerical experiments, we demonstrate network slimming with p\ell_p and T1\ell_1 penalties on VGGNet and Densenet trained on CIFAR 10/100. The results demonstrate that the nonconvex penalties compress CNNs better than 1\ell_1. In addition, T1\ell_1 preserves the model accuracy after channel pruning, and 1/2,3/4\ell_{1/2, 3/4} yield compressed models with similar accuracies as 1\ell_1 after retraining.

View on arXiv
Comments on this paper