Sage Journals: Discover world-class research

Abstract

Pruning of neural networks is undoubtedly a popular approach to cope with the current compression of large-scale, high-cost network models. However, most of the existing methods require a high level of human-regulated pruning criteria, which requires a lot of human effort to figure out a reasonable pruning strength. One of the main reasons is that there are different levels of sensitivity distribution in the network. Our main goal is to discover compression methods that adapt to this distribution to avoid deep architectural damage to the network due to unnecessary pruning. In this paper, we propose a filter texture distribution that affects the training of the network. We also analyze the sensitivity of each of the diverse states of this distribution. To do so, we first use a multidimensional penalty method that can analyze the potential sensitivity based on this texture distribution to obtain a pruning-friendly sparse environment. Then, we set up a lightweight dynamic threshold container in order to prune the sparse network. By providing each filter with a suitable threshold for that filter at a low cost, a massive reduction in the number of parameters is achieved without affecting the contribution of certain pruning-sensitive layers to the network as a whole. In the final experiments, our two methods adapted to texture distribution were applied to ResNet Deep Neural Network (DNN) and VGG-16, which were deployed on the classical CIFAR-10/100 and ImageNet datasets with excellent results in order to facilitate comparison with good cutting-edge pruning methods. Code is available at https://github.com/wangyuzhe27/CDP-and-DTC.

Keywords

Threshold neural network penalty pruning

Get full access to this article

View all access options for this article.

References

Ba and

Caruana, Do deep nets really need to be deep?, Advances in neural information processing systems 27 (2014).

Bai,

Wang,

Tao,

Li and

Fu, Dual lottery ticket hypothesis, 2022, arXiv preprint arXiv:2203.04248.

E.L.

Denton,

Zaremba,

Bruna,

LeCun and

Fergus, Exploiting linear structure within convolutional networks for efficient evaluation, Advances in neural information processing systems 27 (2014).

Ding,

Guo,

Ding and

Han, ACNet: Strengthening the kernel skeletons for powerful CNN via asymmetric convolution blocks, in: Proceedings of the IEEE/CVF International Conference on Computer Vision, 2019, pp. 1911–1920.

Graves,

A.-r.

Mohamed and

Hinton, Speech recognition with deep recurrent neural networks, in: 2013 IEEE International Conference on Acoustics, Speech and Signal Processing, IEEE, 2013, pp. 6645–6649. doi:10.1109/ICASSP.2013.6638947.

Han,

Liu,

Mao,

Pu,

Pedram,

M.A.

Horowitz and

W.J.

Dally, EIE: Efficient inference engine on compressed deep neural network, ACM SIGARCH Computer Architecture News 44(3) (2016), 243–254. doi:10.1145/3007787.3001163.

Han,

Pool,

Tran and

Dally, Learning both weights and connections for efficient neural network, Advances in neural information processing systems 28 (2015).

Hassibi and

Stork, Second order derivatives for network pruning: Optimal brain surgeon, Advances in neural information processing systems 5 (1992).

He,

Ding,

Liu,

Zhu,

Zhang and

Yang, Learning filter pruning criteria for deep convolutional neural networks acceleration, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020, pp. 2009–2018.

10.

He,

Kang,

Dong,

Fu and

Yang, Soft filter pruning for accelerating deep convolutional neural networks, 2018, arXiv preprint arXiv:1808.06866.

11.

He,

Lin,

Liu,

Wang,

L.-J.

Li and

Han, Amc: Automl for model compression and acceleration on mobile devices, in: Proceedings of the European Conference on Computer Vision (ECCV), 2018, pp. 784–800.

12.

He,

Liu,

Wang,

Hu and

Yang, Filter pruning via geometric median for deep convolutional neural networks acceleration, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2019, pp. 4340–4349.

13.

Hu,

Peng,

Y.-W.

Tai and

C.-K.

Tang, Network trimming: A data-driven neuron pruning approach towards efficient deep architectures, 2016, arXiv preprint arXiv:1607.03250.

14.

Huang and

Wang, Data-driven sparse structure selection for deep neural networks, in: Proceedings of the European Conference on Computer Vision (ECCV), 2018, pp. 304–320.

15.

Krizhevsky,

Sutskever and

G.E.

Hinton, ImageNet classification with deep convolutional neural networks, Communications of the ACM 60(6) (2017), 84–90. doi:10.1145/3065386.

16.

Kusupati,

Ramanujan,

Somani,

Wortsman,

Jain,

Kakade and

Farhadi, Soft threshold weight reparameterization for learnable sparsity, in: International Conference on Machine Learning, PMLR, 2020, pp. 5544–5555.

17.

Lebedev and

Lempitsky, Fast ConvNets using group-wise brain damage, in: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2016, pp. 2554–2564.

18.

LeCun,

Boser,

Denker,

Henderson,

Howard,

Hubbard and

Jackel, Handwritten digit recognition with a back-propagation network, Advances in neural information processing systems 2 (1989).

19.

Lee,

Ajanthan and

P.H.

Torr, Snip: Single-shot network pruning based on connection sensitivity, 2018, arXiv preprint arXiv:1810.02340.

20.

Li,

Kadav,

Durdanovic,

Samet and

H.P.

Graf, Pruning filters for efficient convnets, 2016, arXiv preprint arXiv:1608.08710.

21.

Lin,

Ji,

Wang,

Zhang,

Tian and

Shao, Hrank: Filter pruning using high-rank feature map, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020, pp. 1529–1538.

22.

Lin,

Ji,

Yan,

Zhang,

Cao,

Ye,

Huang and

Doermann, Towards optimal structured cnn pruning via generative adversarial learning, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2019, pp. 2790–2799.

23.

Liu,

Ma,

Xu,

Wang,

Tang and

Ye, Autocompress: An automatic DNN structured pruning framework for ultra-high compression rates, in: Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 34, 2020, pp. 4876–4883.

24.

Liu,

Li,

Shen,

Huang,

Yan and

Zhang, Learning efficient convolutional networks through network slimming, in: Proceedings of the IEEE International Conference on Computer Vision, 2017, pp. 2736–2744.

25.

Liu,

Sun,

Zhou,

Huang and

Darrell, Rethinking the value of network pruning, 2018, arXiv preprint arXiv:1810.05270.

26.

Liu,

Xu,

Peng and

Xiong, Frequency-domain dynamic pruning for convolutional neural networks, Advances in neural information processing systems 31 (2018).

27.

J.-H.

Luo,

Wu and

Lin, ThiNet: A filter level pruning method for deep neural network compression, in: Proceedings of the IEEE International Conference on Computer Vision, 2017, pp. 5058–5066.

28.

Mao,

Han,

Pool,

Li,

Liu,

Wang and

W.J.

Dally, Exploring the granularity of sparsity in convolutional neural networks, in: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2017, pp. 13–20.

29.

Meng,

Cheng,

Li,

Luo,

Guo,

Lu and

Sun, Pruning filter in filter, Advances in Neural Information Processing Systems 33 (2020), 17629–17640.

30.

Molchanov,

Tyree,

Karras,

Aila and

Kautz, Pruning convolutional neural networks for resource efficient inference, 2016, arXiv preprint arXiv:1611.06440.

31.

Szegedy,

Vanhoucke,

Ioffe,

Shlens and

Wojna, Rethinking the inception architecture for computer vision, in: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2016, pp. 2818–2826.

32.

Tukan,

Mualem and

Maalouf, Pruning neural networks via coresets and convex geometry: Towards no assumptions, 2022, arXiv preprint arXiv:2209.08554.

33.

Wang,

Zhang and

Grosse, Picking winning tickets before training by preserving gradient flow, 2020, arXiv preprint arXiv:2002.07376.

34.

Wang,

Qin,

Zhang and

Fu, Neural pruning via growing regularization, in: International Conference on Learning Representations, 2021.

35.

Wang,

Zhang,

Wang,

Yu and

Hu, Structured pruning for efficient convnets via incremental regularization, in: 2019 International Joint Conference on Neural Networks (IJCNN), IEEE, 2019, pp. 1–8.

36.

Wen,

Wu,

Wang,

Chen and

Li, Learning structured sparsity in deep neural networks, Advances in neural information processing systems 29 (2016).

37.

Yao,

Li,

Kang and

Wang, A pruning method based on the dissimilarity of angle among channels and filters, 2022, arXiv preprint arXiv:2210.16504.

38.

You,

Yan,

Ye,

Ma and

Wang, Gate decorator: Global filter pruning method for accelerating deep convolutional neural networks, Advances in neural information processing systems 32 (2019).

39.

Yu,

Li,

C.-F.

Chen,

J.-H.

Lai,

V.I.

Morariu,

Han,

Gao,

C.-Y.

Lin and

L.S.

Davis, NISP: Pruning networks using neuron importance score propagation, in: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2018, pp. 9194–9203.

40.

Yuan and

Lin, Model selection and estimation in regression with grouped variables, Journal of the Royal Statistical Society: Series B (Statistical Methodology) 68(1) (2006), 49–67. doi:10.1111/j.1467-9868.2005.00532.x.

Dynamic finegrained structured pruning sensitive to filter texture distribution

Abstract

Keywords

Get full access to this article

References