Sage Journals: Discover world-class research

Abstract

Deep convolutional neural network (CNN) is difficult to deploy to mobile and portable devices due to its large number of parameters and floating-point operations (FLOPs). To tackle this problem, we propose a novel channel pruning method. We use the modified squeeze-and-excitation blocks (MSEB) to measure the importance of the channels in the convolutional layers. The unimportant channels, including convolutional kernels related to them, are pruned directly, which greatly reduces the storage cost and the number of calculations. For ResNet with basic blocks, we propose an approach to consistently prune all residual blocks in the same stage to ensure that the compact network structure is dimensionally correct. After pruning we retrain the compact network from scratch to restore the accuracy. Finally, we verify our method on CIFAR-10, CIFAR-100 and ILSVRC-2012. The results indicate that the performance of the compact network is better than the original network when the pruning rate is small. Even when the pruning amplitude is large, the accuracy can also be maintained or decreased slightly. On the CIFAR-100, when reducing the parameters and FLOPs up to 82% and 62% respectively, the accuracy of VGG-19 even improve by 0.54% after retraining. The source code is available at https://github.com/JingfeiChang/UCP.

Keywords

Deep learning convolutional neural network network pruning image classification

Get full access to this article

View all access options for this article.

References

Hinton

G.E.

, Osindero

and Teh

Y.-W.

, A fast learning algorithmfor deep belief nets, Neural Computation 18(7) (2006), 1527–1554.

Hinton

G.E.

and Salakhutdinov

R.R.

, Reducing the dimensionality ofdata with neural networks, Science 313(5786) (2006), 504–507.

LeCun

, Bengio

and Hinton

, Deep learning, Nature 521(7553) (2015), 436–444.

LeCun

, Kavukcuoglu

and Farabet

, “Convolutional networks and applications in vision,” in 2010 IEEE International Symposium on Circuits and Systems, 2010, pp. 253–256.

Lecun

, Bottou

, Bengio

and Haffner

, Gradientbasedlearning applied to document recognition, Proceedings of theIEEE 86 (1998), 2278–324.

Krizhevsky

, Sutskever

and Hinton

, Imagenet classificationwith deep convolutional neural networks, Communications of theACM 60 (2017), 84–90.

Zeiler

M.D.

and Fergus

, “Visualizing and understandingconvolutional networks,” in, ECCV 2014 8689 (2014), 818–833.

Simonyan

and Zisserman

, “Very Deep Convolutional Networks for Large-Scale Image Recognition,” arXiv eprints, Sep 2014.

Szegedy

, Liu

, Jia

, Sermanet

, Reed

, Anguelov

, Erhan

, Vanhoucke

and Rabinovich

, “Going deeper with convolutions,” in 2015 CVPR, 2015, pp. 1–9.

10.

, Zhang

, Ren

and Sun

, “Deep residual learning for image recognition,” in CVPR, 2016, pp. 770–778.

11.

Huang

Gao

, Zhuang

Liu

and Weinberger

, “Densely connected convolutional networks,” arXiv, 24 Aug. 2016.

12.

Han

, Pool

, Tran

and Dally

W.J.

, “Learning both weights and connections for efficient neural networks,” in NIPS 2015, 2015.

13.

, Kadav

, Durdanovic

, Samet

and Graf

H.P.

, “Pruning Filters for Efficient ConvNets,” arXiv e-prints, Aug 2016.

14.

Hengyuan

, Peng

Rui

, Tai

Yu-Wing

and Tang

Chi-Keung

, “Network Trimming: A Data-Driven Neuron Pruning Approach towards Efficient Deep Architectures [arXiv],” arXiv, p. 9 pp., 12 July 2016.

15.

Luo

J.-H.

and Wu

, “An Entropy-based Pruning Method for CNN Compression,” arXiv e-prints, Jun 2017.

16.

Yang

T.-J.

, Chen

Y.-H.

and Sze

, “Designing energy-efficient convolutional neural networks using energy-aware pruning,” in CVPR, 2017, pp. 6071–6079.

17.

, Kang

, Dong

, Fu

and Yang

, “Soft filter pruning for accelerating deep convolutional neural networks,” in IJCAI, 2018, pp. 2234–2240.

18.

Huang

, Deng

, Sheng

and Ye

, Accelerating convolutionalneural network-based malware traffic detection through ant-colonyclustering, Journal of Intelligent & Fuzzy Systems 37(1) (2019), 409–423.

19.

Luo

J.-H.

, Zhang

, Zhou

H.-Y.

, Xie

C.-W.

, Wu

and Lin

, “Thinet: Pruning cnn filters for a thinner net,”, IEEETranscations on Pattern Analysis and Machine Intelligence 41(10) (2019), 2525–2538.

20.

Ashouri

A.H.

, Abdelrahman

T.S.

and Remedios

A.D.

, Retraining-freemethods for fast on-the-fly pruning of convolutional neuralnetworks, Neurocomputing 370 (2019), 56–69.

21.

Too

E.C.

, Yujian

, Kwao

, Njuki

, Mosomi

M.E.

and Kibet

, Deep pruned nets for efficient image-based plants diseaseclassification, Journal of Intelligent & Fuzzy Systems 37(3) (2019), 4003–4019.

22.

Liu

, Li

, Shen

, Huang

, Yan

and Zhang

, “Learning efficient convolutional networks through network slimming, in ICCV, 2017, pp. 2755–2763.

23.

, Zhang

and Sun

, “Channel pruning for accelerating very deep neural networks,” in ICCV, 2017, pp. 1398–1406.

24.

Ding

Xiaohan

, Ding

Guiguang

, Guo

Yuchen

and Han

Jungong

, “Centripetal SGD for Pruning Very Deep Convolutional Networks With Complicated Structure,” in 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Proceedings, 2019 2019, Conference Paper, pp. 4938-48.

25.

Tian

, Arbel

and Clark

J.J.

, Structured deep fisher pruning forefficient facial trait classification, Image and VisionComputing 77 (2018), 45–59.

26.

Lin

, Ji

, Yan

, Zhang

, Cao

, Ye

, Huang

and Doermann

D.S.

, “Towards optimal structured cnn pruning via generative adversarial learning.” in CVPR, 2019, pp. 2790–2799.

27.

You

, Yan

, Ye

, Ma

and Wang

, “Gate Decorator: Global Filter Pruning Method for Accelerating Deep Convolutional Neural Networks,” arXiv e-prints, Sep 2019.

28.

Singh

, Kadi

V.S.R.

and Namboodiri

V.P.

, Falf convnets: Fatuousauxiliary loss based filter-pruning for efficient deep cnns, Imageand Vision Computing, 93 (2020), 103857.

29.

Hewahi

N.M.

, “Neural network pruning based on input importance,”, Journal of Intelligent & Fuzzy Systems 37(2) (2019), 2243–2252.

30.

Xie

, Zhu

, Zhao

, Tao

, Liu

and Tao

, Localization-aware channel pruning for object detection, Neurocomputing, 2020.

31.

Huang

and Wang

, Multi-granularity pruning for deep residualnetworks, Journal of Intelligent & Fuzzy Systems 39(5) (2020), 7403–7410.

32.

Long

, Zeng

, Liu

, Xiao

, Zhang

and Ben

, A CNN channel pruning low-bit framework using weight quantization withsparse group lasso regularization, Journal of Intelligent &Fuzzy Systems 39(1) (2020), 221–232.

33.

Luo

J.-H.

and Wu

, Autopruner: An end-to-end trainable filterpruning method for efficient deep model inference, PatternRecognition 107 (2020), 107461.

34.

, Shen

and Sun

, “Squeeze-and-excitation networks,” in CVPR, 2018, pp. 7132–7141.

35.

Liu

, Sun

, Zhou

, Huang

and Darrell

, “Rethinking the value of network pruning,” in International Conference on Learning Representations, 2019.

36.

Kamma

, Isoda

, Inoue

and Wada

, Neural behavior-basedapproach for neural network pruning, IEICE Transactions onInformation and Systems E103D(5) (2020), 1135–1143.

37.

Long

, Zeng

, Chen

, Xiao

and Zhang

, Loss-driven channelpruning of convolutional neural networks, IEICE Transactions onInformation and Systems E103D(5) (2020), 1190–1194.

38.

, Zhang

, Ren

and Sun

, “Identity mappings in deep residual networks,” in ECCV (4) (2016), 630–645.

39.

Lin

, Ji

, Zhang

, Wu

and Tian

, “Channel pruning via automatic structure search,” in Proceedings of the International Joint Conference on Artificial Intelligence (IJCAI), 2020, pp. 673–679.

40.

, Li

, Chen

, Lai

, Morariu

V.I.

, Han

, Gao

, Linand

and Davis

L.S.

, “Nisp: Pruning networks using neuron importance score propagation,” in 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2018, pp. 9194–9203.

41.

Molchanov

, Mallya

, Tyree

, Frosio

and Kautz

, “Importance estimation for neural network pruning,” in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019.

Compressing convolutional neural networks via intermediate features

Abstract

Keywords

Get full access to this article

References