Application of Multi-Column Heterogeneous Convolutional Neural Networks in image classification

Abstract

Image classification is an important research direction of computer vision. Convolutional neural network is a deep feedforward neural network model. It uses the deep learning idea and shows good performance in multiple image classification fields such as speech recognition, face recognition, motion analysis, and medical diagnosis. However, a single-structure convolutional neural network is prone to overfitting problems. The main reason for the overfitting problem is that the learning model overfits the training set and results in the lack of generalization performance, which affects the feature extraction and judgment of the test set.

This paper presents a structure model for Multi-Column Heterogeneous Convolutional Neural Networks. Multi-Column Heterogeneous Convolutional Neural Networks are used in image classification. We construct several convolutional neural networks with different structures by setting different size of convolution kernels and different number of feature maps. Image features are learned from multiple perspectives. Each convolutional neural network model is trained on the training set, and the different network models are fitted to the training set. Finally, through the sliding window, the output of each network is fused to obtain a relatively better prediction result. Experiments show that Multi-Column Heterogeneous Convolutional Neural Networks reduce the overfitting problem to a certain extent, and the accuracy of object recognition is improved compared to the single structure convolutional neural network.

Keywords

Image classification Multi-Column Heterogeneous Convolutional Neural Network convolutional neural network

Get full access to this article

View all access options for this article.

References

Szegedy

Liu

Jia

et al., Going deeper with convolutions, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, IEEE Computer Conference on Boston, USA (2015), 1–9.

Lowe

D.G.

, Distinctive Image Features from Scale-Invariant Keypoints, International Journal of Computer Vision 60(2) (2004), 91–110.

Ciresan

Meier

and Schmidhuber

, Multi-column deep neural networks for image classification, Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, Providence (2012), 3642–3649.

Rosten

and Drummond

, Machine Learning for High-speed Corner Detection, Computer Vision-ECCV 2006, Springer Berlin Heidelberg, 2006, 430–443.

Liu

H.M

and Zheng

Z.Q

, Color classification method based on linear classifier for hybrid spatial lookup table, Chinese Journal of Image Graphics 13(1) (2008), 104–108.

Tang

F.X.

and Yang

Y.F.

, Research of color image segmentation algorithm based on asymmetric kernel density estimation, Journal of Computational Methods in Sciences and Engineering 17(3) (2017), 455–462.

Bay

Tuytelaars

and Gool

L.V

, SURF: Speeded Up Robust Features, European Conference on Computer Vision, Springer-Verlag, 2006, 404–417.

Simon

, Neural Network: A Comprehensive Foundation, Neural Networks: A Comprehensive Foundation, Prentice Hall PTR, 1994, 71–80.

K.M

Zhang

X.Y

Ren

S.Q

et al., Deep Residual Learning for Image Recognition, IEEE Conference on Computer Vision and Pattern Recognition, IEEE Computer Society, 2016, 770–778.

10.

Simonyan

and Zisserman

, Very deep convolutional networks for large-scale image recognition, Computer Science, 2014.

11.

Luan

L.H.

and Ji

G.L.

, Research on Decision Tree Classification Technology, Computer Engineering 30(9) (2004), 94–96.

12.

Sonka

Hlavac

and Boyle

, Image processing, analysis, and machine Vision, Beijing: Posts & Tecom Press, 2003.

13.

Haralick

R.M

, Texture features for image classification, IEEE Transactions on Systems Man & Cybemetics, smc-3(6) (1973), 610–621.

14.

Joachims

, Making Large-Scale SVM Learning Parctical, Technische Universitat Dortmund, Sonderforschungsbereich Komplexitatsreduktion in multivariaten Datenstrukturen, 1998, 499–526.

15.

Xue

X.R.

Wang

J.P.

Xiang

and Wang

H.F.

, An efficient method of SAR image segmentation based on texture feature, Journal of Computational Methods in Sciences and Engineering 16(4) (2016), 855–864.

16.

Meng

X.Z.

, Concept-based shape classification and recognition, Computer Engineering and Applications 43(3) (2007), 166–167.

17.

Lecun

Bottou

Bengio

et al., Gradient-based learning applied to document recognition, Proceedings of the IEEE 86(11) (1998), 2278–2324.