Sage Journals: Discover world-class research

Abstract

Deep Neural Networks (DNNs) have powerful recognition abilities to classify different objects. Although the models of DNNs can reach very high accuracy even beyond human level, they are regarded as black boxes that are absent of interpretability. In the training process of DNNs, abstract features can be automatically extracted from high-dimensional data, such as images. However, the extracted features are usually mapped into a representation space that is not aligned with human knowledge. In some cases, the interpretability is necessary, e.g. medical diagnoses. For the purpose of aligning the representation space with human knowledge, this paper proposes a kind of DNNs, termed as Conceptual Alignment Deep Neural Networks (CADNNs), which can produce interpretable representations in the hidden layers. In CADNNs, some hidden neurons are selected as conceptual neurons to extract the human-formed concepts, while other hidden neurons, called free neurons, can be trained freely. All hidden neurons will contribute to the final classification results. Experiments demonstrate that the CADNNs can keep up with the accuracy of DNNs, even though CADNNs have extra constraints of conceptual neurons. Experiments also reveal that the free neurons could learn some concepts aligned with human knowledge in some cases.

Keywords

Deep neural networks conceptual alignment interpretability supervised learning representation learning

Get full access to this article

View all access options for this article.

References

Abadi

, Barham

, Chen

, Davis

and Dean

, et alTensorFlow: A system for large-scale machine learning, USENIX Association, Usenix Conference on Operating Systems Design and Implementation (2016), pp. 265–283.

Ahmed

, Yu

, Xu

W.Y.

, Gong, and Xing

, Training Hierarchical Feed-Forward Visual Recognition Models Using Transfer Learning from Pseudo-Tasks, European Conference on Computer Vision, Springer, 2008, 69–82.

Bengio

(2012) Deep learning of representations for unsupervised and transfer learning, Workshoon Unsupervised and Transfer Learning, JMLR, 17–37.

Bengio

, Courville

and Vincent

, Representation learning: A review and new perspectives, IEEE Transactions on Pattern Analysis and Machine Intelligence35(8) (2013), 1798–1828.

Che

, Kale

, Li

, Bahadori

M.T.

, Liu

Deep computational phenotypingACM International Conference on Knowledge Discovery and Data Mining, ACM, 2015, pp. 507–516.

Che

, Purushotham

, Khemani

, Liu

Interpretable deep models for icu outcome prediction, AMIA Annual Symposium proceedings, AMIA Symosium, 2017, pp. 371–380.

Chen

, Duan

, Houthooft

, Schulman

, Sutskever

, Abbeel

InfoGAN: Interpretable representation learning by information maximizing generative adversarial nets

Advances in Neural Information Processing Systems, Curran Associates Inc. 2016, pp. 2172–2180.

Dai

, Long

, Xing

, Wang

Exploring body constitution in traditional Chinese medicine with k-means clustering

Advances in Services Computing, Springer, 2016, pp. 52–64.

Graves

, Mohamed

A.R.

, Hinton

Speech recognition with deep recurrent neural networks, in: International Conference on Acoustics, Speech and Signal Processing, IEEE, 2013, pp. 6645–6649.

10.

Goodfellow

I.J.

, Pouget-Abadie

, Mirza

, Xu

, Warde-Farley

, Ozair

, et alGenerative adversarial nets, in: International Conference on Neural Information Processing Systems, MIT Press, 2014, pp. 2672–2680.

11.

Goodfellow

I.J.

, Bengio

and Courville

, Deep Learning, MIT Press, 2016.

12.

, Zhang

, Ren

, Sun

Deep residual learning for image recognition

Computer Vision and Pattern Recognition, IEEE, 2016, pp. 770–778.

13.

Hinton

G.E.

and Salakhutdinov

R.R.

, Reducing the dimensionality of data with neural networks, Science313(5786) (2006), 504–507.

14.

Hinton

G.E.

, Deng

, Yu

, Dahl

G.E.

, Mohamed

and Jaitly

, et alDeep neural networks for acoustic modeling in speech recognition: The shared views of four research groups, IEEE Signal Processing Magazine29(6) (2012), 82–97.

15.

Krizhevsky

, Sutskever

, Hinton

G.E.

ImageNet classification with deep convolutional neural networks

Advances in Neural Information Processing Systems, Curran Associates Inc, 2012, pp. 1097–1105, .

16.

Long

, Ding

, Wang

, Sun

Guo

and Yu

P.S.

Transfer Sparse Coding for Robust Image Representation

Computer Vision and Pattern Recognition, IEEE, 2013, pp. 407–414.

17.

Oquab

, Bottou

, Laptev

, Sivic

Learning and Transferring Mid-level Image Representations Using Convolu-tional Neural Networks

Computer Vision and Pattern Recognition, IEEE, 2014, pp. 1717–1724, .

18.

Pan

J.S.

, and Yang

, A survey on transfer learning, IEEE Transactions on Knowledge & Data Engineering22(10) (2010), 1345–1359.

19.

Radford

, Metz

, Chintala

Unsupervised representation learning with deep convolutional generative adversarial networks

International Conference on Learning Representations (2016), 1–16.

20.

Reed

, Akata

, Yan

, Logeswaran

, Schiele

, Lee

Generative adversarial text to image synthesis

International Conference on Machine Learning, JMLR, 2016, pp. 1060–1069.

21.

Reed

, Akata

, Mohan

, Tenka

, Schiele

, Lee

Learning what and where to draw

Advances in Neural Information Processing Systems, Curran Associates Inc, 2016, pp. 217–225.

22.

Reed

, Akata

, Lee

, Schiel

Learning deep representations of fine-grained visual descriptions

Computer Vision and Pattern Recognition,IEEE, 2016, pp. 490–58.

23.

Rumelhart

D.E.

and Hinton

G.E.

, and Williams

R.J.

, Learning representations by back-propagating errors, Nature323(6088) (1986), 533–536.

24.

Simonyan

, Zisserman

Very deep convolutional networks for large-scale image recognition

International Conference on Learning Representations (2015), 1–14.

25.

Szegedy

, Liu

, Jia

, Sermanet

, Reed

, Anguelov

et al

Going deeper with convolutions

Computer Vision and Pattern Recognition, IEEE, 2015, pp. 1–9.

26.

Wang

, Classification and diagnosis basis of nine basic constitutions in chinese medicine,pp. , Journal of Beijing University of Traditional Chinese Medicine28(4) (2005), 1–8.

27.

Xiao

, Rasul

and Vollgraf

, Fashion-MNIST: A novel image dataset for benchmarking machine learning algorithms, Arxiv e-print, 2017.

28.

, Lin

, Lafferty

Learning image representations from the pixel levelvia hierarchical sparse coding

Computer Vision and Pattern Recognition, IEEE, 2011, pp. 1713–1720.

29.

Zeiler

M.D.

, Krishnan

, Taylor

W.G.

and Fergus

Deconvolutional networks

Computer Vision and Pattern Recognition, IEEE, 2010, pp. 2528–2535.

30.

Zeiler

M.D.

, Fergus

Visualizing and understanding convolutional networks

European Conference on Computer Vision, Springer, 2014, pp. 818–833.

31.

Zhu

, Luo

Wang

and Tang

, Multi-view perceptron: A deep model for learning face identity and view representationsAdvances in Neural Information Processing Systems, Curran Associates Inc, 2014, 217–225.

Conceptual alignment deep neural networks

Abstract

Keywords

Get full access to this article

References