Sage Journals: Discover world-class research

Abstract

Keywords

deep neural network ReLU breast cancer LeakyReLU multiple activation Swish

Get full access to this article

View all access options for this article.

References

Ananthi

Vijayakumar

(2021) Stock market analysis using candlestick regression and market trend prediction (CKRM). Journal of Ambient Intelligence and Humanized Computing 12(5): 4819–4826.

Chollet

(2015) Keras. Available at: https://keras.io (accessed 5 August 2020)

Clevert

Unterthiner

Hochreiter

(2015) Fast and accurate deep network learning by exponential linear units (elus). arXiv preprint arXiv: 1511.07289.

Dey

Koshinaka

Motlicek

, et al. (2018) DNN based speaker embedding using content information for text-dependent speaker verification. In: IEEE international conference on acoustics, speech and signal processing (ICASSP), Calgary, AB, Canada, 15–20 April 2018, pp.5344–5348. New York, NY: IEEE.

Dua

Graff

(2019) UCI Machine Learning Repository. Irvine, CA: University of California, School of Information and Computer Science. Available at: http://archive.ics.uci.edu/ml (accessed 5 August 2020)

Duchi

Hazan

Singer

(2011) Adaptive subgradient methods for online learning and stochastic optimization. Journal of Machine Learning Research 12(7): 2121–2159.

Glorot

Bordes

Bengio

(2011) Deep sparse rectifier neural networks. In: Fourteenth international conference on artificial intelligence and statistics, vol. 15. Fort Lauderdale, FL, USA Publisher-PMLR, pp.315–323.

Hahnloser

Sarpeshkar

Mahowald

Douglas

Seung

(2000) Digital selection and analogue amplification coexist in a cortex-inspired silicon circuit. Nature 405(6789): 947–951.

Zhang

Ren

, et al. (2015) Delving deep into rectifiers: Surpassing human-level performance on imagenet classification. In: IEEE international conference on computer vision, Santiago, Chile Publisher: IEEE pp.1026–1034.

10.

Hendrycks

Gimpel

(2016) Gaussian error linear units (gelus). arXiv preprint arXiv: 1606.08415.

11.

Hochreiter

(1991) Untersuchungen zu dynamischen neuronalen Netzen. Diploma, Technische Universität München, 91(1).

12.

Hochreiter

Bengio

Frasconi

, et al. (2001) Gradient flow in recurrent nets: The difficulty of learning long-term dependencies. In: Kolen

Kremer

(eds) A Field Guide to Dynamical Recurrent Neural Networks. New York, NY: IEEE Press, PP. 237–244.

13.

Kadam

Jadhav

Vijayakumar

(2019) Breast cancer diagnosis using feature ensemble learning based on stacked sparse autoencoders and softmax regression. Journal of Medical Systems 43: 263.

14.

Kingma

(2014) Adam: A method for stochastic optimization. arXiv preprint arXiv: 1412.6980.

15.

Klambauer

Unterthiner

Mayr

, et al. (2017) Self-normalizing neural networks. In: Guyon

Luxburg

Bengio

Wallach

R. Fergus

Vishwanathan

Garnett

(eds) Advances in Neural Information Processing Systems. Curran Associates Inc., NY, USA, pp.971–980.

16.

Krizhevsky

Sutskever

Hinton

(2012) Imagenet classification with deep convolutional neural networks. In: Pereira

Burges

CJC

Bottou

Weinberger

(eds) Advances in Neural Information Processing Systems. Curran Associates Inc., NY, USA, pp.1097–1105.

17.

Kurian

Jyothi

(2021) Breast cancer prediction using an optimal machine learning technique for next generation sequences. Concurrent Engineering 29(1): 49–57.

18.

LeCun

Bengio

Hinton

(2015) Deep learning. Nature 521(7553): 436–444.

19.

Maas

Hannun

(2013) Rectifier nonlinearities improve neural network acoustic models. In: Atlanta, GA, USA, Publisher-PMLR (eds) Proceedings of the 30th international conference on machine learning, ICML, Atlanta, GA.

20.

Mangasarian

Street

Wolberg

(1995) Breast cancer diagnosis and prognosis via linear programming. Operations Research 43(4): 570–577.

21.

Menon

Mehrotra

Mohan

, et al. (1996) Characterization of a class of sigmoid functions with applications to neural networks. Neural Networks 9(5): 819–835.

22.

Nair

Hinton

(2010) Rectified linear units improve restricted boltzmann machines. In: Proceedings of the 27th international conference on machine learning, ICML, Haifa, Israel.

23.

Nwankpa

Ijomah

Gachagan

, et al. (2018) Activation functions: Comparison of trends in practice and research for deep learning. arXiv preprint arXiv: 1811.03378.

24.

Pedamonti

(2018) Comparison of non-linear activation functions for deep neural networks on MNIST classification task. arXiv preprint arXiv: 1804.02763.

25.

Qin

Wang

Zou

(2018) The optimized deep belief networks with improved logistic sigmoid units and their application in fault diagnosis for planetary gearboxes of wind turbines. IEEE Transactions on Industrial Electronics 66(5): 3814–3824.

26.

Ramachandran

Zoph

(2017a) Swish: A self-gated activation function. arXiv preprint arXiv: 1710.05941.

27.

Ramachandran

Zoph

(2017b) Searching for activation functions. arXiv preprint arXiv: 1710.05941.

28.

Reddi

Kale

Kumar

(2019) On the convergence of Adam and beyond. arXiv preprint arXiv: 1904.09237.

29.

Rose

Jaspin

Vijayakumar

(2021) Lung cancer diagnosis based on image fusion and prediction using CT and PET image. In: Priya

Rajinikanth

(eds) Signal and Image Processing Techniques for the Development of Intelligent Healthcare Systems. Springer, Singapore, pp.67–86.

30.

Schaul

Antonoglou

Silver

(2013) Unit tests for stochastic optimization. arXiv preprint arXiv: 1312.6055.

31.

Srivastava

Hinton

Krizhevsky

, et al. (2014) Dropout: A simple way to prevent neural networks from overfitting. The Journal of Machine Learning Research 15(1): 1929–1958.

32.

Street

Wolberg

Mangasarian

(1993) Nuclear feature extraction for breast tumor diagnosis. In: Biomedical image processing and biomedical visualization, 29 July 1993, vol. 1905, pp.861–870. International Society for Optics and Photonics.

33.

Sutskever

Martens

Dahl

, et al. (2013) On the importance of initialization and momentum in deep learning. In: 30th international conference on machine learning (ICML-13), pp.1139–1147.

34.

Szegedy

Liu

Jia

, et al. (2015) Going deeper with convolutions. In: IEEE conference on computer vision and pattern recognition, pp. 1–9, Boston, USA: IEEE

35.

Tieleman

Hinton

(2012) Divide the gradient by a running average of its recent magnitude. Lecture. COURSERA: Neural networks for machine learning.

36.

Vijayakumar

(2021) Computational intelligence, machine learning techniques, and IOT. Concurrent Engineering 29(1): 3–5.

37.

Wolberg

Street

Mangasarian

(1994) Machine learning techniques to diagnose breast cancer from image-processed nuclear features of fine needle aspirates. Cancer Letters 77(2–3): 163–171.

38.

Zeiler

(2012) ADADELTA: An adaptive learning rate method. arXiv: 1212.5701.

39.

Zhang

Woodland

(2015) Parameterised sigmoid and ReLU hidden activation functions for DNN acoustic modelling. In: Sixteenth annual conference of the international speech communication association, Dresden, Germany, 6–10 September 2015.

40.

Zhang

Woodland

(2016) DNN speaker adaptation using parameterised sigmoid and ReLU hidden activation functions. In: IEEE international conference on acoustics, speech and signal processing (ICASSP), Shanghai, China, 20–25 March 2016, pp.5300–5304. New York, NY: IEEE.

RETRACTED: Breast cancer diagnosis using multiple activation deep neural network

Abstract

Keywords

Get full access to this article

References