Sage Journals: Discover world-class research

Abstract

In this study, we propose a vision-based mouse controller capable of controlling objects from a distant location via hand gestures. The proposed hybrid model constitutes hand detection, prediction of hand states and direction and finally, with the aid of deep learning algorithm, we systematically control hand gestures to reposition objects on computer screen. This hybrid system is explicitly designed to control mouse on computer screen during formal presentation. Random movement of hand from up to down and right to left move the mouse pointer and sends signal to the system utilizing states of the hand. Here, close hand places the mouse button on active mode while open hand releases the button. The proposed hybrid model is made up of two modules: Single Shot Multi Box Detection (SSD) structure utilized to detect hand while Convolutional Neural Network (CNN) is utilized for prediction. For comparative purposes, we performed similar experiment where SSD is used for hand detection while Radial Basis Function Network (RBFN) is used for hand states prediction. In the comparative results of hand states prediction, SSD+CNN greatly outperformed SSD+RBFN. The proposed hybrid model is vision-based hence, it does not require additional hardware to perform its task. Overall performance of the framework depicts that the system is accurate and robust.

Keywords

Hand gesture convolutional neural network radial basis function network computer vision deep learning

Get full access to this article

View all access options for this article.

References

Molchanov

, Gupta

, Kim

and Pulli

, Multi-sensor system for drivers hand-gesture recognition, IEEE Conference on Automatic Face and Gesture Recognition (2015), 1–8.

Gang

, Chuan

, O’Neill amd

and Willis

, 3D Freehand Gestural Navigation for Interactive Public Displays, IEEE Computer Graphics and Applications 33 (2013), 47–55.

Lee

and Chung

, Wristband-Type Driver Vigilance Monitoring System Using Smartwatch, IEEE Sensors Journal 15 (2015), 5624–5633.

Kumar

, Verma

and Prasad

, Hand data glove: A wearable realtime device for human-computer interaction, International Journal of Advanced Science and Technology (2012), 43.

Roitberg

, Somani

, Perzylo

, Rickert

and Knoll

, Multimodal human activity recognition for industrial manufacturing processes in robotic workcells, Proceedings of the 2015 ACM on International Conference on Multimodal Interaction (2015), 259–266.

Wachs

, Kölsch

, Stern

and Edan

, Vision-based hand-gesture applications, Communications of the ACM 54 (2011), 60.

Chen

, Duckworth

and Chaiken

, Motivated heuristic and systematic processing, Psychological Inquiry 10(1) (1999), 44–49.

Sermanet

, Eigen

, Zhang

, Mathieu

, Fergus

and LeCun

, Overfeat: Integrated recognition, localization and detection using convolutional networks, 2013, arXiv preprint arXiv:1312.6229.

Viola

and Jones

, Rapid object detection using a boosted cascade of simple features, 2001, In null (p. 511), IEEE.

10.

Fragopanagos

and Taylor

J.G.

, Emotion recognition in human– computer interaction, Neural Networks 18(4) (2005), 389–405.

11.

Al-Rahayfeh

and Faezipour

, Eye tracking and head movement detection: A state-of-art survey, IEEE journal of Translational Engineering in Health and Medicine 1 (2013), 2100212–2100212.

12.

Gerdtman

, Bäcklund

and Lindén

, A gyro sensor based computer mouse with a USB interface: A technical aid for motor-disabled people, Technology and Disability 24(2) (2012), 117–127.

13.

Arai

and Mardiyanto

, Real time blinking detection based on Gabor filter, International Journal of Human Computer Interaction 1(3) (2010), 33–45.

14.

Wang

, Liu

and Chan

, Superpixel-Based Hand Gesture Recognition With Kinect Depth Camera, IEEE Transactions on Multimedia 17 (2015), 29–39.

15.

Ishiyama

and Kurabayashi

, Monochrome glove: A robust real-time hand gesture recognition method by using a fabric glove with design of structured markers, Virtual Reality (VR) IEEE (2016), 187–188.

16.

Luzhnica

, Simon

, Lex

and Pammer

, A sliding window approach to natural hand gesture recognition using a custom data glove. In 3D User Interfaces (3DUI), IEEE Symposium (2016), 81–90.

17.

LeCun

, Bottou

, Bengio

and Haffner

, Gradient-based learning applied to document recognition, Proceedings of the IEEE 86(11) (1998), 2278–2324.

18.

Rubio

J.J.

, Lughofer

, Meda-Campaña

J.A.

, Páramo

L.A.

, Novoa

J.F.

and Pacheco

, Neural network updating via argument Kalman filter for modeling of Takagi-Sugeno fuzzy models, Journal of Intelligent & Fuzzy Systems 35(2) (2018), pp. 2585–2596. doi: 10.3233/JIFS-18425 .

19.

Pan

, Sun

and Yu

, On parameter convergence in least squares identification and adaptive control, International Journal of Robust and Nonlinear Control (2019). doi: 10.1002/rnc.4527 .

20.

Rubio

J.J.

, SOFMLS: Online Self-Organizing Fuzzy Modified Least Square Network, IEEE Transactions on Fuzzy Systems 17(6) (2009), pp. 1296–1309. doi: 10.1109/TFUZZ.2009.2029569 .

21.

Rubio

J.J.

, USNFIS: Uniform Stable Neuro Fuzzy Inference System, Neurocomputing 262 (2017), 57–66. doi: 10.1016/j.neucom.2016.08.150 .

22.

Russakovsky

, Deng

, Su

, Krause

, Satheesh

, Ma

and Berg

A.C.

, Imagenet large scale visual recognition challenge, International Journal of Computer Vision 115(3) (2015), 211–252.

23.

Szegedy

, Liu

, Jia

, Sermanet

, Reed

, Anguelov

and Rabinovich

, Going deeper with convolutions, In Proceedings of the IEEE conference on computer vision and pattern recognition (2015), pp. 1–9.

24.

, Zhang

, Ren

and Sun

, Deep residual learning for image recognition, In Proceedings of the IEEE conference on computer vision and pattern recognition (2016), pp. 770–778.

25.

Ciresan

D.C.

, Meier

, Gambardella

L.M.

and Schmidhuber

, Convolutional neural network committees for handwritten character classification, In 2011 International Conference on Document Analysis and Recognition pp. 1135–1139, IEEE.

26.

Baldominos

, Saez

and Isasi

, Evolutionary convolutional neural networks: An application to handwriting recognition, Neurocomputing 283 (2018), 38–52.

27.

Pugeault

and Bowden

, Spelling It Out: Real-Time ASL Fingerspelling Recognition. Proceedings of the 1st IEEE Workshop on Consumer Depth Cameras for Computer Vision, jointly with ICCV’ (2011).

28.

Abadi

, et al., Tensor-Flow: Large-scale machine learning on heterogeneous systems, 2015. Software available from tensorflow.org.

29.

Liu

, Anguelov

, Erhan

, Szegedy

, Reed

, Fu

and Berg

, SSD:, Single shot multibox detector. In European Conference on Computer Vision (2016), 21–37. Springer.

30.

Abiyev

R.H.

and Arslan

, Head mouse control system for people with disabilities, Expert Systems (2019). doi: 10.1111/exsy.12398 .

31.

Abiyev

R.H.

and Maaitah

K.S.M.

, Deep Convolu-tional Neural Networks for Chest Diseases Detection, Journal of Healthcare Engineering (2018).

32.

Diederik

P.K.

and Jimmy

L.B.

, ADAM: A method for stochastıc optımızatıon. ICLR, 2015.

33.

Schwenker

, Kestler

H.A.

and GÈPalm , Three learning phases for radial-basis-function networks, Neural Networks 14 (2001), 439–458.

34.

, Dorado

, Yeung

D.S.

, Pedrycz

and Izquierdo

, Image classification with the use of radial basis function neural networks and the minimization of the localized generalization error, Pattern Recognition 40(1) (2007), 19–32.

35.

Helwan

, Uzun

, Abiyev

R.H.

and Idoko

J.B.

, One-Year Survival Prediction of Myocardial Infarction, International Journal of Advanced Computer Science and Applications (2017), 8.

36.

Abiyev

R.H.

, Arslan

, Gunsel

and Cagman

, Robot pathfinding using vision based obstacle detection. 3rd IEEE International Conference on Cybernetics (CYBCONF), Exeter, ENGLAND Date: JUN 21–23, 2017, DOI 10.1109/CYBConf.2017.7985805.

37.

Idoko

J.B.

, Abiyev

H.H.

and Maaitah

K.S.M.

, Intelligent machine learning algorithms for colour segmentation, WSEAS Transactions on Signal Processing (13) (2017), 232–240.

38.

Idoko

J.B.

, Abiyev

R.H.

, Ma’aitah

K.S.M.

and Altıparmak

, Integrated artificial intelligence algorithm for skin detection, ITM Web of Conferences 16 (2018), 02004.

39.

Maaitah

K.S.M.

, Abiyev

R.H.

and Idoko

J.B.

, Intelligent Classification of Liver Disorder using Fuzzy Neural System, International Journal of Advanced Computer Science and Applications (2017), 8.

40.

Abiyev

R.H.

and Altunkaya

, Neural network based biometric personal identification with fast iris segmentation, International Journal of Control, Automation and Systems 7 (2009), 17–23.

41.

Abiyev

R.H.

and Abizade

, Diagnosing Parkinson’s Diseases Using Fuzzy Neural System, Computational and Mathematical Methods in Medicine (2016), 1–9.

42.

Abiyev

R.H.

and Koray

, Personal Iris Recognition Using Neural Networks, International Journal of Security and Its Applications 2(2) (2008).

43.

Abiyev Rahib

and Helwan

, Fuzzy neural networks for identification of breast cancer using images’ shape and texture features, Journal of Medical Imaging and Health Informatics 8(4) (2018), 817–825.

44.

Helwan

, Idoko

J.B.

and Abiyev

R.H.

, Machine learning techniques for classification of breast tissue, Procedia Computer Science 120 (2017), 402–410.

45.

Idoko

J.B.

, Arslan

and Abiyev

R.H.

, Fuzzy Neural System Application to Differential Diagnosis of Erythemato-Squamous Diseases, Cyprus J Med Sci 3 (2018), 90–97.

46.

Arslan

, Abiyev

R.H.

and Idoko

J.B.

, Head Movement Mouse Control Using Convolutional Neural Network for People with Disabilities, in Proc 13th Int Conf on Application of Fuzzy Systems and Soft Computing, ICAFS 2018, Varsava, Advances in Intelligent Systems and Computing 896(XIV), pp. 239–248.

47.

Idoko

J.B.

, Arslan

and Abiyev

R.H.

, Intensive Investigation in Differential Diagnosis of Erythemato-Squamous Diseases, In Proc. 13th International Conference on Theory and Application of Fuzzy Systems and Soft Computing (ICAFS-2018) (2019), doi: 10.1007/978-3-030-04164-9_21 .

48.

Qasem

S.N.

and Shamsuddin

S.M.

, Radial basis function network based on time variant multi-objective particle swarm optimization for medical diseases diagnosis, Applied Soft Computing Journal 11(1) (2011), pp. 1427–1438.

49.

Ren

, Yuan

, Meng

and Zhang

, Robust Part-Based Hand Gesture Recognition Using Kinect Sensor, in IEEE Transactions on Multimedia 15(5) (2013), pp. 1110–1120. doi: 10.1109/TMM.2013.2246148 .

50.

Nagi

, et al., Max-pooling convolutional neural networks for vision-based hand gesture recognition, 2011 IEEE International Conference on Signal and Image Processing Applications (ICSIPA) (2011), pp. 342–347, doi: 10.1109/ICSIPA.2011.6144164 .

51.

Nguyen

T.N.

, Huynh

and Meunier

, Static hand gesture recognition using principal component analysis combined with artificial neural network, J Autom Control Eng 3(1) (2015), pp. 40–45.

52.

Hasan

and Abdul-Kareem

, Static hand gesture recognition using neural networks, Artif Intell Rev 41(2) (2014), pp. 147–181.

53.

Avraam

, Static gesture recognition combining graph and appearance features,pp, Int J Adv Res Artif Intell 3(2) (2014), 1–4.

54.

Oyebade

K.O.

and Adnan

, Deep learning in vision-based static hand gesture recognition, Neural Comput & Applic 28(2017), 3941–3951. doi: 10.1007/s00521-016-2294-8 .

55.

Pławiak

, Sośnicki

, Niedźwiecki

, Tabor

and Rzecki

, Hand body language gesture recognition based on signals from specialized glove and machine learning algorithms, IEEE Transactions on Industrial Informatics 12(3) (2016), 1104–1113.

56.

Rahimi

, Benatti

, Kanerva

, Benini

and Rabaey

J.M.

, Hyperdimensional biosignal processing: A case study for EMG-based hand gesture recognition, In 2016 IEEE International Conference on Rebooting Computing (ICRC) (2016), (pp. 1–8). IEEE.

57.

Aguiar

L.F.

and Bó

A.P.

, Hand gestures recognition using electromyography for bilateral upper limb rehabilitation, In 2017 IEEE Life Sciences Conference (LSC) (2017), pp. 63–66. IEEE.

Impact of machine learning techniques on hand gesture recognition

Abstract

Keywords

Get full access to this article

References