Sage Journals: Discover world-class research

Abstract

Hand Gesture Recognition (HGR) has become a vital approach in monitoring patients by their medical professionals to mitigate health risks. To recognize hand gesture signs, advanced deep learning architectures have been widely applied recently. Despite these advancements, balancing accuracy and efficiency remains a major constraint for the current models. Hence, advanced object detection methods, such as the You Only Look Once (YOLO), have been increasingly adopted to bridge this gap. Thus, this work designs a lightweight hand gesture recognition by developing a feature extraction strategy and hybrid metaheuristic optimization for the YOLOv5s network. Initially, it considers key features from RGB, depth, and skeleton hand gesture images, involving the extraction of inter-frame, intra-frame, and finger features. Secondly, the backbone of the YOLOv5s network is updated by the ResNet50 to precisely maintain the tradeoff between accuracy and efficiency through the concise learning of the gesture patterns. It potentially captures various dimensions of finger features, such as direction, shape, and quantity extraction, to improve gesture sign detection. Finally, the proposed model utilizes a novel hybrid metaheuristic algorithm design with a Genetic algorithm and Crow Search Algorithm (GCSA), significantly increasing the speed and improving the quality by selecting the optimal set of hyperparameters. The experimental results on the Praxis hand gesture dataset show the superiority of YOLOv5s.

Keywords

hand gesture recognition inter-frame intra-frame YOLOv5s ResNet50 hybrid metaheuristic hyperparameter and genetic-crow search algorithm

Get full access to this article

View all access options for this article.

References

Abd

(2021) Advanced metaheuristic optimization techniques in applications of deep neural networks: a review. Neural Computing & Applications 33(4): 1–21.

Al-Hammadi Muhammad

Abdul

, et al. (2020) Hand gesture recognition for Sign Language using 3DCNN. IEEE Access 8(1): 79491–79509.

Al-Hammadi

Muhammad

Abdul

, et al. (2020) Deep learning-based approach for Sign Language gesture recognition with efficient hand gesture representation. IEEE Access 8(7): 192527–192542.

Alashhab

Gallego

Lozano

MÁ

(2022) Efficient gesture recognition for the assistance of visually impaired people using multi-head neural networks. Engineering Applications of Artificial Intelligence 114(c): 105188.

Althoff

(2022) Once learning for looking and identifying based on YOLO-v5 object detection. In Proceedings of the Brazilian Symposium on Multimedia and the Web, IEEE, pp. 298–304.

Bednar

Watt

(1984) Alpha-trimmed means and their relationship to median filters. IEEE Transactions on Acoustics, Speech, & Signal Processing 32(1): 145–153.

Chakraborty

Sarma

Bhuyan

, et al. (2018) Review of constraints on vision based gesture recognition for human–computer interaction. IET Computer Vision 12(1): 3–15.

Cheok

Omar

Jaward

(2019) A review of hand gesture and sign language recognition techniques. International Journal of Machine Learning and Cybernetics 10(1): 131–153.

Dang

Tran

Nguyen

, et al., (2022) An improved hand gesture recognition system using keypoints and hand bounding boxes Array”, 16(1):100251.

10.

Diwan

Anirudh

Tembhurne

(2022) Object detection using YOLO: challenges, architectural successors, datasets and applications. Multimedia Tools and Applications 82(6): 9243–9275.

11.

Farid

Hashim

Abdullah

, et al. (2022) A structured and methodological review on vision-based hand gesture recognition system. Journal of Imaging 8(6): 153.

12.

Gadekallu

Alazab

Kaluri

, et al. (2021) Hand gesture classification using a novel CNN-crow search algorithm. Complex & Intelligent Systems 7(6): 1855–1868.

13.

Gao

Chen

, et al. (2022) Dynamic hand gesture recognition based on 3D hand pose estimation for human–robot interaction. IEEE Sensors Journal 22(18): 17421–17430.

14.

Gionfrida

Rusli

WMR

Kedgley

, et al. (2022) A 3DCNN-LSTM multi-class temporal segmentation for hand gesture recognition. Electronics 11(15): 2427.

15.

Hakim

Shih

Kasthuri Arachchi

, et al. (2019) Dynamic hand gesture recognition using 3DCNN and LSTM with FSM context-aware model. Sensors 19(24): 5429.

16.

Zhu

Liu

, et al. (2022) Gesture recognition based on modified Yolov5s. IET Image Processing 16(8): 2124–2132.

17.

Islam

(2019) Static hand gesture recognition using convolutional neural network with data augmentation. In: International Conference on Informatics, Electronics & Vision (ICIEV) and International Conference on Imaging, Vision & Pattern Recognition (icIVPR), IEEE, pp. 324–329.

18.

Jain

Karsh

Barbhuiya

(2022) Literature review of vision based dynamic gesture recognition using deep learning techniques. Concurrency and Computation: Practice and Experience 34(22): e7159.

19.

Juneja

Dhiman

, et al. (2021) Computer vision-enabled character recognition of hand gestures for patients with hearing and speaking disability. Mobile Information Systems 1(10): 4912486.

20.

Inria

(2023) Praxis gesture dataset. https://team.inria.fr/stars/en/praxis-dataset/

21.

Khalaf

(2022) Survey on recognition hand gesture by using data mining algorithms. In: IEEE International Congress on Human-Computer Interaction, Optimization and Robotic Applications (HORA), IEEE, pp. 1–4.

22.

Kolkur

(2017) Human skin detection using RGB, HSV and YCbCr color models. arXiv preprint arXiv:1708.02694.

23.

Kotropoulos

Pitas

(1997) Rule-based face detection in frontal views. International Conference on Acoustics, Speech, and Signal Processing 4(1): 2537–2540.

24.

León

(2022) Video hand gestures recognition using depth camera and lightweight CNN. IEEE Sensors Journal 22(14): 14610–14619.

25.

Zhang

(2022) Static hand gesture recognition based on hierarchical decision and classification of finger features. Science Progress 105(1): 1–29.

26.

Mohamed

Mustafa

Jomhari

(2021) A review of the hand gesture recognition system: current progress and future directions. IEEE Access 9(1): 157422–157436.

27.

Mujahid

Awan

Yasin

, et al. (2021) Real-time hand gesture recognition based on deep learning YOLOv3 model. Applied Sciences 11(9): 4164.

28.

Mustafa

Kader

MMMA

(2018) A review of histogram equalization techniques in image enhancement application. Journal of Physics: Conference Series 1019(1): 012026.

29.

Nayak

Naik

Dash

, et al. (2021) Hyper-parameter tuned light gradient boosting machine using memetic firefly algorithm for hand gesture recognition. Applied Soft Computing 107(2): 107478.

30.

Neethu

Suguna

Sathish

(2020) An efficient method for human hand gesture detection and recognition using deep learning convolutional neural networks. Soft Computing 24(20): 15239–15248.

31.

Oudah Al-Naji

Chahl

(2020) Hand gesture recognition based on computer vision: a review of techniques. Journal of Imaging 6(8): 73.

32.

Sekehravani

Babulak

Masoodi

(2020) Implementing canny edge detection algorithm for noisy image. Bulletin of Electrical Engineering and Informatics 9(4): 1404–1410.

33.

Sharma

Singh

(2021) Vision-based hand gesture recognition using deep learning for the interpretation of sign language. Expert Systems with Applications 182(1): 115657.

34.

Simão

(2019) Improving novelty detection with generative adversarial networks on hand gesture data. Neurocomputing 358(1): 437–445.

35.

Song

(2021) Real-time low-cost human skeleton detection. Multimedia Tools and Applications 80(26): 34389–34402.

36.

Sze

Kin-Man Lam Guoping Qiu (2005) A new key frame representation for video segment retrieval. IEEE Transactions on Circuits and Systems for Video Technology 15(9): 1148–1155.

37.

Tang

Liu

Xiao

, et al. (2019) Fast and robust dynamic hand gesture recognition via key frames extraction and feature fusion. Neurocomputing 331(c): 424–433.

38.

Tomasi

Manduchi

(1998) Bilateral filtering for gray and color images. In: International Conference on Computer Vision. IEEE, pp. 839–846.

39.

Tran

Yang

, et al. (2020) Real-time hand gesture spotting and recognition using RGB-D camera and 3D convolutional neural network. Applied Sciences 10(2): 722.

40.

Vuletic

Duffy

Hay

, et al. (2019) Systematic literature review of hand gestures used in human computer interaction interfaces. International Journal of Human-Computer Studies 129(c): 74–94.

41.

Yasen

Jusoh

(2019) A systematic review on hand gesture recognition techniques, challenges and applications. PeerJ Computer Science 5(e218): 506–522.

42.

Zhang

(2017) Clustering by fast search and find of density peaks based on manifold distance. Computer Knowledge & Technology 2(3): 179–182.

43.

Zheng

(2022) Gesture recognition real-time control system based on YOLOV4. Journal of Physics: Conference Series 2196(1): 012026.

44.

Zhou

Chen

(2022) A lightweight hand gesture recognition in complex backgrounds. Displays 74(3): 102226.

A lightweight design of YOLOv5 with hybrid metaheuristic optimization for hand gesture recognition

Abstract

Keywords

Get full access to this article

References