Optimized Three-Fold Deep Learning Model and ROI Identification for Video-Based Human Activity Recognition

Abstract

A novel to address these issues, human activity recognition is developing a three-fold deep learning model. Prior to anything else, the collected raw video frames undergo pre-processing. Wiener filtering, video-to-frame conversion, and contrast enhancement based on contrast limited adaptive histogram equalization are among the activities accomplished during this phase. Characteristics including monogenic binary coding, binary pattern of phase congruency, local Gabor transitional pattern, and chessboard median binary pattern are then recovered from the ROI region that was obtained. Using artificial ecosystem customized bald eagle optimization, the best features from the retrieved features would be selected. In the activity categorization phase, the three-fold deep learning model is trained using the chosen optimal features. Three deep learning models are used to model the activity categorization phase: convolutional neural network (CNN), optimized recurrent neural network (optimized RNN), and bidirectional long short-term memory (Bi-LSTM). With ratings of 94.3%, 94.39%, 92.2%, and 94.03% for 60, 70, 80, and 90 learning percentages, accordingly, the suggested model has shown the maximum detection accuracy. The two types of datasets used are Action Recognition Data Set, Human Action Clips, and Segments Dataset for Recognition and Temporal Localization. Real-time performance may suffer from the drawn-out frame-by-frame video processing and feature extraction procedure. In order to demonstrate the effectiveness of the recommended approach, the efficiency of the suggested approach is finally compared to other conventional models.

Keywords

HAR improved median binary pattern (I-MBP)improved watershed algorithm three-fold deep learning model

Get full access to this article

View all access options for this article.

References

Akbari

Al Maadeed

Elharrouss

Ottakath

Khelifi

(2024). Hierarchical deep learning approach using fusion layer for Source Camera Model Identification based on video taken by smartphone. Expert Systems with Applications, 238, 121603. https://doi.org/10.1016/j.eswa.2023.121603

Alsattar

H. A.

Zaidan

A. A.

Zaidan

B. B.

(2020). Novel meta-heuristic bald eagle search optimisation algorithm. Artificial Intelligence Review, 53, 2237–2264. https://doi.org/10.1007/s00521-019-04452-x

Arzani

M. M.

Fathy

Azirani

A. A.

Adeli

(2020). Switching structured prediction for simple and complex human activity recognition. IEEE Transactions on Cybernetics, 51(12), 5859–5870. https://doi.org/10.1109/TCYB.2019.2960481

Perello-Nieto

Santos-Rodriguez

Flach

(2020). Human activity recognition based on dynamic active learning. IEEE Journal of Biomedical and Health Informatics, 25(4), 922–934. https://doi.org/10.1109/JBHI.2020.3013403

Chen

Yao

Zhang

Wang

Chang

Nie

(2020). A semisupervised recurrent convolutional attention model for human activity recognition. IEEE Transactions on Neural Networks and Learning Systems, 31(5), 1747–1756. https://doi.org/10.1109/TNNLS.2019.2927224

Chen

Ota

Dong

(2018). Robust activity recognition for aging society. IEEE Journal of Biomedical and Health Informatics, 22(6), 1754–1764. https://doi.org/10.1109/JBHI.2018.2819182

Dataset1. https://www.crcv.ucf.edu/research/data-sets/ucf101/

Dataset2. http://hacs.csail.mit.edu/

Fei

Xiao

Han

Huang

Sun

(2019). Multi-variations activity based gaits recognition using commodity WiFi. IEEE Transactions on Vehicular Technology, 69(2), 2263–2273. https://doi.org/10.1109/TVT.2019.2962803

10.

Damer

Kirchbuchner

Kuijper

(2020). Sensing technology for human activity recognition: A comprehensive survey. Ieee Access, 8, 83791–83820. https://doi.org/10.1109/ACCESS.2020.2991891

11.

Gupta

Kembhavi

Davis

L. S.

(2009). Observing human-object interactions: Using spatial and functional compatibility for recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence, 31(10), 1775–1789. https://doi.org/10.1109/tpami.2009.83

12.

Hasegawa

(2020). Smartphone sensor-based human activity recognition robust to different sampling rates. IEEE Sensors Journal, 21(5), 6930–6941. https://doi.org/10.1109/JSEN.2020.3038281

13.

Huang

Lin

Wang

Dai

Xie

Zhou

(2019). TSE-CNN: A two-stage end-to-end CNN for human activity recognition. IEEE Journal of Biomedical and Health Informatics, 24(1), 292–299. https://doi.org/10.1109/JBHI.2019.2909688

14.

Jethanandani

Sharma

Perumal

Chang

J. R.

(2020). Multi-label classification based ensemble learning for human activity recognition in smart home. Internet of Things, 12, 100324. https://doi.org/10.1016/j.iot.2020.100324

15.

Jia

Chen

(2020). Integrated data and knowledge driven methodology for human activity recognition. Information Sciences, 536, 409–430. https://doi.org/10.1016/j.ins.2020.03.081

16.

Khan

M. A.

Javed

Khan

S. A.

Saba

Habib

Khan

J. A.

Abbasi

A. A.

(2024). Human action recognition using fusion of multiview and deep features: An application to video surveillance. Multimedia Tools and Applications, 83(5), 14885–14911. https://doi.org/10.1007/s11042-020-08806-9

17.

Khetkeeree

Liangrocapart

(2019, July 19). Image restoration using optimized weiner filtering based on modified tikhonov regularization. In 2019 IEEE 4th International Conference on Signal and Image Processing (ICSIP) (pp. 1015–1020). IEEE.

18.

Liu

Shahroudy

Perez

Wang

Duan

L. Y.

Kot

A. C.

(2019). Ntu rgb+ d 120: A large-scale benchmark for 3d human activity understanding. IEEE Transactions on Pattern Analysis and Machine Intelligence, 42(10), 2684–2701. https://doi.org/10.1109/TPAMI.2019.2916873

19.

Liu

Zhou

Zha

Z. J.

Nie

(2021). Human activity recognition by manifold regularization based dynamic graph convolutional networks. Neurocomputing, 444, 217–225. https://doi.org/10.1016/j.neucom.2019.12.150

20.

Luo

Khan

Huang

(2021). Binarized neural network for edge intelligence of sensor-based human activity recognition. IEEE Transactions on Mobile Computing, 22(3), 1356–1368. https://doi.org/10.1109/TMC.2021.3109940

21.

Wang

Jin

Xiao

Song

(2020). A hybrid network based on dense connection and weighted feature aggregation for human activity recognition. IEEE Access, 8, 68320–68332. https://doi.org/10.1109/ACCESS.2020.2986246

22.

Yang

L. T.

Lin

Zhang

Dai

(2021). Weighted support tensor machines for human activity recognition with smartphone sensors. IEEE Transactions on Industrial Informatics, 1–9. https://doi.org/10.1109/ACCESS.2019.2905575

23.

Magherini

Fantechi

Nugent

C. D.

Vicario

(2013). Using temporal logic and model checking in automated recognition of human activities for ambient-assisted living. IEEE Transactions on Human-Machine Systems, 43(6), 509–521. https://doi.org/10.1109/TSMC.2013.2283661

24.

Mishra

S. R.

Mishra

T. K.

Sanyal

Sarkar

Satapathy

S. C.

(2020). Real time human action recognition using triggered frame extraction and a typical CNN heuristic. Pattern Recognition Letters, 135, 329–336. https://doi.org/10.1016/j.patrec.2020.04.031

25.

Mishra

S. R.

Mishra

T. K.

Sarkar

Sanyal

(2021). PSO Based combined kernel learning framework for recognition of first-person activity in a video. Evolutionary Intelligence, 14(2), 273–279. https://doi.org/10.1007/s12065-018-0177-x

26.

Pham

Nguyen-Thai

Tran-Quang

Tran

T. H.

T. L.

(2020). Senscapsnet: Deep neural network for non-obtrusive sensing based human activity recognition. IEEE Access, 8, 86934–86946. https://doi.org/10.1109/ACCESS.2020.2991731

27.

Poulose

Kim

J. H.

Han

D. S.

(2022). HIT HAR: Human image threshing machine for human activity recognition using deep learning models. Computational Intelligence and Neuroscience, 2022(1), 1808990. https://doi.org/10.1155/2022/1808990

28.

Qin

Zhang

Meng

Qin

Choo

K. K.

(2020). Imaging and fusing time series for wearable sensor-based human activity recognition. Information Fusion, 53, 80–87. https://doi.org/10.1016/j.inffus.2019.06.014

29.

Savvaki

Tsagkatakis

Panousopoulou

Tsakalides

(2017). Matrix and tensor completion on a human activity recognition framework. IEEE Journal of Biomedical and Health Informatics, 21(6), 1554–1561. https://doi.org/10.1109/JBHI.2017.2716112

30.

Sena

Barreto

Caetano

Cramer

Schwartz

W. R.

(2021). Human activity recognition based on smartphone and wearable sensors using multiscale DCNN ensemble. Neurocomputing, 444, 226–243. https://doi.org/10.1016/j.neucom.2020.04.151

31.

Shanableh

(2023). Vico-moco-dl: Video coding and motion compensation solutions for human activity recognition using deep learning. IEEE Access, 11, 73971–73981. https://doi.org/10.1109/ACCESS.2023.3296252

32.

Shavit

Klein

(2021). Boosting inertial-based human activity recognition with transformers. IEEE Access, 9, 53540–7. https://doi.org/10.1109/ACCESS.2021.3070646

33.

Singh

Kushwaha

A. K.

Srivastava

(2019). Multi-view recognition system for human activity based on multiple features for video surveillance system. Multimedia Tools and Applications, 78, 17165–17196. https://doi.org/10.1007/s11042-018-7108-9

34.

Sun

Dong

Shi

Liu

Fan

Wang

(2022). Capsganet: Deep neural network based on capsule and GRU for human activity recognition. IEEE Systems Journal, 16(4), 5845–5855. https://doi.org/10.1109/JSYST.2022.3153503

35.

Tan

Zhang

Liu

Zhao

(2021). Phase variable based recognition of human locomotor activities across diverse gait patterns. IEEE Transactions on Human-Machine Systems, 51(6), 684–695. https://doi.org/10.1109/THMS.2021.3107256

36.

Tao

Jin

Yuan

Xue

(2014). Ensemble manifold rank preserving for acceleration-based human activity recognition. IEEE Transactions on Neural Networks and Learning Systems, 27(6), 1392–1404. https://doi.org/10.1109/tcyb.2019.2960481

37.

Umri

B. K.

Akhyari

M. W.

Kusrini

(2020, October 27). Detection of COVID-19 in chest X-ray image using CLAHE and convolutional neural network. In 2020 2nd international conference on cybernetics and intelligent system (ICORIS) (pp. 1–5). IEEE.

38.

Wang

Zhang

(2019). Attention-based convolutional neural network for weakly labeled human activities’ recognition with wearable sensors. IEEE Sensors Journal, 19(17), 7598–7604. https://doi.org/10.1109/JSEN.2019.2917225

39.

Yan

Zhang

Wang

(2019). Wiact: A passive WiFi-based human activity recognition system. IEEE Sensors Journal, 20(1), 296–305. https://doi.org/10.1109/JSEN.2019.2938245

40.

Yan

Ricci

Liu

Sebe

(2015). Egocentric daily activity recognition via multitask clustering. IEEE Transactions on Image Processing, 24(10), 2984–2995. https://doi.org/10.1109/TIP.2015.2438540

41.

Zhang

Zhou

Parker

L. E.

(2015). Fuzzy temporal segmentation and probabilistic recognition of continuous human daily activities. IEEE Transactions on Human-Machine Systems, 45(5), 598–611. https://doi.org/10.1109/THMS.2015.2443037

42.

Zhao

Wang

Zhang

(2020). Artificial ecosystem-based optimization: A novel nature-inspired meta-heuristic algorithm. Neural Computing and Applications, 32(13), 9383–9425. https://doi.org/10.1007/s00521-019-04452-x