Sage Journals: Discover world-class research

Abstract

Emotion recognition models are used to determine the thoughts, feelings, and emotions of humans from facial visuals. The enormity of facial expressions makes it challenging to extract emotions from face images. The main focus of this research is to extract emotions from facial images and emotional speech using deep learning models. In previous research, proposed methods suffer from issues like performance degradation caused by poor layer selection as well as poor accuracy. In the proposed model, data is gathered, and preprocessed to improve the image's quality for more accurate emotion recognition. The region extraction is carried out using a faster Recurrent-convolutional neural network (R-CNN) and the standard Resnet-101. Then, a pretrained model is created using the standard combination of the ResNet-101 and GoogLeNet model for feature extraction. To classify emotions accurately, an activational attention layer coupled deep learning model (ALNN-EmR model) is proposed using the bald hawks-based deep convolutional neural network (bald hawks-deep CNN) in this research. In the proposed model, the features are acquired using the ResLeNet model designed by concatenating the ResNet-101 and GoogLeNet features. Using the ResLeNet features, the proposed activational attention layer coupled deep learning model (ALNN-EmR) recognizes the emotions, where the weights and biases of the model are successfully adjusted using the bald hawk optimization (BHO). The proposed ALNN-EmR model is implemented and the effectiveness is revealed through the emotional speech and video-based data analysis.

Keywords

bald hawk optimization faster r-CNN Resnet-101 GoogLeNet activation attention module-based deep CNN classifier

Get full access to this article

View all access options for this article.

References

AFEW-VA. dataset is taken form: https://paperswithcode.com/dataset/afew-va#:∼:text=The%20AFEW%2DVA%20databaset%20is,for%20600%20challenging%20video%20clips

Ali

M. H.

Jaber

M. M.

Abd

S. K.

Rehman

Awan

M. J.

Vitkutė-Adžgauskienė

Damaševičius

Bahaj

S. A.

(2022). Harris hawks sparse auto-encoder networks for automatic speech recognition system. Applied Sciences, 12(3), 1091. https://doi.org/10.3390/app12031091

Alsattar

H. A.

Zaidan

A. A.

Zaidan

B. B.

(2020). Novel meta-heuristic bald eagle search optimisation algorithm. Artificial Intelligence Review, 53(6), 2237–2264. https://doi.org/10.1007/s10462-019-09732-5

Alsayat

(2022). Improving sentiment analysis for social media applications using an ensemble deep learning language model. Arabian Journal for Science and Engineering, 47(2), 2499–2511. https://doi.org/10.1007/s13369-021-06227-w

Alsubai

Hamdi

Abdel-Khalek

Alqahtani

Binbusayyis

Mansour

R. F.

(2022). Bald eagle search optimization with deep transfer learning enabled age-invariant face recognition model. Image and Vision Computing, 126(2), 104545. https://doi.org/10.1016/j.imavis.2022.104545

Barrett

L. F.

Adolphs

Marsella

Martinez

A. M.

Pollak

S. D.

(2019). Emotional expressions reconsidered: Challenges to inferring emotion from human facial movements. Psychological Science in the Public Interest, 20(1), 1–68. https://doi.org/10.1177/1529100619832930

Borth

Chen

Breuel

Chang

S. F.

(2013). Large-scale visual sentiment ontology and detectors using adjective noun pairs. Proceedings of the 21st ACM international conference on Multimedia, 223–232.

Cao

Zhu

Shi

Liu

(2018). Multi-feature based event recommendation in event-based social network. International Journal of Computational Intelligence Systems, 11(1), 618–633. https://doi.org/10.2991/ijcis.11.1.48

Chakraborty

Bhattacharyya

Bag

(2020). A survey of sentiment analysis from social media data. IEEE Transactions on Computational Social Systems, 7(2), 450–464. https://doi.org/10.1109/TCSS.2019.2956957

10.

Chen

Wei

Liji

Hao

(2019). A temporal recommendation mechanism based on signed network of user interest changes. IEEE Systems Journal, 14(1), 244–252. https://doi.org/10.1109/JSYST.2019.2900325

11.

Chen

F. X.

Chen

Cui

Chen

Y. Y.

Chang

S. F.

(2014). Object-based visual sentiment concept analysis and application. Proceedings of the 22nd ACM international conference on Multimedia. 367–376.

12.

Creamer

G. G.

(2017). Network structure and market risk in the European equity market. IEEE Systems Journal, 12(2), 1090–1098. https://doi.org/10.1109/JSYST.2017.2657516

13.

DTL-I-ResNet18. facial emotion recognition based on deep transfer learning and improved ResNet18.

14.

Elgamal

Z. M.

Yasin

N. B.

Tubishat

Alswaitti

Mirjalili

(2020). An improved harris hawks optimization algorithm with simulated annealing for feature selection in the medical field. IEEE Access, 8(8), 186638–52. https://doi.org/10.1109/ACCESS.2020.3029728

15.

Hassan

S. Z.

Ahmad

Hicks

Halvorsen

Al-Fuqaha

Conci

Riegler

(2022). Visual sentiment analysis from disaster images in social media. Sensors, 22(10), 3628. https://doi.org/10.3390/s22103628

16.

Huang

Zhang

Zhao

(2018). Bi-directional spatial-semantic attention networks for image-text matching. IEEE Transactions on Image Processing, 28(4), 2008–2020. https://doi.org/10.1109/TIP.2018.2882225

17.

Huang

Zhang

Zhao

(2019). Image–text sentiment analysis via deep multimodal attentive fusion. Knowledge-Based Systems, 167(4), 26–37. https://doi.org/10.1016/j.knosys.2019.01.019

18.

Islam

Mahmud

Hossain

(2018). Facial expression region segmentation based approach to emotion recognition using 2D Gabor filter and multiclass support vector machine. 2018 21st International Conference of Computer and Information Technology (ICCIT), 1–6.

19.

Issa

Demirci

M. F.

Yazici

(2020 May 1). Speech emotion recognition with deep convolutional neural networks. Biomedical Signal Processing and Control, 59, 101894. https://doi.org/10.1016/j.bspc.2020.101894

20.

Jain

Narayan

Balaji

Bhowmick

Muthu

R. K.

Speech emotion recognition using support vector machine. arXiv preprint arXiv:2002.07590. 2020 Feb 3.

21.

Lee

C. C.

Mower

Busso

Lee

Narayanan

(2011). Emotion recognition using a hierarchical binary decision tree approach. Speech Communication, 53(9–10), 1162–1171. https://doi.org/10.1016/j.specom.2011.06.004

22.

Feng

Xiong

(2012). Scaring or pleasing: Exploit emotional impact of an image. InProceedings of the 20th ACM International Conference on MULTIMEDIA, 1365–1366.

23.

Liu

(2018). Emotion recognition from multichannel EEG signals using K-nearest neighbor classification. Technology and Health Care, 26(S1), 509–519. https://doi.org/10.3233/THC-174836

24.

Machajdik

Hanbury

(2010). Affective image classification using features inspired by psychology and art theory. InProceedings of the 18th ACM international conference on Multimedia. 83–92.

25.

Mehendale

(2020 Mar). Facial emotion recognition using convolutional neural networks (FERC). SN Applied Sciences, 2(3), 446. https://doi.org/10.1007/s42452-020-2234-1

26.

Mistry

Zhang

Neoh

S. C.

Lim

C. P.

Fielding

(2016). A micro-GA embedded PSO feature selection approach to intelligent facial emotion recognition. IEEE Transactions on Cybernetics, 47(6), 1496–1509. https://doi.org/10.1109/TCYB.2016.2549639

27.

Moung

E. G.

Wooi

C. C.

Sufian

M. M.

C. K.

Dargham

J. A.

(2022). Ensemble-based face expression recognition approach for image sentiment analysis. Int. J. Electr. Comput. Eng, 12(3), 2588–2600. https://doi.org/10.1109/TMM.2022.3160060

28.

Murtaza

Sharif

AbdullahYasmin

Ahmad

(2019). Facial expression detection using six facial expressions hexagon (SFEH) model. 2019 IEEE 9th annual computing and communication workshop and conference (CCWC), 0190–0195.

29.

Pan

(2019). Personalized online-to-offline (O2O) service recommendation based on a novel frequent service-set network. IEEE Systems Journal, 13(2), 1599–1607. https://doi.org/10.1109/JSYST.2018.2883214

30.

Perez-Gomez

Rios-Figueroa

H. V.

Rechy-Ramirez

E. J.

Mezura-Montes

Marin-Hernandez

(2020). Feature selection on 2D and 3D geometric features to improve facial expression recognition. Sensors, 20(17), 4847. https://doi.org/10.3390/s20174847

31.

Poria

Cambria

Hazarika

Mazumder

Zadeh

Morency

L. P.

(2017). Multi-level multiple attentions for contextual multimodal sentiment analysis. 2017 IEEE International Conference on Data Mining (ICDM). 1033–1038.

32.

Fan

Chen

Zhou

(2015). Facial expression recognition from image sequences using twofold random forest classifier. Neurocomputing, 168, 1173–1180. https://doi.org/10.1016/j.neucom.2015.05.005

33.

Rajesh

Nalini

N. J.

(2020). Musical instrument emotion recognition using deep recurrent neural network. Procedia Computer Science, 167, 16–25. https://doi.org/10.1016/j.procs.2020.03.178

34.

RAVDESS. dataset is taken from : https://www.kaggle.com/datasets/uwrfkaggler/ravdess-emotional-speech-audio.

35.

Ren

D. D.

Liu

(2018). Forecasting stock market movement direction using sentiment analysis and support vector machine. IEEE Systems Journal, 13(1), 760–770. https://doi.org/10.1109/JSYST.2018.2794462

36.

Rosa

R. L.

Schwartz

G. M.

Ruggiero

W. V.

Rodríguez

D. Z.

(2018). A knowledge-based recommendation system that includes sentiment analysis and deep learning. IEEE Transactions on Industrial Informatics, 15(4), 2124–2135. https://doi.org/10.1109/TII.2018.2867174

37.

Serengil

S. I.

Ozpinar

(2020 Oct 15). Lightface: A hybrid deep face recognition framework. 2020 innovations in intelligent systems and applications conference (ASYU) (pp. 1–5). IEEE.

38.

Shengtao

Chao

(2019). Facial expression recognition based on global and local feature fusion with CNNs. 2019 IEEE International Conference on Signal Processing, Communications and Computing (ICSPCC) 1–5.

39.

Shin

Lee

Choi

J. D.

(2016). Lexicon integrated CNN models with attention for sentiment analysis. arXiv preprint arXiv:1610.06272.

40.

Wang

Huang

Yuan

(2016). A multi-granularity fuzzy computing model for sentiment classification of Chinese reviews. Journal of Intelligent & Fuzzy Systems, 30(3), 1445–1460. https://doi.org/10.3233/IFS-151853

41.

Cao

Legg

Liu

(2019). Venue2vec: An efficient embedding model for fine-grained user location prediction in geo-social networks. IEEE Systems Journal, 14(2), 1740–1751. https://doi.org/10.1109/JSYST.2019.2913080

42.

Cao

Liu

Wang

(2020a). Survey on user location prediction based on geo-social networking data. World Wide Web, 23(3), 1621–1664. https://doi.org/10.1007/s11280-019-00777-8

43.

Huang

Philip

S. Y.

(2020b). Social image sentiment analysis by exploiting multimodal content and heterogeneous relations. IEEE Transactions on Industrial Informatics, 17(4), 2974–2982. https://doi.org/10.1109/TII.2020.3005405

44.

Huang

Philip

S. Y.

(2020c). Visual sentiment analysis with social relations-guided multiattention networks. IEEE Transactions on Cybernetics, 52(6), 4472–4484. https://doi.org/10.1109/TCYB.2020.3027766

45.

Zhao

(2020). A facial expression recognition algorithm based on CNN and LBP feature. 2020 IEEE 4th Information Technology, Networking, Electronic and Automation Control Conference (ITNEC) 1, 2304–2308.

46.

Yadav

Ekbal

Saha

Bhattacharyya

(2018). Medical sentiment analysis using social media: towards building a patient assisted system. Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018).

47.

Yanulevskaya

van Gemert

J. C.

Roth

Herbold

A. K.

Sebe

Geusebroek

J. M.

(2008). Emotional valence categorization using holistic image features. 2008 15th IEEE international conference on Image Processing. 101–104.

48.

You

Luo

Jin

Yang

(2016). Cross-modality consistent regression for joint visual-textual sentiment analysis of social multimedia. Proceedings of the Ninth ACM international conference on Web search and data mining. 13–22.

49.

Yuan

Mcdonough

You

Luo

(2013). Sentribute: Image sentiment analysis from a mid-level perspective. Proceedings of the second international workshop on issues of sentiment discovery and opinion mining. 1–8.

50.

Zhang

Shi

Jiang

Yuan

(2020). Multidimensional extra evidence mining for image sentiment analysis. IEEE Access, 8, 103619–34. https://doi.org/10.1109/ACCESS.2020.2999128

51.

Zhou

Cao

Zhu

Liu

(2020). Visual-textual sentiment analysis enhanced by hierarchical cross-modality interaction. IEEE Systems Journal, 15(3), 4303–4314. https://doi.org/10.1109/JSYST.2020.3026879

52.

Zhu

Yang

Zhao

Liu

Qian

(2022). Multimodal sentiment analysis with image-text interaction network. IEEE Transactions on Multimedia, 16, 1–16. https://doi.org/10.11591/ijece.v12i3.pp2588-2600

An Efficient Emotion Recognition Model Using Bald Hawk Optimization Based on Deep Learning

Abstract

Keywords

Get full access to this article

References