Sage Journals: Discover world-class research

Abstract

Facial expressions play a vital role in non-verbal communication, conveying a wide range of emotions and messages. Although prior research achieved notable advances through architecture design or dataset-specific optimization, few studies have integrated multiple advanced techniques into a unified facial expression recognition (FER) pipeline. Addressing this gap, we propose a comprehensive approach that combines (i) multiple pre-trained CNNs, (ii) MTCNN-based face detection for improved facial region localization, and (iii) Grad-CAM-based interpretability. While MTCNN enhances the quality of face localization, it may slightly affect classification accuracy by focusing on cleaner yet more challenging samples. We evaluate four pre-trained models — DenseNet121, ResNet-50, ResNet18, and MobileNetV2 — on two datasets: Raf-DB and Cleaned-FER2013. The proposed pipeline demonstrates consistent improvements in interpretability and overall system robustness. The results emphasize the strength of integrating face detection, transfer learning, and interpretability techniques within a single framework can significantly enhance the transparency and reliability of FER systems. Combining FER with EEG-based systems significantly enhances the emotional intelligence of brain-computer interfaces, enabling more adaptive and personalized user experiences. With this approach the paper bridges the gap between affective computing and cognitive neuroscience, aligning closely EEG-centered interaction methodologies. Besides understanding the relationship between facial expressions of emotions and EEG signals will be an important study for literature.

Keywords

image processing facial expressions deep learning MTCNN grad-Cam analysis

Get full access to this article

View all access options for this article.

References

Lim

Lee

. Move-CNNs: Model averaging ensemble of convolutional neural networks for facial expression recognition. IAENG Int J Comput Sci. 2021;48(3).

Srivastav

Khare

. Deep learning based automatic facial emotion recognition. Int J Intell Syst Appl Eng. 2024;12(11s):619-624.

Mehrabian

. Communication Without Words. 2nd ed. Routledge; 2017:193-200. doi:10.4324/9781315080918-15

Lek

JXY

Teo

. Academic emotion classification using FER: A systematic review. Hum Behav Emerg Technol. 2023;2023:9790005. doi:10.1155/2023/9790005

Darwin

Prodger

. The Expression of the Emotions in Man and Animals. Oxford University Press; 1998.

Ekman

Friesen

. Constants across cultures in the face and emotion. J Pers Soc Psychol. 1971;17(2):124.

Sari

Moussaoui

Hadid

. Automated facial expression recognition using deep learning techniques: An overview. Int J Informatics Appl Math. 2020;3(1):39-53.

Mellouk

Handouzi

. FER Using deep learning: Review and insights. Procedia Comput Sci. 2020;175:689-694. doi:10.1016/j.procs.2020.07.101

. A brief review of FER based on visual information. Sensors (Basel). 2018;18(2):401.

10.

Gürbüz

Yılmaz

. Evrişimli sinir ağları kullanarak yüz belirleme ve tanıma uygulaması. J Investig Eng Technol. 2023;6(2):45-60.

11.

Mohana

Subashini

Krishnaveni

. Emotion recognition from facial expression using hybrid CNN–LSTM network. Int J Pattern Recognit Artif Intell. 2023;37(08):2356008.

12.

Bakariya

Singh

, et al. FER And music recommendation system using CNN-based deep learning techniques. Evol Syst. 2024;15(2):641-658. doi:10.1007/s12530-023-09506-z

13.

Gursesli

Lombardi

Duradoni

Bocchi

Guazzini

Lanata

. FER through custom lightweight CNN model: performance evaluation in public datasets. IEEE Access. 2024;4(3).

14.

Singh

Sharma

Mehta

Vohra

Singh

. EfficientNet for human FER using transfer learning. ICTACT J Soft Comput. 2022;13(1).

15.

Hameed Qutub

Atay

. Deep learning approaches for classification of emotion recognition based on facial expressions. Nexo Rev Cient. 2023;36(05):1-18. doi:10.5377/nexo.v36i05.17181

16.

Yan

Ashraf

Ihsan

, et al. Facial expression recognition system using deep learning. In: 2024 IEEE 1st Karachi Section Humanitarian Technology Conference (KHI-HTC). IEEE; 2024:1-11. doi:10.1109/KHI-HTC60760.2024.10482152

17.

Zhang

Qiao

. Joint face detection and alignment using multi-task cascaded convolutional networks. arXiv. 2016arXiv. Published online 2016. arXiv:1604.02878

18.

Jia

Tian

. Face detection based on improved multi-task cascaded convolutional neural networks. Int J Comput Sci (IJCS). 2024;51(2):1-14.

19.

Deng

Guo

Zhou

Kotsia

Zafeiriou

. RetinaFace: Single-stage dense face localisation in the wild. 2019. arXiv. Published online 2019. arXiv:1905.00641

20.

Reddi

Krishna

. CNN Implementing transfer learning for facial emotion recognition. Int J Intell Syst Appl Eng. 2023;11(4s):35-45.

21.

Akhand

MAH

Roy

Siddique

Kamal

MAS

Shimamura

. FER Using transfer learning in the deep CNN. Electronics (Basel). 2021;10(9):1036. doi:10.3390/electronics10091036

22.

Al-Khater

Al-Madeed

. Using 3D-VGG-16 and 3D-ResNet-18 deep learning models and FABEMD techniques in the detection of malware. Alex Eng J. 2024;89:39-52.

23.

Carrier

Courville

. The Facial Expression Recognition 2013 (FER-2013) dataset. Tech Rep. Wolfram Data Repository. 2013.

24.

Deng

. Reliable crowdsourcing and deep locality-preserving learning for expression recognition in the wild. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. IEEE; 2017:2852-2861.

25.

Huang

Chen

Wang

. Facial expression recognition: A survey. Symmetry (Basel). 2019;11(10):1189.

26.

LeCun

Boser

Denker

, et al. Backpropagation applied to handwritten ZIP code recognition. Neural Comput. 1989;1(4):541-551.

27.

Bukhari

Hussain

Ayoub

Khan

. Deep learning based framework for emotion recognition using facial expression. Pak J Eng Technol. 2022;5(3):51-57.

28.

Zhang

Ren

Sun

. Deep residual learning for image recognition. In: 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). IEEE; 2016:770-778. doi:10.1109/CVPR.2016.90

29.

Huang

Liu

Van Der Maaten

Weinberger

. Densely connected convolutional networks. In: 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). IEEE; 2017. doi:10.1109/CVPR.2017.243

30.

Sandler

Howard

Zhu

Zhmoginov

Chen

. Mobilenetv2: inverted residuals and linear bottlenecks. In: 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). IEEE; 2018:4510-4520. doi:10.1109/CVPR.2018.00474

31.

Ahmed

Hossain

Islam

Andersson

. Facial expression recognition using convolutional neural network with data augmentation. In: 2019 Joint 8th International Conference on Informatics, Electronics & Vision (ICIEV) and 2019 3rd International Conference on Imaging, Vision & Pattern Recognition (icIVPR). IEEE; 2019:336-341. doi:10.1109/ICIEV.2019.8858529

32.

Yang

. Face detection based on receptive field enhanced multitask cascaded convolutional neural networks. IEEE Access. 2020;8:174922-174930.

33.

Yang

Zhang

. Heterogeneous face detection based on multi-task cascaded convolutional neural network. IET Image Process. 2022;16(1):207-215.

34.

Selvaraju

Cogswell

Das

Vedantam

Parikh

Batra

. Grad-CAM: visual explanations from deep networks via gradient-based localization. In: Proceedings of the IEEE International Conference on Computer Vision. IEEE; 2017:618-626.

Advanced Facial Expression Recognition Using Model Averaging Ensembles of Convolutional Neural Networks and CAM Analysis

Abstract

Keywords

Get full access to this article

References