Sage Journals: Discover world-class research

Abstract

As the primary operators of vehicles, drivers play a decisive role in maintaining traffic safety. Numerous studies have identified driver behavior as a major contributing factor to traffic crashes, with distracted driving recognized as one of the most frequent and dangerous causes. To mitigate these risks, recent studies have focused on developing real-time driver-monitoring systems using vision-based models. However, existing models often rely solely on images’ global features and face challenges in balancing recognition accuracy and inference efficiency. In this study, we propose a novel pose-guided multilevel fusion network (PG-MFNet). Specifically, driver pose features are introduced to force attention to behavior-relevant local regions near keypoints, while also modeling the spatial relationships among body parts. Next, a multilevel fusion strategy is applied to progressively integrate low-level geometric contours, mid-level structural patterns, and high-level semantic cues, enabling comprehensive behavioral understanding from fine-grained detection to global interpretation. Moreover, we introduce a feature conditional attention module that dynamically adjusts class-specific representations based on inter-class differences, enhancing discriminability across behavior classes. Furthermore, to support training under varied real-world scenarios, we construct SAA13, a large-scale dataset that aggregates diverse drivers, driving contexts, and sensor viewpoints from multiple sources. Experimental results show that PG-MFNet achieves 92.16% accuracy with 68.1 FPS (frames per second) inference speed, outperforming state-of-the-art (SOTA) models in balancing performance and efficiency. These advancements serve as a practical and scalable solution for real-time distracted-driving detection and driver monitoring, providing reliable behavioral tracking for intelligent transportation systems.

Keywords

multilevel fusion network driver-behavior recognition complex driving scenarios traffic safety conditional attention module

Get full access to this article

View all access options for this article.

References

Y. S.

R. J.

Abdelraouf

Han

K. Y. T.

Gupta

Wang

Z. R.

Driver Digital Twin for Online Recognition of Distracted Driving Behaviors. IEEE Transactions on Intelligent Vehicles, Vol. 9, No. 2, 2024, pp. 3168–3180.

T. Y.

X. T.

Ren

Guo

An Effective Multi-Scale Framework for Driver Behavior Recognition with Incomplete Skeletons. IEEE Transactions on Vehicular Technology, Vol. 73, No. 1, 2024, pp. 295–309.

Global Status Report on Road Safety 2023. World Health Organization, Geneva, Switzerland, 2023.

Zhang

K. P.

Wang

S. P.

Jia

Zhao

Han

C. Y.

Integrating Visual Large Language Model and Reasoning Chain for Driver Behavior Analysis and Risk Assessment. Accident Analysis & Prevention, Vol. 198, 2024, p. 107497.

Zhang

Y. Z.

T. G.

Zhou

X. H.

A Novel Driver Distraction Behavior Detection Method Based on Self-Supervised Learning with Masked Image Modeling. IEEE Internet of Things Journal, Vol. 11, No. 4, 2024, pp. 6056–6071.

Yang

H. H.

Liu

H. C.

Z. X.

Nguyen

A. T.

Guerra

T. M.

Quantitative Identification of Driver Distraction: A Weakly Supervised Contrastive Learning Approach. IEEE Transactions on Intelligent Transportation Systems, Vol. 25, No. 2, 2024, pp. 2034–2045.

Wang

Lin

Shao

Xiang

J. W.

MF-YOLO: A Lightweight Method for Real-Time Dangerous Driving Behavior Detection. IEEE Transactions on Instrumentation and Measurement, Vol. 73, 2024.

M. Q.

Y. C.

X. B.

Pose-Guided Model for Driving Behavior Recognition Using Keypoint Action Learning. Signal Processing-Image Communication, Vol. 100, 2022, p. 116513.

Sun

H. B.

Y. J.

MAViT: A Lightweight Hybrid Model with Mutual Attention Mechanism for Driver Behavior Recognition. Engineering Applications of Artificial Intelligence, Vol. 143, 2025, p. 109921.

10.

Huo

R. T.

Chen

J. K.

Zhang

Gao

3D Skeleton Aware Driver Behavior Recognition Framework for Autonomous Driving System. Neurocomputing, Vol. 613, 2025, p. 128743.

11.

Yang

X. H.

Qiao

Han

S. Y.

Feng

Chen

Y. H.

Appearance-Posture Fusion Network for Distracted Driving Behavior Recognition. Expert Systems with Applications, Vol. 257, 2024, p. 124883.

12.

M. Y.

Zhang

Shen

L. L.

Pose-Aware Multi-Feature Fusion Network for Driver Distraction Recognition. Proc., International Conference on Pattern Recognition (LCPR), Milan, Italy, IEEE, New York, 2021, pp. 1228–1235.

13.

Uddin

M. A.

Hossain

Ahamed

Islam

M. M.

Khraisat

Alazab

Ahamed

M. K. U.

Talukder

M. A.

Abnormal Driving Behavior Detection: A Machine and Deep Learning Based Hybrid Model. International Journal of Intelligent Transportation Systems Research, Vol. 23, No. 1, 2025, pp. 568–591.

14.

Liu

Q. C.

Chen

S. Q.

Liu

G. Q.

Yang

Yuan

Cai

Y. F.

Chen

Dual-Perspective Safety Driver Secondary Task Detection Method Based on Swin-Transformer and Cross-Attention. Advanced Engineering Informatics, Vol. 65, 2025, p. 103320.

15.

Gao

Liu

Improving Real-Time Driver Distraction Detection via Constrained Attention Mechanism. Engineering Applications of Artificial Intelligence, Vol. 128, 2024, p. 107408.

16.

Xiao

W. C.

Xie

G. Q.

Liu

H. L.

Chen

W. H.

R. F.

FDAN: Fuzzy Deep Attention Networks for Driver Behavior Recognition. Journal of Systems Architecture, Vol. 147, 2024, p. 103063.

17.

Zhao

F. W.

Chen

Wang

A Lightweight and Efficient Distracted Driver Detection Model Fusing Convolutional Neural Network and Vision Transformer. IEEE Transactions on Intelligent Transportation Systems, Vol. 25, No. 12, 2024, pp. 19962–19978.

18.

Tang

X. X.

Chen

Y. F.

Yang

W. X.

Zhou

H. P.

Huang

J. Z.

A Lightweight Model Combining Convolutional Neural Network and Transformer for Driver Distraction Recognition. Engineering Applications of Artificial Intelligence, Vol. 132, 2024, p. 107910.

19.

Liu

W. Z.

Gong

Zhang

G. Y.

J. L.

Zhou

Y. L.

Liao

J. B.

GLMDriveNet: Global-Local Multimodal Fusion Driving Behavior Classification Network. Engineering Applications of Artificial Intelligence, Vol. 129, 2024, p. 107575.

20.

Kuang

W. J.

Zhang

Z. C.

MIFI: MultI-Camera Feature Integration for Robust 3D Distracted Driver Activity Recognition. IEEE Transactions on Intelligent Transportation Systems, Vol. 25, No. 1, 2024, pp. 338–348.

21.

Liao

C. P.

Lin

K. Y.

DDC-Chat: Achieving Accurate Distracted Driver Classification Through Instruction Tuning of Visual Language Model. Journal of Safety Science and Resilience, Vol. 6, No. 2, 2025, pp. 250–264.

22.

Hasan

M. Z.

Chen

J. J.

Wang

J. Y.

Rahman

M. S.

Joshi

Velipasalar

Hegde

Sharma

Sarkar

Vision-Language Models Can Identify Distracted Driver Behavior from Naturalistic Videos. IEEE Transactions on Intelligent Transportation Systems, Vol. 25, No. 9, 2024, pp. 11602–11616.

23.

Gao

M. T.

Liu

Learning Driver-Irrelevant Features for Generalizable Driver Behavior Recognition. IEEE Transactions on Intelligent Transportation Systems, Vol. 25, No. 10, 2024, pp. 14115–14127.

24.

Liu

Y. Y.

S. Y.

Han

H. C.

Chen

X. D.

Zeng

Tian

Z. Q.

Adaptive Distraction Recognition via Soft Prototype Learning and Probabilistic Label Alignment. IEEE Transactions on Intelligent Transportation Systems, Vol. 25, No. 11, 2024, pp. 18701–18713.

25.

Al-Mahbashi

Peng

Y. X.

Al-Soswa

Debsi

Real-Time Distracted Driving Detection Based on GM-YOLOv8 on Embedded Systems. Journal of Transportation Engineering, Part A: Systems, Vol. 151, No. 3, 2025.

26.

R. J.

C. D.

Qin

X. R.

Zhao

J. P.

Chuai

W. H.

Liu

B. S.

YOLO-SGC: A Dangerous Driving Behavior Detection Method with Multiscale Spatial-Channel Feature Aggregation. IEEE Sensors Journal, Vol. 24, No. 21, 2024, pp. 36044–36056.

27.

Ping

Huang

Ding

W. P.

Liu

Y. K.

Chiyomi

Kazuya

Distracted Driving Detection Based on the Fusion of Deep Learning and Causal Reasoning. Information Fusion, Vol. 89, 2023, pp. 121–142.

28.

M. Q.

X. B.

Keypoint-Enhanced Adaptive Weighting Model with Effective Frequency Channel Attention for Driver Action Recognition. Engineering Applications of Artificial Intelligence, Vol. 123, 2023, p. 106321.

29.

Behera

Wharton

Keidel

Debnath

Deep CNN, Body Pose, and Body-Object Interaction Features for Drivers’ Activity Monitoring. IEEE Transactions on Intelligent Transportation Systems, Vol. 23, No. 3, 2022, pp. 2874–2881.

30.

Tan

M. K.

G. Q.

Liu

Zhang

S. L.

X. M.

Wang

Y. W.

Zeng

R. H.

Bidirectional Posture-Appearance Interaction Network for Driver Behavior Recognition. IEEE Transactions on Intelligent Transportation Systems, Vol. 23, No. 8, 2022, pp. 13242–13254.

31.

Kaggle. State Farm Distracted Driver Detection. https://www.kaggle.com/competitions/state-farm-distracted-driver-detection. Accessed August 15, 2024.

32.

Eraqi

H. M.

Abouelnaga

Saad

M. H.

Moustafa

M. N.

Driver Distraction Identification with an Ensemble of Convolutional Neural Networks. Journal of Advanced Transportation, Vol. 2019, 2019, p. 4125865.

33.

Saad

M. H.

Khalil

M. H.

Abbas

H. M.

End-to-End Driver Distraction Recognition Using Novel Low Lighting Support Dataset. Proc., IEEE International Conference on Computer Engineering and Systems (ICCES), Cairo, Egypt, IEEE, New York, 2020. pp. 1–6.

34.

Wang

W. J.

Zhang

Z. C.

Zhong

Sebe

100-Driver: A Large-Scale, Diverse Dataset for Distracted Driver Classification. IEEE Transactions on Intelligent Transportation Systems, Vol. 24, No. 7, 2023, pp. 7061–7072.

35.

Jegham

Ben Khalifa

Alouani

Mahjoub

M. A.

A Novel Public Dataset for Multimodal Multiview and Multispectral Driver Distraction Analysis: 3MDAD. Signal Processing-Image Communication, Vol. 88, 2020, p. 115960.

36.

P. H.

Yang

Y. F.

Grosu

Wang

G. D.

Y. H.

Huang

Driver Distraction Detection Using Octave-Like Convolutional Neural Network. IEEE Transactions on Intelligent Transportation Systems, Vol. 23, No. 7, 2022, pp. 8823–8833.

37.

K. M.

Zhang

X. Y.

Ren

S. Q.

Sun

Deep Residual Learning for Image Recognition. Proc., IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, Computer Vision Foundation, 2016. pp. 770–778.

38.

Jocher

Chaurasia

Jing

Ultralytics YOLO. https://github.com/ultralytics/ultralytics. Accessed August 15, 2024.

39.

Xiao

W. C.

Liu

H. L.

Z. J.

Chen

W. H.

Attention-Based Deep Neural Network for Driver Behavior Recognition. Future Generation Computer Systems-the International Journal of Escience, Vol. 132, 2022, pp. 152–161.

40.

Huang

Wang

X. C.

Cao

J. N.

Wang

S. H.

Zhang

HCF: A Hybrid CNN Framework for Behavior Detection of Distracted Drivers. IEEE Access, Vol. 8, 2020, pp. 109335–109349.

41.

Yang

Tan

Chen

Xia

Tang

Cao

Zhou

Lin

Dai

BiRSwinT: Bilinear Full-Scale Residual Swin-Transformer for Fine-Grained Driver Behavior Recognition. Journal of the Franklin Institute, Vol. 360, No. 2, 2023, pp. 1166–1183.

42.

Deng

W. H.

Zhao

C. H.

Shen

Zhang

Z. Y.

Driver Behavior Recognition in Complex Driving Scenarios via Multicollinear Fusion Network with Optimizations. IEEE Sensors Journal, Vol. 25, No. 8, 2025, pp. 13300–13315.

Pose-Guided Multilevel Fusion Network with Conditional Attention for Driver-Behavior Recognition in Complex Driving Scenarios

Abstract

Keywords

Get full access to this article

References