Sage Journals: Discover world-class research

Abstract

Traffic sign detection is a fundamental component of intelligent transportation systems, yet remains challenging due to the small size of signs, visual occlusions, and complex environmental conditions. In this paper, we propose a novel YOLO-based architecture enhanced with multi-scale attention and Transformer modules to address these limitations. Specifically, a Convolutional Block Attention Module (CBAM) is employed to refine spatial and channel-wise features, while a C3 Transformer (C3TR) module introduces multi-head self-attention to capture global contextual information. The proposed enhancements significantly improve the model's ability to detect small and visually degraded traffic signs. Evaluated on the German Traffic Sign Detection Benchmark (GTSDB), our model achieves a mAP@0.5 of 96.75%, mAP@0.5:0.95 of 81.18%, precision of 97.05%, and recall of 95.07%. Compared to YOLOv5 s, this reflects relative gains of +11.2% in mAP@0.5, + 26.6% in mAP@0.5:0.95, + 1.6% in precision, and +20.0% in recall, with a 41.9% reduction in model size. It also outperforms YOLOv8, YOLOv7-tiny, and Faster R-CNN, particularly for degraded signs. For real-time deployment on embedded systems, the model is optimized using NVIDIA TensorRT. This optimization significantly reduces inference latency and computational load while preserving high detection accuracy, making the model well-suited for ADAS and autonomous driving applications.

Keywords

object detection lightweight deep learning models intelligent transportation systems YOLO architecture real-time detection

Get full access to this article

View all access options for this article.

References

Arcos-García

Alvarez-Garcia

J. A.

Soria-Morillo

L. M.

(2018). Evaluation of deep neural networks for traffic sign detection systems. Neurocomputing, 316, 332–344. https://doi.org/10.1016/j.neucom.2018.08.009

Bai

Chen

Yang

Liu

(2023). Multi-dimension compression of feed-forward network in vision transformers. Pattern Recognition Letters, 176, 56–61. https://doi.org/10.1016/j.patrec.2023.10.014

Chen

Jia

Chen

Zhang

(2022). A real-time and high-precision method for small traffic-signs recognition. Neural Computing and Applications, 34, 2233–2245. https://doi.org/10.1007/s00521-021-06526-1

De-Las-Heras

Sanchez-Soriano

Puertas

(2021). Advanced driver assistance systems (ADAS) based on machine learning techniques for the detection and transcription of variable message signs on roads. Sensors, 21(17), 5866. https://doi.org/10.3390/s21175866

Dhamija

Bansalb

(2025). AFLF: A defensive framework to defeat multi-faceted adversarial attacks via attention feature fusion. Evolving Systems, 16(1), 1–20. https://doi.org/10.1007/s12530-024-09637-x

Dou

Zhou

Liu

Wang

Zhang

Wang

Chen

Diao

(2024). An improved YOLOv5s fire detection model. Fire Technology, 60, 135–166. https://doi.org/10.1007/s10694-023-01492-7

Fan

B. B.

Yang

(2021). Multi-scale traffic sign detection model with attention, proceedings of the institution of mechanical engineers. Part D: Journal of Automobile Engineering, 235(2-3), 708–720. https://doi.org/10.1177/0954407020950054

Güney

Bayilmiş

Çakan

(2022). An implementation of real-time traffic signs and road objects detection based on mobile GPU platforms. IEEE Access, 10, 86191–86203. https://doi.org/10.1109/ACCESS.2022.3198954

Han

Wang

Zhang

(2024). YOLO-SG: Small traffic signs detection method in complex scene. The Journal of Supercomputing, 80, 2025–2046. https://doi.org/10.1007/s11227-023-05547-y

10.

Guo

Zhang

Xia

Geng

Zou

Ding

(2025). NTS-YOLO: A Nocturnal Traffic Sign Detection Method Based on Improved YOLOv5. Applied Sciences, 15(3), 1578. https://doi.org/10.3390/app15031578

11.

Houben

Stallkamp

Salmen

Schlipsing

Igel

(2013). Detection of traffic signs in real-world images: The German traffic sign detection benchmark. In The 2013 international joint conference on neural networks (IJCNN), Ieee, pp. 1–8.

12.

Huang

Liang

Luo

Lee

D. H.

(2022). Attention-enhanced one-stage algorithm for traffic sign detection and recognition. Journal of Sensors, 2022, 3705256. https://doi.org/10.1155/2022/3705256

13.

Hussein

Zhu

W.-X.

(2024). A real-time ghost machine learning model built on YOLOv8 for traffic road signs detection and classification in Germany. Multimedia Systems, 30, 344. https://doi.org/10.1007/s00530-024-01527-1

14.

Liu

Fan

Wang

(2024). Improved YOLOv8 for small traffic sign detection under complex environmental conditions. Franklin Open, 8, 100167. https://doi.org/10.1016/j.fraope.2024.100167

15.

Meng

(2023). Attention-YOLOV4: A real-time and high-accurate traffic sign detection algorithm. Multimedia Tools And Applications, 82, 7567–7582. https://doi.org/10.1007/s11042-022-13251-x

16.

Liu

Chang

Wang

(2019). Machine vision based traffic sign detection methods. Review, Analyses And Perspectives, Ieee Access, 7, 86578–86596. https://doi.org/10.1109/ACCESS.2019.2924947

17.

Lopez-Montiel

Orozco-Rosas

Sánchez-Adame

Picos

Ross

O. H. M.

(2021). Evaluation method of deep learning-based embedded systems for traffic sign detection. Ieee Access, 9, 101217–101238. https://doi.org/10.1109/ACCESS.2021.3097969

18.

Patel

Vekariya

Shah

Vala

(2024). Detection of traffic sign based on YOLOv8. AIP Conference Proceedings, 3107(1), 050015.

19.

Prabhat

Sai

J. P.

Vishnupriya

Ramesh

S. S.

(2024). Traffic sign detection and recognition using CNN and KERAS. 2024 2nd International Conference on Advances in Computation, Communication and Information Technology (ICAICCIT), IEEE, pp. 797–802.

20.

Rainio

Teuho

Klén

(2024). Evaluation metrics and statistical tests for machine learning. Scientific Reports, 14, 6086. https://doi.org/10.1038/s41598-024-56706-x

21.

Rajendran

R. M.

Vyas

(2024). Detecting apt using machine learning: Comparative performance analysis with proposed model. SoutheastCon 2024, IEEE, pp. 1064–1069.

22.

She

Hong

Zeng

(2023). Improved traffic sign detection model based on YOLOv7-tiny. IEEE Access, 11, 126555–126567. https://doi.org/10.1109/ACCESS.2023.3331426

23.

Shi

Zhang

(2025). A road traffic sign recognition method based on improved YOLOv5. International Journal of Sensor Networks, 47(1), 47–59. https://doi.org/10.1504/IJSNET.2025.143900

24.

Subhamathi

Sathiyapriyan

Anand

A. J.

Rheem

Arun

(2024). Traffic sign detection problems using convolutional neural techniques in image processing. 2024 International Conference on Intelligent Systems for Cybersecurity (ISCS), IEEE. pp. 1–6.

25.

Tabernik

Skočaj

(2019). Deep learning for large-scale traffic-sign detection and recognition. IEEE Transactions On Intelligent Transportation Systems, 21(4), 1427–1440. https://doi.org/10.1109/TITS.2019.2913588

26.

Tian

Gelernter

Wang

(2019). Traffic sign detection using a multi-scale recurrent attention network. IEEE Transactions On Intelligent Transportation Systems, 20, 4466–4475. https://doi.org/10.1109/TITS.2018.2886283

27.

Vignesh

V. S. P.

Prajith

K. S.

Gokul

G. P.

Adarsh

Rajareddy

G. N.

(2024). An advanced computer vision model for aiding automobile in traffic sign classification. 2024 International Conference on Computing and Data Science (ICCDS), IEEE, pp. 1–6.

28.

Wang

Chen

Dong

Gao

(2023). Improved YOLOv5 network for real-time multi-scale traffic sign detection. Neural Computing and Applications, 35, 7853–7865. https://doi.org/10.1007/s00521-022-08077-5

29.

Xiao

Zhang

Jiang

Xia

(2020). CNN–MHSA: A convolutional neural network and multi-head self-attention combined approach for detecting phishing websites. Neural Networks, 125, 303–312. https://doi.org/10.1016/j.neunet.2020.02.013

30.

Yuan

Xiong

Wang

(2016). An incremental framework for video-based traffic sign detection, tracking, and recognition. IEEE Transactions on Intelligent Transportation Systems, 18(7), 1918–1929. https://doi.org/10.1109/TITS.2016.2614548

31.

Yuan

Xiong

Wang

(2019). VSSA-NET: Vertical spatial sequence attention network for traffic sign detection. IEEE Transactions On Image Processing, 28(7), 3423–3434. https://doi.org/10.1109/TIP.2019.2896952

32.

Zhang

(2024). Real-Time detection of multi-scale traffic signs based on decoupled heads. In International conference on intelligent computing (pp. 241–252). Springer.

33.

Zheng

Huang

Zhang

(2024). YOLO Based intelligent recognition of planktonic Algae in whole slide microscopic images. 2024 17th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics (CISP-BMEI), IEEE, pp. 1–6.

Multi-Scale Attention and Transformer-Enhanced YOLO Architecture for Robust Traffic Sign Detection in Complex Visual Environments

Abstract

Keywords

Get full access to this article

References