NTLPCBLM: Non-motorized traffic lane participants crossing behavior language model

Abstract

Non-motorized traffic lane participants crossing behavior during road crossings plays a critical role in both personal safety and overall traffic efficiency. This study proposes a language-model-based framework for analyzing crossing behaviors of non-motorized lane users. First, image preprocessing is performed using the YOLOv8 algorithm to accurately detect key body parts, including the head, hands, and legs. An optimized prompting strategy is then integrated with a visual large language model (LLM) to adapt it to pedestrian-related tasks. In addition, a chain-of-thought inference module is incorporated to strengthen the model’s reasoning ability, thereby improving behavior classification and risk assessment. Experimental results show that, under zero-shot learning, the proposed model improves accuracy by 11.74% compared with other LLM, and under few-shot learning, it surpasses traditional neural networks by 2.97%. These findings demonstrate that the method not only enhances the accuracy and robustness of pedestrian crossing behavior recognition but also maintains strong performance in data-scarce scenarios, offering valuable support for improving road safety.

Keywords

prompt template logical chain of reasoning large language model behavior analysis risk assessment

Get full access to this article

View all access options for this article.

References

Zafri

Rony

Rahman

, et al. Comparative risk assessment of pedestrian groups and their road-crossing behaviours at intersections in Dhaka, Bangladesh. Int J Crashworthiness 2022; 27(2): 581–590.

Mohammed

. Assessment of distracted pedestrian crossing behavior at midblock crosswalks. IATSS Res 2021; 45(4): 584–593.

Sobrinho-Junior

de Almeida

ACN

Ceabras

AAP

, et al. Risks of accidents caused by the use of smartphone by pedestrians are task- and environment-dependent. Int J Environ Res Public Health 2022; 19(16): 1–9.

Mikusova

Wachnicka

Zukowska

. Research on the use of mobile devices and headphones on pedestrian crossings—pilot case study from Slovakia. Safety 2021; 7(1): 1–18.

Gerhard

. Mandatory helmet legislation and risk perception: a qualitative study in Melbourne, Australia. J Transp Health 2023; 33: 1–5.

Ssi

Haworth

Schramm

. Understanding nonuse of mandatory e-scooter helmets. Traffic Inj Prev 2024; 25(5): 757–764.

Adjei

Nakua

Donkor

, et al. Helmet utilisation and its associated factors among motorcyclists in northern Ghana: an analytical cross-sectional survey. Inj Prev 2024; 30(5): 420–426.

Liu

Wiratama

Chao

, et al. Unhelmeted riding, drunk riding, and unlicensed riding among motorcyclists: a population study in Taiwan during 2011-2016. Int J Environ Res Public Health 2023; 20(2): 1–15.

Wang

Liu

Cai

, et al. YOLOv8-QSD: an improved small object detection algorithm for autonomous vehicles based on YOLOv8. IEEE Trans Instrum Meas 2024; 73(1): 1–16.

10.

Sheykhfard

Haghighi

Kavianpour

, et al. Risk assessment of pedestrian red-light violation behavior using surrogate safety measures: influence of human, road, vehicle, and environmental factors. IATSS Res 2023; 47(4): 514–525.

11.

Mukherjee

Mitra

. What affects pedestrian crossing difficulty at urban intersections in a developing country? IATSS Res 2022; 46(4): 586–601.

12.

Liang

Wang

, et al. Evaluating the influence of approaching vehicles on pedestrian’s visual patterns and crossing behaviors at an uncontrolled crosswalk. Transp Res Part F: Traffic Psychol Behav 2022; 88: 236–247.

13.

Davis

Barton

Pugliese

, et al. The influences of listening and speaking on pedestrians’ assessments of approaching vehicles. Transp Res Part F: Traffic Psychol Behav 2021; 82: 348–358.

14.

Bardutz

Bigazzi

. Communicating perceptions of pedestrian comfort and safety: structural topic modeling of open response survey comments. Transp Res Interdiscip Perspect 2022; 14: 1–10.

15.

Mohammadi

Azadnajafabad

Keykhaei

, et al. Barriers and factors associated with the use of helmets by motorcyclists: a scoping review. Accid Anal Prev 2022; 171: 1–10.

16.

Rosander

Breeding

Ngatuvai

, et al. National analysis of motorcycle associated injuries and fatalities: wearing helmet saves lives. Am J Emerg Med 2023; 69: 108–113.

17.

Arbel

Zrifin

Mahmoud

, et al. Maxillofacial injuries sustained by riders of electric-powered bikes and electric-powered scooters. Int J Environ Res Public Health 2022; 19(22): 1–8.

18.

Kim

Park

, et al. Helmet wearing and related factors among electric personal mobility device users in Korea. Health Promot Int 2024; 39(4): 1–9.

19.

Jing

Wang

Jiang

, et al. Determinants of switching behavior to wear helmets when riding e-bikes, a two-step SEM-ANFIS approach. Math Biosci Eng 2023; 20(5): 9135–9158.

20.

Chen

, et al. Injuries and risk factors associated with bicycle and electric bike use in China: a systematic review and meta-analysis. Saf Sci 2022; 152: 1–9.

21.

Jayanthan

Domnic

. An attentive convolutional transformer-based network for road safety. J Supercomput 2023; 79(14): 16351–16377.

22.

Song

Wang

. RBFPDet: an anchor-free helmet wearing detection method. Appl Intell 2022; 53: 5013–5028.

23.

Huang

Zhang

, et al. Motorcyclist helmet detection in single images: a dual-detection framework with multi-head self-attention. Soft Comput 2024; 28(5): 4321–4333.

24.

Lin

. Safety helmet detection based on improved YOLOv8. IEEE Access 2024; 12: 28260–28272.

25.

Wei

Liu

Ren

, et al. Research on helmet wearing detection method based on deep learning. Sci Rep 2025; 14(1): 7010.

26.

. MCX-YOLOv5: efficient helmet detection in complex power warehouse scenarios. J Real Time Image Process 2024; 21(2): 1–19.

27.

Pelicioni

PHS

Chan

LLY

Shi

, et al. Impact of mobile phone use on accidental falls risk in young adult pedestrians. Heliyon 2023; 9(8): 1–9.

28.

Chen

Liang

Chen

, et al. Study on the risk assessment of pedestrian-vehicle conflicts in channelized right-turn lanes based on the hierarchical-grey entropy-cloud model. Accid Anal Prev 2024; 205: 1–10.

29.

Liu

Zhou

Gou

. Learning from interaction-enhanced scene graph for pedestrian collision risk assessment. IEEE Trans Intell Vehicles 2023; 8(9): 4237–4248.

30.

Vamshi Krishna

Kapruwan

Choudhary

. Understanding distracted pedestrians’ risky behaviour: the role of walking and visual characteristics through a field study. Transp Res Part F: Traffic Psychol Behav 2024; 101: 111–129.

31.

Zhou

Zhang

, et al. Deep learning-based pedestrian trajectory prediction and risk assessment at signalized intersections using trajectory data captured through roadside LiDAR. J Intell Transp Syst 2024; 28: 793–805.

32.

Zhang

Chen

Yang

, et al. Pedestrian path prediction for autonomous driving at un-signalized crosswalk using W/CDM and MSFM. IEEE Trans Intell Transp Syst 2021; 22(5): 3025–3037.

33.

Kotseruba

IULIIA

Rasouli

AMIR

Tsotsos

. Benchmark for evaluating pedestrian action prediction. In: Proceedings of the IEEE/CVF Winter conference on applications of computer vision, 2021, pp.1258–1268.

34.

Vaswani

Shazeer

Parmar

, et al. Attention is all you need. In: Advances in neural information processing systems, 2017, p.30.

35.

Orwig

Edenbaum

Greene

, et al. The language of creativity: evidence from humans and large language models. J Creat Behav 2024; 58(1): 128–136.

36.

Leong

Abdelhalim

, et al. MetRoBERTa: leveraging traditional customer relationship management data to develop a transit-topic-aware language model. Transp Res Rec J Transp Res Board 2024; 2678(9): 215–229.

37.

Zhao

Chen

, et al. Accurate detection of vehicle, pedestrian, cyclist and wheelchair from roadside light detection and ranging sensors. J Intell Transp Syst 2024; 28(6): 904–920.

38.

Liou

Chen

, et al. Open-ti: open traffic intelligence with augmented language model. Int J Mach Learn Cybern 2024; 15(10): 4761–4786.

39.

Tian

Zhang

, et al. VistaGPT: generative parallel transformers for vehicles with intelligent systems for transport automation. IEEE Trans Intell Vehicles 2023; 8(9): 4198–4207.

40.

Cui

Cao

, et al. Receive, reason, and react: drive as you say, with large language models in autonomous vehicles. IEEE Intell Transp Syst Mag 2024; 16(4): 81–94.

41.

Zhao

Yuan

, et al. DriveLLaVA: human-level behavior decisions via vision language model. Sensors 2024; 24(13): 4113–414134.

42.

Zhang

Wang

Jia

, et al. Integrating visual large language model and reasoning chain for driver behavior analysis and risk assessment. Accid Anal Prev 2024; 198: 1–12.

43.

Jain

Chen

, et al. Drivegenvlm: real-world video generation for vision language model based autonomous driving. In: 2024 IEEE international automated vehicle validation conference, 2024, pp.1–6.

44.

Fang

Cui

Liang

, et al. CoReVLA: a dual-stage end-to-end autonomous driving framework for long-tail scenarios via collect-and-refine. arXiv preprint, arXiv:2509, 2025, pp.1–9.

45.

Hasan

Chen

Wang

, et al. Vision-language models can identify distracted driver behavior from naturalistic videos. IEEE Trans Intell Transp Syst 2024; 25(9): 11602–11616.

46.

Benlarabi

Khtira

Asri

. CASPL: a coevolution analysis platform for software product lines. In: Handbook of research on investigations in artificial life research and development. IGI Global Scientific Publishing (IGI Global), 2018, pp.380–396.

47.

Zhang

Zhou

, et al. Semantic understanding and prompt engineering for large-scale traffic data imputation. Inf Fusion 2024; 102: 1–17.

48.

Syum Gebre

Beni

Tsehaye Wasehun

, et al. AI-integrated traffic information system: a synergistic approach of physics informed neural network and GPT-4 for traffic estimation and real-time assistance. IEEE Access 2024; 12: 65869–65882.

49.

SVU

HDA

Nguyen

QQV

, et al. DAKRS: domain adaptive knowledge-based retrieval system for natural language-based vehicle retrieval. IEEE Access 2023; 11: 90951–90965.

50.

Shen

Wallis

, et al. LoRA: low-rank adaptation of large language models. ArXiv: 210609685, 2021, pp.1–26.

51.

Liu

Lee

, et al. Visual instruction tuning. In: 37th Conference on neural information processing systems, 2023, pp.1-25.

52.

Chen

Wang

, et al. InternVL: scaling up vision foundation models and aligning for generic visual-linguistic tasks. ArXiv.org, 2024, pp.24185–24198.