Sage Journals: Discover world-class research

Abstract

Safe and efficient decision-making of autonomous vehicles in complex dynamic scenarios requires a decision-making system with human-like cognitive ability, however, existing deep reinforcement learning methods suffer from insufficient generalization ability of unknown scenarios and exploration-exploitation imbalance. In order to solve the above problems, this study proposes a novel prefrontal cortex (PFC) decision-making model, which builds a multi-module synergistic cognitive architecture by modeling the spatio-temporal reasoning of the lateral prefrontal cortex (LPFC), the reward prediction of the medial prefrontal cortex (MPFC), and the adaptive adjustment function of the anterior cingulate cortex (ACC). The innovations are fusion of graph convolutional network (GCN) and long short-term memory network (LSTM) to capture vehicle interaction features; introduction of unsupervised clustering and deep belief network (DBN) to achieve metacognitive planning of action-reward causal association, proposing a dynamic exploration rate regulation mechanism based on alertness, and realizing strategy optimization in complex scenarios through dopamine-based reward prediction error. In this study, this study test the performance of the method in highway and ring intersection scenarios and compare it with existing deep reinforcement learning (DRL) and graph reinforcement learning (GRL) methods. The experimental results show that the PFC model can perform spatio-temporal and task reasoning, and is able to make better decisions in complex and changing scenarios, which significantly improves the efficiency of access. This study can provide reference value for the development of biological neural models and promote their application in dynamic traffic interaction scenarios.

Keywords

Autonomous driving driving decision-making prefrontal cortex anterior cingulate cortex deep reinforcement learning

Get full access to this article

View all access options for this article.

References

Chen

Ying

Inference of hdvs real-time locations in mixed autonomous traffic flow scenario. Transportmetrica B Transp Dyn 2022; 10(1): 468–498.

Chen

Zhao

, et al. A review of decision-making and planning for autonomous vehicles in intersection environments. World Electric Vehicle Journal 2024; 15(3): 99.

Cao

Wang

, et al. An improved dueling deep q-network with optimizing reward functions for driving decision method. Proc IMechE, Part D: J Automobile Engineering 2023; 237(9): 2295–2309.

Qiang

Wang

, et al. Random prior network for autonomous driving decision-making based on reinforcement learning. J Transp Eng Syst 2024; 150(4): 04024012.

Liu

Wang

, et al. A novel lane change decision-making model of autonomous vehicle based on support vector machine. IEEE Access 2019; 7: 26543–26550.

Lee

Hoang

, et al. Joint interaction and trajectory prediction for autonomous driving using graph neural networks. arXiv preprint arXiv:1912078822019.

Zhou

Cui

, et al. Graph neural networks: a review of methods and applications. AI Open 2020; 1: 57–81.

Sheng

Xue

, et al. Graph-based spatial-temporal convolutional network for vehicle trajectory prediction in autonomous driving. IEEE Trans Intell Transp Syst 2022; 23(10): 17654–17665.

Zhang

Song

Aletras

, et al. Node-feature convolution for graph convolutional networks. Pattern Recognit 2022; 128: 108661.

10.

Huang

, et al. Stgat: Modeling spatial-temporal interactions for human trajectory prediction. In: Proceedings of the IEEE/CVF international conference on computer vision, 2019, pp.6272–6281.

11.

Qiang

Wang

Liu

, et al. Edge-enhanced graph attention network for driving decision-making of autonomous vehicles via deep reinforcement learning. Proc IMechE, Part D: J Automobile Engineering 2025; 239: 1168–1180.

12.

Nobari

Eslamipoor

Optimal point-to-point path planning of manipulator by using vibration damping optimization algorithm and game theory method. J Test Eval 2019; 47(4): 2867–2888.

13.

Wang

Zheng

Tian

, et al. Pre-training with asynchronous supervised learning for reinforcement learning based autonomous driving. Front Inf Technol Electron Eng 2021; 22(5): 673–686.

14.

Likmeta

Metelli

Tirinzoni

, et al. Combining reinforcement learning with rule-based controllers for transparent and general decision-making in autonomous driving. Robot Auton Syst 2020; 131: 103568.

15.

Adane

Hou

Peng

, et al. Optimization-based traffic safety improvement strategy for autonomous vehicle driving at level crossings. Transp Res Rec 2023; 2677(2): 1027–1041.

16.

Zheng

Liang

Wang

, et al. Game-theoretic adversarial interaction-based critical scenario generation for autonomous vehicles. Machines 2024; 12(8): 538.

17.

Cai

Yang

Wang

, et al. A decision control method for autonomous driving based on multi-task reinforcement learning. IEEE Access 2021; 9: 154553–154562.

18.

Plebe

Svensson

Mahmoud

, et al. Human-inspired autonomous driving: a survey. Cogn Syst Res 2024; 83: 101169.

19.

Sprague

Yoo

, et al. Joint representation of working memory and uncertainty in human cortex. Neuron 2021; 109(22): 3699–3712.e6.

20.

Bernardi

Benna

Rigotti

, et al. The geometry of abstraction in the hippocampus and prefrontal cortex. Cells 2020; 183(4): 954–967.e21.

21.

Klein-Flügge

Bongioanni

Rushworth

MFS

. Medial and orbital frontal cortex in decision-making and flexible behavior. Neuron 2022; 110(17): 2743–2770.

22.

Jacobs

Moghaddam

Prefrontal cortex representation of learning of punishment probability during reward-motivated actions. J Neurosci 2020; 40(26): 5063–5077.

23.

van Holstein

Floresco

. Dissociable roles for the ventral and dorsal medial prefrontal cortex in cue-guided risk/reward decision making. Neuropsychopharmacol 2020; 45(4): 683–693.

24.

Duverne

Koechlin

Rewards and cognitive control in the human prefrontal cortex. Cereb Cortex 2017; 27(10): 5024–5039.

25.

Pan

Fan

Wang

, et al. Contributions of distinct prefrontal neuron classes in reward processing. Sci China Technol Sci 2014; 57: 1257–1268.

26.

Ellwood

Patel

Wadia

, et al. Tonic or phasic stimulation of dopaminergic projections to prefrontal cortex causes mice to maintain or deviate from previously learned behavioral strategies. J Neurosci 2017; 37(35): 8315–8329.

27.

Rolls

Cheng

, et al. Functional connectivity of the orbitofrontal cortex, anterior cingulate cortex, and inferior frontal gyrus in humans. Cortex 2020; 123: 185–199.

28.

Yuan

Wang

, et al. A corticoamygdalar pathway controls reward devaluation and depression using dynamic inhibition code. Neuron 2023; 111(23): 3837–3853.e5.

29.

Yee

Crawford

Lamichhane

, et al. Dorsal anterior cingulate cortex encodes the integrated incentive motivational value of cognitive task performance. J Neurosci 2021; 41(16): 3707–3720.

30.

Monosov

Haber

Leuthardt

, et al. Anterior cingulate cortex and the control of dynamic behavior in primates. Curr Biol 2020; 30(23): R1442–R1454.

31.

Pastor

Medina

JH.

Medial prefrontal cortical control of reward- and aversion-based behavioral output: bottom-up modulation. Eur J Neurosci 2021; 53(9): 3039–3062.

32.

Monosov

Rushworth

MFS

. Interactions between ventrolateral prefrontal and anterior cingulate cortex during learning and behavioural change. Neuropsychopharmacol 2022; 47(1): 196–210.

33.

Alexander

Brown

JW.

The role of the anterior cingulate cortex in prediction error and signaling surprise. Top Cogn Sci 2019; 11(1): 119–135.

34.

Xie

Chen

Zheng

, et al. From human driving to automated driving: What do we know about drivers? IEEE Trans Intell Transp Syst 2022; 23(7): 6189–6205.

35.

Sun

Gao

Chang

, et al. Brain-like intelligent decision-making based on basal ganglia and its application in automatic car-following. J Bionic Eng 2021; 18(6): 1439–1451.

36.

Hang

Zhang

Brain-inspired modeling and decision-making for human-like autonomous driving in mixed traffic environment. IEEE Trans Intell Transp Syst 2023; 24(10): 10420–10432.

37.

Liao

, et al. A cognitive-based trajectory prediction approach for autonomous driving. IEEE Trans Intell Vehicles 2024; 9: 4632–4643.

38.

Liao

Wang

, et al. A cognitive-driven trajectory prediction model for autonomous driving in mixed autonomy environment. arXiv preprint arXiv:240417520, 2024.

39.

Zhang

Wang

Zhang

, et al. Multi-attention network for pedestrian intention prediction based on spatio-temporal feature fusion. Proc IMechE, Part D: J Automobile Engineering 2024; 238(13): 4202–4215.

40.

Chen

Yan

, et al. Intention-guided heuristic partially observable monte carlo planning for off-ramp decision-making of autonomous vehicles. IEEE Trans Intell Transp Syst Epub ahead of print 14 March 2025. DOI: 10.1109/TITS.2025.3547906.

41.

Chen

Dong

, et al. Graph neural network and reinforcement learning for multi-agent cooperative control of connected autonomous vehicles. Comput Civ Infrastruct Eng 2021; 36(7): 838–857.

42.

Tan

, et al. Neural correlates of olfactory working memory in the human brain. Neuroimage 2025; 306: 121005.

43.

Mack

Preston

Love

BC.

Ventromedial prefrontal cortex compression during concept learning. Nat Commun 2020; 11(1): 46.

44.

Johnston

Abbass

Corrigan

, et al. Decoding spatial locations from primate lateral prefrontal cortex neural activity during virtual navigation. J Neural Eng 2023; 20(1): 016054.

45.

Vogel

Hahn

Duvarci

, et al. Prefrontal pyramidal neurons are critical for all phases of working memory. Cell Rep 2022; 39(2): 110659.

46.

Banaie Boroujeni

Tiesinga

Womelsdorf

. Interneuron-specific gamma synchronization indexes cue uncertainty and prediction errors in lateral prefrontal and anterior cingulate cortex. eLife 2021; 10: e69111.

47.

Tervo

DGR

Kuleshova

Manakov

, et al. The anterior cingulate cortex directs exploration of alternative strategies. Neuron 2021; 109(11): 1876–1887.e6.

48.

Brockett

Roesch

. Anterior cingulate cortex and adaptive control of brain and behavior. Amsterdam, Netherlands: Elsevier, 2021. Vol. 158, pp.283–309.

49.

Kim

, et al. A hierarchical motion planning framework for autonomous driving in structured highway environments. IEEE Access 2022; 10: 20102–20117.

50.

Chou

Bagabaldo

Bayen

AM.

The lord of the ring road: a review and evaluation of autonomous control policies for traffic in a ring road. ACM Trans Cyber Phys Syst 2022; 6(1): 1–25.

51.

Albeaik

Bayen

Chiri

, et al. Limitations and improvements of the intelligent driver model (IDM). SIAM J Appl Dyn Syst 2022; 21(3): 1862–1892.

52.

Sadid

Antoniou

Modelling and simulation of (connected) autonomous vehicles longitudinal driving behavior: a state-of-the-art. IET Intell Transp Syst 2023; 17(6): 1051–1071.

Driving decision making of autonomous vehicles based on deep reinforcement learning with prefrontal cortex-like cognition

Abstract

Keywords

Get full access to this article

References