Sage Journals: Discover world-class research

Abstract

Cooperative adaptive cruise control (CACC) can improve the traffic efficiency and safety of a platoon on the road. Most traditional CACC methods need to rely on accurate mathematical models, while those based on deep reinforcement learning (DRL) suffer from long training times and poor convergence. In this context, this study proposes a CACC framework based on imitation learning (IL) and DRL, which aims to improve the car-following efficiency and long platoon stability of connected autonomous vehicles (CAVs) in a mixed traffic environment. This method combines the optimization ability of model predictive control (MPC) and the adaptive learning characteristics of a soft actor-critic (SAC) algorithm. MPC is demonstrated as an expert, and the pre-training policy network is obtained by imitation learning. The pre-training network is then introduced into SAC’s actor network, which enhances training efficiency in the SAC algorithm. Numerical simulation results show that the improved DRL algorithm has better convergence in the training process. Compared with the baseline model, the proposed framework has higher reward, lower tracking error, and better platoon stability in the evaluation. In addition, the proposed model can efficiently complete the task of car-following under different penetration rates.

Keywords

cooperative adaptive cruise control deep reinforcement learning model predictive control imitation learning mixed traffic

Get full access to this article

View all access options for this article.

References

Taiebat

Brown

A. L.

Safford

H. R.

A Review on Energy, Environmental, and Sustainability Implications of Connected and Automated Vehicles. Environmental Science & Technology, Vol. 52, No. 20, 2018, pp. 11449–11465.

Rana

M. M.

Hossain

Connected and Autonomous Vehicles and Infrastructures: A Literature Review. International Journal of Pavement Research and Technology, Vol. 16, No. 2, 2023 pp. 264–284.

Acquarone

Miretti

Misul

Sassara

Cooperative Adaptive Cruise Control Based on Reinforcement Learning for Heavy-Duty BEVs. IEEE Access, Vol. 11, 2023, pp. 127145–127156.

Jiang

Xie

Evans

N. G.

Wen

Chen

Reinforcement Learning Based Cooperative Longitudinal Control for Reducing Traffic Oscillations and Improving Platoon Stability. Transportation Research Part C: Emerging Technologies, Vol. 141, 2022, p. 103744.

Prathiba

S. B.

Raja

Dev

Kumar

Guizani

A Hybrid Deep Reinforcement Learning for Autonomous Vehicles Smart-Platooning. IEEE Transactions on Vehicular Technology, Vol. 70, No. 12, 2021, pp. 13340–13350.

Naus

G. J. L.

Vugts

R. P. A.

Ploeg

van de Molengraft

M. J. G.

Steinbuch

String-Stable CACC Design and Experimental Validation: A Frequency-Domain Approach. IEEE Transactions on Vehicular Technology, Vol. 59, No. 9, 2010, pp. 4268–4279.

F. W.

Wang

J. W.

Zhu

Gelbal

S. Y.

Yang

Aksun-Guvenc

Guvenc

Distributed Control of Cooperative Vehicular Platoon with Nonideal Communication Condition. IEEE Transactions on Vehicular Technology, Vol. 69, No. 8, 2020, pp. 8207–8220.

Sawant

Chaskar

Ginoya

Robust Control of Cooperative Adaptive Cruise Control in the Absence of Information About Preceding Vehicle Acceleration. IEEE Transactions on Intelligent Transportation Systems, Vol. 22, No. 9, 2021, pp. 5589–5598.

Zhou

Wang

Ahn

Distributed Model Predictive Control Approach for Cooperative Car-Following with Guaranteed Local and String Stability. Transportation Research Part B: Methodological, Vol. 128, 2019, pp. 69–86.

10.

Kazemi

Mahjoub

H. N.

Tahmasbi-Sarvestani

Fallah

Y. P.

A Learning-Based Stochastic MPC Design for Cooperative Adaptive Cruise Control to Handle Interfering Vehicles. IEEE Transactions on Intelligent Vehicles, Vol. 3, No. 3, 2018, pp. 266–275.

11.

Wang

Y. Z.

Jin

P. J.

Model Predictive Control Policy Design, Solutions, and Stability Analysis for Longitudinal Vehicle Control Considering Shockwave Damping. Transportation Research Part C: Emerging Technologies, Vol. 148, 2023, p. 104038.

12.

Gong

Cooperative Platoon Control for a Mixed Traffic Flow Including Human Drive Vehicles and Connected and Autonomous Vehicles. Transportation Research Part B: Methodological, Vol. 116, 2018, pp. 25–61.

13.

H. Y.

Cheng

Z. Y.

Wang

Z. X.

Jin

Gao

Z. H.

Shen

C. L.

A Cooperative Interaction Strategy for Vehicle Platoons to Obtain Merging Gaps in Connected Environments. Proceedings of the Institution of Mechanical Engineers Part D: Journal of Automobile Engineering, Vol. 239, No. 4, 2025, pp. 1021–1034.

14.

Zhong

Z. J.

Lee

Zhao

L. H.

Multiobjective Optimization Framework for Cooperative Adaptive Cruise Control Vehicles in the Automated Vehicle Platooning Environment. Transportation Research Record: Journal of the Transportation Research Board, 2017. 2625: 32–42.

15.

Cao

Z. H.

Z. B.

Augmented Mixed Vehicular Platoon Control with Dense Communication Reinforcement Learning for Traffic Oscillation Alleviation. IEEE Internet of Things Journal, Vol. 11, No. 22, 2024, pp. 35989–36001.

16.

Guo

Peng

J. K.

D. W.

Zhang

H. L.

C. C.

C. Y.

Enhanced Consensus Control Architecture for Autonomous Platoon Utilizing Multi-Agent Reinforcement Learning. Computer-Aided Civil and Infrastructure Engineering, Vol. 40, No. 17, 2025, pp. 2561–2577.

17.

Zhou

A. Y.

Peeta

Zhou

Laval

Wang

Z. J.

Cook

Implications of Stop-and-Go Traffic on Training Learning-Based Car-Following Control. Transportation Research Part C: Emerging Technologies, Vol. 168, 2024, p. 104578.

18.

Pfeiffer

Shukla

Turchetta

Cadena

Krause

Siegwart

Nieto

Reinforced Imitation: Sample Efficient Deep Reinforcement Learning for Mapless Navigation by Leveraging Prior Demonstrations. IEEE Robotics and Automation Letters, Vol. 3, No. 4, 2018, pp. 4423–4430.

19.

Z. B.

Wang

S. C.

Wang

B. T.

Anti-Disturbance Self-Supervised Reinforcement Learning for Perturbed Car-Following System. IEEE Transactions on Vehicular Technology, Vol. 72, No. 9, 2023, pp. 11318–11331.

20.

Desjardins

Chaib-draa

Cooperative Adaptive Cruise Control: A Reinforcement Learning Approach. IEEE Transactions on Intelligent Transportation Systems, Vol. 12, No. 4, 2011, pp. 1248–1260.

21.

Cai

Chen

Wang

Sun

Gao

Altruistic Cooperative Adaptive Cruise Control of Mixed Traffic Platoon Based on Deep Reinforcement Learning. IET Intelligent Transport Systems, Vol. 17, No. 10, 2023, pp. 1951–1963.

22.

Selvaraj

D. C.

Hegde

Amati

Deflorio

Chiasserini

C. F.

An ML-Aided Reinforcement Learning Approach for Challenging Vehicle Maneuvers. IEEE Transactions on Intelligent Vehicles, Vol. 8, No. 2, 2023, pp. 1686–1698.

23.

Liu

Lei

Zheng

Zhang

Autonomous Platoon Control with Integrated Deep Reinforcement Learning and Dynamic Programming. IEEE Internet of Things Journal, Vol. 10, No. 6, 2023, pp. 5476–5489.

24.

Wang

Jin

Liu

Fang

Guo

Design of Intelligent Connected Cruise Control with Vehicle-to-Vehicle Communication Delays. IEEE Transactions on Vehicular Technology, Vol. 71, No. 8, 2022, pp. 9011–9025.

25.

Albeaik

Vurimi

Chou

F.-C.

X.-Y.

Bayen

A. M.

Longitudinal Deep Truck: Deep Longitudinal Model with Application to sim2real Deep Reinforcement Learning for Heavy-Duty Truck Control in the Field. Journal of Field Robotics, Vol. 40, No. 2, 2023, pp. 306–329.

26.

Z. R.

Jiao

X. H.

S. K.

Proximal Policy Optimization-Based Driving Control Strategy of Connected Cruise Vehicle Platoons to Improve Traffic Efficiency and Safety. Transportation Research Record: Journal of the Transportation Research Board, 2023. 2677: 58–72.

27.

Shi

Zhou

Wang

Lin

Ran

Connected Automated Vehicle Cooperative Control with a Deep Reinforcement Learning Approach in a Mixed Traffic Environment. Transportation Research Part C: Emerging Technologies, Vol. 133, 2021, p. 103421.

28.

Cao

A Reinforcement Learning-Based Vehicle Platoon Control Strategy for Reducing Energy Consumption in Traffic Oscillations. IEEE Transactions on Neural Networks and Learning Systems, Vol. 32, No. 12, 2021, pp. 5309–5322.

29.

Haarnoja

Zhou

Abbeel

Levine

Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor. Proc., 35th International Conference on Machine Learning (ICML), No. 80, Stockholm, Sweden, 2018.

30.

Haarnoja

Zhou

Hartikainen

Tucker

Tan

Kumar

, et al. Soft Actor-Critic Algorithms and Applications. arXiv Preprint arXiv:1812.05905, 2018.

31.

Helbing

Hennecke

Shvetsov

Treiber

Micro- and Macro-Simulation of Freeway Traffic. Mathematical and Computer Modelling, Vol. 35, No. 5–6, 2002, pp. 517–547.

32.

Zhang

C. Y.

Sun

L. J.

Bayesian Calibration of the Intelligent Driver Model. IEEE Transactions on Intelligent Transportation Systems, Vol. 25, No. 8, 2024, pp. 9308–9320.

33.

Treiber

Hennecke

Helbing

Congested Traffic States in Empirical Observations and Microscopic Simulations. Physical Review E, Vol. 62, No. 2, 2000, pp. 1805–1824.

34.

Yin

Y. F.

Wei

Y. L.

Dong

Z. J.

Xue

M. Q.

Vazquez

L. G.

V2V-Based Cooperative Control of Heterogeneous CAV Platoons: An Intelligent VO-IDA Approach. IEEE Internet of Things Journal, Vol. 11, No. 22, 2024, pp. 36257–36271.

35.

Song

Ding

Xiao

Data-Driven Optimal Cooperative Adaptive Cruise Control of Heterogeneous Vehicle Platoons with Unknown Dynamics. Science China-Information Sciences, Vol. 63, No. 9, 2020, p. 190204.

36.

Y. J.

Gao

Liu

Improved SQP and SLSQP Algorithms for Feasible Path-Based Process Optimisation. Computers & Chemical Engineering, Vol. 188, 2024, p. 108751.

37.

van Hasselt

Guez

Silver

Deep Reinforcement Learning with Double Q-Learning. Proc., 30th Association-for-the-Advancement-of-Artificial-Intelligence (AAAI) Conference on Artificial Intelligence, Association for the Advancement of Artificial Intelligence, Phoenix, AZ, 2016, pp. 2094–2100.

38.

Shi

H. T.

Zhou

Wang

S. C.

Gong

S. Y.

Ran

A Deep Reinforcement Learning-Based Distributed Connected Automated Vehicle Control Under Communication Failure. Computer-Aided Civil and Infrastructure Engineering, Vol. 37, No. 15, 2022, pp. 2033–2051.

39.

Krajewski

Bock

Kloeker

Eckstein

The highD Dataset: A Drone Dataset of Naturalistic Vehicle Trajectories on German Highways for Validation of Highly Automated Driving Systems. Proc., 21st IEEE International Conference on Intelligent Transportation Systems (ITSC), Maui, HI, IEEE, New York, 2018, pp. 2118–2125.

40.

Parvini

Gonzalez

Villamil

Schulz

Fettweis

Joint Resource Allocation and String-Stable CACC Design with Multi-Agent Reinforcement Learning. Proc., IEEE International Conference on Communications (IEEE ICC), Rome, Italy, IEEE, New York, 2023, pp. 1232–1237.

41.

Zhou

Ahn

Robust Local and String Stability for a Decentralized Car Following Control Strategy for Connected Automated Vehicles. Transportation Research Part B: Methodological, Vol. 125, 2019, pp. 175–196.

Cooperative Adaptive Cruise Control with Model-Based Imitation Reinforcement Learning in Mixed Traffic of Connected Autonomous and Human Driven Vehicles

Abstract

Keywords

Get full access to this article

References