Sage Journals: Discover world-class research

Abstract

Faced with the rapid growth in demand for instant delivery, traditional logistics delivery modes have struggled to meet these needs effectively because of capacity constraints. Autonomous delivery vehicles (ADVs) can compensate for a shortage of human labor. ADVs, which rely on batteries for propulsion, occasionally need to return to battery-swapping stations to maintain the state of charge of their batteries during delivery. In the context of applying ADVs for instant delivery, we employ agent-based modeling to set the behavioral rules of customers, the ADVs, and the distribution center; therefore, an instant delivery scheduling simulation environment is created. A vehicle routing problem with time windows mathematical model is established and solved to optimize the delivery scheduling by the adaptive large neighborhood search heuristic algorithm. Given the dynamically changing environmental conditions, we utilize the Dueling Double Deep Q Network deep reinforcement learning algorithm, which adapts to these changes, to train ADVs on autonomous battery swapping decisions. The performance of the proposed model is compared with several benchmark policies, including threshold-based strategies, alternative reinforcement learning algorithms, and a fixed strategy in which the ADV swaps its battery on each return to the distribution center. Simulation experiments, based on real-world cases, demonstrate that the proposed model achieves better results. Specifically, it reduces the delay time by approximately 17.55% compared with the average delays of all other benchmark policies and decreases the number of battery swaps by approximately 49.06%. Furthermore, the model exhibits strong adaptability to the dynamically changing simulation environment.

Keywords

instant delivery autonomous delivery vehicles battery swapping decisions deep reinforcement learning environmental adaptability

Get full access to this article

View all access options for this article.

References

Frost & Sullivan. 2023 China Instant Delivery Industry Trends White Paper. https://www.frostchina.com/content/insight/detail?id=65f64369a2aa84f5d865a560. Accessed July 29, 2024.

Huang

Teng

Multi-agent Simulation for Online Fresh Food Autonomous Delivery. Journal of System Simulation, Vol. 34, 2022, pp. 1185–1195.

Jennings

Figliozzi

Study of Road Autonomous Delivery Robots and Their Potential Effects on Freight Efficiency and Travel. Transportation Research Record, Vol. 2674, 2020, pp. 1019–1029.

Srinivas

Ramachandiran

Rajendran

Autonomous Robot-driven Deliveries: A Review of Recent Developments and Future Directions. Transportation Research Part E: Logistics and Transportation Review, Vol. 165, 2022, p. 102834.

Zhang

Guo

Research on Unmanned Vehicle Routing Problem with Variable Travel Time. Industrial Engineering and Management, Vol. 24, 2019, pp. 120–126, 131.

James

J. Q.

Lam

A. Y.

Autonomous Vehicle Logistic System: Joint Routing and Charging Strategy. IEEE Transactions on Intelligent Transportation Systems, Vol. 19, 2017, pp. 2175–2187.

Sonneberg

M. O.

Leyerer

Kleinschmidt

Knigge

Breitner

M. H.

Autonomous Unmanned Ground Vehicles for Urban Logistics: Optimization of Last Mile Delivery Operations. Proceedings of the 52nd Hawaii International Conference on System Sciences, 2019.

Zhao

Research on Path Optimization of Unmanned Ground Delivery Vehicles in Charging and Power Swapping Mode. Beijing Jiaotong University, Beijing, 2021.

Han

Unmanned Logistics Distribution Route Considering Customer Level and Time-varying Road Conditions. Journal of Zhejiang University (Engineering Science), Vol. 57, 2023, pp. 2018–2027.

10.

Moradi

Sadati

İ.

Çatay

Last Mile Delivery Routing Problem Using Autonomous Electric Vehicles. Computers & Industrial Engineering, Vol. 184, 2023, p. 109552.

11.

Zhang

Han

Research on Distribution Optimization of Electric Unmanned Vehicles in Urban Logistics. Journal of Zhejiang Sci-Tech University (Social Sciences Edition), Vol. 44, 2020, pp. 124–133.

12.

Lee

M. S.

Jang

Y. J.

The AGV Battery Swapping Policy Based on Reinforcement Learning. 2022 IEEE 18th International Conference on Automation Science and Engineering (CASE), 2022, pp. 1479–1484.

13.

Van Hasselt

Guez

Silver

Deep Reinforcement Learning with Double Q-learning. Proceedings of the AAAI Conference on Artificial Intelligence, 2016, p. 30.

14.

Macal

C. M.

North

M. J.

Agent-based Modeling and Simulation. Proceedings of the 2009 Winter Simulation Conference (WSC), 2009, pp. 86–98.

15.

Daly

A. J.

De Visscher

Baetens

J. M.

De Baets

Quo Vadis, Agent-based Modelling Tools?

Environmental Modelling & Software, Vol. 157, 2022, p. 105514.

16.

Ropke

Pisinger

An Adaptive Large Neighborhood Search Heuristic for the Pickup and Delivery Problem with Time Windows. Transportation Science, Vol. 40, 2006, pp. 455–472.

17.

Christiaens

Vanden Berghe

Slack Induction by String Removals for Vehicle Routing Problems. Transportation Science, Vol. 54, 2020, pp. 417–433.

18.

Sutton

R. S.

Barto

A. G.

Reinforcement Learning: An Introduction. MIT Press, Cambridge, 1998.

19.

Oliehoek

F. A.

Amato

A Concise Introduction to Decentralized POMDPs. Springer International Publishing, Cham, 2016.

20.

Flow Bridge Strategies. 2024 Last-Mile Autonomous Delivery Industry Research Report. https://www.fxbaogao.com/view?id=4385816. Accessed December 27, 2024.

21.

Neolix. Neolix Autonomous Vehicle. https://www.neolix.cn/productTechnology. Accessed December 28, 2024.

22.

National Bureau of Statistics of China. Per Capita Income of Urban Residents. https://data.stats.gov.cn/easyquery.htm?cn=C01. Accessed June 4, 2025.

23.

CEIC. Electricity Price: 36 Cities. https://www.ceicdata.com/zh-hans/china/electricity-price-36-city. Accessed May 26, 2025.

24.

Carbon Neutrality Professional Committee. Collection | Understand Carbon Dioxide Emissions from Various Energy Consumptions in One Minute. https://zhuanlan.zhihu.com/p/391541584. Accessed May 26, 2025.

25.

Research Center of Sustainable Development, Fudan University. The Fudan Carbon Price Index was released in May 2025! https://rcsd.fudan.edu.cn/info/1024/1797.htm. Accessed May 26, 2025.

Adaptive Battery Swapping for Autonomous Delivery Vehicles Using Dueling Double Deep Q Network

Abstract

Keywords

Get full access to this article

References