Bridging gaps in intelligent truck dispatching: An underexplored PPO-based model with expanded feature integration within open pit mines

Abstract

The mining industry is increasingly adopting intelligent systems to address challenges such as declining ore grades, rising operational costs and sustainability demands. This study presents a Proximal Policy Optimisation (PPO)-based truck dispatching model designed to enhance operational efficiency in open-pit mining. Addressing two key research gaps – limited integration of dispatching features and underutilisation of advanced reinforcement learning (RL) algorithms – the proposed model incorporates 19 critical features and is evaluated against the conventional Fixed Schedule (FS) method. A discrete event simulation environment was developed to emulate an open pit case study with heterogeneous trucks and shovels. The PPO model demonstrated convergence within 3.5 h and outperformed the FS baseline across multiple key performance indicators, including a 5.7% increase in total production, 4.2% improvement in plant delivery, and 13.2% higher truck utilisation. Compared to widely used RL algorithms in this domain, the PPO approach achieved faster convergence despite handling a more complex feature set. These findings highlight the potential of PPO as a robust and scalable solution for intelligent dispatching, offering practical benefits for Mining 4.0 initiatives.

Keywords

Truck dispatching simulation PPO fleet management open pit mining

Get full access to this article

View all access options for this article.

References

De Carvalho

Dimitrakopoulos

(2021) Integrating production planning with truck-dispatching decisions through reinforcement learning while managing uncertainty. Minerals 11(6): 87.

De Carvalho

Dimitrakopoulos

(2023) Integrating short-term stochastic production planning updating with mining fleet management in industrial mining complexes: An actor-critic reinforcement learning approach. Applied Intelligence 53(20): 23179–23202.

Cheng

Chen

CLP

, et al. (2022) Proximal policy optimization with policy feedback. IEEE Transactions on Systems, Man, and Cybernetics: Systems 52(7): 4600–4610.

Hazrathosseini

Moradi Afrapoli

(2023a) Intelligent fleet management systems in surface mining: Status, threats, and opportunities. Mining, Metallurgy and Exploration 40(6): 2087–2106.

Hazrathosseini

Moradi Afrapoli

(2023b) Maximizing mining operations: Unlocking the crucial role of intelligent fleet management systems in surface mining’s value chain. Mining 4(1): 7–20.

Hazrathosseini

Moradi Afrapoli

(2023c) The advent of digital twins in surface mining: Its time has finally arrived. Resources Policy 80: 103155.

Hazrathosseini

Moradi Afrapoli

(2024) Transition to intelligent fleet management systems in open pit mines: A critical review on application of reinforcement-learning-based systems. Mining Technology 133: 50–73.

Hazrathosseini

Moradi Afrapoli

(2025) An intelligent rule-based decision-making system for preliminary truck dispatching within open-pit mines. Mining Technology: Transactions of the Institutions of Mining and Metallurgy 134(1): 3–14.

Huo

Sari

Kealey

, et al. (2023) Reinforcement learning-based fleet dispatching for greenhouse gas emission reduction in open-pit mining operations. Resources, Conservation and Recycling 188: 106664.

10.

Huo

Sari

Zhang

(2024) Smart dispatching for low-carbon mining fleet: A deep reinforcement learning approach. Journal of Cleaner Production 435: 140459.

11.

Igogo

Awuah-Offei

Newman

, et al. (2021) Integrating renewable energy into mining operations: Opportunities, challenges, and enabling approaches. Applied Energy 300: 117375.

12.

Khorasgani

Wang

Gupta

(2020) Challenges of applying deep reinforcement learning in dynamic dispatching. ArXiv:2011.05570.

13.

Matsui

Escribano

Angeloudis

(2023) Real-time dispatching for autonomous vehicles in open-pit mining deployments using deep reinforcement learning. In: IEEE Conference on Intelligent Transportation Systems, Proceedings, ITSC, pp.5468–5475.

14.

Noriega

Pourrahimian

Askari-Nasab

(2025) Deep reinforcement learning based real-time open-pit mining truck dispatching system. Computers & Operations Research 173: 106815.

15.

Puterman

(2014) Markov Decision Processes: Discrete Stochastic Dynamic Programming. New Jersey, USA: John Wiley & Sons. Available at: https://books.google.com/books?hl=en&lr=&id=VvBjBAAAQBAJ&oi=fnd&pg=PP9&ots=rtkAzIVZLL&sig=88nyhp2oGHi5jMrGkcmfJdn_ivM

16.

Qiu

Yang

(2023) A reinforcement learning based dynamic multi-objective constrained evolutionary algorithm for open-pit mine truck scheduling. In: Proceedings - 2023 China Automation Congress, CAC 2023, pp.5370–5375.

17.

Schulman

Levine

Abbeel

, et al. (2015) Trust region policy optimization. In: International conference on machine learning, pp. 1889–1897. https://proceedings.mlr.press/v37/schulman15.html

18.

Schulman

Wolski

Dhariwal

, et al. (2017) Proximal policy optimization algorithms. ArXiv:1707.06347.

19.

Zhang

Odonkor

Zheng

(2020) Dynamic dispatching for large-scale heterogeneous fleet via multi-agent deep reinforcement learning. In: Proceedings - 2020 IEEE international conference on big data, big data 2020, pp. 1436–1441.