Sage Journals: Discover world-class research

Abstract

Pushback optimisation, an early phase of strategic open pit planning, shapes mining phases that guide extraction for the mine's life and strongly influences operational feasibility and economic value. The task unfolds in two steps: first, a set of nested pits is generated; second, a subset of these pits is chosen, defining the pushbacks. Selecting pushbacks is hard because it must meet several operational criteria and has an impact on the production schedule, and thus the project's NPV. Existing tools offer partial help but still depend on manual judgment and rough schedules, often preventing optimal solutions. This study presents an automated pushback-selection method built on reinforcement learning (RL). An RL agent learns, through interactions with the nested-pit environment, to select the pushback set that maximises NPV while respecting key constraints, minimum mining width (MMW), waste-to-ore ratio, balanced tonnage swings, and full use of mining and processing capacities. The framework is tested on the publicly available McLaughlin mine dataset, with emphasis on maintaining the MMW constraint. Results show the RL approach is efficient and economically superior: it produces pushbacks whose NPV is 9% higher than mining every nested pit sequentially. These outcomes underscore RL's promise for automated, constraint-aware pushback optimisation.

Keywords

pushback selection optimisation reinforcement learning strategic mine planning open pit mining

Get full access to this article

View all access options for this article.

References

Araya

Nehring

Vega

, et al. (2020) The impact of equipment productivity and pushback width on the mine planning process. Journal of the Southern African Institute of Mining and Metallurgy 120(10): 599–607.

Asad

MWA

Dimitrakopoulos

Eldert

(2014) Stochastic production phase design for an open pit mining complex with multiple processing streams. Engineering Optimization 46(8): 1139–1152.

Askari-Nasab

Szymanski

(2007) Open pit production scheduling using reinforcement learning. Cell 780(717): 2987.

Avalos

Ortiz

(2020) A guide for pit optimization with pseudoflow in python. Queen’s University. Epub ahead of print. 2020

Bai

Marcotte

Gamache

, et al. (2018) Automatic generation of feasible mining pushbacks for open pit strategic planning. Journal of the Southern African Institute of Mining and Metallurgy 118(5): 514–530.

Chandran

Hochbaum

(2009) A computational study of the pseudoflow and push-relabel algorithms for the maximum flow problem. Operations Research 57(2): 358–376.

Colas

Sigaud

Oudeyer

(2019) A hitchhiker’s guide to statistical comparisons of reinforcement learning algorithms. arXiv preprint arXiv:1904.06979. Epub ahead of print 2019.

Consuegra

Dimitrakopoulos

(2010) Algorithmic approach to pushback design based on stochastic programming: Method, application and comparisons. Mining Technology 119(2): 88–101.

Dabney

Rowland

Bellemare

, et al. (2018) Distributional reinforcement learning with quantile regression. In: Proceedings of the AAAI conference on artificial intelligence.

10.

Deutsch

Dağdelen

Johnson

(2022) An open-source program for efficiently computing ultimate pit limits: Mineflow. Natural Resources Research 31(3): 1175–1187.

11.

Elkington

Durham

(2011) Integrated open pit pushback selection and production capacity optimization. Journal of Mining Science 47(2): 177–190.

12.

Espinoza

Goycoolea

Moreno

, et al. (2013) Minelib: A library of open pit mining problems. Annals of Operations Research 206(1): 93–114.

13.

Ghasemi

Mousavi

Ebrahimi

(2024) Comprehensive Survey of Reinforcement Learning: From Algorithms to Practical Challenges. arXiv preprint arXiv:2411.18892. Epub ahead of print 2024.

14.

Goodfellow

Dimitrakopoulos

(2013) Algorithmic integration of geological uncertainty in pushback designs for complex multiprocess open pit mines. Mining Technology 122(2): 67–77.

15.

Henderson

Islam

Bachman

, et al. (2018) Deep reinforcement learning that matters. In: Proceedings of the AAAI conference on artificial intelligence.

16.

Hochbaum

(2008) The pseudoflow algorithm: A new algorithm for the maximum-flow problem. Operations Research 56(4): 992–1009.

17.

Hochbaum

Chen

(2000) Performance analysis and best implementations of old and new algorithms for the open-pit mining problem. Operations Research 48(6): 894–914.

18.

Hustrulid

Kuchta

(1995) Open pit mine planning and design. Volume 1 - fundamentals. . Netherlands: N. p. Epub ahead of print. 31 December 1995.

19.

Islam

Henderson

Gomrokchi

, et al. (2017) Reproducibility of benchmarked deep reinforcement learning tasks for continuous control. arXiv preprint arXiv:1708.04133. Epub ahead of print 2017.

20.

Jélvez

Morales

Askari-Nasab

(2020) A new model for automated pushback selection. Computers & Operations Research 115: 104456.

21.

Jordan

Chandak

Cohen

, et al. (2020) Evaluating the performance of reinforcement learning algorithms. In: International Conference on Machine Learning, Vienna, Austria, pp. 4962–4973. PMLR.

22.

Komatsu Ltd . (2018) P&H 4100XPC Electric Rope Shovel. Product Brochure. Available at: https://www.komatsu.com.au/getattachment/55048245-725e-4eab-b44d-6fb9cb00d7cf/4100XPC.

23.

Kržanović

Vušović

Ljubojev

(2018) Selection of the optimum pushbacks in a long-term planning process of the open pit - A condition for maximization the net present value: Case study: The open pit Veliki Krivelj, Serbia. (1). Epub ahead of print 2018.

24.

Lerchs

Grossmann

(1965) Optimum design of open pit mines. CIM Bulletin 58(633): 47–54.

25.

Matsui

Escribano

Angeloudis

(2023) Real-time Dispatching for Autonomous Vehicles in Open-pit Mining Deployments using Deep Reinforcement Learning. In: 2023 IEEE 26th International Conference on Intelligent Transportation Systems (ITSC), Bilbao, Bizkaia, Spain, pp. 5468–5475.

26.

Meagher

Dimitrakopoulos

Avis

(2014a) Optimized open pit mine design, pushbacks and the gap problem—a review. Journal of Mining Science 50(3): 508–526.

27.

Meagher

Dimitrakopoulos

Vidal

(2014b) A new approach to constrained open pit pushback design using dynamic cut-off grades. Journal of Mining Science 50(4): 733–744.

28.

Morales

Nelis

Amaya

(2024) An efficient method for optimizing nested open pits with operational bottom space. International Transactions in Operational Research 31(3): 1609–1630.

29.

Mwangi

Jianhua

Gang

, et al. (2020) Ultimate pit limit optimization methods in open pit mines: A review. Journal of Mining Science 56(4): 588–602.

30.

Nancel-Penard

Jelvez

Mancilla

, et al. (2025) Open-pit phase design considering operational constraints: Towards the generation of high adherence production planning policies. Resources Policy 103: 105546.

31.

Nancel-Penard

Morales

(2022) Optimizing pushback design considering minimum mining width for open pit strategic planning. Engineering Optimization 54(9): 1494–1508.

32.

Navarro

Morales

Contreras-Bolton

, et al. (2024) Open-pit pushback optimization by a parallel genetic algorithm. Minerals 14(5): 438.

33.

Picard

(1976) Maximal closure of a graph and applications to combinatorial problems. Management Science. INFORMS. Epub ahead of print 22(11): 1268–1272. DOI: 10.1287/mnsc.22.11.1268. 1 July 1976.

34.

Powell

(2007) Approximate Dynamic Programming: Solving the Curses of Dimensionality. John Wiley & Sons.

35.

Ramazan

(1996) New push back design algorithm in open pit mining, A.1990-1999-Mines Theses & Dissertations. Colorado School of Mines. Arthur Lakes Library. Epub ahead of print 1996.

36.

Sutton

Barto

(2018) Reinforcement Learning, Second Edition : An Introduction. Cambridge, UNITED STATES: MIT Press. Available at: http://ebookcentral.proquest.com/lib/polymtl-ebooks/detail.action?docID=6260249.

37.

Tabesh

Mieth

Askari-Nasab

(2013) Open pit production planning using controlled pushbacks and aggregates. In: Proceedings of the 23rd World Mining Congress.

38.

Tabesh

Mieth

Askari–Nasab

(2014) A multi–step approach to long–term open–pit production planning. International Journal of Mining and Mineral Engineering 5(4): 273–298.

39.

Vallet

(1976) Optimisation mathematique de l’exploitation d’une mine a ciel ouvert ou le problem de l’enveloppe. Annales des Mine de Belgique. Administration des Mines Brussels: 113–135.

40.

Wharton

Whittle

(1997) The effect of minimum mining width on NPV. Optimizing with Whittle: 173–178. Whittle Programming Pty. Ltd Perth, Western Australia.

41.

Wooldridge

(2009) An Introduction to Multiagent Systems. John wiley & sons.

42.

Wooldridge

Jennings

(1995) Intelligent agents: Theory and practice. The knowledge engineering review. Cambridge University Press. 10(2): 115–152.

43.

Yarmuch

Brazil

Rubinstein

, et al. (2021) A mathematical model for mineable pushback designs. International Journal of Mining, Reclamation and Environment 35(7): 523–539.

44.

Yarmuch

Brazil

Rubinstein

, et al. (2023) A model for open-pit pushback design with operational constraints. Optimization and Engineering 24(1): 623–639.

Automating and optimising pushback selection using reinforcement learning

Abstract

Keywords

Get full access to this article

References