Sage Journals: Discover world-class research

Abstract

Bearings are essential elements of rotating machinery and their malfunction may result in considerable operational interruptions and financial detriment. This paper investigates Proximal Policy Optimization (PPO), a reinforcement learning (RL) technique, to formulate data-driven policies for bearing maintenance. A custom OpenAI Gym environment was developed to replicate the decision-making process employing experimental vibration data from normal bearings, as well as bearings with ball, inner-race, and outer-race faults. The RL agent was trained to determine the appropriate maintenance policies such as inspection, repair, and replacement to reduce total costs and prevent breakdowns. In addition, training performance was evaluated using essential measures such as cumulative rewards, loss, KL divergence, and value loss. The experimental findings demonstrate that the PPO agent achieved 94.2% accuracy in decision making in 10 epochs with limited improvement from additional training. Furthermore, the method shows instability in policy updates, value loss, and sensitivity to sparse-reward structure. These findings demonstrate that PPO holds considerable potential for vibration-based CBM; however, its performance in real-world operational environments remains highly dependent on reward design and hyperparameter tuning. This research showcases a balanced evaluation of PPO’s strengths and limitations in bearing maintenance and provides a foundation for future studies on hybrid and alternative reinforcement learning strategies.

Keywords

vibration condition-based maintenance reinforcement learning proximal policy optimization (PPO)industrial maintenance optimization

Get full access to this article

View all access options for this article.

References

Al-Said

Findik

Assanova

, et al. (2024) Enhancing predictive maintenance in manufacturing: a cnn-lstm hybrid approach for reliable component failure prediction. In: Technology-Driven Business Innovation: Unleashing the Digital Advantage. Springer, Vol. 1, pp. 137–153.

Alhams

Abdelhadi

Badri

, et al. (2024) Enhanced bearing fault diagnosis through trees ensemble method and feature importance analysis. Journal of Vibration Engineering & Technologies 12: 1–17. https://doi.org/10.1007/s42417-024-01405-0

Díaz-Saldaña

Cureño-Osornio

Zamudio-Ramírez

, et al. (2024) Methodology for the detection of contamination and gradual outer race faults in bearings by fusion of statistical vibration–current features and svm classifier. Applied Sciences 14(12): 5310. https://doi.org/10.3390/app14125310

Ding

Jia

Zhao

, et al. (2024) Joint optimization of degradation assessment and remaining useful life prediction for bearings with temporal convolutional auto-encoder. ISA Transactions 146: 451–462. https://doi.org/10.1016/j.isatra.2023.12.031

Dixit

Verma

(2020) Intelligent condition-based monitoring of rotary machines with few samples. IEEE Sensors Journal 20(23): 14337–14346. https://doi.org/10.1109/jsen.2020.3008177

Dou

Yang

Liu

, et al. (2012) A rule-based intelligent method for fault diagnosis of rotating machinery. Knowledge-Based Systems 36: 1–8. https://doi.org/10.1016/j.knosys.2012.05.013

Chen

Zhen

, et al. (2024) System-level predictive maintenance optimization for no-wait production machine–robot collaborative environment under economic dependency and hybrid fault mode. Processes 12(8): 1690. https://doi.org/10.3390/pr12081690

Jadhav

Singh

Kolhar

, et al. (2024) Reinforcement learning for rolling bearing fault diagnosis–a comprehensive review. Journal Européen des Systèmes Automatisés 57(4): 1185–1193.

Jawad

Jaber

(2023) Bearings health monitoring based on frequency-domain vibration signals analysis. Engineering and Technology Journal 41(1): 86–95. https://doi.org/10.30684/etj.2022.131581.1043

10.

Kiakojouri

(2024) A Generalised AI-based Model for Rolling Element Bearing Fault Diagnosis in Rotating Machinery. University of Southampton. PhD Thesis.

11.

Kumar

Chauhan

Pandit

(2022) Time domain vibration analysis techniques for condition monitoring of rolling element bearing: a review. Materials Today: Proceedings 62: 6336–6340. https://doi.org/10.1016/j.matpr.2022.02.550

12.

Lee

Mitici

(2023) Deep reinforcement learning for predictive aircraft maintenance using probabilistic remaining-useful-life prognostics. Reliability Engineering & System Safety 230: 108908. URL. https://doi.org/10.1016/j.ress.2022.108908. https://www.sciencedirect.com/science/article/pii/S0951832022005233

13.

Dai

Jiang

, et al. (2024a) Prediction of the remaining useful life of bearings through cnn-bi-lstm-based domain adaptation model. Sensors 24(21): 6906. https://doi.org/10.3390/s24216906

14.

Dong

Zhang

, et al. (2024b) A deep reinforcement learning-based fault diagnosis algorithm for unlabeled and imbalanced data. In: Chen

Liu

(eds.) Fourth International Conference on Mechanical Engineering, Intelligent Manufacturing, and Automation Technology (MEMAT 2023), volume 13082. International Society for Optics and Photonics, SPIE, p. 130820F. URL. https://doi.org/10.1117/12.3026147. https://doi.org/10.1117/12.3026147

15.

Liang

Knutsen

Vanem

, et al. (2024) A review of maritime equipment prognostics health management from a classification society perspective. Ocean Engineering 301: 117619. https://doi.org/10.1016/j.oceaneng.2024.117619

16.

Liu

Yang

Zhang

(2022) An improved empirical wavelet transform and sensitive components selecting method for bearing fault. Measurement 187: 110348. https://doi.org/10.1016/j.measurement.2021.110348

17.

Modirrousta

Aliyari Shoorehdeli

Yari

(2024) Imbalanced classification in faulty turbine data: new proximal policy optimisation. IET Collaborative Intelligent Manufacturing 6(3): e12114. https://doi.org/10.1049/cim2.12114

18.

Neupane

Bouadjenek

Dazeley

, et al. (2025) Data-driven machinery fault detection: a comprehensive review. Neurocomputing 627: 129588.

19.

Qin

Hou

Pang

, et al. (2025a) Reinforcement learning-based secure tracking control for nonlinear interconnected systems: an event-triggered solution approach. Engineering Applications of Artificial Intelligence 161: 112243. https://doi.org/10.1016/j.engappai.2025.112243

20.

Qin

Pang

Wang

, et al. (2025b) Observer based fault tolerant control design for saturated nonlinear systems with full state constraints via a novel event-triggered mechanism. Engineering Applications of Artificial Intelligence 161: 112221. https://doi.org/10.1016/j.engappai.2025.112221

21.

Qin

Ran

Zhang

(2025c) Unsupervised image stitching based on generative adversarial networks and feature frequency awareness algorithm. Applied Soft Computing 183: 113466. URL. https://doi.org/10.1016/j.asoc.2025.113466. https://www.sciencedirect.com/science/article/pii/S156849462500777X

22.

Qin

Pang

Wang

, et al. (2026) Event-triggered zero-sum game for input saturated nonlinear systems with state constraints. IEEE Transactions on Consumer Electronics. Springer Nature.

23.

Salunkhe

Khot

Yelve

, et al. (2024) Rolling element bearing fault diagnosis by the implementation of elman neural networks with long short-term memory strategy. Journal of Tribology 1: 1–17.

24.

Schaeffler (2025) Bearing structure. URL. https://www.schaeffler.us/us/products-and-solutions/industrial/product-portfolio/rolling_and_plain_bearings/deep_groove_ball_bearing/ Accessed: 2025-02-19.

25.

Tang

Wang

Ouyang

, et al. (2022) A wind turbine bearing fault diagnosis method based on fused depth features in time–frequency domain. Energy Reports 8: 12727–12739. https://doi.org/10.1016/j.egyr.2022.09.113

26.

Vakharia

Gupta

Kankar

(2015) Ball bearing fault diagnosis using supervised and unsupervised machine learning methods. International Journal of Acoustics and Vibration 20(4): 244–250. https://doi.org/10.20855/ijav.2015.20.4387

27.

Wan

Gong

Zhang

, et al. (2021) An efficient rolling bearing fault diagnosis method based on spark and improved random forest algorithm. IEEE Access 9: 37866–37882. https://doi.org/10.1109/access.2021.3063929

28.

Zhang

(2025) Predictive maintenance optimization for industrial equipment via reliable prognosis and risk-aware reinforcement learning. Complex & Intelligent Systems 12: 11. https://doi.org/10.1007/s40747-025-02127-w

29.

Zhang

Hao

Wang

, et al. (2023) An efficient lightweight convolutional neural network for industrial surface defect detection. Artificial Intelligence Review 56(9): 10651–10677. URL. https://doi.org/10.1007/s10462-023-10438-y. https://doi.org/10.1007/s10462-023-10438-y

30.

Zhang

Liu

Xiang

, et al. (2024) Reinforcement learning in reliability and maintenance optimization: a tutorial. Reliability Engineering & System Safety 251: 110401. https://doi.org/10.1016/j.ress.2024.110401

31.

Zhang

, et al. (2025a) A lightweight network enhanced by attention-guided cross-scale interaction for underwater object detection. Applied Soft Computing 184: 113811. URL. https://doi.org/10.1016/j.asoc.2025.113811. https://www.sciencedirect.com/science/article/pii/S156849462501124X

32.

Zhang

Yuan

Meng

, et al. (2025b) Reinforcement learning for single-agent to multi-agent systems: from basic theory to industrial application progress, a survey. In: Artificial Intelligence Review. Springer Nature, vol. 59.

33.

Zheng

Cao

Pan

, et al. (2022) Spectral envelope-based adaptive empirical fourier decomposition method and its application to rolling bearing fault diagnosis. ISA Transactions 129: 476–492. https://doi.org/10.1016/j.isatra.2022.02.049

34.

Zhou

Xiao

Niu

, et al. (2022) Rolling bearing fault diagnosis based on wgwoa-vmd-svm. Sensors 22(16): 6281. https://doi.org/10.3390/s22166281

A proximal policy optimization framework for bearing condition monitoring using low-dimensional time-domain features

Abstract

Keywords

Get full access to this article

References