Abstract
Robust and safe industrial operations emphasize asset criticality, reliability and availability through effective maintenance which minimizes failures and costly overhauls. Existing maintenance strategies often fall short of interpreting the underlying mechanisms driving equipment degradation patterns. Prescriptive maintenance advances beyond failure prediction, generating tailored, granular and actionable strategies as knowledge-driven decision supports. Causality discovery is crucial in prescriptive maintenance through investigating root degradation causes and failure propagation, enabling evidence-grounded proactive interventions. However, systematic methodologies for uncovering and quantifying causality driving sensor data dependencies, temporal degradation progression and early warning indicator relationships remain underdeveloped. This paper presents a new causal discovery framework to facilitate prescriptive maintenance. It applies: (1) Transfer Entropy to identify features exhibiting temporal causal influence on degradation, (2) Dynamic Time Warping-based clustering to form “symptom groups” representing shared degradation behaviors, and (3) a deep learning architecture revealing intra-cluster causal relationships and temporal delays. Case studies demonstrate framework utility of transforming multimodal truck sensor data into explainable causal knowledge. Results indicate failure-prone interconnections on individual trucks and fleet-wide diagnostics where key triggers and their evolving roles across different degradation stages are presented. Optimized with expert calibration, this work delivers a prescriptive decision-support foundation to enhance equipment safety and promote operation sustainability.
Keywords
Background
Equipment maintenance is an integral component of industrial operations. It is recognized as a cooperative partner and a profit contributor in organizational planning, 1 given its direct impact on operational safety, asset life longevity, and cost control. Over the decades, maintenance strategies have expanded and diversified, transitioning from corrective maintenance where interventions occur only after failures, to preventive maintenance that follows scheduled servicing typically determined by equipment manufacturers, industry standards and engineer experience. The advent of Internet of Things (IoT), artificial intelligence (AI), and big data analytics has further propelled this evolution, leading to the rise of predictive maintenance. This strategy leverages data modeling and analytics to perform proactive health condition diagnosis and forecast potential failure occurrence, mitigating operational risk and elevating operational efficiency.2,3 However, its scope remains primarily on temporal failure predictions. This narrow focus, although valuable, limits its ability to provide insights into failure mechanisms and translate raw maintenance data into actionable maintenance solutions. To address this gap, prescriptive maintenance emerges as the most advanced paradigm.4,5 It represents an analytical framework that integrates sensor diagnostics, machine learning algorithms, and decision support systems. Beyond failure prognosis, its core capability lies in generating high-granular, contextually-aware maintenance directives that consider both equipment-level needs and broad operational constraints.
A building block of prescriptive maintenance framework lies in systematic approach to maintenance knowledge extraction. Prescriptive maintenance is also referred to as knowledge-based maintenance.6–8 Knowledge discovery synthesizes diverse data sources-sensor readings, past failure and repair records, the factors surrounding equipment, maintenance histories, operational parameters, and domain expertise to establish comprehensive understanding into equipment behavioral patterns. Given that engineering systems require quick and efficient repair or replacement to minimize the loss associated with production losses, lower-level knowledge discovery (i.e. part or component level beyond system level) is particularly important. The resultant knowledge base enables not only pattern identification and anomaly detection but, crucially, reveals the underlying mechanisms of degradation and deterioration across equipment pieces. This knowledge-driven characteristic rationalizes the implemented maintenance activities, as it ensures that the proposed recommendations are technically sound and operationally justifiable. From the organizational management perspective, the capacity of prescriptive maintenance to explain its directives through knowledge richness enhances stakeholder trust and enables continuous refinement of maintenance strategies, ultimately supporting sustained asset reliability.
Causality constitutes an important form of maintenance knowledge which directly maps to the physical mechanisms of failure propagation. It describes a directional relationship where one state (known as the cause) exerts a deterministic or probabilistic influence on another state (known as the effect), with this influence persisting under intervention or manipulation.
9
Since industrial equipment sensor data is typically time series data characterized by continuous temporal measurements and complex mechanical interactions, the “temporal precedence” principle,9,10 that is, the cause must precede its effect in time, should be used to describe causality in the present context. Granger
11
introduces a widely accepted definition of causality for time series data based on predictability: A variable
Despite the presence in various domains such as healthcare, 15 mineral economics, 16 and transportation, 17 causal discovery analysis in the industrial maintenance field remains a new area.18,19 Notably, studies on sensor data-driven causal discovery approaches is limited in scope.20–22 On the specific subject of equipment health monitoring and maintenance, research on causal discovery is comparatively thin as many present studies prioritize failure prediction over explainable causality for maintenance guidance. Pang and Lodewijks 23 introduces a Bayesian inference-based causal modeling method combined with fuzzy logic to diagnose failures and support maintenance decision-making in for conveyor belt braking system. Zhou et al. 24 introduces CausalKG, a knowledge graph integrating spot-inspection data and causal relationship knowledge, and develops the CausalKG-ALBERT model to enhance root cause analysis of equipment failures. Vanderschueren et al. 25 proposes a causal inference-based approach to optimize preventive maintenance frequency by learning maintenance effect on overhaul and failure rates followed by individualized and cost-effective maintenance schedules. Nadim et al. 26 combines machine learning, process mining, and human expertise to generate multi-level causal models for diagnosing abnormal events on real-world mill machine data. Gui et al. 27 develops a causal-factors-aware attention network to improve equipment fault prediction by incorporating causal contribution weights of features and time-aware attention mechanisms. However, the notion of causality here is primarily associated with feature contribution to prediction rather than uncovering degradation mechanisms. In general, existing methodologies face limitations. Many approaches are tailored to specific domains or rely heavily on predefined structural models, which constrains their generalizability and adaptability to diverse, data-driven maintenance contexts. The reliance on static causal structures, event data and rule-based frameworks limits their capacity to capture dynamic, time-varying causal relationships in high-dimensional sensor data that is critical for understanding progressive system degradation. Furthermore, methods that integrate probabilistic updates or intervention-based frameworks often focus on high-level causal reasoning without adequately modeling granular, temporal interactions between variables, and thus overlooking subtle degradation patterns. Some approaches also rely on strong prior assumptions or domain expertise before obtaining model-generated causal relationships, which can introduce bias and reduce robustness when applied to complex, multi-component systems with limited prior knowledge. Another critical problem is the lack of explicit mechanisms to compute causal delays between cause and effect. This practice is vital as it quantifies the temporal propagation of causal dependencies among degradation parameters, which is beneficial for early detection of degradation patterns. This in turn optimizes maintenance schedules and minimizes downtime. With respect to prescriptive maintenance, notably, related studies so far are mainly associated with the formation of theoretical structure and the design of a holistic analytical model.6,28–30 Systematic causal discovery approaches for temporal sensor data in prescriptive maintenance contexts remain underexplored.
This paper proposes a causal discovery framework to support sensor-based equipment prescriptive maintenance. The goal is to reveal underlying degradation mechanism from the provided data and build an evidence-based maintenance knowledge base facilitating maintenance decision-making. In essence, the framework integrates three complementary analytical techniques: (1) Transfer Entropy (TE), 31 (2) an clustering algorithm driven by Dynamic Time Warping (DTW)32,33 and (3) Temporal Causal Discovery Framework (TCDF), 34 a deep learning causal discovery architecture based on convolutional neural networks with attention mechanism. Each method has exhibited individual promise across various domains. TE has proven effective for causal detection in industrial maintenance settings,35–37 cryptocurrency forecasting, 38 neuroscience, 39 etc. DTW, originally developed to handle temporal distortion for speech recognition, has expanded to image analysis, 40 seismology, 41 human gait capture, 42 and maintenance-related studies.43–45 TCDF excels in temporal causal discovery for complex multi-variable time series, finding applications in both maintenance46–48 and other domains.49,50 However, effective prescriptive maintenance knowledge discovery presents multifaceted analytical requirements that individual methods would struggle to fulfill. Modern equipment generates high-dimensional sensor data characterized by complex temporal dependencies, nonlinear interactions, and asynchronous degradation patterns across assemblies, components, and parts. Prescriptive maintenance demands systematic extraction of actionable causal knowledge: identifying which sensor parameters causally influence degradation, understanding failure patterns and their organization, and quantifying specific directional causal relationships among these subsystems to enable informed, targeted, and proactive interventions. Deploying these techniques in isolation encounter inherent limitations when addressing these requirements. TE, while powerful for detecting information transfer, may identify relationships that lack practical relevance to failure dynamics when applied to unfiltered high-dimensional sensor streams, and provides limited insight into temporal pattern structures within identified causal features. DTW effectively captures temporal similarity but fundamentally lacks causal detection mechanisms. TCDF alone offers advanced causality discovery capabilities with delay quantification but could face computational and interpretability challenges when processing large-scale, heterogeneous sensor datasets without strategic feature reduction and organization. This methodological fragmentation leaves a critical gap: no systematic knowledge discovery approach exists for cohesively addressing causal feature identification while reducing dimensionality, temporal pattern organization, and granular causal relationship extraction within a cohesively arranged workflow tailored to prescriptive maintenance requirements.
To tackle the challenges aforementioned, this paper proposes a new modular approach establishing synergistic integration of the three aforementioned techniques. Specifically, the contributions are as follows:
A multi-stage causality mining framework that unifies TE, DTW, and TCDF within a unified methodological workflow, where each technique complementarily builds upon the preceding analysis, thereby transforming fragmented techniques into a scalable methodology;
A dual-scale causal investigation strategy that employs TE for causal feature identification with respect to system health index computed via a proposed approach, and TCDF for local causality analysis, where complex non-linear temporal dynamics among sensor measurements are captured, and causal delay information between cause and effect is computed, thereby supporting proactive maintenance planning;
A DTW-based clustering algorithm serving as a strategic intermediary that enables systematic hierarchical transition from macro to micro-scale causal analysis while effectively accommodating heterogeneous sensor response patterns, thereby structuring high-dimensional data into interpretable groups for targeted causality discovery; and
An application to real-world heavy truck sensor data, the SCANIA Component X dataset, 51 representing a new case study focused on causal discovery tasks, with results validating framework applicability and enabling the development of interpretable, degradation class-specific truck causal knowledge that systematically transforms multi-dimensional sensor data into structured, actionable decision supports based on extracted causal features, cluster plots, causal network graphs, and patterns.
The remainder of the paper is organized as follows. The second section provides detailed descriptions of the methodologies. The third section presents the case study along with results. The fourth section discusses the interpretation of the results and their practical implications. The final section concludes the paper and suggests avenues for future research.
Methodology
The developed causal discovery framework operates on structural, equal-sized, multi-feature datasets collected from equipment monitoring systems. Mathematically, one can denote the dataset
where

Diagrammatic illustration of the proposed causal discovery framework.
Identification of causal features for equipment health via transfer entropy
The first step selects features causally linked to system health degradation. These features are crucial for prescriptive actions as they directly influence system behavior when structural equipment failure datasets contain numerous parameters. Hence, this step simultaneously performs dimensionality reduction to enhance computational efficiency. In general, two strategies exist to reach this goal: transformation-based techniques and feature selection techniques. The former, such as Principal Component Analysis, 52 Independent Component Analysis, 53 and Non-Negative Matrix Factorization, 54 reduce computational burden through lower-dimensional representations but typically create new composite features through mathematical transformations of the original variables. This limits interpretability of individual parameter contributions—a consideration particularly relevant in maintenance applications where tracing system behavior back to specific components is crucial. As such, this research employs a direct reduction of the number of original features. However, common methods in this category such as Pearson correlation, 55 Spearman rank correlation, 56 or Mutual Information57,58 are primarily designed to capture statistical associations. Pearson correlation measures linear relationships between variables, Spearman rank correlation captures monotonic relationships, and Mutual Information detects both linear and non-linear dependencies between variables. These methods however do not reveal the causal relationships between features and the target variable as they do not account for the directionality or temporal dynamics required to infer causality.
This research applies TE to identify causal features. Introduced by Schreiber, 31 TE is based on the concept of Shannon entropy 58 to quantify directional information transfer between data series. It evaluates additional information gained about the target’s future by considering the source’s past, effectively isolating the source’s distinct causal contribution from the target’s inherent temporal dynamics. This explainability characterizes the causal influence from source to target. TE offers a non-parametric approach eliminating distributional assumptions, enabling analysis of complex system dynamics without predefined constraints. Unlike correlation-based methods or Granger Causality 11 which assumes static linear relationships and requires parametric models, TE quantifies directional information flow and captures non-linear, temporal dependencies, providing robust identification of influential features in high-dimensional equipment systems.
Mathematically, define
In the context of causal feature discovery, let
Then, TE from
The expanded form of equation (5) can be written as equation (6):
In fact,
Clustering causal features based on dynamic time warping
While TE reduces dimensionality and uncovers initial causality with respect to system degradation, a core challenge is sensor parameter measurements can respond asynchronously to system-wide failures. Sensor signals exhibit high variability due to diverse operational dynamics—components operating under varying demands, conditions, loads, or environments—complicating data comparison and interpretation. Additionally, they are often unaligned due to measurement errors, sensor failures, or data processing inconsistencies, which can result in partial signal loss, unsynchronized timestamps, or irregular patterns. Maintenance engineers typically diagnose by examining individual sensor outputs or following standardized troubleshooting protocols, but this strategy is labor-intensive and may fail to extract critical causal chains explaining system degradation, as these relationships remain hidden in noisy data streams.
To address these problems and optimize diagnosis management, this paper introduces a clustering module following causal feature selection. An algorithm empowered by DTW32,33 to cluster causal features based on temporal patterns is proposed. Since sensor data exhibit temporality and continuity, they contain trends and numerical patterns emerging during degradation. This paper argues that similar features with trends and patterns constitute a “symptom group” that conveys partial but unique and augmented diagnostic information linked to degradation. Each cluster is composed of signal sets presenting similar temporal behaviors and collectively responding to system degradation. In practical terms, the cluster can correspond to a group of performance parameters from the same system where the underlying time series show shared trends and patterns from task initiation to repair intervention. Alternatively, the cluster can include sensor series from different assemblies, subassemblies or parts that are physically distinct yet exhibit similar responses to the system-wide anomaly. For instance, data transmitted from the brakes, the radiator, and the cooling fan of a truck may display comparable patterns during an engine overheating event even though these parts are functionally separate. DTW-based clustering is chosen over conventional methods (e.g. k-means, hierarchical clustering) specifically for its capacity to handle temporal misalignment and varying degradation rates across sensors. Standard distance metrics fail to capture similarity in temporally distorted signals, whereas DTW’s elastic matching aligns degradation patterns regardless of temporal shifts. This clustering step assists maintenance teams in structurally exploring degradation-related causal relationships embedded within multisource noisy data through intra- and cross-cluster sensor examination. In what follows, the DTW method and the clustering algorithm are presented.
Dynamic time warping
DTW determines the similarity between two data series that are usually asynchronous through finding a globally optimal alignment, a.k.a. an optimal warping path. Also returned along with this path is the shortest distance between the two sequences, which quantifies the minimum cumulative cost of warping one series to align with the other. This distance effectively captures their similarity while accommodating potential shifts and distortions in time. The optimal warping path and the corresponding distance are coexistent; the path indicates how the two sequences are aligned, while the distance reflects the degree of similarity.
While DTW can accommodate series of different lengths, for simplicity, this paper considers series of equal lengths. As such, define in equation (7) the collection of causal features
where
The next step is to align the two series by computing a warping path that quantifies their similarity. Denote a warping path as a sequence
The total distance of
Here,
The minimum distance is computed using a dynamic programming approach, where a cumulative cost matrix
Effectively, the cumulative cost at each point is determined by adding the current distance between
The proposed clustering algorithm
The following introduces the proposed clustering algorithm:
DTW distance matrix establishment
Once the DTW distances between each series pair are obtained, these values form a distance matrix
Cluster formation through optimal number of neighbors
The algorithm now performs causal feature clustering via a neighbor-based approach. In particular, it automatically determines the optimal number of neighbors evaluated by silhouette score.
60
The silhouette score informs the quality of clusters by assessing how similar an element is to its own cluster compared to other existent clusters. For a given series
where
The algorithm begins by initializing the best silhouette score to −1 and the optimal number of neighbors to 1. The maximal number of neighbors
The algorithm performs initial clustering based on the current
A merging process refines the initial clusters to eliminate redundancies. This step employs a transitivity-based approach, wherein clusters exhibiting overlapping members are combined to generate coherent groupings.
The quality of the merged clusters is evaluated using the average silhouette score via the function
If only a single cluster is formed during any iteration, the algorithm terminates early by returning the current optimal configuration. Otherwise, if the calculated average silhouette score exceeds the current optimal score, the algorithm updates the clustering configuration accordingly. This process continues iteratively, and after all values of
The pseudocode in Figure 2 summarizes the algorithm. It is important to note that this algorithm is motivated by the proposed clustering algorithm in Mitici and De Pater 44 The present approach introduces several distinctions. The previous work focused on the clustering of health indicators for degradation modeling and Remaining Useful Life (RUL) estimation using normalized DTW distances based on graph-theory, where the number of neighbors is assigned statically. In this research, the algorithm is designed for grouping sensor measurements for a sounding causality discovery process. It develops an iterative process that allows that the definition of the number of the neighbors is based on user preference. Furthermore, the optimal number of neighbors is determined automatically based on a quantitative clustering quality computation.

Pseudocode of the proposed clustering algorithm.
Intra-cluster deep causality learning via attention-based convolutional neural networks
Following causal feature cluster obtention, the next step extracts actionable insights to guide maintenance activities and improve health monitoring. This research argues that identified features within each cluster are inherently linked through causal relationships due to shared causal responses to system failure evolution. These relationships can be obscured in raw sensor data by high feature numbers, data noise, and non-linear dependencies. Relationship complexity varies—causative interactions can be binary, ternary, or more complex causality chains. Causality chains involve multiple sensor series, systematically explaining how changes propagate over time. This cascading effect occurs sequentially, reflecting the sequential nature of failure propagation. Quantifying these temporal delays is critical for prescriptive maintenance, as delay values indicate the available intervention windows between detecting causal triggers and observing their downstream effects. For example, haul truck tire pressure abnormalities temporally precede braking system pressure increases, followed by brake pad deviations as pads wear rapidly due to increased demand. This simple example suggests a causative chain with time delay where the nature and timing of sensor responses to the fault mechanism(s) can vary. To illustrate further, in a hypothetical symptom group cluster identified in an excavator hydraulic system, a drop in hydraulic pressure is anterior to a reduction in actuator velocity, which in turn may lead to an increase in motor current as the system compensates for the loss in efficiency. Discovering such cascading mechanisms and examining temporal order among features is critical for understanding how factors influence one another, revealing event sequences contributing to equipment degradation.
Figure 3 depicts an illustrative graph of the causal relationships among four hypothetical sensor parameters. Each vertex (P1, P2, P3, and P4) represents a causal feature. Arrowheads indicate causal influence direction, while edge values indicate time delays between cause and effect. Zero implies instantaneous causal effects. For example, P1 causatively affects P2 with a delay of three timesteps. it is important to note that not all features within the same cluster would contribute equally to failure progression. Some features are primary drivers directly influencing system behavior, while others present less significance, only amplifying effects. This distinction can equally exist across observed clusters. Although clusters are based on temporal similarities, features from different clusters may share functional similarities or monitoring categories. Therefore, effective causality analysis should identify dominant factors or recurring elements most critical for maintenance from cross-cluster interactions. This refines explainability of extracted causal features, ensuring relevant elements are emphasized such that maintenance teams can concentrate efforts on critical aspects with improved efficiency.

An example of causal relationship graph.
To address the above challenges, this research applies attention-based convolutional neural networks known as TCDF introduced by Nauta et al. 34 This approach is selected due to its maintenance-specific advantages. Constraint-based methods (e.g. PC algorithm 61 ) can struggle with high-dimensional time series and cannot compute delays. Classical statistical approaches—whether requiring pre-specified causal structures such as Structural Equation Modeling 62 or assuming linear functional forms such as Granger causality—impose parametric assumptions incompatible with the complex, non-linear, and exploratory nature of machinery degradation dynamics. Conversely, the deep learning architecture of TCDF allows to automatically learn complex non-linear patterns from sensor data without parametric constraints. The attention mechanism quantifies feature causal contribution, while causal validation distinguishes true causality from spurious associations. Furthermore, the temporal delay computation uniquely provides the intervention timing windows essential for proactive maintenance. TCDF is a composite architecture of convolutional neural networks (CNN) 63 that can extract causative relationships from multivariate data. A CNN is a deep learning model built on convolutional layers. CNNs apply filters (i.e. kernels) to detect temporal patterns and dependencies in the data, capturing both local and global behaviors over time. This structure allows efficient processing of multi-dimensional data and identification of pertinent patterns. The ability to automatically learn features without manual extraction makes CNNs particularly suitable for detecting subtle temporal relationships. Additionally, CNNs benefit from parameter sharing, which reduces computational load when handling complex data streams. The following provides an overview of the core TCDF mechanism in the context of this research. Readers are encouraged to consult Nauta et al. 34 for more details.
TCDF aims at analyzing the potential causal relationship among multiple sensor features and determining the time delays between causes and effects, if any. For each CNN used under TCDF, the input is complete cluster-specific causal features. Let
Here,
Figure 4 illustrates the causal discovery workflow of TCDF using “TCDF 1” from Figure 1 as an example.

Mechanism by which TCDF achieves deep causal discovery (adapted from Nauta et al. 34 ).
Case studies
This paper applies the proposed causality discovery methodology to the publicly available SCANIA Component X dataset 51 to validate its effectiveness in a real-world context.
Introduction to dataset
SCANIA Component X is a large-scale, multivariate dataset recently published in the field of maintenance prognosis. This dataset involves a range of data types that provide a comprehensive view of health, performance, and maintenance history of an engine component, all collected from SCANIA heavy trucks operating under varying conditions. In essence, the dataset covers information of three perspectives explained as follows:
Due to proprietary reasons, some data handling measures have been applied in the original dataset. For example, sensor feature names in the operational records are fully anonymized, each truck is assigned a unique ID instead of its actual identity number, and data are subjected to scaling and transformations.
Study design and multi-stage data process
General preparation
The case studies aim to discover features causally influencing Component X degradation in each investigated truck, identify symptom group clusters, and explore key factors and causal relationships reflecting how these features interact and contribute to the degradation process. This requires run-to-failure data—temporal records capturing equipment’s full operational lifecycle. Among available training, validation, and test datasets, the training set is selected since it includes uninterrupted operational records for each truck and exclusively possesses failure time records. To align with study objectives, only operational sensor records from trucks experiencing recorded failures (repair indicator = 1) are retained, resulting in 105,852 temporal sensor records from 2272 failed trucks. Component X is a complex system recorded by 105 anonymized features: eight numeric counters providing accumulative condition measurements, and 97 features derived from six histogram variables aggregating sensor readings into discrete bins based on predefined operational conditions or measurement ranges. Each bin feature captures measurement frequency at each timestep, offering integrated operational pattern views over time. The preprocess continues with low-variance feature removal followed by NaN value imputation using linear interpolation, which preserves degradation trends without introducing bias. Low-entropy features are then removed by calculating entropy based on equation (2). While low-variance removal targets minimal temporal variation, low-entropy removal focuses on insufficient value distribution diversity. These strategies altogether preserve only informative, dynamic degradation-related features, with the low-entropy threshold set at entropy value 10 based on Figure 5.

Feature entropy distribution.
A key task is to categorize the trucks in the training dataset into distinct classes based on the temporal proximity of the last operational readout to the failure event. In fact, Kharazian et al. 64 introduces a classification scheme for the validation set. The scheme divides the failed trucks into five categories defined by specific time windows leading up to the failure. These time windows are as follows: (1) more than 48 timesteps, (2) 48 to 24 timesteps, (3) 24 to 12 timesteps, (4) 12 to 6 timesteps, and (5) 6 to 0 timesteps before the failure. Each case corresponds to a class label ranging from 0 to 4. By adopting the same scheme, the difference between the duration of operational time and the last recorded timestep is computed for each truck in the training set, followed by the assignment of correct class to the truck. The distribution of severity classes of 2272 trucks is plotted in Figure 6. For the case studies, Class 0 truck units are excluded due to the smallest size and the lowest severity. For the remaining classes, 4 trucks from Class 1, 6 trucks from Class 2, 10 trucks from Class 3 and 23 trucks from Class 4 are randomly selected. More trucks are selected from Classes 3 and 4 than from Classes 1 and 2 to reflect the greater criticality of later classes due to their proximity to failure. This selection approach prioritizes methodological clarity and practicality while ensuring sufficient representation of critical degradation-related cases. Table 1 lists the IDs of the selected trucks. The causal discovery framework will be applied to the trucks within each class independently to ensure detailed examinations of the degradation patterns and causal relationships specific to each subset.

Class distribution of trucks in the training set.
Trucks selected for the case studies.
Truck-wise health quantification
Due to noise and high variability in the raw data, Health Indicator (HI) can be used to quantitatively define the degree of tear and wear of the system within a given timeframe. This research considers virtual HI through fusing multi-sensor signals. 65 Initially, the Root Mean Square (RMS) values for each feature series are computed. RMS is a commonly adopted approach to build HI.66,67 It quantifies the average energy content in the signal. 68 RMS is not sensitive to sudden signal fluctuation and increases as failure progresses, 69 which renders this approach appropriate to simulate long-term declining trend. Mathematically, RMS for a discrete signal input can be defined in equation (22) as 69 :
where
To construct the final HI, an Exponentially Weighted Moving Average (EWMA) technique is applied. EWMA assigns higher weights to recent RMS values while accounting for historical fluctuations through exponentially decaying weights, enabling the HI to capture both short-term variations and long-term degradation trends. A smoothing factor
where
Figure 7 illustrates HI versus time-before-failure (TbF) for trucks 14470 and 21007 as examples.

HI versus TbF for truck 14470 and truck 21007.
Monotonicity and trendability of the two trucks.
Table 2 lists the monotonicity and trendability values for the two trucks; the calculations are based on normal time progression instead of TbF. Results confirm the viability of the proposed HI calculation strategy.
Stationarity test
TE calculation traditionally requires that data is stationary
31
— time series with statistical properties unchanged over time. Since the SCANIA Component X dataset exhibits non-stationary characteristics with observable rising trends approaching failure, a systematic workflow is designed in this study to transform data into stationary forms. Three complementary tests are involved: Augmented Dickey-Fuller (ADF, null hypothesis: non-stationary),
71
Kwiatkowski-Phillips-Schmidt-Shin (KPSS, null hypothesis: stationary),
72
and Levene’s test (detecting variance instability).
73
Series are classified as stationary only when passing both ADF (p<0.05) and KPSS (p>0.05) tests which primarily focus on trends and autocorrelation detection. For non-stationary series, Levene’s test (p
Framework implementation and results presentation
The elaborated operational data of each selected truck is passed to the developed causality discovery framework. In the first module, TE is calculated as an averaged variant of local TE using Python library PyInform. 75 This TE realization embeds the past states of the target series (the HI series in this research) over a chosen number of history lags specified by the user, whereas only a lag of 1 is embedded for each source series. 31 Due to the anonymization of the dataset and the absence of additional domain-specific insights that could guide the selection of optimal lag values, the case studies examine target history lengths between 2 and 5 for each truck for methodological demonstration purposes. the case studies assume a lag history between 2 and 5 for each truck for methodological demonstration purposes. Determined through empirical testing, this range is found to be able to retain approximately 10%–30% of the features from each truck’s initial dataset, which is generally considered reasonable in the sense of achieving a balance between dimensionality reduction and meaningful information preservation.
Posterior to TE computation on each feature with respect to the target series, a significance test is required to eliminate bias or spurious relations 59 in order to determine genuine causal features. The null hypothesis states that there is no causal relationship between the source and the target. The significance test follows the procedure described in Lizier et al. 76 For each feature, surrogate datasets are created through random permutation (1,000 times in this research) to simulate the null hypothesis. This approach is valid given the history embedding of 1 in the source features. 59 Next, TE is computed for each surrogate dataset, and as such, a reference distribution under the null hypothesis is established. This step is followed by the derivation of the p-value by comparing the observed TE against the surrogated distribution. This way features with p-values below the defined significance level (0.1 in this research) are considered statistically significant causal features.
Afterwards, the obtained causal features are clustered through the proposed algorithm. In the case study, DTW computations are realized via DTAIDistance library 77 which defaults to squared Euclidean distance metric. In the clustering stage, the maximum number of neighbors to be tested is set as 5. Figure 8 show four examples as the result of the algorithm execution, where certain causal feature clusters associated with four trucks from each class are visualized.

DTW-aligned causal feature Cluster 1 of truck 14470 from Class 1, Cluster 6 of truck 3420 from Class 2, Cluster 1 of truck 16327 from Class 3, and Cluster 1 of truck 17107 from Class 4.
During the intra-cluster causality discovery phase, the TCDF is trained for 1,000 epochs on 70% of the dataset for each truck, with the remaining 30% used for prediction evaluation. All experiments employ Adam optimizer with a learning rate of 0.01. The kernel size and the dilation coefficient are both set as 2. For each model, a single hidden layer is considered to enhance prediction performance (compared to models without any hidden layer) while minimizing the risk of overfitting (compared to models having deeper architectures). The mean absolute scaled error (MASE) averaged over all feature series and its standard deviation (SD) are adopted as the evaluation metrics for model prediction. When validating significant causal features, the study follows the recommended

Causal relationship graph of Cluster 2 of truck 14370.

Causal relationship graph of Cluster 1 of truck 3420.

Causal relationship graph of Cluster 1 of truck 16327.

Causal relationship graph of Cluster 1 of truck 5865.
Tables 3 to 6 present the complete results from applying the three modules of the proposed framework to all trucks investigated in each class, with each table corresponding to one class (e.g. Table 3 for Class 1, Table 4 for Class 2, and so on). Furthermore, the case studies propose two metrics that are used to evaluate the influence of each feature series on the obtained causal network in each class. Essentially, the metric in equation (27) evaluates the proportion of a feature series acting as a cause in the obtained complete causal relationships, whereas the metric in equation (28) quantifies the overall involvement of a feature series in the causal network by considering its contributions as both a cause and an effect. For each class, the distributions of feature series contribution based on the two metrics are plotted in four figures from Figures 13 to 16 respectively. Also can be found are summaries that highlight the key discoveries for each class, including influential feature series, recurring causal patterns, and notable trends derived from the causal analysis.
Causal feature identification, clustering, and intra-cluster analysis results for Class 1 trucks.
Causal feature identification, clustering, and intra-cluster analysis results for Class 2 trucks.
Causal feature identification, clustering, and intra-cluster analysis results for Class 3 trucks.
Causal feature identification, clustering, and intra-cluster analysis results for Class 4 trucks.

Contribution of feature series as causes, and total contribution of feature series as causes and effects, for selected Class 1 trucks.

Contribution of feature series as causes, and total contribution of feature series as causes and effects, for selected Class 2 trucks.

Contribution of feature series as causes, and total contribution of feature series as causes and effects, for selected Class 3 trucks.

Contribution of feature series as causes, and total contribution of feature series as causes and effects, for selected Class 4 trucks.
The 459 series serves as the secondary key factor in the extracted causal relationships. Contributing 29.63% as causes, its features exert equal causal influence on the 397 series and within the series, each accounting for around 11% of the total causal relationships. In truck 1732, causal interactions are reported among 459_15, 459_16, and 459_17 with no delays (459_15 causes 459_16 and vice versa, and 459_17 causes 459_16). In the same truck it is also found that 459_15 causes 397_30 with a delay of one timestep. Additionally, 459_11 causes 291_0 with zero timestep in the first cluster of truck 14370, and 459_0 causes 100_0 and 397_27 with no delay in its second cluster. These findings illustrate the dual role of the 459 series in causally driving both intra-series and inter-series interactions across clusters.
The remaining series play less impactful roles, with the 291 series being 11.11% and the others being less than 4% in terms of their cause-effect contribution rates. The 167 and 666 series are only observed in serving as effects in the extracted causal relationships in this class.
A large portion of the remaining causal relationships involve the 459, 272, and 167 feature series, each with a cause-effect contribution rate of approximately 11% of all discovered relations. Notably, compared to Class 1, the cause-effect contribution rates of the 272 and 167 series have increased by over 7% each in Class 2. Their interactions mainly consist of causalities within each series and between the two series. Conversely, the cause-effect contribution rate of the 459 series decrease by nearly 16%. The 459 series primarily exhibits local causality instead of system-wide interaction. With the 397 series, some instances include 459_5 causing 397_13 in truck 1872 and 459_5 causing 397_24 in truck 4686. Additionally, 459_16 shows mutual causality with 397_21 in truck 3420, alongside interactions with the 291 series in trucks 1872 and 4686.
Class 2 introduces three new feature series—171, 837, and 427—each with a cause and cause-effect contribution rate of approximately 2.78%, 1.39%, and 1.39%, respectively, in the overall causal relationships. These series may offer additional insights into the component X’s dynamics in this particular class.
Key feature series-wise, the 167 series demonstrates an increasing impact among the extracted causal relationships, with its presence as causes rising to 15.31% and its overall presence rising to 14.80%. Notably, it is observed that the 167 series features often function as causal intermediaries that causally link multiple features in various contexts. For example, in the second cluster of truck 721, 167_2 causes 272_1 and 167_2 causes 272_0 without delay. Simultaneously, 272_1 causes 167_2 without delay. Another example is in the first cluster of truck 16327, where 167_3 causes 158_9 with a delay of one step and 167_4 causes 167_3 with no delay, forming a causal chain. Nevertheless, this relaying role is not universal, as in some cases the 167 series features act as a localized or terminal node, such as a bidirectional causality between 167_1 and 272_0 in the first cluster of truck 3880, or a unidirectional causality from 397_35 to 167_8 with three-step delay in the fifth cluster of truck 14639.
The 291 series becomes increasingly noticeable role in Class 3 causal relationships. It shows a nearly 10% increase in its cause-effect contribution rate, and its role as causes is further strengthened with a percentage rise from around 5.56% in Class 2 to 16.33% in Class 3. Examples include 291_0 causing 100_0 in the second cluster of truck 370, as well as 291_10 causing 171_0 in truck 23507. Additionally, mutual causality loops involving the 291 series are observed in various trucks such as truck 1472, truck 3880, and truck 14639.
The 397 series shows a decline in influence compared to Class 2. Its presence as causal roles drops to around 28.57% and its cause-effect contribution rate drops to around 29.08%. The majority of 397 series features function as causes in the extracted causal relationships. For example, in the first cluster of truck 721, 397_20 causes 158_5 with no delay. In truck 3880, 397_15 causes 456_16 with a delay of three steps and 837_0 with no delay. Some inter-series relations are also observed, such as mutual causality between 397_21 and 397_15 in truck 3880, and additional interactions in Truck 16327.
The 459 series shows a slight decrease in its overall presence, dropping from 11.81% in Class 2 to 10.20% in Class 3. Nevertheless, its influence experiences a substantial reduction compared to its nearly 28% cause-effect contribution rate and its nearly 30% cause contribution rate in Class 1. In contrast, the contribution of the 158 series remains stable at around 9.69% in overall causal relationships and around 9.18% as causes, which shows little change from its combined cause and effect contribution rate of 9.72% in Class 2. Compared to Class 1, the 158 series plays a more critical role in the causal network, with both its cause and cause-effect contribution rates nearly 6% higher than those observed in Class 1. Features from the 158 series appear in multiple trucks, such as 158_5 causally influencing both 397_20 and 397_22 in truck 721, and 158_9 in truck 16327 causally influencing 397_0 and 291_4.
Compared to Class 2, two new feature series appear in the discovered relations: the 309 and 835 series. These series play a minor role as both their cause and cause-effect contribution rates are approximately 1.02% of the total relations.
Recurring causal relationships in Class 4 trucks.
In respect of key feature series, the 397 series once again plays a dominant role. Its cause-effect contribution rate reaches 43.41%, markedly higher than that in Class 3. In particular, the feature 397_28 has the highest appearance of 17 times in the discovered causal relationships. In general, it is found that the 397 series principally interact with 459 and 167 feature series, such as 397_22 causing 167_8 with a delay of three timesteps in the first cluster of truck 5865, 397_24 causing 167_2 with a delay of one timestep in the second cluster of truck 9639, and 397_27 causing 459_2 with no delay in the third cluster of truck 18564. Furthermore, inter-series causalities are observed.
The 158 series demonstrates a unique prominence in Class 4. Its cause contribution rate and cause-effect contribution rate increases to 14.09% and 12.50%, respectively, which surpass those in the other three classes. Its features mainly causally drive those from the 397 series, the 167 series, and the 158 series itself, which in total account for 25 causal relationships out of 220 relations reported. Furthermore, their role as effects is also captured, particularly in the relations that are associated with the features from the 666, 459, and 272 series. Notably, features from 158 series often function as a bidirectional influencer, such as the mutual causality between 158_1 and 167_3 as well as 397_7 in truck 4426, and that between 158_8 and 459_10 in truck 26703.
The 167 series shows a decrease in its overall presence in Class 4 compared to Class 3, but its role as a connector in fault propagation is more pronounced, since its features are more embedded in extensive causal networks. For instance, in the first cluster of truck 4426, 167_6 being a coordinating node causes 158_9 and 272_5, with bidirectional interactions also observed between 167_6 and 272_5. In the second cluster of truck 9639, 397_25, 397_24, and 291_6 cause 167_2 with delays of two, one, and zero steps, respectively, whereas 167_2 causes 397_24 with a delay of two steps. In addition, in the second cluster of truck 31303, 167_2 and 272_1 share bidirectional causal links; meanwhile, 167_2 also causes 272_0, which indicates its influence extends across related features.
The 291 series shows a sharp decline in its presence, contributing only 5.23% to the extracted causal relationships, an apparent reduction from its higher cause-effect contribution rates in Class 1 (11.11%), Class 2 (8.33%), and Class 3 (17.35%). While this series continues to act as a local driver, interactions with other series are less frequent, with exceptions such as 158_9 causing 291_10 with no delay in truck 9997 and 100_0 causing 291_4 with a two-step delay in truck 12862. On the contrary, Class 4 witnesses a resurgence in the influence of the 459 series. Its cause contribution rate and cause-effect contribution rate rise to 18.18% and 17.05%, respectively, regaining its status as the second most critical feature series. Most of the 459 series features continue to exhibit within-group causalities, typical examples being the sixth cluster of truck 25117 and the sole cluster of truck 33056 where all discovered causal relationships include 459 series features. Inter-feature interactions involving the 459 series are less common. Examples include 459_11 causing 837_0 with no delay in the second cluster of truck 3929, 167_0 causing 459_17 with a delay of one step, and 397_12 causes 459_14 with a delay of three timesteps in the first cluster of truck 9639.
In addition, the case studies find some causal relationships that are present in more than one class. They are reported in Table 8.
Recurring causal relationships existing in more than one class.
To summarize, the 397, 459, 291, 167, and 158 series in the present case studies appear to be the principal players in the extracted causal relationships of the selected trucks in four classes. They may be treated as warning indicators for maintenance personnels. The 397 series takes the lead on the whole in terms of its cause and cause-effect contributions spanning both intra- and inter-series interactions, despite a diminished existence in Class 3. The 459 series is also involved in intra- and inter-series interactions, but its influence declines in the first three classes before regaining the status as secondary contributor to the causal relationships in Class 4. The 291 series shifts from overall moderate contributions in classes 1–3 to a more limited participant in Class 4. Its surging cause and cause-effect contribution rates and active involvement in causal chains in Class 3, however, may denote its importance as a signature series for that class. The 167 series transitions from being a minor contributor in early classes to a crucial connector to later classes where its role as a bridge enables causality pathways. Lastly, the 158 series shows a continuously rising causality contributions across the four classes with the peak appearing in Class 4. As such, it may act as an identifier of last-phase degradation dynamics of Component X. With respect to the given tasks and data, maintenance teams may opt to prioritize these five series and their associated recurring patterns for further diagnostic purposes while not overlooking the potential effect of less influential series (e.g. 171 and 837 series) on the failure of Component X.
Result discussion and implication
Results from TE application demonstrate the method’s capability in both dimensionality reduction and causal feature identification, while exhibiting variability in lag value selection and identified feature numbers. Higher lag values generally identify more causal features (evident in Classes 1–2), although Classes 3 and 4 show more complex patterns. For instance, in Class 3, truck 3880 with lag 3 identifies 18 features, surpassing truck 721 with lag 5 finding 14 features. Similarly, in Class 4, truck 9639 with lag 5 identifies 23 features, yet trucks 25117 and 5865 with lag 3 achieve comparable results. This suggests diminishing returns beyond certain thresholds. Higher lags capture deeper dependencies but risk introducing noise or ignoring recent critical patterns, while lower values preserve short-term patterns but sacrifice historical information. The same logic applies to causal feature quantity: comprehensive sets provide richer diagnostic signals but may introduce redundancy, whereas targeted subsets enable focused interventions but limit explanatory power. While computing the ideal lag value and causal feature number is not the aim of this research, this variability implies that no universal parameterization exists. Effective TE causal feature selection requires adaptation to specific operational needs, maintenance records, and desired analytical granularity for each system, even within homogenous fleets. For prescriptive maintenance, this adaptability requirement suggests effective strategies should be tailored to both equipment characteristics and organizational constraints: fewer features may enable rapid response protocols, while comprehensive sets support detailed degradation mechanism analysis for strategic planning.
The clustering performance showcases both strengths and limitations of the DTW-based algorithm. The algorithm manages to isolate symptom groups with shared temporal patterns, correctly aligning abnormal oscillations (e.g. sharp fluctuations in the final 50–100 time units before failure in trucks 3420, 16327, and 17107) that potentially signify subsystem responses to degradation. The algorithm maintains functionality despite noise and signal variability However, limitations appear in such cases as trucks 14470 and 17107 where signals cluster imperfectly (e.g. feature 272_4) or exhibit weak synchronization, reflecting sensor data realities due to inherent response differences, operating condition variations, and measurement errors (e.g. sensor malfunctions). Nonetheless, maintenance applications prioritize overall trend identification over perfect alignment, and the algorithm robustly captures broader trends essential for degradation characterization and subsequent causality analysis. Numerically, silhouette score evaluation quantify clustering quality, revealing strategic implications for maintenance planning, Higher scores (e.g. 0.88 for truck 1659) indicate cohesive, well-separated clusters ideal for identifying clear subsystems or fault mechanisms. Lower scores (e.g. 0.08 for truck 1872) reflect weaker cohesion or overlap possibly caused by data variability or inferior data quality. Cluster counts ranging from one to seven reveal relationship complexity. This clustering diversity translates directly into strategical differentiation: in practice, fewer clusters with high scores suggest densely correlated sensor behaviors pointing to simpler diagnostic processes on smaller subsystems, while more clusters with low scores indicate diverse sensor responses requiring additional scrutiny to identify failure patterns. While not all clusters are perfectly defined and cluster numbers may not align directly with physical components, this does not undermine utility. This intermediary provides maintenance personnel with a new computational perspective. It augments domain knowledge by revealing hidden signal relationships that support systematic causal discovery and improved diagnostics.
TCDF’s causal discovery demonstrates strong prediction performance with low MASE and SD values across most clusters, validating identified relationships. Exceptions with elevated metrics (e.g. first cluster of truck 1732 in Class 1) indicate instances where noise or unpredictability limits temporal dependency learning. Varying causal relationship counts within clusters provide prescriptive insights: high numbers (e.g. 13 in truck 3880’s first cluster) represent complex cascading mechanisms requiring careful maintenance prioritization. Conversely, fewer relationships may indicate isolated or straightforward propagation pathways, simpler to diagnose but potentially not capturing complete causal flows. Importantly, not every TE-identified feature induces intra-cluster causality, which demonstrates TCDF’s complementary refinement power in isolating truly relevant features. This interpretable prediction-validation mechanism aligns with the viewpoint of Pashami et al. 78 that explainability within predictive models fosters confidence in prescriptive maintenance solutions.
Causal relationship interpretation is crucial for prescriptive maintenance realization. These relationships constitute the knowledge base that explains failure formation and propagation, showing how sensor behaviors transmit across subsystems with observable delays and supporting proactive, targeted maintenance strategies. Three relationship types emerge as pivotal. Unidirectional relations reveal sequential fault propagation and clear causal chains useful for identifying primary triggers and intervention points. Bidirectional relations represent mutual causality indicating interdependence between sensor-monitored components. They can result from genuine feedback loops or shared but hidden confounders, which requires more judicious examination of possible shared systemic influences. Self-causal relationships signify autoregressive tendencies or persistent fault states useful for prioritizing inspection, recalibration, or early warning systems. Recurring relationships across multiple units represent diagnostic commonalities that, despite low recurrence frequencies (majority being 2, maximum being 3), effectively highlight problematic sources and local causal chains across machines with varying service conditions, benefiting fleet health management and part inventory optimization.
Temporal delays provide physically interpretable intervention timing for maintenance planning. Delay values can be interpreted as mechanical response times, wear progression speeds or cascading failure sequences depending on the systems and sensor types involved. Non-zero delays (e.g. 397_28 causes 459_17 with two-timestep lag shown in Table 7) could suggest a measurable lag in how changes in one subsystem manifest in another, providing intervention windows for inspection or corrective action before downstream effects occur to reduce further damage. Zero-delay relationships (e.g. 167_2 causes 272_1 in Table 8) could indicate tightly coupled components requiring simultaneous monitoring, which would necessitate real-time response protocols or increased sampling rates for early detection as causality propagates faster than the measurement frequency can resolve. The delay information also informs maintenance resource allocation—longer delays permit scheduled interventions during planned downtime, whereas shorter delays require rapid-response capabilities. However, misinterpretation of delays can lead to unnecessary resource consumption (too early) or untimely intervention (too late), emphasizing the need for context-driven, human-supervised maintenance rather than exclusive computational reliance. For effective prescriptive maintenance, these data-driven delay patterns should be interpreted alongside domain expertise, system dynamics, and historical maintenance records to determine whether they align with known mechanical response times, buffering effects, or cascading failure mechanisms in the equipment under study.
The multi-stage architectural design delivers comprehensive diagnostics through interdependent module functionality. Case study demonstrates that this layered approach manages to reveal both fleet-wide commonalities and individual nuances that would otherwise remain obscured through isolated single-method approaches. However, the framework utility adapts to dataset dimensionality and problem complexity in real-world applications. While high-dimensional systems could benefit from the complete workflow’s systematic feature reduction and structured causality discovery, strategic module deployment becomes essential when sensor parameters are limited. For instance, practitioners may choose to bypass TE for direct clustering and causality analysis when domain knowledge sufficiently identifies causal relationships. The decision is ultimately constrained by organizational maintenance objectives, available computational resources, and the criticality of equipment-specific failure mechanisms. Regardless of dimensionality, the framework’s prescriptive value lies in systematic temporal relationship characterization that augments human expertise, ensuring maintenance interventions integrate computational insights with domain knowledge for optimized timing and resource allocation.
The current implementation establishes proof-of-concept viability for the proposed approach within the scope of the provided data tasks. Success with data subject to confidential treatment highlights the framework’s adaptability and suggests its promise for broader use. At the same time, parameter optimization and how the resultant knowledge should be interpreted can vary when applied to systems with differing characteristics, such as higher dimensionality, different sampling frequencies, or more complex interaction patterns among components. validation on a wider scale across diverse equipment types and operational environments would therefore strengthen generalizability claims and enable systematic benchmarking against alternative causal discovery methodologies under controlled conditions, thereby allowing continuous modular refinement. It is also important to note that the confidential data treatment in this study limits direct domain validation and exemplifies a common constraint in industrial research where ground truth causal relationships are rarely documented. Consequently, the interpretative aspects of discovered relationships necessitate reliance on domain-agnostic validation approaches such as those employed in this study. Future developments could benefit from establishing systematic parameter selection protocols and developing standardized evaluation frameworks for maintenance-oriented causal discovery tasks. Additionally, as noted in Chatterjee and Dethlefs 46 integrating human-in-the-loop mechanisms would enhance deployment confidence. These considerations collectively highlight pathways for evolving the current analytical foundation into a comprehensive prescriptive maintenance solution supporting real-time monitoring and automated decision-making capabilities.
Conclusion
This paper proposes a causality discovery framework as a novel approach to understanding and managing sensor data-tracked equipment health in maintenance applications. The framework establishes a foundation for equipment prescriptive maintenance in industrial IoT environments through three interrelated modules. The first module adopts TE to determine features causally influencing degradation processes. The second module employs a DTW-driven algorithm decomposing causal feature space into clusters based on temporal pattern similarity in response to system degradation. The third module applies the TCDF deep learning network to identify causal relationships and compute delay times through attention mechanisms and causality validation. Case studies implementing the framework on real-world SCANIA truck datasets demonstrate functionality and performance across individual trucks, proposing generalizable maintenance insights. Results show framework adaptability in causality discovery at varying granularity levels and potential for fleet-wide applications. The paper emphasizes that the framework requires expert discretion in parameter design and result interpretation, considering equipment-specific and organizational factors to elevate credibility, avoid over-simplification, and minimize execution outcome misinterpretation. From a practical perspective, the systematic causality discovery approach enables proactive interventions that enhance equipment safety and reliability through evidence-based, targeted interventions, thereby increasing asset availability and preventing catastrophic failures.
Future work is expected to address the following aspects. User interactivity can be enhanced by enabling domain experts to customize DTW clustering beyond silhouette score reliance, allowing maintenance specialists to adjust configurations based on domain knowledge and diagnostic requirements. Real-time capability should be pursued through adaptive systems leveraging historical causal patterns while continuously updating causal feature space, symptom groupings, and delay estimations as new data becomes available, facilitating immediate anomaly detection and shifting from retrospective analysis to proactive monitoring. Additionally, automation of maintenance decision-making should translate discovered causal relationships and temporal delays into actionable recommendations via intelligent advisory systems providing guidance on optimal timing for replacements, resource allocation priorities, and targeted repair strategies. This would extend the current analytical framework into a comprehensive prescriptive maintenance solution ready for industrial deployment.
Footnotes
Funding
The authors disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work is funded by the Natural Sciences and Engineering Research Council of Canada (NSERC; No: NSERC RGPIN-2019-04763). The authors are grateful for this support.
Declaration of conflicting interests
The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
