Sage Journals: Discover world-class research

Abstract

In vibration-based structural health monitoring (SHM), it is well known that environmental and operational variations (EOVs) affect the dynamic response of the structure of interest. This fact makes it difficult to distinguish between structural changes caused by damage and those caused by EOVs. In SHM, this issue is addressed by data normalisation, whereby machine learning techniques are commonly applied. However, ensuring their accuracy necessitates capturing a comprehensive range of EOVs within the training data. Collecting these is inherently challenging for real-world applications, especially with new EOV states emerging due to climate change. This study’s unique contribution is applying and comparing two grey-box models based on Gaussian process (GP) regression to remove EOVs with low data coverage and demonstrate their efficiency for damage detection with an open-access benchmark dataset. To this end, the first two bending mode natural frequencies – used as damage-sensitive features – of the Leibniz University Test Structure for Monitoring (LUMO), an outdoor lattice tower, are mapped. Two approaches to embedding physical knowledge in the GP are investigated. The first approach incorporates knowledge through the mean function, while the second involves the selection or design of a kernel. Subsequently, the two approaches are compared with a black-box and a white-box model. For long-term SHM, the repair of a damage mechanism is accounted for by the normal-condition alignment scheme, and damage detection is performed using the Mahalanobis distance. The study demonstrates that applying grey-box models contributes to a more reliable representation of the variations caused by unknown EOVs than pure black-box models, thereby improving damage detectability with sparse and incomplete training data due to climate change. However, as the dependencies modelled for LUMO are primarily linear, further research is required to assess the applicability of these findings to structures where dependencies are expected to be non-linear.

Keywords

Structural health monitoring grey-box model Gaussian process regression data normalisation repair problem damage detection

Introduction

Structural health monitoring (SHM) employs various strategies to detect and identify structural damage. The damage identification process results in a hierarchical, five-level approach.¹ The lowest two levels are concerned with making reliable statements about whether damage has occurred and its location. The higher levels involve identifying the type of damage and prognosis for the remaining lifetime. A distinction is generally made between unsupervised SHM, which addresses the lower two levels, and supervised SHM for the higher levels. Following Farrar et al.,² an SHM process generally involves the steps (a) operational assessment, (b) data collection, (c) feature extraction and (d) statistical model development for feature classification, whereby these steps are usually summarised by the statistical pattern recognition paradigm. During the feature extraction step, damage-sensitive features (DSFs), which are expected to correlate with damage-related changes in the structure, are extracted from the measurement data. The choice of DSFs is application specific.³ Once appropriate DSFs have been selected, the subsequent feature classification step employs algorithms to determine the current state of the monitored structure. Unfortunately, for most civil engineering infrastructure, only data from the structure’s healthy state are available, resulting in an unsupervised learning SHM problem. Here, the DSFs are continuously extracted from the measurement data and compared with the healthy state of the structure. Any significant deviation can then be considered as indicating damage. However, environmental and operational variations (EOVs) can affect the structure’s behaviour and mask the DSFs. It can be stated that the more sensitive a DSF is to damage, the more sensitive it is to changing EOVs.⁴ Handling these effects is commonly referred to as data normalisation and remains a challenge in the SHM research community. Considering the effect of climate change is crucial for a reliable data normalisation algorithm, as it will change the EOVs under which a structure operates over time.⁵ Therefore, appropriate handling of the variations due to EOVs not included in the training data is essential to ensure the reliability of the SHM system.⁶

In the case of global vibration-based SHM, it is assumed that the vibration characteristics under ambient, unknown excitation (output only, e.g., due to wind) of the considered structure change due to damage and can thus be utilised to obtain DSFs. A benefit of using a global method for damage detection is that the measurement locations do not have to be close to the damage. Here, natural frequencies are commonly employed as DSFs because they are physically tractable and can be extracted from dynamic measurements in near-real time due to the availability of established identification algorithms such as stochastic subspace identification (SSI) or Bayesian operational modal analysis (BAYOMA). Beyond this, spectral functions such as power spectra,⁷ mode shapes,⁸ transmissibilities,⁹ autoregressive (AR) coefficients¹⁰ and subspace-based indicators¹¹ can also serve as DSFs for vibration-based SHM methods. Although natural frequencies are not always sufficiently sensitive to damage,⁴ they are selected as DSFs for this study as the focus is on data normalisation rather than on choosing the most sensitive DSFs.

Many authors have investigated the influence of EOVs on the natural frequencies of different structures. Farrar et al.¹² reported that the natural frequencies of the Alamosa Canyon Bridge vary by approximately 5% during 24 h. Peeters et al.¹³ also indicated for the Z-24 bridge that for 1 year of monitoring, differences ranging from 14% to 18% in natural frequencies could be explained by EOVs. Ubertini et al.¹⁴ reported variations in the natural frequencies of a bell tower due to temperature effects. Jonscher et al.¹⁵ observed EOV-dependent variations in the natural frequencies for an operating onshore concrete-steel hybrid wind turbine tower of up to 7%. Similar variations due to EOVs were also identified for the natural frequencies of an offshore Vestas V90-3MW turbine on a monopile foundation, as reported in Weijtjens et al.¹⁶ All these studies indicate that natural frequencies are susceptible to changing EOVs, which makes distinguishing between structural changes caused by damage and those due to changing EOVs challenging. Climate change will be another source of increasing variations in DSFs due to changing EOVs. Changing temperature and other dominant factors, such as flooding and high winds, significantly influence this variation.¹⁷ However, the exact magnitude of the change is still unclear, whereby it is unquestionable that our climate is changing.¹⁸ Therefore, numerous studies have investigated climate change’s potential impact on infrastructures.^19–21 However, Figueiredo et al.⁵ stated a research gap in the literature regarding the effect of unknown EOVs caused by climate change on long-term damage detection. Therefore, the authors of this study believe that vibration-based damage identification procedures will only become feasible in the face of climate change with adequate accounting for the new variations due to EOVs using robust data normalisation algorithms.

Several strategies have been proposed for developing a DSF insensitive to EOVs.²² Currently, there are two well-known approaches for data normalisation. The first approach, and the one used in this study, is to measure the parameters related to EOVs and the structural response. The extracted DSFs of interest are then modelled as a function of the EOVs assumed to cause their variation. The second approach does not rely on measurements of the EOVs. Here, algorithms are applied directly to the DSFs to account for the influence of EOVs on them. Examples regarding the first approach are given by Dervilis et al.,²³ who employed multivariate linear regression analysis to model the variation of natural frequencies with respect to EOVs for bridge monitoring. To compensate for EOVs using multivariate non-linear regression, Roberts et al.²⁴ modelled the natural frequencies of an operational Vestas V27 wind turbine blade based on temperature and wind speed. Other non-linear regression approaches use machine learning techniques such as artificial neural networks to normalise DSFs by modelling them as a function of EOVs.²⁵ Beyond this, Gaussian process (GP) regression has also been used in regression-based data normalisation. As a powerful Bayesian tool, GPs provide several desirable characteristics, including the ability to make predictions and probability distributions without specifying a particular parametric functional form, requiring only a few a priori inputs, and being capable of modelling signals with high levels of noise.²⁶ In this context, Worden and Cross²⁷ utilise an extended GP to map the variations in natural frequencies due to temperature for the SHM of the Z24 bridge. Jonscher et al.⁸ used a heteroscedastic GP for data normalisation of the modal parameters of an onshore wind turbine tower to account for input-dependent noise.

As mentioned above, the second approach for data normalisation relies solely on the DSFs and does not require measurements of the EOVs. Here, Lucà et al.²⁸ apply the Mahalanobis squared distance to multiple modal parameters to filter out variations due to EOVs and enable damage detection as a multivariate outlier detection problem. Yan et al.²⁹ leveraged local principle component analysis as an extended version of principle component analysis (PCA) to cluster natural frequencies into linear regions based on their temperature conditions. Cointegration was also applied to spectral lines from the frequency spectrum to make these insensitive to temperature-induced variations.³⁰ In addition to this, Figueiredo et al.³¹ compared an auto-associative neural network, factor analysis, Mahalanobis distance and singular value decomposition to distinguish between changes in the AR coefficients caused by EOVs and those caused by damage. Here, the Mahalanobis distance-based algorithm was identified as the most effective overall approach to data normalisation.

However, despite the availability of large amounts of structural and EOV data, it is often impossible to fully capture all possible states of a structure under the influence of those EOVs. Instead, the data provide information across a specific window of EOVs. This especially applies to the advanced machine learning approaches in regression-based data normalisation. Although these are powerful tools for extracting unknown correlations from data, predictions for unknown EOV states are challenging. As climate change progresses and the average temperature changes over time,³² robust algorithms will become even more critical for dealing with those changes.³³ Here, Figueiredo et al.⁵ analysed the effects of climate change on the SHM of the Z-24 bridge and stated that traditional machine learning algorithms may not be robust enough to deal with those effects. The second category of data normalisation approaches—data projection methods—is more robust against limited EOVs’ variability in the training data, as they model the relationships between the DSFs themselves rather than directly modelling the influence of the changing EOVs on the DSFs. Therefore, it seems reasonable to extend the purely data-based regression approaches for data normalisation with knowledge about the primary correlations between EOVs and DSFs to lessen the reliance on fully capturing the complete EOV data and to make the subsequent damage detection robust to climate change effects using regression-based data normalisation.

In this work, prior knowledge is incorporated into a GP to form a grey-box model.³⁴ There are a wide range of ways to incorporate prior knowledge. For example, a physics-informed kernel can be derived to introduce prior knowledge,³⁵ or constraints can be incorporated into the GP since boundary conditions are often known in advance in many engineering fields.³⁶ A detailed overview of the different approaches is given in Cross et al.³⁷ In the context of SHM, several authors have previously demonstrated that grey-box modelling can significantly improve predictive accuracy despite this research area’s relative novelty. Jones et al.³⁸ presented a constrained kernel regarding the geometry of the structure of interest for damage localisation. The study of Haywood-Alexander et al.³⁹ shows how physical knowledge can be utilised by constructing different kernels to model ultrasonic-guided waves. Pitchforth et al.⁴⁰ presented a grey-box model for wave loading prediction by combining Morison’s equation and a GP. Möller et al.⁴¹ explicitly encoded known dependencies between inputs and outputs into the GP via a prior mean function. Zhang et al.⁴² also incorporate a linear relationship between cable extension and temperature as prior knowledge to predict the deck displacement of a cable-stayed bridge for monitoring the structure’s performance. To create a physically meaningful probabilistic power curve model of an operating wind turbine, Mclean et al.⁴³ also employed bounded GPs to account for the normalised predicted power constrained to a maximum rated power.

In conclusion, with regard to the application of grey-box models in SHM, these models show promising results. However, their applicability for damage detection has yet to be thoroughly investigated. This study extends the investigations of the grey-box modelling approach proposed by Möller et al.⁴¹ for the data normalisation of the DSFs in terms of the natural frequencies of a lattice tower exposed to changing EOVs. Here, the Leibniz University Test Structure for Monitoring (LUMO) serves as the case study on which the investigations of this work are carried out.⁴⁴ This work’s primary contribution is applying and comparing two different grey-box approaches to remove EOVs in case of low EOV coverage in the training data to enhance the resilience to new EOVs, for example, due to climate change, for damage detection. The classification performance of the models is demonstrated using an outdoor benchmark structure, and the grey-box models are compared against a black-box and a white-box model, whereby the rates of true positive and false negative indications of damage determine the classification performance.

The remainder of the study is organised as follows: section ‘GP regression’ briefly surveys GP regression as a solely data-driven method (or black-box model). After presenting GP regression, the introduction of GP-based grey-box modelling is initiated. Here, two approaches are discussed to embed physical knowledge as proposed solutions to model the effects of the EOVs on the natural frequencies selected as DSFs. The first approach is to incorporate prior knowledge into the GP by designing or selecting an appropriate kernel to embed a belief of which kind of functions the solution of the GP is drawn. The second approach is to include prior knowledge in the prior mean of the GP. Throughout the paper, LUMO is used to demonstrate the efficiency of the proposed models. In the section ‘Case study’, details of LUMO are offered, and its natural frequencies are presented as DSFs that show variability due to EOVs in terms of changing temperature and wind speed. In order to compensate for the influence of the EOVs on the DSFs, a comparative study of the black, white and grey-box models, using sparse and incomplete training data, is conducted. The section concludes with an investigation of the efficiency of the models for three different damage scenarios carried out on LUMO. Finally, the section ‘Conclusion and outlook’ presents a conclusion and discussion of the analysis carried out in this study.

GP regression

A GP model is designed for regression problems of the form $y = f (x) + N (0, σ_{n}^{2})$ . Here, a target $y$ is modelled as a function $f$ of a given set of inputs $x$ , corrupted by additive noise with zero mean and constant variance $σ_{n}^{2}$ . As defined by Rasmussen and Williams,²⁶ a GP forms a prior probability distribution over possible functions that fit a given set of training points. To describe a GP, only the mean function $m (x)$ (commonly set to zero in Rasmussen and Williams²⁶) and the covariance function $k (x, x^{'})$ (generally referred to as the kernel) are required

f (x) ~ G P (m (x), k (x, x')) .

(1)

To make predictions, Bayesian inference is used. The prior, formed by $m (x)$ and $k (x, x^{'})$ , is updated with observed data to produce a posterior distribution, which can be computed using the Bayes theorem. Given a set of training data $X = [x_{1}, x_{2}, \dots, x_{n}]$ and target values $y$ , the joint distribution of the observed targets $y$ and the predicted values $f_{*}$ for a query point $x_{*}$ is given by

\begin{matrix} (\begin{matrix} y \\ f_{*} \end{matrix}) ~ \\ N (\begin{matrix} [\begin{matrix} m (X) \\ m (x_{*}) \end{matrix}], & [\begin{matrix} K (X, X) + σ_{n}^{2} I & K (X, x_{*}) \\ K (x_{*}, X) & K (x_{*}, x_{*}) \end{matrix}] \end{matrix}), \end{matrix}

(2)

where $I$ denotes the identity matrix and $K (\cdot)$ represents the covariance matrix defined by $k (x, x^{'})$ .

The kernel is always chosen as a function of the inputs $x$ to define the similarity between $x$ and $x^{'}$ and, therefore, specifies the smoothness of the GP. A variety of kernels are available for use in GPs. The squared exponential kernel is a popular choice, but due to its smoothing property, it is often inappropriate for modelling physical processes.⁴⁵ Therefore, the Matérn 5/2 kernel is employed as the covariance function in this study. This kernel is defined as

k_{v = \frac{5}{2}} (r) = σ_{f}^{2} (1 + \frac{\sqrt{5} r}{l} + \frac{5 r^{2}}{3 l^{2}}) \exp (- \frac{\sqrt{5} r}{l}),

(3)

where $r = {| x - x^{'} |}^{2}$ . To make predictions, the joint distribution from Equation (2) is conditioned on the training data, yielding Equation (4)

p (f_{*} | x_{*}, y, x) ~ N (E [f_{*}], V [f_{*}]),

(4)

where the posterior predictive mean and variance are given by

E [f_{*}] = m (x_{*}) + K (x_{*}, X) {[K (X, X) + σ_{n}^{2} I]}^{- 1} (y - m (X)),

(5)

V [f_{*}] = K (x_{*}, x_{*}) - K (x_{*}, X) {[K (X, X) + σ_{n}^{2} I]}^{- 1} K (X, x^{*}) .

(6)

To predict the test targets $y_{*}$ , $σ_{n}^{2} I$ is added to the variance defined in Equation (6).

As shown in Equation (3), the Matérn 5/2 kernel has two hyperparameters: $σ_{f}^{2}$ , which defines the signal variance, and $l$ , defining the length scale. The hyperparameters of a GP with a Matérn 5/2 kernel are given by $Θ = [σ_{n}^{2}, σ_{f}^{2}, l]$ . To determine the optimal values of the hyperparameters $Θ$ to fit the training data, the likelihood of the predictions $p (y | X, Θ)$ is used. Consequently, the likelihood function describes the probability of observing the targets $y$ given the input data $X$ and the $Θ$ . During the training phase, the likelihood, expressed in the negative log form as

- \log p (y | X, Θ) = \frac{1}{2} y^{T} K_{y}^{- 1} y + \frac{1}{2} | K_{y} | + \frac{n}{2} 2 π,

(7)

where $K_{y} = K (X, X) + σ_{n}^{2} I$ , is minimised with respect to the set of hyperparameters $Θ$ .

When applying the exact inference for a GP on a data set of size $n$ , this has the drawback of a computational complexity $O (n^{3})$ . This complexity prohibits the application of GPs when a data set contains more than a few thousand data points.⁴⁶ This limits the use of GPs in the context of SHM, as large amounts of data from several years are often available. Therefore, methods are used to reduce the complexity. These methods construct an approximation based on a small set of $m$ (with $m < n$ ) support or inducing variables to reduce the complexity from $O (n^{3})$ to $O (n m^{2})$ . The inducing variables are values of the function $f$ evaluated at the points $X_{m} = [x_{1}, x_{2}, \dots, x_{m}]$ , where $X_{m}$ can be a subset of the training inputs or auxiliary pseudo-points.⁴⁷ Considering this, the mean and variance from Equation (4) can be approximated. Reference is made to⁴⁸ for a comprehensive review of fundamental sparse approximation methods.

For this study, the variational approach to inducing variables of Titsias⁴⁹ is utilised. Following this, the posterior GP can be described by the predictive Gaussian distribution

p (z | y) = \int p (z | f) p (f | y) d f,

(8)

where $z$ is any finite set of function points, $p (z | f)$ is the conditional prior of the GP, and $p (f | y)$ denotes the posterior distribution over the corresponding values of the latent function $f$ . The integral in Equation (8) is to be approximated by using a small set of $m$ inducing variables $f_{m}$ , evaluated at the inputs of $X_{m}$ . Equation (8) can therefore also be equivalently expressed as

p (z | y) = \int p (z | f_{m}, f) p (f | f_{m}, y) p (f_{m} | y) d f d f_{m} .

(9)

Assuming that $z$ and $f$ are independent for a given $f_{m}$ , Equation (9) results in

q (z) = \int p (z | f_{m}) ϕ (f_{m}) d f_{m} = \int q (z, f_{m}) d f_{m},

(10)

where $q (z) = p (z | y)$ and $ϕ (f_{m}) = p (f_{m} | y)$ . The predictive distribution from Equation (10) therefore only requires the $m$ inducing variables $f_{m}$ and can be computed in $O (n m^{2})$ . However, the assumption for $f_{m}$ is unlikely to hold true in general, so $q (z)$ is merely an approximation that needs to be optimised, whereby the quality is strongly dependent on the choice of the inducing variables $X_{m}$ .

According to,⁴⁹ it is useful for a good approximation to assume $ϕ (f_{m})$ to be a variational Gaussian distribution that depends on a mean vector $μ$ and a covariance matrix $A$ . To select the optimal quantities for $X_{m}$ and $ϕ$ , Titsias⁴⁹ presented a variational inference method to determine these jointly, whereby the inducing inputs are selected by minimising the Kullback-Leibler divergence (see Titsias⁴⁹ for further details).

All the models presented in this paper were written in Python and run using GPflow.⁵⁰

Incorporating prior knowledge

Following Cross et al.,³⁷ it is possible to incorporate prior knowledge into a GP in various forms. Here, it generally holds true that the more physical knowledge is incorporated, the less the predictions depend on the available data. Depending on how prior knowledge is integrated into the GP, the resulting grey-box model is more at the whiter end (much prior knowledge) of the spectrum of grey-box modelling possibilities or at the darker end (less prior knowledge, classical machine learning approaches).³⁷ For example, a GP with zero mean prior and a Matérn 5/2 kernel represents a black-box model and can be denoted as $G P (0, k_{ML})$ . The subscript ML indicates that this GP follows a purely data-based machine learning approach.

As seen in Figure 1(a), this approach maps the trend within test data similar to the training data (it can be described as interpolation) well. However, as soon as the test data is beyond the observation range of the training data (described as extrapolation), the purely data-based GP can no longer predict the trend appropriately and falls back to its mean value of 0.

Figure 1.

Interpolation and extrapolation of different grey-box approaches, including the 95% confidence interval (CI). (a) Shows the purely data-based GP: $G P (0, k_{ML})$ , (b) the GP with a kernel as a combination of knowledge-based and data-driven kernels: $G P (0, g (k_{P}, k_{ML}))$ , (c) a GP with a mean as a prior: $G P (m_{P}, k_{ML})$ and (d) a simple linear model. The data were generated by $Y = 3 + X \sin (π X + \exp (0.5 X)) \exp (0.04 X + 0.2) + 0.5 N (0, 1)$ .

This study investigates two possibilities for incorporating prior knowledge into a GP. On the one hand, a GP with a kernel that combines knowledge-based and data-driven parts is used. This is particularly appealing if only little or partial knowledge of the process of interest is available. The resulting GP can be denoted as $G P (0, g (k_{P}, k_{ML}))$ .³⁷ The subscript P denotes physics and indicates the white-box part of this grey-box model, while the subscript ML describes the black-box part.

On the other hand, a physics-based prior mean is utilised to account for prior knowledge. This approach becomes useful if parts of the mean behaviour can explicitly be expressed, and the GP is denoted by $G P (m_{P}, k_{ML})$ .³⁷ Here, the prior mean is additionally combined with the Matérn 5/2 kernel, which serves as the black-box part to account for additional variability. Throughout this study, the parameters of the mean function were set a priori by fitting a linear model to the training data. The prior mean thus describes the known relationships and cannot be adjusted during the hyperparameter optimisation of the GP. This strategy corresponds to a residual modelling approach.³⁷

Beyond the above-mentioned approaches, there is still a broad spectrum of methods for incorporating physical insights. For the example in Figure 1, it is conceivable to account for the periodic behaviour to obtain a better fit. Nevertheless, this can also lead to inaccurate predictions as the period and amplitude change. Thus, it seems reasonable to only map the underlying trend to minimise the risk of modelling errors. Therefore, the underlying trend is assumed to be linear. In the case of $G P (0, g (k_{P}, k_{ML}))$ , this relationship can be incorporated within a kernel by combining the Matérn 5/2 with a linear kernel, which is defined as

k_{LIN} (x, x') = σ_{f}^{2} (x^{⊤} x') .

(11)

By summing the kernels $g (k_{P}, k_{ML}) = k_{LIN} + k_{v = 5 / 2}$ , it is possible to model the data as a superposition of the two, now representing a more structured kernel.⁵¹ Figure 1(b) shows that such a GP can now extrapolate the underlying linear trend. However, when it comes to extrapolation, the confidence intervals of the prediction are still quite large compared to those for the interpolation, which indicates a certain degree of uncertainty in the prediction.

As illustrated in Figure 1(c), the grey-box model $G P (m_{P}, k_{ML})$ using a linear prior mean has the advantage that the CIs for the extrapolation are smaller than those from the kernel-based GP shown in Figure 1(b).

An essential idea behind machine learning is always to select the simplest model that best explains the underlying relation (Occam’s razor). Since a linear regression model is the simplest model that could be assumed for the example in Figure 1, this model can be used to evaluate the added value of the grey-box models under consideration. Figure 1(d) indicates that the linear model can capture the underlying linear trend, whereas non-linear effects cannot be captured by definition. In addition to the capacity to extrapolate the underlying linear trend, it is also vital to consider how well a model can cope with incomplete training data. For the evaluation, the $R^{2}$ score is utilised, which is defined as

R^{2} = 1 - \frac{\sum_{i = 1}^{n} {(y_{i} - f_{i})}^{2}}{\sum_{i = 1}^{n} {(y_{i} - \bar{y})}^{2}},

(12)

where $y_{i}$ and $f_{i}$ are the measurements and predictions, respectively, and $\bar{y}$ is the mean of the measurement data. If the predictions exactly match the measurements, $R^{2} = 1$ holds. A model that always predicts the mean $\bar{y}$ has $R^{2} = 0$ , and models that are worse than this have a negative $R^{2}$ score.

To this end, Figure 2 shows how a reduction of the training set affects the prediction accuracy of the models considered. For both Figure 2(a) and (b), the training set was reduced by removing equally distributed samples, so the proportion of the training set used is only up to 10% of the original training set.

Figure 2.

Comparison of the $R^{2}$ scores. (a) Shows the $R^{2}$ scores using a test set with only known data. Here, the grey-box model scores are almost identical: interpolation and (b) illustrates the $R^{2}$ scores using a test set including data beyond the training data’s observation range: extrapolation.

It can be seen in Figure 2(a), where the test data cover the same range of observations as the training data, that the size of the training set only has a minor effect on the accuracy, as the $R^{2}$ scores are almost constant for all models. Concerning the GPs, this finding is also favoured by the fact that the GPs use sparse approximation, so the prediction is based on $m$ inducing variables. Concerning accuracy, the linear model always performs worse than the GP-based models, as just the underlying linear trend can be captured. In the case of interpolation, the purely data-driven GP also performs slightly better than the grey-box approaches, as this pure machine learning approach can map any trends for known data ranges due to its intrinsic flexibility.

However, as can be seen from Figure 2(b), such an approach cannot extrapolate, which resulted in negative $R^{2}$ scores. The grey-box models $G P (0, g (k_{P}, k_{ML}))$ and $G P (m_{P}, k_{ML})$ , as well as the linear model, can map the underlying linear trend when it comes to extrapolation. Here, the kernel-based approach performs slightly worse with a small training set than the GP with a prior mean and the linear model. This is because providing prior knowledge via the kernel is a less explicit form compared to the GP using a prior mean and the linear model.

Case study

The Leibniz University Test Structure for Monitoring (LUMO) represents a benchmark structure for SHM methods.⁵² The structure is a 9 m high steel lattice mast, shown in Figure 3(a). The mast consists of three identical sections with a length of 3 m each and weighs around 90 kg.

Figure 3.

Photograph of the LUMO structure (a), a schematic of the structure highlighting the nine acceleration measurement levels (ML) and the investigated damage locations (DAM) with the reference axes (b), and the damage location 6 (DAM6) displaying the reversible damage mechanisms (c).

LUMO’s key advantage is the possibility of introducing a reversible local damage mechanism at six levels of the structure. Figure 3(b) depicts the locations relevant to this work. Removing up to three bracings (weighing 55 g each) at each damage level to introduce localised stiffness and mass changes is possible. The damage mechanisms include an M10 threaded rod with a coupling nut on each end. Damage is introduced by loosing the coupling nuts or removing the entire bracing. In this study, all considered damage cases involve the complete removal of one bracing at a certain damage level. Figure 3(c) shows the damage mechanism at level 6 (DAM6).

The structure is equipped with 18 uniaxial IEPE accelerometers mounted in pairs using 90° mounting brackets to measure orthogonal directions in the horizontal plane and capture spatial motion. The data is acquired at a sampling rate of 1651.61 Hz and stored in 10-min data blocks and can be accessed through a repository at Leibniz University Hannover.⁴⁴ Figure 3(b) shows the positions of the accelerometers that are positioned at ML1-9. The material temperature is measured at ML10 using a Pt100-type temperature sensor. It should be noted that this temperature is not expected to be spatially constant.

Beside the sensors on the structure, the Institute of Meteorology and Climatology (IMUK) of Leibniz University Hannover provides additional meteorological data from a meteorology mast close to the test structure. This data includes the air temperature, maximum wind speed, relative humidity, global radiation and wind direction. This study investigates the period from mid-April 2021 to July 2021. The corresponding natural frequencies of the first two bending mode pairs are pictured in Figure 4.

Figure 4.

Evolution of natural frequencies for B1-x, B1-y (a) and B2-x, B2-y (b) for all states under study using 2-min intervals. The abbreviations B1 and B2 refer to the first and second bending modes, and x and y indicate the dominant orientation of the associated mode shape. Damage states are highlighted in grey.

This period covers a total of six different condition states of the LUMO. Descriptions of the condition state during this period are summarised in Table 1.

Table 1.

Health states of the structure and corresponding labels.

Condition of the structure	Period of dataacquisition	Label
Healthy	20.04.21 to 04.05.21	H
Damage state – one strutremoved	04.05.21 to 18.05.21	DAM6
Repair state after DAM6	18.05.21 to 28.05.21	R1
Damage state – one strutremoved	28.05.21 to 14.06.21	DAM4
Repair state after DAM4	14.06.21 to 25.06.21	R2
Damage state – one strutremoved	25.06.21 to 12.07.21	DAM3

The states DAM6, DAM4 and DAM3 specified in Table 1 correspond to the damage locations pictured in Figure 3(b).

Increasing temperatures can be observed in the damage cases, representing variations in environmental conditions due to climate change in this study. To investigate the influence of increasing temperatures, which are not included in the training data, the regression approaches used in this study are only trained with data from state H. This represents the challenges of unknown environmental conditions due to climate change. This section first examines the damage-sensitive features—the natural frequencies of the first two bending modes—and their environmental variations. Afterwards, the data normalisation approaches are applied, and damage detection is performed.

DSFs and environmental variations

In order to assess the condition of a structure, DSF in terms of natural frequencies needs to be identified from the measurement data. This study uses the frequency domain method BAYOMA for this purpose.⁵³

As in previous studies regarding LUMO, torsional modes are excluded from the subsequent analysis. Despite their sensitivity in assessing damage states for LUMO, as evidenced by,^54,55 torsional modes often do not present a high sensitivity to damage in operational civil structures, such as bridges and wind turbines. To develop an SHM system applicable to practical scenarios, it is reasonable to exclude them from this study. A similar finding regarding damage sensitivity also applies to higher natural frequencies, which tend to exhibit higher sensitivity to damage compared to lower frequencies.⁴ However, due to a lack of excitation, higher modes can often not be reliably identified, and generally, more sensors are necessary to detect them with high fidelity. For larger structures such as bridges or wind turbines, the spatial resolution of sensors is often limited. Even for LUMO, it has been indicated that higher modes cannot be reliably identified using the current sensor setup.⁴⁴ Therefore, it is reasonable to utilise only the lower modes of LUMO in the subsequent analysis to obtain reliable identifications and to facilitate the transfer of findings to larger structures. Taking this into account, this study considers the first two bending mode pairs (B1-x, B1-y, B2-x and B2-y) to provide a realistic case study. These DSFs have also been studied by Jonscher et al.⁵⁴ concerning EOVs and damage detectability.

Besides the excitation, the time interval used for operational modal analysis is crucial for the accurate identification of the natural frequencies, as the identification uncertainty of these depends on the number of oscillation periods within the data set.⁵⁶ A comparison of the identification uncertainties $σ$ of the first two bending mode pairs of the 2-min and 10-min intervals of state H is shown in Figure 5(a) and (b). As expected, a shorter time interval increases the uncertainties compared to 10-min intervals.

Figure 5.

Cumulative distribution function (CDF) of the standard deviation $σ$ of the natural frequencies for B1-x, B1-y (Figure 5a) and B2-x, B2-y (Figure 5b) from the identification results of BAYOMA during state H.

Hence, for the operational modal analysis of large structures such as wind turbines, 10-min intervals are commonly used.^15,57,58 Regarding LUMO, Kahrger et al.⁵⁹ employed 2-min intervals to ensure better comparability of the findings obtained on LUMO with larger structures in terms of the number of oscillation periods within a dataset used for operational modal analysis. Therefore, this study is also based on the identification results of 2-min intervals. Table 2 characterises the natural frequencies obtained using BAYOMA during state H for 2-min and 10-min intervals.

Table 2.

Characterisation of the mean, standard deviation ( $σ$ ), and coefficient of variation (CV) of the identified natural frequencies (B1-x, B1-y, B2-x, B2-y) based on 2-min intervals and averaged for state H. Values for 10-min intervals are given in brackets.

Identifier	Mean [Hz]		$σ$ [Hz]		CV [%]
B1-y	2.77	(2.77)	0.0037	(0.0015)	0.12	(0.05)
B1-x	2.82	(2.82)	0.0034	(0.0013)	0.13	(0.05)
B2-y	16.01	(16.01)	0.0078	(0.0031)	0.05	(0.02)
B2-x	16.34	(16.34)	0.0073	(0.0030)	0.04	(0.02)

As LUMO is located outdoors, the structure is exposed to varying EOVs. In Figure 6, the variations of the considered natural frequencies can be observed for state H.

Figure 6.

Evolution of natural frequencies for B1-x, B1-y (a) and B2-x, B2-y (b) during state H for a 2-min interval.

It can be assumed that material temperature and maximum wind speed significantly impact the structural behaviour of LUMO.⁵² It should be noted that other EOVs can also be considered, as their selection is an application-specific modelling choice. Correlation analysis, in combination with a physical plausibility check, is often suitable for choosing EOVs that significantly influence structural behaviour. For instance, relative humidity was additionally considered for data normalisation of the natural frequencies of LUMO in Möller et al.⁴¹ However, as it strongly correlates with temperature, it was not included in this study. Furthermore, in Kahrger et al.,⁵⁹ global radiation was considered, which proved particularly useful for modelling predictive variance (cf. Equation (6)) as a function rather than a constant using heteroscedastic GP regression. Nevertheless, this study does not utilise global radiation since heteroscedastic GPs are not considered.

As noted above, variations in the natural frequencies of LUMO are primarily caused by temperature cycles that occur daily and seasonally.⁵² Since Young’s modulus, $E$ , depends on the temperature, and the natural frequencies are proportional to the root of $E$ , this relationship can be assigned a physical foundation. The temperature dependence of the natural frequencies is shown for LUMO as an example in Figure 7(a) for the first bending mode in the y direction for the healthy state H.

Figure 7.

Correlation plots and histograms of the 2-min intervals for the natural frequencies B1-y versus material temperature(a) and wind speed (b) for state H.

Figure 7(a) clearly shows a linear relationship between the material temperature and natural frequency following the findings of Wernitz et al.⁵² In this study, the Pearson correlation coefficient $ρ$ is employed to measure the linear relationship between these, whereby a Pearson correlation of $+ 1$ or $- 1$ indicates a perfect linear correlation. For the relationship between material temperature and B1-y, $ρ$ results in −0.7125. It should be noted that structures constructed from materials other than steel may exhibit distinct behaviours. In the case of masonry towers, thermal expansion in the masonry can lead to the reversal of this dependency, as it causes the closing of micro-cracks in mortar layers.⁶⁰ However, this phenomenon does not alter the fundamental principle that material temperature maintains a linear relationship with natural frequencies. Moreover, freezing temperatures affect the stiffness of structures, thereby challenging the validity of the aforementioned linear assumption.⁶¹ For LUMO, it was observed that the natural frequencies varied significantly from November 2020 to March 2021 due to ice formation, which can either stiffen or soften the structure depending on the amount and location of the ice.⁵² However, no freezing effects were observed during the period under investigation in this study, and only data above 0°C were utilised.

As the wind speed measurements for the period under consideration were incomplete, the values were estimated using the methodology presented in Wernitz et al.⁵² Since Wernitz et al.⁵² noted that temperature variations mask the effects of wind speed, a linear regression was used to remove the fundamental linear trend of the temperature effects from the natural frequencies. Figure 7(b) depicts the influence of the maximum wind speed on the remaining residual $Δ$ B1-y. The residual correlates with the maximum wind speed, with a Pearson correlation coefficient of $ρ = - 0.2523$ . A reason for the dependency between wind speed and natural frequency could be aerodynamic damping that increases with stronger winds and results in a decrease in the natural frequencies.⁵²

In conclusion, the influence of temperature on the considered natural frequencies is essentially linear and can be explicitly modelled for further investigations. However, a relationship with wind speed also exists, but to a significantly lower extent. Therefore, it seems reasonable not to explicitly specify this relationship as a linear function for the following investigations to avoid modelling errors.

Grey-box enhanced data normalisation

This study aims to make the first two bending mode pairs (B1-x, B1-y, B2-x, B2-y) as insensitive as possible to unknown EOVs using different grey-box-based data normalisation approaches. Furthermore, the study seeks to identify damage instances using the proposed data normalisation scheme. The grey-box models should enable meaningful predictions for EOV states not adequately represented in the training data, obtaining potentially robust algorithms to deal with EOV variability due to climate change. As stated in the section ‘DSFs and environmental variations’, the influences of material temperature and maximum wind speed cause the main variations of the natural frequencies. Hence, the grey-box models, introduced in the section ‘Incorporating prior knowledge’ should be investigated in terms of their applicability to mapping the EOVs to the natural frequencies as accurately as possible, especially with sparse and incomplete data.

Here, purely data-based model $G P (0, k_{ML})$ maps the influences of material temperature and wind speed as a pure black-box model. The two grey-box models, as discussed in the section ‘Incorporating prior knowledge’, can incorporate the prior knowledge about the linear trends between material temperature or maximum wind speed and natural frequency. On the one hand, the linear trend between material temperature or maximum wind speed and natural frequency is introduced by superposing the Matérn 5/2 kernel (black-box) and the linear kernel (white-box), resulting in the grey-box model $G P (0, g (k_{P}, k_{ML}))$ . On the other hand, the relationship between material temperature and natural frequency is assumed to be reliably linear and can therefore be incorporated using the prior mean (white-box) (cf. section ‘Incorporating prior knowledge’). In contrast, the influence of maximum wind speed is incorporated as for $G P (0, g (k_{P}, k_{ML}))$ in order to account for potential modelling errors. Therefore, two modelling techniques were employed in this grey-box model to incorporate the linear trend for each model input, one more explicit (prior mean) and the other less explicit (kernel). The resulting grey-box model is referred to as $G P (m_{P}, g (k_{P}, k_{ML}))$ . Finally, a linear regression model represents the simplest possible model, where only the linear relationship between material temperature and natural frequency is mapped (serving as a benchmark model).

Initially, an investigation is carried out to compare the accuracy of the grey-box models with each other, as well as with the purely data-based GP and the linear model for different-sized training sets for the state H. To this end, two cases are considered. Firstly, different subsets of the training data were created by removing uniformly distributed samples from the training data set (cf. Figure 8(a)). Secondly, the subsets were created by only considering samples up to a maximum temperature in the training set (cf. Figure 8(b)). For both, the accuracy of the models in fitting a test data set was compared in terms of their $R^{2}$ scores.

Figure 8.

Evolution of the $R^{2}$ scores of the models for B2-y with sparse and incomplete data. (a) Shows the $R^{2}$ scores for a test set, where the training set was reduced by removing equally distributed samples: Good coverage and (b) depicts the $R^{2}$ scores for the same test set, whereby the training set only contains data up to a maximum temperature of 22.5°C, which corresponds to approximately 98.5% of the total amount of training data: Bad coverage. The test data covers temperatures up to 24°C.

Analysis of Figure 8(a) reveals that the $R^{2}$ scores of all models appear relatively constant across different training set sizes. This observation aligns with the findings in section ‘Incorporating prior knowledge’. However, a slight underperformance of the grey-box models is worth noting compared to the purely data-based GP when trained on only up to 35% of the original data set. A factor contributing to this discrepancy could be the assumption of linearity, which could lead to modelling errors when only a small amount of training data is available as the underlying trend is incorrectly captured. This observation is supported by the fact that the more explicit knowledge is introduced, the lower the $R^{2}$ scores become in the case of sparse training data. Therefore, such modelling errors are more likely to occur when there is limited training data, as the model’s black-box components cannot compensate for these errors adequately. This is an interesting finding and needs more consideration in further studies as it has not been discussed in the literature before.

Nevertheless, a closer examination of Figure 8(a) indicates that the models demonstrate equivalent or superior accuracy to the purely data-based model when trained on larger data sets. This shows the effectiveness of employing grey-box models, as evidenced by the linear model showing the worst fit. Table 3 gives the accuracy of each considered model when the training data are complete (i.e., in the case of good EOV coverage).

Table 3.

$R^{2}$ scores for the bending mode pairs based on a test set.

Model	B1-x	B1-y	B2-x	B2-y
$G P (0, k_{ML})$	0.645	0.559	0.771	0.657
$G P (0, g (k_{P}, k_{ML}))$	0.645	0.550	0.773	0.658
$G P (m_{P}, g (k_{P}, k_{ML}))$	0.644	0.550	0.773	0.658
Linear	0.601	0.527	0.752	0.569

Table 3 reveals that the GP-based models perform almost equally for all natural frequencies and that the linear regression model always performs slightly worse than the others.

Figure 8(b) illustrates that the purely data-based GP can only map the trend above a certain limit with incomplete training data. The other models show a better extrapolation capability with incomplete training data regarding temperature coverage. When the EOV coverage is limited, it is noteworthy that the grey-box model $G P (m_{P}, g (k_{P}, k_{ML}))$ initially falls back to its prior mean. Therefore, the $R^{2}$ score corresponds to that of the linear regression model. In the case of limited training data, the kernel-based grey-box model $G P (0, g (k_{P}, k_{ML}))$ is superior to the other approaches. When sufficient data are available, the effects of the EOVs are adequately mapped by the GP-based models and the $R^{2}$ scores of these are consequently higher than those of the linear model. Figure 8(a) and (b) indicate that the $R^{2}$ scores of the grey-box models appear relatively constant across different training set sizes in terms of the covered EOVs. Therefore, it is a valid assumption that the proposed grey-box approaches build a robust model with sparse and incomplete training data, which is essential for robust damage detection.

Furthermore, Figure 9 shows the predictions of the individual models for B2-y, whereby the training set only contains samples for which the material temperature is less than or equal to 11°C.

Figure 9.

Predictions (interpolation and extrapolation) of the investigated models for B2-y on training data with a maximum material temperature of 11°C. (a) Shows the purely data-based GP as a black-box model: $G P (0, k_{ML})$ , (b) the kernel-based grey-box model: $G P (0, g (k_{P}, k_{ML}))$ , (c) the grey-box mode with a prior mean and a combined kernel: $G P (m_{P}, g (k_{P}, k_{ML}))$ and (d) shows the linear regression model.

In Figure 9(a), it can be seen that the purely data-based GP cannot extrapolate adequately. This model achieves an $R^{2}$ score of 0.195, and for the temperatures in the test set that were beyond the temperatures within the training data, the $R^{2}$ score is −1.132.

In comparison with the purely data-based GP, the grey-box model $G P (0, g (k_{P}, k_{ML}))$ (cf. Figure 9(b)) performs better and reaches an overall $R^{2}$ score of 0.571. However, in the case of temperatures beyond the observations contained in the training data, it only has an $R^{2}$ score of 0.04. This shows that although the incorporation of prior knowledge improves predictions, the ability to extrapolate remains very limited.

The grey-box model $G P (m_{P}, g (k_{P}, k_{ML}))$ (cf. Figure 9(c)) can capture the underlying linear trend and the $R^{2}$ score for the test set is 0.637. In cases where the temperature in the test data was not included in the training data, an $R^{2}$ score of 0.237 is achieved.

Using a linear model (cf. Figure 9(d)), the overall $R^{2}$ score is 0.572, and for unknown temperatures, the $R^{2}$ score is 0.05.

The findings presented above demonstrate that the grey-box model $G P (m_{P}, g (k_{P}, k_{ML}))$ exhibits superior accuracy and can make predictions beyond the temperatures contained in the training data compared to the grey-box model $G P (0, g (k_{P}, k_{ML}))$ . Under poor EOV coverage, it becomes apparent that the grey-box model $G P (m_{P}, g (k_{P}, k_{ML}))$ tends to revert to the prediction of its prior mean, thereby aligning with the linear model employed within this study. This observation is in accordance with the definition of such a GP and is also demonstrated in Figure 1(c). However, with an adequately large training set, the model exhibits improvement compared to the linear regression model. Consequently, using this grey-box model to account for EOV variability due to climate change consistently enhances the white-box component utilised. While the linear regression model may be suitable for simple structures like LUMO, it is not readily transferable to more complex structures. For instance, in the case of wind turbines, non-linear relationships, such as those due to nacelle-tower interaction or the effects of aerodynamic forces on the rotor, are expected. A linear regression model cannot map these relationships by definition, while a non-linear regression model often cannot adequately capture these relationships either because no closed-form expression exists. In contrast, the proposed grey-box approach overcomes this limitation as it can incorporate a known underlying trend represented by an arbitrary (non-linear) function on the one hand and map any unknown or hidden relationships based on the available data on the other hand.

Applicability in case of non-linear dependencies

As described in the previous section, the linear model performed well during the period considered in this study. This is mainly due to the linear relationships between the EOVs and DSFs. In general, however, non-linear correlations can also occur, as Peeters et al.,¹³ for example, found a bilinear behaviour between temperature and natural frequency for the Z-24 bridge, as the asphalt contributes significantly to the stiffness of the structure for negative temperatures. Wernitz et al.⁵² found a similar behaviour for the natural frequencies of LUMO from December 2020 to March 2021 due to ice formation.

To demonstrate the proposed models’ applicability to model non-linear dependencies between EOVs and DSF, Figure 10 shows the prediction of the models considered in this study. The training data covers the period from 01 January to 23 February 2021. For the training, 80% of the available data was randomly selected, and only the temperature range from −10°C to 10°C was considered. The entire available temperature range was used for the test set (20% of the original data), for which the corresponding predictions (interpolation and extrapolation) can be seen in Figure 10. Furthermore, for $G P (m_{P}, g (k_{P}, k_{ML}))$ and the linear model, it was assumed that the linear relationship between temperature and natural frequency can only be assumed for positive temperatures. For this reason, the linear functions are only trained on data with a positive temperature.

Figure 10.

Predictions (interpolation and extrapolation) of the models under consideration for B2-y, including negative temperatures, demonstrate their applicability to model non-linear dependencies. (a) Shows the purely data-based GP as a black-box model: $G P (0, k_{ML})$ , (b) the kernel-based grey-box model: $G P (0, g (k_{P}, k_{ML}))$ , (c) the grey-box mode with a prior mean and a combined kernel: $G P (m_{P}, g (k_{P}, k_{ML}))$ and (d) shows the linear regression model.

Figure 10(a) shows that the purely data-based GP can map the non-linear dependency between temperature and natural frequency ( $R^{2}$ score: 0.773) but is not able to extrapolate beyond the EOVs covered in the training data.

The same observation holds for the $G P (0, g (k_{P}, k_{ML}))$ , depicted in Figure 10(b). It can be observed that the linear trend captured by the model is wrong, and this grey-box model achieves a $R^{2}$ score of 0.55. This is because the kernel of this grey-box model is a composition of the Matérn 5/2 kernel and a linear kernel, whereby the same linear kernel is used for negative and positive temperatures. An elegant solution to overcome this was proposed by Pitchforth et al.⁶² They suggest a change-point kernel to switch between different kinds of kernels. However, the application of those types of kernel to improve the prediction of $G P (0, g (k_{P}, k_{ML}))$ for non-linear dependencies is beyond the scope of this study.

Figure 10(c) demonstrates that the grey-box model $G P (m_{P}, g (k_{P}, k_{ML}))$ can map non-linear dependencies as well as correctly extrapolate the linear trend for positive temperatures ( $R^{2}$ score: 0.794).

An interesting observation can also be seen in Figure 10. Here, the training data has two different trajectories around the freezing point, one steep and one less steep. The reason for this is the different behaviour of the structure for the thawing and freezing cases. All GPs used only map one of the cases. Strategies for mapping both cases can be the utilisation of a change-point kernel,⁶² but this is not the aim of this study. It must also be mentioned that the non-linear relationship must be included in the training data; otherwise, the GP-based grey-box models cannot map this relationship correctly.

The linear model (cf. Figure 10(d)) is, by definition, not able to model the non-linear dependency. Therefore, the $R^{2}$ score results in −0.05. Applying local multivariate linear regression is one option to overcome the limitations of the linear regression model. These methods rely on local clusters instead of training a single model on the whole data. However, since the models are formed for each cluster, these models represent another type of black-box method, whereby investigating these is not the scope of this study.

The findings in this section demonstrate the ability of the proposed models to map non-linear relationships and how they can be adjusted for this purpose. However, since there are no suitable damaged states for LUMO from December 2020 to March 2021, it is not appropriate to use the corresponding data for the subsequent investigations.

Damage detection with incomplete data

Three damage cases (cf. Table 1) are investigated to demonstrate the effectiveness of the proposed grey-box data normalisation approach for damage detection for incomplete training data regarding missing temperature ranges, representing insufficient EOV coverage due to climate change. However, as stated in the section ‘Grey-box enhanced data normalisation’, the findings for incomplete training data can be transferred to the case of sparse training data. The data normalisation is combined with the Mahalanobis distance (MD) as a conventional multivariate outlier analysis method in SHM to perform damage detection.⁶³ It is well known that the accuracy of the MD is limited by the variability of the EOVs contained in the training data. Consequently, if the reference state of the MD differs in terms of the EOVs from the current, healthy state, this can result in a false-positive detection. However, precisely for this reason, the MD demonstrates the benefits of using powerful data normalisation methods. The MD is defined as:

{MD}_{i} = \sqrt{{(x_{i} - \bar{x})}^{T} S^{- 1} (x_{i} - \bar{x})} .

(13)

Here, $x_{i}$ is the current data point, $\bar{x}$ is the mean vector of the training data and $S$ is the covariance matrix. In this study, the corresponding values of the four natural frequencies B1-x, B1-y, B2-x and B2-y are used to define the vector $x_{i}$ to enable multivariate outlier detection. To account for so-called outlier masking, the minimum covariance determinant estimator as a robust extension of the MD is used in this study.⁶⁴

For damage detection, the variations of the DSFs due to EOVs are removed by utilising the results of data normalisation via normal-condition alignment. Following Poole et al.,⁶⁵ normal-condition alignment can be employed in the case of so-called domain shifts, for example, due to repair.⁶⁶ Here, a repair state’s first two statistical moments are aligned with the initial healthy state. As a result, an affine transformation and scaling are performed. The procedure to achieve this is as follows:

1. Standardise the data from the initial healthy state using,

z_{s} = \frac{x_{s} - μ_{s, n}}{σ_{s, n}}

(14)

where $μ_{s, n}$ and $σ_{s, n}$ are the mean and standard deviation of the structure under normal conditions, respectively.

2. Align the data from each repair state individually to the initial healthy state using,

z_{t} = (\frac{x_{t} - μ_{t, n}}{σ_{t, n}}) σ_{z_{s}} + μ_{z_{s}},

(15)

where $μ_{t, n}$ and $σ_{t, n}$ are the mean and standard deviation of the repaired structure under normal conditions, respectively.

Rather than estimating the statistical moments from the initially healthy state and repair state, the corresponding values $μ_{s, n}$ , $μ_{s, t}$ , $σ_{s, n}$ and $σ_{s, t}$ obtained from the data normalisation method are incorporated into Equations (14) and (15) in order to account for variations due to EOVs more accurately.

For damage detection, the MD is first calculated using the training set, and a threshold is set by determining the 99% CI based on the Monte Carlo method using 3000 samples.⁶³ During monitoring, the MD of each new data point is calculated. If the MD crosses the threshold, this point is presumed to be novel compared to the data used during training.

State H and DAM6 from Table 1 are used to investigate how the proposed data normalisation approach affects damage detection based on the above-mentioned procedure. The true positive rate (TPR) and false-positive rate (FPR) are used to judge the accuracy. The TPR is calculated as

TPR = \frac{TP}{(TP + FN)},

(16)

where TP is the number of damaged data points classified as such, and FN is the number of incorrectly classified damaged state data points. The FPR is calculated as

FPR = \frac{FP}{(FP + TN)},

(17)

where FP is the number of misclassified normal-condition data points, and TN is the number of correctly classified normal-condition data points.

As already depicted in Figure 4, this damage can be detected solely based on B2-y without further examination. Therefore, the TPR for all considered models is 100%. As stated in the section ‘Grey-box enhanced data normalisation’, grey-box models can be beneficial in the case of incomplete data. Therefore, this section builds upon the previous investigation by examining the impact of incomplete data on the FPR. The results are presented in Figure 11 for the states H and DAM6.

Figure 11.

Evolution of the FPR for states H and DAM6 with incomplete data (a). (b) Shows the results when the moving average of the MD over the past 12 h is used. It should be noted that each data normalisation method is combined with the MD to perform multivariate outlier detection. The resulting FPRs are referred to as GP (pure), GP (kernel), GP (prior mean + kernel) and linear. The MD without data normalisation (referred to as MD) is also used here for comparison.

Figure 11(a) and (b) demonstrate that data normalisation with a purely data-based GP and MD exhibits an unacceptably high FPR when the degree of incompleteness within the training data is high, while the TPR remains at 100%. Therefore, the purely data-based model is not robust to unknown temperatures. Since a change in temperature due to climate change is to be assumed, a traditional machine learning method seems unsuitable for dealing with the effects of climate change. This finding aligns with the results of Figueiredo et al.⁵

Figure 11(a) demonstrates that the MD can achieve a better FPR with sufficiently good EOV coverage compared to other methods. This is because the regression models involve modelling errors when removing the EOVs from the DSF according to Equations (14) and (15), whereas the pure MD does not include additional modelling errors. Conversely, the FPR of the linear model is equivalent to that of the grey-box models. This suggests that the primary influence of the EOVs on LUMO stems from the linear correlation between temperature and natural frequency. However, in the case of good EOV coverage, this model also shows a slightly lower FPR than the two grey-box approaches (cf. Figure 11(a)).

Figure 11(b) shows the result of applying a moving average of the MD over the past 12 h for robustness against identification errors and avoidance of false-positive detections. Therefore, a decision on whether damage has occurred is not made based on a single data point. As shown in Figure 11(b), it becomes evident that applying a moving average to the normalised MD positively affects the results compared to Figure 11(a). This observation suggests that they are more resilient to the effects of climate change. However, with improved data coverage, the purely data-based GP and the MD can match the FPRs of the grey-box approaches. However, without relevant experience, it is challenging to determine when this sufficiently large amount of data is available for a particular use case. In the context of SHM, encountering situations where only limited data on the normal state of the system are available is expected.

Table 4 demonstrates the impact of incomplete training data on the accuracy of damage detection using the proposed models for data normalisation, achieved by restricting the maximum material temperature occurring in the training data to 11°C. Notably, this temperature limit closely corresponds to the mean material temperature of 11.63°C for state H. Moreover, the mean material temperature for state DAM6 is 13.65°C, indicating a significant overlap between the training and test data.

Table 4.

Evaluation of classification performance in the case of using training data with a maximum material temperature of 11°C. $\bar{FPR}$ and $\bar{TPR}$ indicate that the moving average of the Mahalanobis distance over the past 12 h is used.

Model	FPR (%)	TPR (%)	$\bar{FPR}$ (%)	$\bar{TPR}$ (%)
$G P (0, k_{ML})$	15.21	100.00	3.95	100.00
$G P (0, g (k_{P}, k_{ML}))$	14.30	100.00	2.35	100.00
$G P (m_{P}, g (k_{P}, k_{ML}))$	13.08	100.00	1.17	100.00
Linear	14.67	100.00	7.98	100.00
MD	41.74	100.00	40.93	100.00

TPR: true positive rate; FPR: false-positive rate.

From Table 4, it is evident that both grey-box models demonstrate a significant improvement in the FPR when considering the moving average of the MD. The utilisation of grey-box models, combining their white-box and black-box parts, outperforms the models where only a black-box or white-box part is employed. Grey-box models prove particularly valuable when the limited training data still encompass a significant portion of the subsequent test data. In scenarios with unknown EOV regimes, these models predominantly fall back on their white-box part. Therefore, grey-box models generally enhance the performance of the white-box parts using additional data information while maintaining the white-box parts’ accuracy as a lower limit. This observation supports the adoption of grey-box models for data normalisation to better address climate change’s impact.

Damage detection and the repair problem under incomplete training data

Following the results presented in section ‘Damage detection with incomplete data’, where the effectiveness of the grey-box enhanced data normalisation approach in reducing the FPR when utilising the MD in the case of incomplete training data was demonstrated, the applicability of this approach as a long-term SHM method will now be investigated. For this purpose, the period from mid-April to July 2021, including the states listed in Table 1, will be studied, whereby all models are exclusively trained on data from state H. As shown in Table 5, the test data go beyond the observations included in the training data. The incompleteness within the training data thus provides a practical application case for grey-box models. Moreover, the test data’s temperatures are higher, representing climate change’s influence on SHM in this study.

Table 5.

EOVs for state H and without state H.

Enviromental parameter	H	Without H
Temperature (air) [°C]	8.34	16.17
Temperature (material) [°C]	11.63	20.03
Relative humidity [%]	59.65	73.57
Global radiation [W/m²]	206.67	214.82
Max. wind speed [m/s]	5.33	4.91
Direction of wind [°]	200.29	208.37

EOV: environmental and operational variation.

Before employing these, however, the importance of data normalisation methods is demonstrated by using MD without prior data normalisation for damage detection in Figure 12(a).

Figure 12.

Application of the moving average of the pure MD for long-term SHM (a). The FPR and TPR for healthy and damaged states were found to be 12.33% and 66.29%, respectively. Utilisation of a linear model for data normalisation without addressing the repair problem in combination with the moving average of the MD (b). For all healthy and damaged states, the overall FPR is 35.06%, and the TPR is 97.67%.

When damage detection is solely based on the MD, Figure 12(a) shows that damage DAM3 is completely masked and cannot be detected. Furthermore, Figure 12(b) depicts the challenges that arise when repairs are conducted after each instance of damage, as the healthy data after the repair of DAM4 are misclassified as a novelty. While the damage mechanisms of LUMO were designed to be completely reversible, slight variations may occur during repair, leading to changes in the structural dynamics. This study assumes that repairs primarily impact the stiffness properties of LUMO, resulting in shifts in the natural frequencies as a constant offset. As shown in Equation (15), the normal-condition alignment utilises $μ_{s, t}$ and $σ_{s, t}$ obtained from the data normalisation method. Since these models were trained using data from the initial healthy state, their predictions will result in an offset compared to the data from the post-repair state. Therefore, adjusting the models used for data normalisation is crucial to prevent false positives arising from the repair process.

The following procedure is implemented to tackle the repair problem while also minimising the training effort required for data acquisition and computational resources for data normalisation: As explained in Wickramarachchi et al.,⁶⁷ it is presumed that LUMO returns to normal conditions immediately after each repair and for the subsequent 7 days. The daily average offset ${\bar{δ}}_{j}$ between the prediction from the regression model $f_{i}$ used for data normalisation and the identified natural frequency $y_{i}$ is then calculated as follows

{\bar{δ}}_{j} = \frac{1}{n} \sum_{i = 1}^{n} y_{i} - f_{i}, j = 1, \dots, 7 .

(18)

To account for the offset resulting from the repair, a prediction from a regression-based data normalisation method is aligned to the post-repair state by

\hat{f_{i}} = f_{i} + {\bar{δ}}_{j} .

(19)

Figure 13(a) illustrates the impact of correction via the offset ${\bar{δ}}_{j}$ on the prediction of the linear regression model.

Figure 13.

(a) Depicts the alignment of the predictions of the linear regression model for state R2 using the optimal offset ${\bar{δ}}_{1}$ and (b) shows the FPR resulting from the data normalisation approaches combined with the MD for each of the seven offsets ${\bar{δ}}_{j}$ for state R2.

To obtain the optimal offset, the FPR for the first 7 days after the repair is calculated using each offset ${\bar{δ}}_{j}$ and the one for which the FPR is the lowest is chosen as the optimal offset. It is important to emphasise that only a constant offset is determined during this process. As a result, the risk of overfitting is minimal. As demonstrated in the section ‘Grey-box enhanced data normalisation’, applying a moving average positively affects the FPR. Consequently, the FPR resulting from the moving average applied to the MD is considered when choosing the optimal offset. Figure 13(b) exemplifies this procedure for the state R2.

As depicted in Figure 13(b), the optimal offset for obtaining the minimal FPR for state R2 is achieved using 1 day for the data normalisation models except for the purely data-based GP. In the case of the purely data-based GP, using an offset always leads to an increase in the FPR compared to when no offset is applied. This is due to the model’s inability to extrapolate well, as illustrated in Figure 9(a). This makes the correction with an offset ineffective. However, as illustrated in Figure 13(b), a slight increase in the FPR after 1 day becomes apparent for the other models under consideration. This phenomenon may be attributed to unusual EOV states affecting LUMO during this extended period, leading to either an overestimation or underestimation of the offset. The procedure for addressing the repair problem appears to be applicable. However, adapting the regression-based data normalisation models to the post-repair state leaves room for improvement for future research, although it is not the focus of this study.

Using the methodology mentioned above to tackle the repair problem, the results of the subsequent damage detection are presented in Figure 14.

Figure 14.

Results of the damage detection using different regression models for the data normalisation and the MD after applying a moving average of the past 12 h. (a) Shows the results for the purely data-based GP: $G P (0, k_{ML})$ , (b) illustrates the damage detectability using a kernel-based GP: $G P (0, g (k_{P}, k_{ML}))$ , (c) shows the MD in combination with a GP using a prior mean and a combined kernel: $G P (m_{P}, g (k_{P}, k_{ML}))$ and (d) presents the results when applying the linear model for data normalisation and the subsequent damage detection.

Figure 14 illustrates that DAM6 and DAM4 can be reliably detected with all proposed models, and the repair state R1 can be appropriately considered. However, in terms of damage detection, it can be observed that the influence of the damage on the MD decreases with increasing height of the damage position (see Figure 3(b)). This is because the first two bending mode pairs of the lattice tower structure are employed as DSFs. It should be noted that only one mode of each bending mode pair reacts significantly to damage, as the damage is asymmetric due to the activation of one damage mechanism. Regarding the relationship between damage location and its impact on DSFs, the first mode shape (see Ref. 52) indicates that the natural frequencies of the first bending mode pair are less affected by damage as height above the tower base increases. Furthermore, the damage mechanisms are located on the diagonal struts of the structure. With regard to DAM3, these struts are barely stressed for both bending mode pairs, which reduces their influence on the structure’s natural frequencies.

Concerning the data normalisation approaches employed for subsequent damage detection, it can be seen that when utilising the purely data-based GP in Figure 14(a), the threshold is exceeded before DAM3 is activated. This observation appears to be caused by the model’s poor ability to extrapolate to unknown EOV states. Therefore, whether the DAM3 damage was genuinely detected or was coincidentally identified due to insufficient data normalisation using this approach is questionable. This illustrates that rising temperatures, for example, due to climate change, can lead to false positives when using a black-box approach. In contrast, both grey-box models appear to be more capable of capturing the influences of these EOVs. Nevertheless, Figure 14(b) indicates that the grey-box model $G P (0, g (k_{P}, k_{ML}))$ also experiences problems concerning false-positive detections before DAM3 is activated but to a lesser extent. In contrast, the grey-box model $G P (m_{P}, g (k_{P}, k_{ML}))$ depicted in Figure 14(c) as well as the simple linear regression shown in Figure 14(d) can compensate well for such EOV influences. It can reasonably be assumed that the latter two models can detect DAM3 and are therefore able to make more reliable statements in the case of unknown variations in the EOVs caused by climate change than a black-box model. Here, it should be emphasised that the linear model performs very well in comparison. This is to be expected as it is a simple white-box model that represents model that represents the relationships of LUMO exclusively based on prior knowledge. However, as was shown in the section ‘Applicability in case of non-linear dependencies’, such a model immediately reaches its limits if other dependencies occur. A more flexible grey-box model is superior to the pure white-box model as it can also extract relationships from the data.

When looking at the results presented in Figure 14, it is noticeable that the TPRs of $G P (m_{P}, g (k_{P}, k_{ML}))$ and the linear model are not expected to be significantly high. As shown in Figure 14, the damage instance DAM3, in particular, significantly influences the performance of all the models considered. A possible cause for the somewhat limited damage dependability of the damage instance DAM3 is, besides the aforementioned reasons, the increased uncertainty in identifying natural frequencies due to the 2-min intervals, as discussed in the section ‘DSFs and environmental variations’. For this reason, the above analysis is re-examined for natural frequencies identified based on 10-min intervals. Here, the procedure is the same as the procedure for the 2-min intervals. As previously stated, the states R2 and DAM3 were identified as the primary sources of damage detection issues. Consequently, only these states are depicted in Figure 15(d).

Figure 15.

Detectability of DAM3 using 10-min intervals. Again, the different data normalisation approaches and the moving average MD (past 12 h) are considered. The results using the purely data-based GP are shown in (a) $G P (0, k_{ML})$ , (b) illustrates the results when a kernel-based grey-box model is used: $G P (0, g (k_{P}, k_{ML}))$ , (c) depicts the MD combined with a GP that includes a prior mean and a combined kernel as the white-box part: $G P (m_{P}, g (k_{P}, k_{ML}))$ and (d) shows the results utilising the linear regression model.

Figure 15(a) shows that data normalisation using a purely data-based GP again does not work reliably, as the threshold is exceeded before DAM3 has been activated. The two grey-box approaches (cf. Figure 15(b) and (c)) demonstrate the capacity to distinguish between the healthy and damaged states with considerable efficacy. In the case of the grey-box model $G P (m_{P}, g (k_{P}, k_{ML}))$ depicted in Figure 15(c), there is a near-perfect separation between R2 and DAM3. To a somewhat lesser extent, this also holds true for the grey-box model $G P (0, g (k_{P}, k_{ML}))$ shown in Figure 15(b). Compared to the 2-min intervals, the more reliable identification using 10-min intervals positively affects the kernel-based model, as it represents the darker end of the spectrum of grey-box models and is, therefore, more dependent on the available data. From the results of the linear regression model depicted in Figure 15(d), it is also evident that R2 and DAM3 are well separated. The FPRs and TPRs of the considered models using 10-min intervals are presented in Table 6.

Table 6.

Evaluation of classification performance encompassing all healthy and damaged states. $\bar{FPR}$ and $\bar{TPR}$ indicate that the moving average of the MD over the past 12 h is used.

Model	FPR (%)	TPR (%)	$\bar{FPR}$ (%)	$\bar{TPR}$ (%)
$G P (0, k_{ML})$	17.05	92.56	11.13	94.58
$G P (0, g (k_{P}, k_{ML}))$	10.16	90.78	0.50	93.88
$G P (m_{P}, g (k_{P}, k_{ML}))$	7.79	93.81	0.00	97.56
Linear	9.39	91.38	0.00	95.99

TPR: true positive rate; FPR: false-positive rate.

The results from Table 6 indicate that the grey-box model $G P (m_{P}, g (k_{P}, k_{ML}))$ and the linear model now also exhibits high TPRs. It is also worth noting that the application of the moving average to the MD results in an FPR of 0% for both models and the grey-box model $G P (m_{P}, g (k_{P}, k_{ML}))$ achieved a TPR of 97.56%. The kernel-based GP also benefits from the improved data quality due to the reduced uncertainty. However, compared to the models that explicitly utilise the linear relationship between natural frequency and temperature, this approach exhibits somewhat inferior performance regarding the TPR and FPR.

To also judge the accuracy of models individually, Table 7 shows the FPRs and TPRs for each of the three healthy respectively repair states and the three damaged states.

Table 7.

Classification of the correct state, considering the three healthy and the three damaged states separately. $\bar{FPR}$ and $\bar{TPR}$ indicate the moving average of the MD of the past 12 h.

Model	H & DAM6		R1 & DAM4		R2 & DAM3
	$\bar{FPR}$ (%)	$\bar{TPR}$ (%)	$\bar{FPR}$ (%)	$\bar{TPR}$ (%)	$\bar{FPR}$ (%)	$\bar{TPR}$ (%)
$G P (0, k_{ML})$	0.00	100.00	1.13	100.00	21.72	84.22
$G P (0, g (k_{P}, k_{ML}))$	0.00	100.00	1.13	100.00	0.00	82.19
$G P (m_{P}, g (k_{P}, k_{ML}))$	0.00	100.00	0.00	99.92	0.00	92.99
Linear	0.00	100.00	0.00	100.00	0.00	88.34

TPR: true positive rate; FPR: false-positive rate.

Table 7 and Figure 15 illustrate that the FPRs und TPRs of all considered models improve for the damage instance DAM3. Therefore, it can be noted that in the case of the 10-min intervals, the quality of the data, especially the precise identification of natural frequencies, significantly impacted the model’s accuracy. However, it is crucial to emphasise that a trade-off is constantly being made. While longer time intervals result in lower uncertainties, this is accompanied by an increasing violation of the underlying system identification assumption of linear time invariance. Consequently, enhancing the data quality by employing longer time intervals is not straightforward and is structure dependent. While higher evaluation times only slightly violate the assumptions for a lattice tower, the assumptions can be much more severely violated for wind turbines due to changes in the rotor speed, for example.

As noted within the previous sections, temperature variations most likely cause the main variations in the natural frequencies. Therefore, the linear model can explain these variations well based on the more accurate identification results. However, the slightly poorer performance of the linear model compared to the grey-box model $G P (m_{P}, g (k_{P}, k_{ML}))$ indicates that the effects of the maximum wind speed also influence damage detection. As this correlation is not clearly linear, capturing the effects using a grey-box model seems beneficial. Nevertheless, especially when using longer time intervals for the identification, such a model can also result in failing to capture the short-term dynamics due to the maximum wind speed, as discussed by Wernitz et al.⁵²

With all models, it is also noticeable that the FPRs are relatively high if no moving average of the MD is used. One reason is the notable disparity in the EOVs contained in the training set with reference to the states R2 and DAM3, as depicted in Table 8.

Table 8.

Average of the EOVs for state H and state R2 plus DAM3.

Enviromental parameter	H	R2 plus DAM3
Temperature (material) [°C]	11.63	31.95
Max. wind speed [m/s]	5.33	4.36

EOV: environmental and operational variation.

Table 8 highlights a significant discrepancy in the EOV coverage between state H and state R2 with DAM3, particularly concerning material temperature. The findings above show that the purely data-based model is not robust enough to deal with the temperature changes expected from climate change. It is important to note that the grey-box models show an increasing dependency on their white-box components if the EOV coverage in the training data is incomplete. Given that this white-box model describes a linear relationship, the accuracy of the grey-box model converges towards that of the linear model. The extent to which grey-box models depend on their white-box components varies depending on the type of grey-box model utilised.

In terms of the overall performance throughout this study, the grey-box model $G P (m_{P}, g (k_{P}, k_{ML}))$ and the linear regression model can be considered effective data normalisation approach for LUMO, whereby the grey-box model demonstrates superior performance as it strikes a balance between the white-box and black-box components. The white-box component employs a prior mean to account for the linear correlation due to temperature variations, while the black-box component employs a structured kernel, providing flexibility to account for the influence of wind speed. Concerning the question of generalisation to more complex structures (e.g. bridges or wind turbines), a grey-box model from the class $G P (m_{P}, g (k_{P}, k_{ML}))$ , therefore, appears to offer a promising direction for further investigation. The challenge here is to specify the nonlinearities in the grey-box model and to ensure that the assumed prior mean is correct for unseen conditions. For wind turbines, this includes incorporating periodic behaviour due to the position of the nacelle or boundary conditions while simultaneously enabling the extraction of correlations from data to account, for example, for interactions with aerodynamic forces that can lead to structural nonlinearities. Equally significant is the robustness of the model against rising temperatures due to climate change, a factor that must be analysed for more complex structures.

Conclusion and outlook

This work compares two novel data normalisation approaches that utilise grey-box methods to account for variations in natural frequencies resulting from changing EOVs that were not included in the training data. Considering such influences in the context of SHM is crucial, as they can cause significant deviations in the dynamic response of structures and mask damage-induced deviations. This is particularly important in the context of climate change, where temperatures are expected to rise. This study addressed this issue by enhancing regression-based data normalisation with machine learning methods. However, machine learning approaches are limited in making meaningful predictions when the new data contains EOVs that were not included in the training set. With this issue in mind, this study’s unique contribution is the application of two grey-box enhanced data normalisation approaches using GP to guarantee plausible mean predictions in the case of unknown EOV regimes due to climate change, their utilisation for subsequent damage detection, and a comparison with a purely data-based GP. Two approaches were employed to integrate the physically plausible linear dependency between the EOVs and natural frequencies into the GP. First, a linear correlation involving the enviromental parameter wind speed and material temperature was encoded through the kernel of the GP. Second, the linear correlation specific to material temperature was incorporated via the GP’s prior mean, and the wind speed correlation was modelled using a combined kernel.

In the case of incomplete training data in terms of poor EOV coverage, it has been shown that the grey-box models capture the underlying trend better than the black-box counterpart, whereby the grey-box model $G P (m_{P}, g (k_{P}, k_{ML}))$ consistently reverts to the prediction of the white-box part. Compared to the linear regression model, higher $R^{2}$ scores were found when mapping data from known EOV states. Therefore, it can be concluded that using a grey-box model enhances the performance of the white-box part when sufficiently good training data are available; otherwise, it performs similarly to the white-box model. Interestingly, the grey-box models perform worse than a purely data-based GP in the case of sparse training data with good EOV coverage.

Grey-box models are a promising approach for a resilient SHM system that can cope with the impact of climate change. However, this requires good modelling of the temperature effects in the case of unknown EOVs.

The damage detection was conducted using the MD. Here, using a moving average of the normalised MD proved advantageous for damage detection, as opposed to using a single data point. Additionally, the predictions were corrected to address the repair problem using an offset $\bar{δ}$ for each repair state. The grey-box models and the linear regression model were capable of separating the damaged states from the undamaged states. Although the underlying correlations for this study are mainly linear, the grey-box model $G P (m_{P}, g (k_{P}, k_{ML}))$ proved slightly superior compared to the other approaches. Furthermore, it became evident that the accuracy when identifying the natural frequencies significantly impacts the quality of damage detection.

In summary, a grey-box approach, where some dependencies are incorporated as prior knowledge while others are extracted from the data, is beneficial for data normalisation. However, this is only true if the assumed physical mean is correct for unseen conditions. Combining this procedure with the MD for long-term SHM makes it possible to detect the complete removal of one bracing at different locations from the LUMO structure. Here, the grey-box model $G P (m_{P}, g (k_{P}, k_{ML}))$ was more successful than the kernel-based grey-box model. Using grey-box models, therefore, represents an essential step in mapping the variations of DSF due to unknown EOVs. It was demonstrated that enhancing a black-box model with prior knowledge improves the extrapolation capability regarding new EOV regimes of a GP as a typical machine learning tool. This study thus laid the foundation for applying grey-box enhanced data normalisation to existing SHM systems, thereby enabling the reliable handling of unknown EOV due to climate change.

Regarding LUMO, the proposed approaches to grey-box modelling showed varying degrees of success, opening up several opportunities for improvement and further development in future work. As previously stated, the correlation to be captured in this study was mainly linear. Therefore, the potential benefits of grey-box modelling approaches to deal with effects related to climate change have only been partially demonstrated. This opens up the possibility of further investigating the robustness of grey-box models to the effects of climate change in the long-term SHM of more complex structures such as bridges or wind turbines. As non-linear effects are expected for these structures, a key challenge is how prior knowledge about these relationships can be incorporated into the grey-box model. In the case of unknown EOV regimes due to climate change, it should also be investigated how frequently new combinations of EOVs arise and how to account for these. As new EOV clusters are formed that differ significantly from the EOV clusters of the training data, handling them in data normalisation is particularly challenging. This is because homogeneous regions in otherwise heterogeneous data are formed. In this case, local cluster-based methods (e.g., local PCA or local multivariate linear regression) could be preferable to global methods. Of particular research interest is how many points from the neighbourhood of a query point have to be included in the prediction in the case of sparse and incomplete training data. Introducing additional forms of prior knowledge could also be beneficial. A case in point is ensuring that the prediction fulfils certain boundary conditions. Moreover, this study did not consider the potential limitations of the predictive variance of GP. The application of heteroscedastic GPs could be particularly suited to mapping these variables with greater precision, whereby the assumption of an underlying normal distribution of variances could also be relaxed. Applying a damage metric that incorporates both the uncertainties from the prediction of the GP and the uncertainties associated with the identification to obtain a probability-based metric appears attractive in this context. Finally, further investigation is required to ascertain the extent of damage that can be reliably detected using different damage detection methods under the effects of climate change.

Footnotes

Declaration of conflicting interests

The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Funding

The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: The authors gratefully acknowledge the financial support provided by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) – Subproject C02, ID 434502799, SFB 1463.

ORCID iD

Sören Möller

References

Worden

Structural health monitoring using pattern recognition. Vienna: Springer Vienna, 2010, pp. 183–246.

Farrar

Doebling

Nix

. Vibration–based structural damage identification. Philos Transact A Math Phys Eng Sci 2001; 359(1778): 131–149.

Farrar

Worden

An introduction to structural health monitoring. Phil Trans R Soc A Math Phys Eng Sci 2007; 365(1851): 303–315.

Farrar

Worden

Structural health monitoring: a machine learning perspective. Chichester: John Wiley & Sons, Ltd, 2012.

Figueiredo

Peres

Moldovan

, et al Impact of climate change on long-term damage detection for structural health monitoring of bridges. Struct Health Monit 2024: 1–19.

Figueiredo

Santos

Moldovan

, et al A roadmap for an integrated assessment approach to the adaptation of concrete bridges to climate change. J Bridge Eng 2023; 28(6): 03123002.

Liberatore

Carman

GP.

Power spectral density analysis for damage identification and location. J Sound Vibr 2004; 274(3): 761–776.

Jonscher

Möller

Liesecke

, et al Heteroscedastic Gaussian processes for data normalisation in probabilistic novelty detection of a wind turbine. Eng Struct 2024; 305: 117786.

Zhou

Figueiredo

Maia

, et al Damage detection in structures using a transmissibility-based Mahalanobis distance. Struct Control Health Monit 2015; 22(10): 1209–1222.

10.

Datteo

Lucà

Busca

, et al Long-time monitoring of the G. Meazza stadium in a pattern recognition prospective. Procedia Eng 2017; 199: 2040–2046.

11.

Rosso

Aloisio

Cirrincione

, et al Subspace features and statistical indicators for neural network-based damage detection. Structures 2023; 56: 104792.

12.

Farrar

Cornwell

Doebling

, et al Structural health monitoring studies of the Alamosa Canyon and I-40 bridges. Los Alamos National Lab. (LANL), Los Alamos, NM (United States), 2000.

13.

Peeters

De Roeck

One-year monitoring of the Z24-bridge: environmental effects versus damage events. Earthquake Eng Struct Dyn 2001; 30(2): 149–171.

14.

Ubertini

Comanducci

Cavalagli

, et al Environmental effects on natural frequencies of the San Pietro bell tower in Perugia, Italy, and their removal for structural performance assessment. Mech Syst Signal Process 2017; 82: 307–322.

15.

Jonscher

Möller

Liesecke

, et al Identification uncertainties of bending modes of an onshore wind turbine for vibration-based monitoring. Struct Control Health Monit 2024; 2024: 3280697.

16.

Weijtjens

Verbelen

Sitter

, et al Foundation structural health monitoring of an offshore wind turbine—a full-scale case study. Struct Health Monit 2016; 15(4): 389–402.

17.

Athanasopoulou

Sousa

Dimova

, et al Thermal design of structures and the changing climate. Luxembourg: Publications Office of the European Union, 2020.

18.

Intergovernmental Panel on Climate Change (IPCC). Climate Change 2021 – The Physical Science Basis: Working Group I Contribution to the Sixth Assessment Report of the Intergovernmental Panel on Climate Change. Cambridge: Cambridge University Press, 2021.

19.

Nasr

Björnsson

Honfi

, et al. A review of the potential impacts of climate change on the safety and performance of bridges. Sustainable Resilient Infrastruct 2021; 6(3–4): 192–212.

20.

Mishra

Sadhu

Towards the effect of climate change in structural loads of urban infrastructure: a review. Sustainable Cities Soc 2023; 89: 104352.

21.

Orcesi

O’Connor

Diamantidis

, et al. Investigating the effects of climate change on structural actions. Struct Eng Int 2022; 32(4): 563–576.

22.

Farrar

Sohn

Worden

Data normalization: a key for structural health monitoring. In: Proceedings of the Third International Structural Health Monitoring Workshop, Stanford, California, 2001, pp. 1229–1238. DEStech Publications, Inc., Lancaster, Palo Alto.

23.

Dervilis

Worden

Cross

EJ.

On robust regression analysis as a means of exploring environmental and operational conditions for SHM data. J Sound Vibr 2015; 347: 279–296.

24.

Roberts

Cava

Avendaño-Valencia

LD.

Addressing practicalities in multivariate nonlinear regression for mitigating environmental and operational variations. Struct Health Monit 2023; 22(2): 1237–1255.

25.

Zhou

JM.

Generalization capability of neural network models for temperature-frequency correlation using monitoring data. J Struct Eng 2009; 135(10): 1290–1300.

26.

Rasmussen

Williams

CKI

. Gaussian processes for machine learning. Massachusetts, London, England: The MIT Press, 2005.

27.

Worden

Cross

EJ.

On switching response surface models, with applications to the structural health monitoring of bridges. Mech Syst Signal Process 2018; 98: 139–156.

28.

Lucà

Manzoni

Cigada

, et al A vibration-based approach for health monitoring of tie-rods under uncertain environmental conditions. Mech Syst Signal Process 2022; 167: 108547.

29.

Yan

Kerschen

De Boe

, et al. Structural damage diagnosis under varying environmental conditions—part II: local PCA for non-linear cases. Mech Syst Signal Process 2005; 19(4): 865–880.

30.

Cross

Worden

Chen

Cointegration: a novel approach for the removal of environmental trends in structural health monitoring data. Proc R Soc A Math Phys Eng Sci 2011; 467(2133): 2712–2732.

31.

Figueiredo

Park

Farrar

, et al. Machine learning algorithms for damage detection under operational and environmental variability. Struct Health Monit 2011; 10(6): 559–572.

32.

Houghton

Global warming: the complete briefing. 5th ed. Cambridge: Cambridge University Press, 2015.

33.

Vagnoli

Remenyte-Prescott

Andrews

Railway bridge structural health monitoring and fault detection: state-of-the-art methods and future challenges. Struct Health Monit 2018; 17(4): 971–1007.

34.

Cross

Gibson

Jones

, et al Physics-informed machine learning for structural health monitoring. Cham: Springer International Publishing, 2022, pp. 347–367.

35.

Cross

Rogers

TJ.

Physics-derived covariance functions for machine learning in structural dynamics. IFAC-PapersOnLine 2021; 54(7): 168–173.

36.

Swiler

Gulian

Frankel

, et al A survey of constrained gaussian process regression: approaches and implementation challenges. J Mach Learn Model Comput 2020; 1(2): 119–156.

37.

Cross

Rogers

Pitchforth

, et al A spectrum of physics-informed Gaussian processes for regression in engineering. Data-Centric Eng 2024; 5: e8.

38.

Jones

Rogers

Cross

EJ.

Constraining Gaussian processes for physics-informed acoustic emission mapping. Mech Syst Signal Process 2023; 188: 109984.

39.

Haywood-Alexander

Dervilis

Worden

, et al Structured machine learning tools for modelling characteristics of guided waves. Mech Syst Signal Process 2021; 156: 107628.

40.

Pitchforth

Rogers

Tygesen

, et al Grey-box models for wave loading prediction. Mech Syst Signal Process 2021; 159: 107741.

41.

Möller

Jonscher

Grießmann

, et al Investigations towards physics-informed Gaussian Process regression for the estimation of modal parameters of a lattice tower under environmental conditions. In: Limongelli

Giordano

Quqa

, et al (eds) Experimental vibration analysis for civil engineering structures. Cham: Springer Nature Switzerland, 2023. pp. 401–410.

42.

Zhang

Cross

EJ.

Grey-box modelling via Gaussian process mean functions for mechanical systems. In: Madarshahian

Hemez

(eds). Data science in engineering, vol. 9. Cham: Springer International Publishing, 2022. pp. 161–168.

43.

Mclean

Jones

O’Connell

, et al. Physically meaningful uncertainty quantification in probabilistic wind turbine power curve models as a damage-sensitive feature. Struct Health Monit 2023; 22(6): 3623–3636.

44.

Wernitz

Hofmeister

Jonscher

, et al LUMO - Leibniz universtity test structure for monitoring. Hannover: LUIS, 2021.

45.

Stein

ML.

Interpolation of spatial data. New York: Springer, 1999.

46.

Hensman

Fusi

Lawrence

ND.

Gaussian processes for big data. In: Proceedings of the Twenty-Ninth Conference on Uncertainty in Artificial Intelligence, Bellevue, WA, 2013, pp. 282–290. Arlington, Virginia: AUAI Press.

47.

Snelson

Ghahramani

Sparse Gaussian processes using pseudo-inputs. In: Weiss

Schölkopf

Platt

(eds). Advances in neural information processing systems, vol. 18. MIT Press, 2005.

48.

Quinonero-Candela

Rasmussen

CE.

A unifying view of sparse approximate Gaussian process regression. J Mach Learn Res 2005; 6(65): 1939–1959.

49.

Titsias

Variational learning of inducing variables in sparse Gaussian processes. In: van Dyk

Welling

(ed). Proceedings of the Twelth International Conference on Artificial Intelligence and Statistics. vol. 5 of Proceedings of Machine Learning Research. Hilton Clearwater Beach Resort, Clearwater Beach, FL, USA: PMLR; 2009, pp. 567–574.

50.

Matthews

AGdG

van der Wilk

Nickson

, et al GPflow: a Gaussian process library using TensorFlow. J Mach Learn Res 2017; 18(40): 1–6.

51.

Duvenaud

Lloyd

Grosse

, et al Structure discovery in nonparametric regression through compositional kernel search. In: Dasgupta

McAllester

(ed) Proceedings of the 30th International Conference on Machine Learning. Proceedings of Machine Learning Research. Atlanta, Georgia, USA: PMLR; 2013, pp. 1166–1174.

52.

Wernitz

Hofmeister

Jonscher

, et al A new open-database benchmark structure for vibration-based structural health monitoring. Struct Control Health Monit 2022; 29(11): e3077.

53.

Zhang

YC.

Bayesian operational modal analysis: theory, computation, practice. Comput Struct 2013; 126: 3–14.

54.

Jonscher

Hofmeister

Grießmann

, et al Influence of environmental conditions and damage on closely spaced modes. In: Rizzo

Milazzo

(ed.) European workshop on structural health monitoring. Cham: Springer International Publishing, 2023, pp. 902–911.

55.

Wickramarachchi

Gardner

Poole

, et al Damage localisation using disparate damage states via domain adaptation. Data-Centric Eng 2024; 5: e3.

56.

SK.

Uncertainty law in ambient modal identification—Part II: implication and field verification. Mech Syst Signal Process 2014; 48(1): 34–48.

57.

Brownjohn

JMW

Raby

, et al Bayesian operational modal analysis of offshore rock lighthouses: close modes, alignment, symmetry and uncertainty. Mech Syst Signal Process 2019; 133: 106306.

58.

Devriendt

Magalhaes

Weijtjens

, et al Structural health monitoring of offshore wind turbines using automated operational modal analysis. Struct Health Monit 2014; 13(6): 644–659.

59.

Kahrger

Möller

Jonscher

, et al Detectability of structural damage to a lattice tower using eigenfrequencies and Gaussian process regression. Res Rev J Nondestr Test 2024; 2(2): 96–107.

60.

Ramos

Marques

Lourenço

, et al Monitoring historical masonry structures with operational modal analysis: two case studies. Mech Syst Signal Process 2010; 24(5): 1291–1305.

61.

Peeters

Maeck

Roeck

GD.

Vibration-based damage detection in civil engineering: excitation sources and temperature effects. Smart Mater Struct 2001; 10(3): 518.

62.

Pitchforth

Jones

Cross

EJ.

Physically-informed change-point kernels for variable levels of physical knowledge inclusion in Gaussian processes. e-J Nondestructive Test 2024; 29(7): 1–9.

63.

Worden

Sohn

Farrar

CR.

Novelty detection in a changing environment: regression and interpolation approaches. J Sound Vibr 2002; 258(4): 741–761.

64.

Rousseeuw

PJ.

Multivariate estimation with high breakdown point. Math Stat Appl 1985; 8(283–297): 37.

65.

Poole

Gardner

Dervilis

, et al On statistic alignment for domain adaptation in structural health monitoring. Struct Health Monit 2023; 22(3): 1581–1600.

66.

Gardner

Bull

Dervilis

, et al Overcoming the problem of repair in structural health monitoring: metric-informed transfer learning. J Sound Vibr 2021; 510: 116245.

67.

Wickramarachchi

Poole

Hübler

, et al Statistical alignment in transfer learning to address the repair problem: an experimental case study. Hannover: Institutionelles Repositorium der Leibniz Universität Hannover, 2023.

Comparing Gaussian process enhanced grey-box approaches to detect damage in unknown environmental conditions due to climate change

Abstract

Keywords

Introduction

GP regression

Incorporating prior knowledge

Case study

DSFs and environmental variations

Grey-box enhanced data normalisation

Applicability in case of non-linear dependencies

Damage detection with incomplete data

Damage detection and the repair problem under incomplete training data

Conclusion and outlook

Footnotes

Declaration of conflicting interests

Funding

ORCID iD

References