Sage Journals: Discover world-class research

Abstract

Data-driven approaches have been applied to structural health assessment due to their capability to address complex challenges and achieve high-precision and rapid defect detection. This article presents a deep learning-based generalization framework for guided wave (GW)-based damage identification in diverse isotropic beam-like structures with various cross-sectional geometries and material properties. A multiscale convolution and squeeze-and-excitation attention-assisted bidirectional hybrid (MC-SE-BiH) network is proposed to generalize the damage identification. A higher-order beam model is employed to enhance computational efficiency and simulation accuracy of modelling GW propagation in various damage scenarios. A series of numerical case studies is carried out to demonstrate the generalization ability and robustness of the proposed MC-SE-BiH framework in damage identification using GW. The effectiveness and practicality are further validated using experimental measurements obtained from isotropic beams with different material properties and cross-sectional configurations. The results demonstrate that the proposed MC-SE-BiH framework can identify damage across diverse beam-like structures.

Keywords

Damage identification data-driven guided waves multiscale convolution squeeze-and-excitation attention bidirectional hybrid network

Introduction

Damage detection and identification is one of essential components in structural health monitoring (SHM). Reliable damage detection and identification techniques play a critical role in ensuring structural integrity, reliability, and serviceability. Exploring robust early-stage damage detection techniques has garnered significant attention, especially for high-value assets. Over the past decades, numerous damage detection techniques have been developed to ensure structural integrity, detecting potential defects and/or identifying risk caused by structural failure. These techniques available, generally include simple visual inspection, eddy current, conventional A- or C-scan ultrasonic testing, acoustic emission, vibration-based approaches,¹ and ultrasonic-guided wave (GW)-based approaches.² Compared with conventional damage detection techniques, those based on vibration and GW are the predominant techniques. Vibration-based techniques are effective for global structural monitoring by leveraging modal parameters or dynamic responses, but they may be insensitive minor local defects that are crucial for structural safety. On the contrary, GW-based damage detection technique has been proven to be one of the most promising techniques for early-state damage detection due to their high sensitivity to local and minor defects, ability to travel relatively long distances and capability of inspecting hard-to-reach regions of the structures.^3–5

Non-model-based and model-based approaches for damage detection using GWs

GW-based damage detection can be categorized into non-model-based and model-based approaches. Non-model-based GW approaches detect damage by analysing changes in specific features of acquired signals compared to a reference GW signal measured in health or prior state. However, obtaining an accurate baseline signal remains a significant challenge in practical situations.⁶ Additionally, non-model-based GW approaches often require selecting an appropriate wave mode based on the correct dispersion characteristics of each target structure for effective evaluation. Model-based GW approaches utilize a model to characterize or establish the relationship between GW responses and structural damage. These model-based approaches have become increasingly prevalent in damage detection due to their ability to achieve baseline-free characteristics and provide quantitative damage information.

Based on the nature of numerical models, model-based GW approaches can be categorized into physics-based models and data-driven models.⁷ Specifically, damage identification of physics-based models is achieved using physical-meaning model such as analytical solutions and finite element (FE) models. The model’s ability to accurately simulate GW propagation and its interaction with damage is crucial in these physics-based GW approaches. While physics-based models offer valuable insights and a high degree of interpretability, they are often constrained by numerous assumptions, high computational costs, and reduced accuracy in complex damage scenarios. Analytical solutions require simplified assumptions before application, which are effective only when conditions are straightforward and the damage can be represented numerically or analytically.^8,9 Although FE simulation-based optimization methods can iterate damage properties by taking advantage of the traditional FE method’s ability to model precise GW propagation with appropriately small element sizes, the computational cost is significant and often impractical, especially for more complex or large-scale structures.¹⁰ To address the computational burden, a variety of modelling approaches have been developed.¹¹ For example, frequency-domain spectral FE (SFE) method achieves computationally efficient compared with the conventional FE method, as it directly employs spectral analysis techniques to investigate structural behaviors at discrete frequencies by solving the partial differential governing equations in the frequency domain.¹² Alternatively, time-domain SFE method utilizes Gauss–Lobatto–Legendre (GLL) nodes together with high-order approximation polynomials to reduce the number of elements in FE simulation to enhance computational efficiency, while maintaining accuracy.¹³ These breakthroughs in FE simulation have made significant progress for model-based approaches. However, a considerable computational cost is still required due to the necessity of iterative simulation, which may limit its practicality for complex SHM applications. Furthermore, the adaptability of this method is limited, as its applicability is confined to a well-defined structural configuration, which also highlights the necessity of obtaining the structure-related parameters prior to implementation.¹⁴

To summarize, these limitations make it more challenging to quantitatively characterize the physical behaviors of GW, let alone generalizing damage detection to different structures. In contrast, data-driven models can directly capture the intricate patterns and nonlinear relationships inherent in the data itself, without the need for explicit physical modelling.¹⁵ Due to the ability to bypass the need for complex prior knowledge (e.g., frequency dispersion and multimode characteristics in GW) and their flexibility in addressing complex problems, data-driven approaches have attracted considerable interest from researchers.^16,17

Data-driven approaches in damage detectionusing GWs

Data-driven approaches utilize machine learning (ML) or deep learning (DL) to obtain mapping relationships between data collected from structures and object variables (e.g., damage characteristics or performance indicators), enabling structural health diagnosis and prediction. These approaches provide an intelligent platform to implement online damage detection in structures. Some ML model-assisted frameworks for GW-based damage identification have been successfully developed and applied. To address a complex challenge of modal analysis-based approaches, Mahajan and Banerjee¹⁸ extracted damage features from GW responses to train a ML-based framework for identifying damage type, location, and severity on railheads. Lyu et al.¹⁹ proposed a data-informed framework for GW-based damage detection using a support vector machine (SVM), where wave energy distribution vectors derived from captured GW signals served as input features for the SVM classifier. Khan et al.²⁰ investigated the role of ML in intelligent structural assessment, emphasizing that the quality and effectiveness of features fed into ML model is crucial. ML has demonstrated its capability in assisting wave analysis and improving damage detection efficiency. However, its effectiveness is somewhat limited by the need for manual feature selection.

In contrary, DL techniques have recently gained significant attention due to their ability to perform feature extraction without requiring operator intervention or extensive domain expertise to design an effective feature extraction.²¹ Specifically, DL models can process raw data and automatically extract hierarchical feature representations. In GW-based damage detection applications, convolutional neural networks (CNNs) have been extensively and successfully used for damage detection. Liu and Zhang²² framed the damage detection problem of notch-type crack detection as an image classification task utilizing a CNN framework. Gonzalez-Jimenez et al.²³ proposed a CNN-based regression algorithm for damage localization in composite structures, where GW signals, generated through a pseudo-damage methodology, were transformed into image representations. Beyond image-based input, the direct utilization of entire GW signal presents an alternative avenue for training DL models.²⁴ Most existing CNN-based GW methods utilize convolutional operations within the CNN to extract features from GW responses. Notably, the receptive field size in these convolutional operation significantly impacts the quality of feature extraction.²⁵ A limited receptive field may yield features with detailed information but restricts them to local characteristics, potentially leading to overfitting. Conversely, an excessively large receptive field can introduce redundant of irrelevant information, resulting in underfitting.

While improved CNNs are capable of learning higher-order damage-related features from GW responses, they may not fully capture the inherent temporal information within the time-series data.²⁶ For time-series data, recurrent neural networks (RNNs), such as long short-term memory (LSTM) networks, have been widely employed across various fields due to their ability to model temporal dependencies in sequential data. Rautela and Gopalakrishnan²⁷ developed a DL-assisted framework utilizing GW responses, generated by a reduced-order SFE model, as input to train a LSTM for damage identification. Rezazadeh et al.²⁸ employed a conditional generative adversarial network to generate GW signals of distinct fault categories, subsequently training a LSTM model for damage classification. However, GW responses encompass both temporal and spatial information. The comprehensive extraction and learning of relevant spatiotemporal features related to damage is crucial for data-driven methodologies. Consequently, the development and integration of DL algorithms has gained increasing prominence in damage detection.^29–31 Specifically, RNNs and CNNs are frequently integrated to leverage their respective strengths, enabling the simultaneously consideration and learning of spatial and temporal information from input data, thus enhancing the prediction performance of the developed algorithm.^32,33 While such integrated algorithms can capture spatiotemporal information from GW responses, there are still limitations such as the former algorithm’s output containing insufficient damage-related information and/or the latter algorithm has limited feature extraction capabilities. Notably, recent advances in DL have progressively improved the capacity of data-driven methodologies to extract and interpret meaningful information from data.³⁴

Generalization in GW-based damage detection

Although the numerous methodologies have been proposed in the literature, GW-based data-driven approaches for damage identification often yield configuration-specific results, with many studies focusing on single case scenarios. Specifically, a well-trained model is typically optimized for a structure with fixed material properties and dimensions, thus limiting its applicability to structures with different attributes.⁷ Furthermore, the majority of researches on the generalization problem of data-driven approaches focuses on robustness investigations related to modeling errors or environment and measurement uncertainties,²⁷ while a significant gap remains in exploring the application of a universal or generalized data-driven approach for damage detection across different structures. GWs are highly sensitive not only to structural damage but also to the properties of material and the shape of the structure itself. This means that even the same type of damage can produce different reflected GW signals in different structures, depending on their shapes, material, and environmental conditions. Additionally, because structures vary in geometry and material, the relationship between damage and GW signals is complex and nonlinear. As a result, data-driven models need advanced nonlinear modelling capabilities to interpret these differences accurately.

Conventional DL techniques often fail to create a single model that works well for many different structures at once. Instead, specialized models may be needed for different types of structures. Most DL-based methods in damage detection also require large datasets, particularly for regression tasks. However, a major challenge in damage identification is the lack of sufficient training data for DL models. When limited datasets are available for a specific structure, DL-based models struggle to learn the accurate underlying relationship between input and corresponding output, for example, GW responses and damage parameters. Consequently, these models are susceptible to generating in inaccurate damage diagnosis results.

Transfer learning has recently demonstrated its effectiveness in reducing the reliance on extensive datasets from target structures,³⁵ leveraging knowledge from a source domain to address related target domain problems.^36,37 However, pretraining a foundational model for the original structure still typically necessitates substantial datasets. Furthermore, adapting the model to multiple diverse structures requires fine-tuning for each target, increasing application complexity when dealing with a range of structures. Moreover, the fine-tuned model may exhibit diminished performance on the original pretraining structure and potentially demonstrate instability when confronted with new structures significantly differing in attributes, or even face domain mismatch risk.³⁸ In summary, a more direct and pragmatic data-driven approach is needed to develop a generalization framework for damage identification across diverse structures, which constitute the primary objective of this study. Leveraging advancements in DL, the effective integration and development of various DL techniques can enhance the capacity to extract and learn complex mapping relationships between high-dimensional nonlinear data and objective variables.³⁹ The deeper insights offered by developed DL techniques hold significant potential for addressing the challenge of generalizing damage detection in a series of structures. Generally, data-driven models are inherently black-box models due to the lack of physical knowledge embedding. Consequently, to improve model’s credibility, a comprehensive evaluation of its performance is particularly important.

Beam-type elements play critical roles in real-world applications, such as engineering truss structures, aircraft wing or aerospace/automation frames, and steel girders. The signal acquisition paths in beam-like structures may be extremely limited, unlike plate structures, which can be collected signals containing sufficient damage information by arranging sensors along different paths. Therefore, this article proposes a noveldata-driven approach, multiscale convolution, and squeeze-and-excitation attention-assisted bidirectional hybrid (MC-SE-BiH) network, for generalized damage detection across different beam-like structures. To support the construction of damage detection scenarios from different structures, a validated higher-order beam model (HOBM) that achieves satisfactory accuracy while maintaining computational efficiency,⁴⁰ enables reliable simulation of GW responses across various beam-like structures. Concurrently, the sample size for each HOBM with a specific configuration is intentionally limited and relatively small, rendering DL-based methods unable to learn an accurate mapping relationship between GW responses and defects. As an alternative, this study investigates whether GW responses from similar or different structures can contribute or augment the learning process for the proposed generalization framework when training datasets from a configuration-specific structure are insufficient or limited. To assess the applicability of the developed framework for detecting damage in a broader range of beam-like structures, material uncertainties are incorporated into case studies by modulating material properties of HOBM. Furthermore, data augmentation techniques are employed to enhance data diversity for robust analysis. The effectiveness, practicality, and accuracy of the proposed generalization framework are evaluated using experimental data from various beam-like structures. In the present work, the generalization ability of the proposed approach is carefully validated across various damage scenarios, where the outputs have been thoroughly validated against numerical and experimental data, showing stable and reliable performance within the scope of this study. Therefore, this also brings confidence that the proposed data-driven model can make reliable predictions. After completing the primary research objectives, this work also explores and discusses damage detection for unseen structures, attempting to further examine its extreme performance.

The article is organized as follows: the second section presents the methodology. The third section details dataset generation and preparation for training. The fourth section presents case studies of diverse beam-like structures under varying uncertainties to comprehensively demonstrate the generalization ability and robustness of the proposed framework. Subsequently, the fifth section presents an experiment to evaluate the effectiveness and practicality of the proposed generalization framework. Finally, conclusions are provided in the sixth section.

Proposed data-driven approach for damage identification

This research proposes a data-driven approach, employing a MC-SE-BiH network, to achieve generalization of damage identification across diverse structures. Figure 1 shows the architecture of the proposed MC-SE-BiH. The general description of each key module in Figure 1 is provided as follows:

(1) The MC module employs three convolution kernels of different sizes and designs to extract feature information from raw GW time domain signals at various scales and then fuses these multiscale features. This module enables capturing local subtle variations of GW responses and global patterns across various perception domains, thereby enriching the features.

(2) The SE attention block is integrated to adaptively recalibrate channelwise feature responses. By emphasizing informative features and suppressing interfering ones, this mechanism directs the model’s focus toward critical features.

(3) The BiH network module processes the sequential data by capturing temporal dependencies in both forward and backward directions, enabling richer contextual feature extraction for damage identification. In the final regression stage, a multilayer perceptron (MLP) is used to learn complex relationships within the high-dimensional feature space through fully connected (FC) layers, while rectified linear unit (ReLU) layer introduces essential nonlinearities for continuous value prediction. The use of a Huber loss function further optimizes the performance.

Figure 1.

Schematic overview of the proposed MC-SE-BiH network architecture. MC-SE-BiH: Multiscale convolution and squeeze-and-excitation attention-assisted bidirectional hybrid.

MC module

Inspiring by the MC neural network,⁴¹ this study proposes a MC module that employs multiple convolutional filters with varying kernel sizes to capture multiscale features from input data. By processing information across different receptive fields, the MC module effectively integrates the extracted information from each receptive field to obtain higher-level global information. By merging the features across multiple scales, MC module has the potential to get more comprehensive features at various scales, improving the availability of input characteristics.

The detailed architecture of the MC module is illustrated in Figure 2. The GW time domain signals are input into convolution layers of different scales across three branches. To achieve an optimal balance between feature extraction capability and computational efficiency,⁴² we employ convolutional kernels of varying sizes (3 × 1, 7 × 1 and 11 × 1) to extract features across different receptive field scales. Specifically, random search within empirically bounded ranges were performed across various candidate combinations (kernel sizes and filter numbers) to efficiently determine the configuration that balanced accuracy and computational cost.⁴³ The search range for kernel size and filter number in the MC module is shown in Table 1. The small kernels (e.g., 3 × 1) are effective for fine-grained local patterns, while larger kernels (e.g., 11 × 1) help in modelling broader information. The intermediate kernels (e.g., 7 × 1) serve as a bridge between both receptive fields. The final configuration reflects the most appropriate trade-off observed in the trial experiments, while enabling the model to capture multiscale features effectively. Following each convolutional operation, a batch normalization (BN) layer,⁴⁴ is used to accelerate the convergence of the operation by reducing internal covariates. To enhance the model’s ability to capture the details and complex patterns in the input data, nonlinearity is introduced through the Gaussian error linear unit (GELU) activation function.⁴⁵ The GELU activation enables smooth integration of subsequent features from different branches. The mathematical formulation of GELU is given by

GELU (x) = xP (X \leq x) = x Φ (x) = \frac{x}{2} [1 + \erf (x / \sqrt{2})]

(1)

where x represents the pre-activation value, $Φ (x)$ denotes the cumulative distribution function and $\erf (\cdot)$ is the Gaussian error function, which offers a probabilistic interpretation.

Figure 2.

The proposed multiscale convolutional structure.

Table 1.

The search range for kernel size and filter number in the MC module.

Branch	Kernel sizerange	Filter numberrange	Samplingdistribution	Example candidates(randomly selected)
Small scale	3, 5	[32–128]	Uniform discrete/log-uniformwith base 2	3, 64
Medium scale	7, 9	[16–64]		9, 32
Large scale	11, 13, 15	[8–32]		11, 16

MC: Multiscale Convolution.

The features obtained from different scales are concatenated, and a 1 × 1 convolution⁴⁶ is applied to aggregate and re-extract these features. Following the multiscale convolution operation, a multilayer convolution process is used to further distill, aggregate, and enhance the fused features across various scales, facilitating deeper integration.⁴⁷ The subsequent multilayer convolution blocks consist of basic CNN modules, including convolutional blocks (ConvBlock) and pooling layers. Each ConvBlock includes a one-dimensional (1D) convolutional layer, a BN layer and a ReLU activation layer.⁴⁸

SE attention

The SE attention mechanism is a channel attention mechanism designed to enhances network representation by explicitly modelling interdependencies between feature channels. It recalibrates channelwise feature responses by assessing the importance of each channel within the feature maps.⁴⁹ The squeeze operation aggregates feature response from the input feature maps to capture global information, such as GW signal amplitude or reflection wave location. The aggregation is achieved through global average pooling operation to generate channel-wise statistics. The output of the multiscale convolution module is defined as $X \in R^{C \times W}$ , where C represents the number of channels and L represents the length of the feature map. Formally, the output $z_{c}$ of the squeeze operation for the c th channel is calculated by

z_{c} = F_{sq} (X_{c}) = \frac{1}{L} \sum_{i = 1}^{L} X_{c} (i)

(2)

where $X_{c}$ denotes the feature map of the c th channel.

After that, the excitation operation utilizes the channelwise statistics from the squeeze operation to fully capture the channel dependencies. The squeezed output $z \in R^{C}$ is passed through a gating mechanism consisting of two FC layers with a ReLU non-linearity in between and a final sigmoid activation to generate the channel weights $s \in R^{C}$ :

s = σ (W_{2} (δ (W_{1} z)))

(3)

where $W_{1} \in R^{\frac{C}{r} \times C}$ and $W_{2} \in R^{C \times \frac{C}{r}}$ represent weights of the two FC layers. One FC layer with $W_{1}$ reduces the dimensionality using a reduction ratio r, and the other FC layer with $W_{2}$ restores the dimensionality. δ denotes the ReLU activation function. The sigmoid activation σ scales the channel weights between 0 and 1.

In the final step, the recalibrated feature map $\tilde{X}$ is obtained by rescaling the original feature map X using the learned channel weights s:

{\tilde{X}}_{c} = s_{c} \cdot X_{c}

(4)

where $s_{c}$ is the weight for the c th channel obtained from the excitation operation. This recalibration enables the model to selectively enhance informative features and suppress less relevant ones, thereby improving the overall feature representation. The diagram and detailed structural configuration of the SE attention module in this study are shown in Figure 3.

Figure 3.

The diagram and detailed layer configuration of the SE attention module in this study. SE: Squeeze-and-Excitation.

The channel attention module is computationally efficient and can be easily integrated into the proposed architecture. By enabling adaptive channel recalibration, it delivers substantial performance improvements. The module processes multichannel feature maps generated from GW signals through convolution operations. Subsequently, the SE attention module applies “squeeze” and “excitation” operations to dynamically weight channelwise feature contributions. This adaptive weighting mechanism enhances the model’s ability to prioritize more important features relevant to the target task, such as damage identification in this study.

Hybrid network of Bi-LSTM and Bi-GRU network

In time domain signal analysis, such as GW time domain responses in this study, sequential data often exhibits complex temporal patterns. Accurate interpretation and extraction of damage-related information from GW responses are crucial for achieving generalized damage identification across different structures. To address this, the BiH network can simultaneously analyse both preceding and succeeding time steps, enhancing its ability to capture intricate temporal relationships that may be overlooked in single-directional models. A parallel model combining bidirectional LSTM (Bi-LSTM) and bidirectional-gated recurrent unit (Bi-GRU) provides a robust solution for time-series data analysis. Specifically, Bi-LSTM has strong ability in capturing long-term dependencies within time series, while Bi-GRU offers computational efficiency and effectively captures short-term dependence features.²⁶ This hybrid architecture leverages both the long-term memory capabilities of Bi-LSTM and the sensitivity of Bi-GRU to instantaneous changes. By effectively integrating the strengths of both Bi-LSTM and Bi-GRU, the BiH network achieves a more comprehensive representation of GW characteristics. This hybrid model leverages their complementary capabilities to enhance the learning of signal patterns. Furthermore, this network excels at extracting and representing multilevel feature information, improves adaptability to signals generated from diverse structures during GW-based damage detection, thereby enhancing the generalization capability of the proposed framework.

Bidirectional long short-term memory

LSTM unit was specifically designed to capture long-range dependencies in sequence data. Through the gating mechanism architecture, LSTM effectively address the vanishing and exploding gradient problems inherent in traditional RNNs.⁵⁰ The bidirectional long-term memory network⁵¹ is a variant of the LSTM model that learns complex temporal dependencies in sequential data by processing it in both forward (past to future) and backward (future to past) directions. This architecture enables the model to capture both past and future information, which is crucial for tasks requiring deep interpretation of time sequences. In a Bi-LSTM architecture, the input sequence is processed by two distinct LSTM units: one in the forward direction and another in the backward direction. The detailed structure of each LSTM is illustrated in Figure 4(a). Each LSTM unit comprises three main gates: the forget gate, the input gate, and the output gate.⁵²

Figure 4.

(a) Detailed structure of an LSTM unit and (b) illustration of Bi-LSTM model, where $f_{t}$ is the forget gate, $i_{t}$ is the input gate, $o_{t}$ is the output gate, ${\tilde{C}}_{t}$ is the candidate cell state, $C_{t}$ is the cell state, $C_{t - 1}$ is the previous cell state, $h_{t - 1}$ is the previous hidden state, $h_{t}$ is the final hidden state, σ is the sigmoid activation function, $\tanh$ represents the hyperbolic tangent activation, along with $x_{t}$ and $x_{t - 1}$ denote the input at the current and previous time step, respectively. LSTM: long short-term memory; Bi-LSTM: bidirectional long short-term memory.

Bi-LSTM extends the fundamental LSTM architecture by combining two independent LSTMs, thus overcoming the limitation of predicting future outputs solely based on the time information of the previous moment. Specifically, for an input sequence $X = {x_{1}, x_{2}, \dots, x_{T}}$ , where $x_{t}$ represents the input at the time step t ( $t = 1, 2, \dots, T$ ), and T denotes the total length of the sequence. The forward LSTM unit processes the input in the inherent order of the sequence from the time step $t = 1$ to $t = T$ , generating a set of forward hidden states ${h_{1}^{lf}, h_{2}^{lf}, \dots, h_{T}^{lf}}$ . Simultaneously, the backward LSTM unit processes the input in reverse order of the sequence from $t = T$ to $t = 1$ , generating backward hidden states ${h_{T}^{lb}, h_{T - 1}^{lb}, \dots, h_{1}^{lb}}$ . The final output of Bi-LSTM at each time step t is a concatenation of the forward and backward hidden states:

h_{t}^{bilstm} = F (h_{t}^{lf}, h_{t}^{lb})

(5)

where $h_{t}^{lf}$ and $h_{t}^{lb}$ represent the forward and backward hidden states, respectively, F represents the combining function for the outputs of the two directions (e.g., summation function), and $h_{t}^{bilstm}$ is the concatenated hidden state. This combined output enables Bi-LSTM to leverage bidirectional information from input sequence, enhancing its ability to capture significant features for complex tasks. The architecture of the Bi-LSTM model is shown in Figure 4(b).

Bidirectional gated recurrent unit

The Bi-GRU is a variant of the RNN architecture, specifically designed to capture both past and future dependencies in sequential data.⁵³ It achieves this by introducing bidirectional processing to the GRU,⁵⁴ where the input sequence is traversed in both forward and backward directions. The bidirectional processing enables a more complete representation of the temporal dependencies within the data, particularly when information (e.g., damage-related information) from both earlier and later time steps is crucial for accurate predictions. Specifically, the GRU unit operates based on two main gating mechanisms: the reset gate and the update gate.⁵⁵ Compared with LSTM, GRU is more computationally efficient due to its simpler architecture and fewer parameters. Furthermore, the gating mechanisms of GRU enhance its ability to capture short-term dependencies in sequential data.²⁶ The reset gate and update gate regulate the flow of information through the network, controlling the retention of past information and the integration of new information. Consequently, the final hidden state maintains an effective balance between the two aspects. The detailed structure of the GRU unit is shown in the Figure 5(a).

Figure 5.

(a) Detailed structure of a GRU unit and (b) illustration of a Bi-GRU model, where $r_{t}$ is the reset gate, $z_{t}$ is the update gate and ${\tilde{h}}_{t}$ is the candidate hidden state. GRU: gated recurrent unit; Bi-GRU: bidirectional gated recurrent unit.

Similar to Bi-LSTM, Bi-GRU processes an input from forward and backward directions using two GRU units. Formally, the forward GRU and the backward GRU process the input simultaneously based on the inherent order and the reversed order of the sequence. At each time step t, the forward and backward hidden states are concatenated to form the final hidden state:

h_{t}^{bigru} = F (h_{t}^{gf}, h_{t}^{gb})

(6)

where $h_{t}^{gf}$ and $h_{t}^{gb}$ represent the forward and backward hidden states, respectively, F represents the combining method of the outputs from the two directions (e.g., summation function) and $h_{t}^{bigru}$ is the concatenated hidden state. The architecture of Bi-GRU model is shown in Figure 5(b).

Loss function and evaluation metrics for damage identification

Huber loss function

Huber loss function,⁵⁶ is used as the loss function in the proposed approach for damage detection regression task, which is motivated by the need to balance sensitivity to small errors with robustness against outliers. This function integrates two common error assessment methods: mean square error (MSE) and mean absolute error (MAE).⁵⁷ When the residual is small, it optimizes the model like MSE (quadratic loss), ensuring that minor errors are effectively processed. On the contrary, when the residual is large, it reduces sensitivity to outliers like MAE (linear loss), thus preventing these abnormal data from excessively affecting the model training. The Huber loss function is calculated by the following equation:

L_{δ} (y, \hat{y}) = {\begin{matrix} \frac{1}{n} \sum_{i = 1}^{n} \frac{1}{2} {(y_{i} - {\hat{y}}_{i})}^{2} if | y_{i} - {\hat{y}}_{i} | \leq δ \\ \frac{1}{n} \sum_{i = 1}^{n} δ (| y_{i} - {\hat{y}}_{i} | - \frac{1}{2} δ) if | y_{i} - {\hat{y}}_{i} | > δ \end{matrix}

(7)

where δ is the threshold, which determines the transition from quadratic to linear error, and its value can be selected by cross-validation.⁵⁸ $y_{i} - {\hat{y}}_{i}$ represents the residual between the actual and predicted value.

Evaluation metrics

For data-driven approaches, it is particularly important to thoroughly and effectively investigate the model performance. To comprehensively evaluate the generalization ability and robustness of the proposed framework, the root mean squared error (RMSE), the coefficient of determination (R²),⁵⁹ and the mean absolute percentage error (MAPE)²⁷ are used as evaluation metrics. These matrices are calculated using the following equations:

RMSE = \sqrt{\frac{1}{n} \sum_{i = 1}^{n} {(y_{i} - {\hat{y}}_{i})}^{2}}

(8)

R^{2} = 1 - \frac{\sum_{i = 1}^{n} {(y_{i} - {\hat{y}}_{i})}^{2}}{\sum_{i = 1}^{n} {(y_{i} - {\bar{y}}_{i})}^{2}}

(9)

MAPE = \frac{1}{n} \sum_{i = 1}^{n} | \frac{y_{i} - {\hat{y}}_{i}}{y_{i}} | \times 100 %

(10)

where ${\hat{y}}_{i}$ is the predicted value, $y_{i}$ is the true value, and ${\bar{y}}_{i}$ is the arithmetic mean of $y_{i}$ and n is the total number of samples. RMSE is the arithmetic square root of MSE, which evaluates the degree of deviation between the predicted and true damage parameters. The smaller the value of RMSE, the higher the prediction accuracy of the model. R² is used to evaluate the fitting effect of the regression model. The closer R² is to 1, the better the model fits the data, which provide a universal measure of model fit that is not affected by the magnitude or units of the data. MAPE represents the mean relative error percentage of the sample, reflecting the relative difference between the predicted and the actual value. The smaller the MAPE, the higher the model prediction accuracy.

Training and structure details of the proposed approach

The proposed MC-SE-BiH is trained using adaptive moment estimation (Adam),⁶⁰ a variant of the stochastic gradient descent algorithm. Adam is well-suited for large datasets and high-dimensional space of ML problems. It adaptively adjusts learning rates for each parameter to enhance convergence and stability. The initial learning rate was set to 0.0001 and reduced by a factor of 0.1 (10% of its previous value) every 50 training epochs. This strategy of piecewise learning rate facilitates rapid convergence during initial training phase while maintaining high validation accuracy in the final stage. The model batch size was set to 16 and the maximum training iterations was set to 200 epochs. Prior to training, the entire dataset was randomly shuffled and arranged into training and evaluation sets to prevent training bias caused by uneven sample distribution or sequence correlation. In addition, dynamic shuffling was applied at the beginning of each epoch to introduce stochasticity in sample presentation of dataset during training, thereby improving model generalization and reducing overfitting.

To implement the feasibility investigation of the methodology, this article adopted both numerical and experimental studies to fully examine the performance of the framework for generalizing damage identification. A series of isotropic beam models with diverse cross-section geometries and material properties were used for comprehensive assessment. The objective of the proposed framework is to accurately identify the location and severity of damage in diverse beam-like structures. Initially, a number of damage cases are generated using numerical model descripted in next Section 3. To assess robustness of the proposed model, these datasets were employed for training, validating, and testing, with an 80%: 10%: 10% split, across various scenarios, which is a cross-validation method widely used in DL-based SHM investigations. After that, experiments were conducted, and the experimentally measured data were used to further evaluate the ability and accuracy of the trained framework. The input fed into MC-SE-BiH is the raw GW time-domain signals, which contain 4000 amplitude points. The regression outputs, damage location and depth percentage, are simultaneously characterized by this framework. The network configuration details of the proposed MC-SE-BiH are presented in Table 2. The model was trained using an Nvidia T1000 graphics processing unit (GPU) (Santa Clara, CA, USA). Notably, computational demand may become issues when scaling to larger datasets. Although training may need more time, the trained model can be used for instantaneous damage identification. Therefore, this diagnostic model that can instantly exert detection capabilities will provide real-time health status in actual monitoring. In addition, computational burden in training can be reduced by using high performance computing, parallel computation, parallel GPU, which is common in computer science field. In the future, more powerful hardware platforms can be adapted to alleviate computing cost issues during training.

Table 2.

The network configuration details of the proposed MC-SE-BiH.

Module	Layer	Main parameter	Value
MC module	Convolution branch 1	Kernel size, filters	3, 64
	BN	–
	GELU
	Convolution branch 2	Kernel size, filters	7, 32
	BN	–
	GELU
	Convolution branch 3	Kernel size, filters	11, 16
	BN	–
	GELU
	Concatenate
	1 × 1 convolution	Kernel size, filters	1, 32
	BN	–
	GELU
	ConvBlock 1	Kernel size, filters	3, 64
	Max pooling 1	Pool size, stride	2, 2
	ConvBlock 2	Kernel size, filters	3, 128
	Max pooling 2	Pool size, stride	2, 2
	ConvBlock 3	Kernel size, filters	3, 256
	Max pooling 3	Pool size, stride	2, 2
SE attention module	Global average pooling	–
	FC layer	Hidden units	64
	ReLU	–
	FC layer	Hidden units	256
	Sigmoid	–
	Multiplication
BiH module	Flatten	–
	Bi-LSTM 1 and Bi-GRU 1	Hidden units	256, 256
	Concatenate	–
	Bi-LSTM 2 and Bi-GRU 2	Hidden units	128, 128
	Concatenate	–
	Bi-LSTM 3 and Bi-GRU 3	Hidden units	64, 64
	Concatenate	–
MLP module	Dropout layer	Dropout probability	0.3
	FC layer	Hidden units	256
	ReLU	–
	FC layer	Hidden units	2
	Huber loss	δ	1.0

MC-SE-BiH: multiscale convolution and squeeze-and-excitation attention-assisted bidirectional hybrid; MC: multiscale convolution; GELU: Gaussian error linear unit; BN: batch normalization; SE: Squeeze-and-Excitation; BiH: Bidirectional Hybrid; FC: fully connected; ReLU: rectified linear unit; Bi-LSTM: bidirectional long short-term memory; Bi-GRU: bidirectional gated recurrent unit; MLP: multilayer perceptron.

Dataset generation for generalization framework

Numerical simulation using HOBM

For ML regression purpose, a substantial amount of data is required to achieve satisfactory accuracy. Given the difficulty of obtaining real-life and experimental datasets, numerical simulation is employed in this work to generate synthetic data for training the generalization framework. To enhance computational efficiency while maintaining simulation accuracy, a HOBM utilizing SFE method is employed.⁴⁰ HOBM has demonstrated strong performance in simulating GW prorogation in laminated beam-like structures.^61,62 It can simulate the entire beam across multiple physical layers in the transverse (z) direction (see Figure 6(a)).

Figure 6.

(a) A SFE beam model with length, L and thickness, d simulated by three physical sublayers and (b) higher-order displacement orientations in physical layer 2. SFE: spectral finite element.

Higher-order displacement kinematics of through thickness direction is employed in HOBM to model each sublayer⁴⁰ as shown in Figure 6(b). Each sublayer is assumed to be under plane-stress condition in the deformation plane. Both in-plane deformation, u, and transverse deformation, v, can be represented by incorporating displacements at the top surface $z = \frac{h}{2}$ , bottom surface $z = \frac{- h}{2}$ and mid plane $z = 0$ of the sublayer:

v = A_{v} v_{0} + B_{v} v_{T} + C_{v} v_{B}

(11)

u = A_{u} u_{0} + B_{u} θ + C_{u} u_{T} + D_{u} u_{B}

(12)

where h is the thickness of sublayer, $A_{v} = \frac{h^{2} - 4 z^{2}}{h^{2}}$ , $B_{v} = \frac{hz + 2 z^{2}}{h^{2}}$ , $C_{v} = \frac{2 z^{2} - hz}{h^{2}}$ , $A_{u} = \frac{h^{2} - 4 z^{2}}{h^{2}}$ , $B_{u} = \frac{4 z^{3} - h^{2} z}{h^{2}}$ , $C_{u} = \frac{2 h z^{2} + 4 z^{3}}{h^{3}}$ and $D_{u} = \frac{2 h z^{2} - 4 z^{3}}{h^{3}}$ , respectively. $u_{0}$ and $v_{0}$ represent the displacements at the mid-plane, while $u_{T}$ and $v_{T}$ represent the displacements at the top surface. $u_{B}$ and $v_{B}$ represent the displacements at the bottom surface, while θ represents the rotation of the sectional plane perpendicular to the x-axis.

The proposed HOBM leverages a multilayer capability to enable the incorporation of varying material properties at across sublayers, significantly simplifying composite beam simulations.^61,62 Additionally, HOBM permits division of an isotropic beam into discrete subsections, which allow simulating localized damage at specific positions. Also, through higher-order displacement kinematics, HOBM maintains superior accuracy even for thick beam scenarios.^40,62 To enhance computational efficiency, SFEs with GLL nodes are incorporated so that the total required number of FE can be significantly reduced. The GW propagation simulation is then performed by assembling element matrices from each SFE in layers and solving the equation of motion using an explicit central difference method.^40,62

A common and representative type of damage in pipes, rods, and railroad tracks – slot cut, was considered as the type of damage for simulation. Slot cuts provide controllable and repeatable damage morphology, aligning experimental specimens with their numerical counterparts. Unlike naturally occurring cracks or complex damage patterns, slot cuts allow for precise creation of damage of specific size and location. This consistency facilitates a reliable validation of the proposed model’s practicality and effectiveness. Furthermore, in terms of other damage types, detailed descriptions of this damage may require more parameters. In addition, to enhance the systematic and efficient generation of damaged GW data, employing the slot cut (i.e., crack-like defect) as the damage shape permits rigorous and pointwise assessment of model’s predictive accuracy while avoiding the many additional parameters required to fully describe other irregular or complex defects. In this study, particularly, the damage type is modeled as a surface slot cut extending across the entire beam cross-sectional width. The surface cracks with varying depths and locations are introduced into different beam models to generate diverse damage cases. An example of surface crack in an isotropic beam is illustrated in Figure 7(a), where Loc, Wid and Dep is the crack location from excitation, crack width along beam length direction and crack depth from the top beam surface, respectively. Crack simulation is achieved using element deletion method, a straightforward damage generation approach that requires no complex modifications.⁶³ This method involves applying a damage coefficient to the corresponding element mass matrix and/or stiffness matrix, effectively reducing the element stress close to zero, resulting in strong discontinuities⁶³:

M_{crack} = β \times M_{intact}

(13)

K_{crack} = β \times K_{intact}

(14)

where $M_{crack}$ and $K_{crack}$ are the penalized element mass matrix and element stiffness matrix. $M_{intact}$ and $K_{intact}$ are the intact element mass matrix and element stiffness matrix. β is the penalization coefficient equal to $0.0001$ .

Figure 7.

(a) Surface crack simulation using HOBM with SFEs and GLL nodes, where the beam is separated into two sublayers of varying thicknesses, and (b) cross-sectional view of the isotropic beam model at damage location, with cross sectional width w. HOBM: higher-order beam model; SFE: spectral finite element; GLL: Gauss–Lobatto–Legendre.

As shown in Figure 7(a) and (b), the isotropic beam is simulated by dividing it into two sub-layers with varying thicknesses, based on the crack depth. In the HOBM simulation, n SFEs are employed, with the surface crack represented by a single SFE of crack width, Wid (the red rectangle with a cross in Figure 7). The remaining intact SFEs have a constant width of $W_SFE$ . The cracked element is then processed by the element deletion method prior to solving the equation of motion to simulate GW propagation.

Damage cases

The proposed HOBM was used to simulate different damage cases. In this study, all considered beams were assumed to be sufficiently long so that no boundary reflection from both ends, while the potential crack location was constrained to an 800-mm region representing an inaccessible inspection zone. The potential crack depth was assumed to be between 10 and 50% of the total beam thickness. The width of crack was assumed to be 2 mm in all damage cases, and hence, two damage parameters need to be identified. The excitation signal was at 100 mm away from the possible damage location range, which is from 100 to 900 mm as shown in Figure 8, and measurement point was selected 50 mm from excitation point. The asymmetrical mode (A₀) GW is excited by using a 60-kHz central frequency Hann-windowed sinusoidal tone burst pulse with six cycles. Both crack width and excitation frequency were determined based on actual experiment condition (see the fifth section). To optimize computational efficiency, 100 mm absorbing regions with an exponentially increasing damping profile were implemented at both beam ends.^64,65 This configuration results in a the total length of 1200 mm (see Figure 8). The damping coefficient within the absorbing regions is defined by^64,65:

C_{absorb, i} = C_{Mmax} \times {(\frac{i}{n})}^{3}, i \in [1, n]

(15)

where $C_{absorb, i}$ is the damping matrix with exponentially increased damping coefficient at the element i, and n is the selected exponential order and total number of elements in absorbing area. $C_{Mmax}$ is the calculated maximum damping matrix, which is 400,000, defined by trial and error methodology until the model meets the acceptability criterion of reflection for all modes. A systematic convergence study established the optimal parameters, employing 120 SFEs with eight GLL nodes per element. The simulation used a time-step of $3.0 \times 10^{- 8} s$ over a total duration of $1.2 \times 10^{- 3} s$ , which was verified to fully capture all significant GW reflections from the damage.

Figure 8.

Schematical diagram of the beam model with absorbing areas.

Two damage datasets were generated from different case studies (see the fourth section). In terms of beam configuration, three different materials, steel, brass and aluminium, were considered in the feasibility study of the generalized framework. Additionally, four different beam thicknesses – 4.00, 6.00, 8.00 and 10.00 mm – were considered to assess the applicability and generalizability of the developed framework. This resulted in a total of twelve unique beam configurations. The first dataset was generated using standard material properties to create 1000 random damage cases for each beam configuration, yielding a total of 12,000 damage scenarios. The second dataset incorporated material uncertainties to account for real-world variability. Due to variations in material grades, manufacturing inconsistencies (material) and temperature effects, material properties, especially density ρ and Young’s modulus E, can vary even for the same type/grade of material. Apart from material uncertainties, fabrication processes can also introduce geometric uncertainties (e.g., local width/thickness tapers, mild out-of-straightness or surface waviness). Such geometry-induced deviations may change GW characteristics and may, in turn, influence data-driven damage detection. However, it may be more difficult or impractical to account for all forms of uncertainties. In addition, adapting the beam model for each geometric variation is more complex and challenging. Therefore, the uncertainties of dataset 2 in this study focus on material property variations, which provide representative and comprehensive damage scenarios for evaluating the generalization ability of the proposed framework. Both inputs of ρ and E thereby were uniformly sampled from predefined ranges for each material to consider these uncertainties. Poisson’s ratio remained fixed for each material type due to its relatively stable nature. The material properties and beam configuration in both datasets are summarized in Table 3.

Table 3.

Summary of damage cases.

Damagedataset	Type	Beam thickness (mm)	Density range( $kg / m^{3}$ )	Young’s modulusrange (GPa)	Poisson’s ratiorange	Cases perthickness	Cases perdataset
Dataset 1	Aluminium	4.00	2700.00	70.00	0.33	1000	12,000
		6.00
		8.00
		10.00
	Steel	4.00	7950.00	195.00	0.30	1000
		6.00
		8.00
		10.00
	Brass	4.00	8575.00	110.00	0.35	1000
		6.00
		8.00
		10.00
Dataset 2	Aluminium	4.00	[2650.00 2750.00]	[65.00 75.00]	0.33	1000	12,000
		6.00
		8.00
		10.00
	Steel	4.00	[7850.00 8050.00]	[185.00 205.00]	0.30	1000
		6.00
		8.00
		10.00
	Brass	4.00	[8400.00 8750.00]	[90.00 120.00]	0.35	1000
		6.00
		8.00
		10.00

To validate the accuracy and efficiency of the proposed HOBM, the commercial FE software ABAQUS was used to simulate GW propagation. An isotropic beam was modeled with a length of 1000 mm, a depth of 8.00 mm and a width of 12.00 mm using three-dimensional (3D) solid elements. A 3.00-mm deep crack (37.50% of beam thickness) was located 400 mm from the excitation location. Steel material properties in dataset 1 of Table 2 were used. The mesh dimension was $0.20 \times 0.20 \times 0.20$ mm³ to ensure sufficient convergence, and the time step was automatically determined by the dynamic explicit solver.

A screenshot of the ABAQUS 3D finite element method (FEM) simulation is shown in Figure 9(a) and the GW reflection signals at the same measurement point from both HBOM and ABAQUS simulation were displayed in Figure 9(b). It should be noted that no absorbing area was considered in the simulation. Despite the complexity of the damage reflections, the results show a good agreement between the HBOM and ABAQUS, indicating the accuracy of HOBM. Moreover, under the same system processer (Intel(R) Core (TM) i7-10700 CPU 2.90 GHz with 16.0 GB RAM), the HOBM required only 31.90 s to generate field outputs (i.e., displacements at all considered nodes). In contrast, the ABAQUS 3D FEM simulation took over 9345.60 CPU seconds to generate history outputs (i.e., only at the selected measurement node). The computational efficiency of HOBM can be further improved by incorporating MATLAB parallel computation.

Figure 9.

(a) A damage scenario simulated using ABAQUS 3D FEM, (b) GW reflection signals comparison between HOBM and ABAQUS 3D FEM without absorbing areas, (c) GW reflection signals at the same measurement point for beam with and without absorbing areas and (d) GW reflection signals of beams with the same damage scenario but different materials. GW: guided wave; HOBM: higher-order beam model.

Figure 9(c) demonstrates the effectiveness of the absorbing area implemented in HOBM, showing significantly reduced reflection complexity through successful elimination of boundary reflections. This approach provides notable computation advantages by removing the requirement for impractically modeling long beam to migrate the boundary reflections. Figure 9(d) shows the signal comparisons between different standard materials under the identical damage condition. Despite they have identical beam geometry and damage properties, wave characteristics vary between different materials. It is observed that both excitation pulses and damage reflection pulses exhibit diverse trends, presenting a significant challenge in identifying damage parameters across different beam models when using a single neural network.

Generalization approach implementation for different beam-like structures

Several numerical cases studies were conducted to investigate the performance of the developed MC-SE-BiH algorithm for generalization damage identification. Initially, the generalization ability of the proposed approach was tested in three scenarios with increasing detection difficulty on case study 1. In this case study, a comparison was made with several conventional representative models: three are based on individual CNN, LSTM, and GRU, and the other two are using integrating algorithms, CNN-LSTM and CNN-GRU, which utilize a grid search strategy,⁶⁶ to optimize their hyperparameters, rather than relying on artificial empirical selection. Furthermore, case study 2 examined the broader generalization performance of developed network for material uncertainties caused by variations in material grades, manufacturing inconsistencies and temperature effects. Finally, case study 3 investigated the robustness of the proposed framework to randomly varying noise and measurement error. Beyond the core objectives of this article (i.e., accessing generalization concept and feasibility), Case study 4 was designed as an exploratory extension to probe the framework’s behavioral boundaries (i.e., performance ceiling) under unseen structural configurations that were intentionally outside the training/validation distribution. Therefore, the case study 4 is only an exploratory probe rather than a confirmatory evaluation.

Case study 1: comparative analysis of DL frameworks for generalization performance

This section evaluates the generalization capability of various DL-based models for damage detection across diverse beam-like structures using dataset 1 (without material uncertainties). We examine three progressively challenging scenarios to systematically assess diagnostic performance involving differences in dimensions and materials, that is, (i) different materials with identical thickness, (ii) different thicknesses with identical material, and (iii) different materials and different thicknesses. This article mainly focuses on training a model that can be applied to a variety of different beam-like structures, so the datasets in the first two scenarios can be used as subtraining datasets in the third scenario of case study 1. Table 4 summarizes all scenarios in case study 1. To ensure a fair comparison, all approaches used in this case study have the same configurations (network backbone) and training strategies except for their own optimal hyperparameters. Since the datasets used for training vary in different scenarios, the hyperparameters for other considered models are determined case by case.

Table 4.

Summary of all scenarios in case study 1.

Case study 1	Description	Material	Thickness (mm)	Total samples
Scenario 1	Different materials but same thickness	Aluminium	6.00	3000
		Brass
		Steel
Scenario 2	Different thicknesses but same material	Aluminium	4.00	4000
			6.00
			8.00
			10.00
Scenario 3	Different materials and different thicknesses	Identical to dataset 1		12,000

The evaluation results of the proposed framework, along with other considered models in case study 1, are summarized in Table 5. As observed, the proposed MC-SE-BiH exhibits the lowest RMSE and MAPE, as well as best goodness of fit (i.e., highest R²), for each scenario of case study 1. This demonstrates the superior predictive performance of MC-SE-BiH in estimating both damage location and depth percentage compared to other considered models.

Table 5.

Summary of the evaluation results of the proposed MC-SE-BiH along with other considered models in case study 1.

Methods	Scenario	Evaluation metrics
		RMSE		R ²		MAPE (%)
		Location	Depth	Location	Depth	Location	Depth
MC-SE-BiH	1	6.02	0.47	0.999	0.998	1.13	1.27
	2	5.81	0.30	0.999	0.999	1.30	0.93
	3	3.49	0.36	0.999	0.999	0.72	0.88
CNN-LSTM	1	11.69	0.55	0.997	0.997	2.22	1.87
	2	14.64	0.73	0.995	0.995	2.69	2.21
	3	13.02	0.60	0.997	0.997	2.61	1.82
CNN-GRU	1	11.33	0.74	0.997	0.996	2.27	1.80
	2	13.75	0.56	0.996	0.997	2.80	1.71
	3	16.63	0.75	0.995	0.996	3.75	2.34
CNN	1	25.72	1.29	0.988	0.987	3.72	3.39
	2	9.55	0.69	0.998	0.996	1.79	1.81
	3	25.74	0.68	0.988	0.997	5.13	1.95
LSTM	1	201.09	10.12	0.312	0.236	38.13	26.78
	2	88.63	5.61	0.859	0.770	19.77	17.80
	3	44.69	5.28	0.962	0.791	8.66	12.79
GRU	1	196.59	9.49	0.254	0.254	33.91	26.17
	2	84.88	4.56	0.856	0.839	18.05	12.43
	3	70.73	5.35	0.909	0.792	15.12	12.61

MC-SE-BiH: Multiscale convolution and squeeze-and-excitation attention-assisted bidirectional hybrid; CNN: convolutional neural network; LSTM: long short-term memory; GRU: gated recurrent unit; RMSE: root mean squared error; R²: coefficient of determination; MAPE: mean absolute percentage error.

In contrast, individual LSTM and GRU struggle to adapt to the complexity of the damage detection scenarios investigated in this article. This is particularly evident in scenario 1, where both models failed to detect damage, posing significant challenges for achieving generalization. Specifically, the R² values for both models remain below 0.400 in this scenario, accompanied by substantial RMSE and MAPE values. Although RNN model such as LSTM or GRU excel at extracting features from temporal sequences, GW signals inherently contain both temporal and spatial characteristics. The considerable variations in GW signals across different structures, often manifesting as nonlinear relationships, and their susceptibility to be confused with each other under certain damage conditions, further affect the distinguishability of information from GW signals. Additionally, the input data is a raw GW signal without any prior signal processing, resulting in high-dimensional input features. As a result, individual LSTM and GRU demonstrate limited effectiveness in isolating and leveraging key features from the input data, resulting in suboptimal performance.

While CNN showed relatively better performance in the first two scenarios, its performance in the third scenario is inadequate. Notably, the relatively good performance of the integrated algorithms, CNN-LSTM and CNN-GRU, is expected and reasonable, as the developed methodology represents an improvement and enhancement built upon the foundational principles of these two approaches by integrating multiscale convolution, channel attention mechanisms, and bidirectional RNNs. Despite the fact that both integrated algorithms achieved good prediction results, their training loss exhibits local marked fluctuations after convergence due to instability in the network during feature extraction, as evidenced by the training loss curves in Figure 10. The results demonstrate that the proposed MC-SE-BiH algorithm more effectively interpret, distil, and learn the information from GW signals, thereby establishing an effective mapping between the GW time-domain signals and damage parameters. The predicted results of the proposed MC-SE-BiH for beam-like structures with different materials and thicknesses are visualized using scattering plots in Figure 12(a). The predicted values closely align with the perfect agreement line, further demonstrating that the strong generalization ability of the proposed approach. The Appendix A (Figures A1 to A6) in this article presents an additional detailed evaluation of several different DL approaches for damage identification in case study 1, featuring predicted-versus-ground-truth scatter plots to facilitate direct and clear comparability. Furthermore, to enhance the interpretation of the physical significance, representative visualizations are provided, aiming to the discoverability and presentation of different models for prediction. In particular, a representative example is presented to examine the identification performance of each model under the scenario 3. The visualization of prediction results adopts a physical representation, as illustrated in Figure 11, where positions and dimensions are all expressed as relative relationships. For damage location prediction, models achieving superior performance exhibit predictions that are spatially closer to the actual damage site. Similarly, for damage depth estimation, the accuracy of a model improves as its predicted depth percentage approaches the true damage depth. It is noteworthy that, the GW time-domain signal corresponding to the worst positional estimation of the proposed approach within the testing dataset is employed to evaluate all other considered models for comparison from a physical perspective. It can be observed that even for the testing instance exhibiting the weakest performance, the proposed model still outperforms alternative models in predictive accuracy.

Figure 10.

Training loss curves of the proposed MC-SE-BiH, CNN-LSTM, and CNN-GRU for damage detection in scenario 3. MC-SE-BiH: multiscale convolution and squeeze-and-excitation attention-assisted bidirectional hybrid; CNN: convolutional neural network; LSTM: long short-term memory; GRU: gated recurrent unit.

Figure 11.

Identification results of different models on the worst estimation from the proposed approach for testing dataset in the scenario 3: (a) MC-SE-BiH, (b) CNN-LSTM, (c) CNN-GRU, (d) CNN, (e) LSTM and (f) GRU. Note: APD is the absolute percentage difference for crack location $L_{APD}$ and its depth percentage $D_{APD}$ . MC-SE-BiH: multiscale convolution and squeeze-and-excitation attention assisted bidirectional hybrid; CNN: convolutional neural network; LSTM: long short-term memory; GRU: gated recurrent unit.

In summary, the case study 1 focuses on evaluating the generalization performance of the proposed MC-SE-BiH and other considered models in detecting damage under three distinct scenarios, enabling a comprehensive comparative analysis. The proposed framework demonstrates a remarkable ability in addressing the challenge of generalizing damage identification in GW-based damage detection using DL techniques. Unlike the approaches used for comparison in this section, which show varying performance limitations, the proposed MC-SE-BiH algorithm achieves superior generalization performance for damage identification across different beam-like structures.

Case study 2: performance of the proposed framework under varying material uncertainties

This section investigates the performance of the MC-SE-BiH framework across various beam-like structures subjected to uncertainties such as material grade variations (which encompass all material properties of this alloy series), manufacturing inconsistencies and temperature effects. These uncertainties primarily influence the Young’s modulus and density of the materials (see the third section for details). Even among alloy variants with the same material composition, the reflected waves from crack damage exhibit noticeable differences. This underscores the importance of further exploring generalization ability of the proposed method for damage identification across a broader range of beam-like structures.

Dataset 2, incorporating material uncertainty, was used to construct the dataset for this case, totalling 12,000 samples sets. Notably, due to the material properties being uniformly and randomly generated within a specific range (as detailed in the third section), no two samples in dataset 2 share identical material properties. This poses a significant challenge for damage detection tasks, as GW characteristics may exhibit only slight variations within a narrow range of material properties, while the data representing such subtle variations are very limited. Furthermore, the feature distribution of GW signals could be highly uneven, with certain ranges exhibiting a higher concentration of data while others have a comparatively lower density. Despite these challenges, the proposed approach demonstrates superior generalization performance, with exceptional goodness of fit for both damage location (R² = 0.999) and depth percentage (R² = 0.997). Additionally, the RMSE and MAPE values for the developed algorithm are only 6.53 and 1.34% for damage location, and 0.61 and 1.73% for damage depth percentage, respectively. The results of the evaluation metrics can be found in Table 6. Figure 12(b) shows the predicted results of the proposed method for identifying damage across different beam-like structures under various material uncertainties. As shown in the scatter plots, nearly all predicted values aligned with or are in close proximity to the perfect agreement line, with only a few outliers.

Table 6.

Results of the proposed MC-SE-BiH for different beam-like structures in case study 2 (from dataset 2) and case study 3 (from dataset 2 after adding noise and signal translation).

Case	Damage parameter	RMSE	R ²	MAPE (%)
Case study 2	Location	6.53	0.999	1.34
	Depth percentage	0.61	0.997	1.73
Case study 3	Location	6.82	0.999	1.54
	Depth percentage	0.69	0.996	1.77

MC-SE-BiH: multiscale convolution and squeeze-and-excitation attention-assisted bidirectional hybrid; RMSE: root mean squared error;R²: coefficient of determination; MAPE: mean absolute percentage error.

Figure 12.

Prediction results of the proposed MC-SE-BiH for damage identification of different beam-like structures under (a) scenario 3 of case study 1, (b) various material uncertainties and (c) various material uncertainties and measurement conditions. MC-SE-BiH: multiscale convolution and squeeze-and-excitation attention assisted bidirectional hybrid.

Case study 3: performance of the proposed framework under different unavoidable noise and measurement errors

This section evaluates the performance of the proposed generalized approach under various measurement conditions, including material errors. Specifically, Gaussian noise is added to simulate the effect of measurement noise. Different signal-to-noise ratios (SNRs) are applied to the GW time-domain signals. To examine the generalization capability of the proposed method across diverse measurement environment scenarios, noise levels, including random and uniform SNR within the range of 30–70 dB, and noise-free conditions, are applied to the all datasets from dataset 2 to simulate GW responses from real beam-like structures.⁶⁷

In addition, the inevitable presence of human measurement errors can result in slight positional offsets at the measurement point. To account for this, signal translation, a data augmentation technique,⁶⁸ is employed to enhance the GW signals by simulating human measurement errors. The signal translation range extends 5.00 mm forward (positive) or backward (negative) from the original measurement point. To address gaps introduced during the translation process, zero-padding is applied. The dataset for signal translation is derived from the original dataset with added noise. Hence, the total dataset size increases to 24,000. The detailed results of the proposed approach for case study 3 are presented in Table 6. As shown, the proposed approach achieves remarkably low RMSE and MAPE values, along with a high goodness of fit. This further demonstrates the strong robustness of the proposed approach.

Figure 12(c) presents the regression results of the proposed approach in the form of scatter plots. The approach demonstrates superior robustness in localizing and sizing damage (depth percentage). Specifically, for damage localization, nearly all the predicted values closely aligned with the perfect agreement line in the test datasets. Compared with the prediction of damage location, the identification results of damage depth percentage are relatively weak. This is because the considered uncertainties in the case studies have more effect in size detection than location detection. Generally, damage location is based on reflection pulse time of flight (ToF). However, damage sizing is more relying on reflection signal amplitude. In realistic, the GW amplitude can be affected easier than ToF. Moreover, the determination of damage size is largely reflected in the amplitude of GW time-domain signals. The introduction of noise further affects the signal amplitude, making it more challenging to distinguish between similar GW signals and accurately identify the severity of damage. Despite the inclusion of random noise levels and the signal translation operation in dataset 2, which contains samples with various material uncertainties, the proposed generalization approach still delivers strong performance in determining damage depth percentage.

Case study 4: generalization capability of the proposed framework on unseen structural configurations

The proposed framework demonstrated exceptional prediction results and robustness in the previous case studies. To further explore the application potential, this section investigates and discusses the performance of the framework when one or more beam models (used to generate training dataset) are removed as unseen beam configurations for testing. The dataset for this generalization capability investigation was derived from dataset 2. A total of twelve HOBM were utilized to generate dataset, with each model incorporating material properties uncertainty. Notably, each HOBM generates 1000 sets of GW signals. To comprehensively evaluate the generalization capability of the proposed approach, a hierarchical evaluation strategy was designed. This section aims to investigate how well the proposed algorithm adapts to different unseen scenarios. The detailed descriptions of these unseen scenarios are as follows: (1) unseen scenario 1: the well-trained MC-SE-BiH did not encounter the data generated by a single HOBM during training; (2) unseen scenario 2: the well-trained MC-SE-BiH did not encounter the data generated by all HOBM with a specific cross-sectional dimension during training; (3) unseen scenario 3: the well-trained MC-SE-BiH did not encounter data generated by all HOBM composed of a specific material during training; (4) unseen scenario 4: the well-trained MC-SE-BiH did not encounter data generated by all HOBM composed of a specific material and those with a specific cross-sectional dimension during training. The training and test dataset for each unseen scenario are summarized in Table 7.

Table 7.

Summary of training and test dataset for each unseen scenario in case study 4.

Unseen scenario	Training dataset			Test dataset
	Material	Thickness (mm)	Total sample	Material	Thickness (mm)	Total sample
1	Aluminium	4.00	11,000	Steel	8.00	1000
		6.00
		8.00
		10.00
	Brass	4.00
		6.00
		8.00
		10.00
	Steel	4.00
		6.00
		10.00
2	Aluminium	4.00	9000	Aluminium	8.00	3000
		6.00
		10.00
	Brass	4.00		Brass	8.00
		6.00
		10.00
	Steel	4.00		Steel	8.00
		6.00
		10.00
3	Aluminium	4.00	8000	Steel	4.00	4000
		6.00
		8.00			6.00
		10.00
	Brass	4.00			8.00
		6.00
		8.00			10.00
		10.00
4	Aluminium	4.00	6000	Steel	8.00	1000
		6.00
		10.00
	Brass	4.00
		6.00
		10.00

The same network architecture and training details of the proposed MC-SE-BiH were employed in this case study. To dynamically evaluate proposed MC-SE-BiH performance and guide its optimization during training, the training datasets in Table 6 were split into a training set (90%) and a validation set (10%). The evaluation results of the proposed algorithm on unseen beam data are presented in Table 8. As shown, especially for unseen scenario 1, the identification results of the proposed approach on the unseen beam data, where only one beam configuration is excluded, are notable. In this scenario, the proposed approach achieves the lowest RMSE, MAPE, and the highest goodness of fit, both in localizing damage and sizing its depth percentage. For other unseen scenarios, despite an increase in the number of unseen beam configurations corresponding to a reduction in the training dataset, the proposed approach still performs well in identifying the damage location.

Table 8.

The results of the proposed MC-SE-BiH for damage identification on unseen beam data.

Unseen scenario	Damage parameter	RMSE	R ²	MAPE (%)
Scenario 1	Location	6.24	0.999	1.49
	Depth percentage	1.10	0.991	3.03
Scenario 2	Location	13.62	0.997	2.64
	Depth percentage	1.93	0.972	4.25
Scenario 3	Location	7.09	0.999	1.41
	Depth percentage	1.83	0.976	4.10
Scenario 4	Location	16.19	0.995	3.18
	Depth percentage	2.27	0.962	5.59

Figure 13 presents the detailed prediction results of the proposed MC-SE-BiH for identifying damage parameters on unseen beam data across different unseen scenarios. As indicated by the evaluation metrics, the proposed approach demonstrates good prediction results for localizing damage on unseen beam data across all scenarios. However, the prediction results for damage size are suboptimal in certain scenarios. Particularly, as observed from the identification results in Figure 13(b) to (d), there is a significant deviation between the predicted and actual values for damage depth percentage within a certain range, particularly for unseen scenario 4. In the literature,²⁷ along with the numerical case studies presented in this study, suggests that GW-based data-driven approaches using DL generally offer more accurate prediction results for damage location parameters than that for size parameters. Furthermore, as designed in the dataset preparation section, no two samples share identical material properties in dataset 2. When extracting unseen beam data for testing the proposed framework in this section, a limited number or lack of critical GW signal samples during training, which is necessary for accurately identifying damage size parameters (i.e., damage depth percentage) on unseen beam data, may occur. Another possible reason for poor prediction performance could be significant differences in feature distributions between training data and unseen test data. Distributional shift between the training data and unseen test data, especially in scenario 4, may have introduced feature patterns not well represented during training. If the feature range or distribution in the training dataset significantly differs from that of the unseen data, the model may fail to effectively learn the relationship. The case study for unseen configuration focuses more on exploring more robust performance of the proposed approach. Generally, in data-driven based damage detection using GW, it is desirable to leverage data from existing or well-characterized structures for model training and detecting damage on these structures. Nevertheless, in these unseen scenarios, the beam models used to generate test data were intentionally removed from the training dataset. For an alternative perspective or potential solution in the future, before applying the proposed approach, additional data from the structures inspected can be used for training (e.g. generating augmentation data for these structures), whereas the specific details require further study.

Figure 13.

Prediction results of the proposed MC-SE-BiH for unseen beam data under (a) unseen scenario 1, (b) unseen scenario 2, (c) unseen scenario 3 and (d) unseen scenario 4. MC-SE-BiH: multiscale convolution and squeeze-and-excitation attention assisted bidirectional hybrid.

After fulfilling conceptual feasibility of generalizing damage detection in this article, the performance upper limit of the proposed method is preliminarily explored and discussed when the knowledge domain gap is large. In summary, the identification results demonstrate that the proposed approach can generalize damage identification for different beam-like structures and exhibits considerable potential for application on unseen structural configurations. In the application of an automatic GW-based damage detection system, most of the scenarios encountered during damage detection are considered and incorporated into the data preparation within the proposed generalization framework. Consequently, the proposed framework has the potential to generalize damage detection for most situations.

Experimental study

Experimental setup

Beams made of three different materials, with a length of 1000 mm and a width of 12.00 mm, were utilized in the experimental study to further demonstrate and investigate the effectiveness and practicability of the proposed generalized framework. The material properties are summarized in Table 9. These values are idealized, but actual properties may differ. Additionally, it is also noteworthy that crack-like flaws were generated by artificial cutting, which may introduce manufacturing errors.

Table 9.

Material properties of each type of beam-like structure in experimental study.

Beam material	Grade	Density ( $kg / m^{3}$ )	Young’s modulus (GPa)
Aluminium	6060 T5	2700.00	69.00
Brass	385	8470.00	96.00
Steel	MS (Mild Steel) 250	7860.00	200.00

To facilitate measurement, the experimental beam was placed on rigid supports. A rectangular piezoceramic transducer (PZT) with size $12 \times 6 \times 1 {mm}^{3}$ , was bonded to the left end surface of each beam using double-sided conductive adhesive tape, was applied to excite $A_{0}$ GW signals. Modelling clay was tightly wrapped around beam ends to minimize the boundary reflections from the left and right beam ends. A brass backing mass with dimension $12 \times 6 \times 3 m m^{3}$ was attached on the top of the PZT, thereby enhancing the excitability of $A_{0}$ GW. To achieve the best SNR in the experiment, the excitation signal was chosen as 60 kHz narrow-band, six-cycle Hann-windowed sinusoidal tone burst pulse. This signal matched the excitation used in the numerical case studies.

A signal generator, National Instrument PXIE-5412 (Austin, TX, USA), was used to generate the excitation signal, which was synthetized and controlled by a computer (PC). The excitation signal was then amplified by a CIPRIAN HVA-400 amplifier (Saint-Ismier, Isère, France) before being applied to the PZT. Transverse (out-of-plane) displacements at the measurement point were recorded using a 1D Polytec PSV-400 laser Doppler vibrometer (Waldbronn, Baden-Württemberg, Germany), with a controlling system OFV5000. The data were subsequently transmitted to the computer for analysis through the data acquisition unit. The measurement point was positioned 50 mm away from the PZT to maintain the same configuration as in the numerical simulation. Signal averaging (500 times) and a band-pass filter were utilized to further reduce the influence of environmental noise on the GW signals. A detailed flowchart of experimental setup is shown in Figure 14. Details on the crack damage for the experimental beam cases can be found in Figure 15 and Table 10.

Figure 14.

Schematic diagram of the experimental setup for GW excitation using PZT and measurement using 1D laser vibrometer. GW: guided wave; 1D: one dimensional; PZT: piezoceramic transducer.

Figure 15.

Details of the (a) three intact beams composed of different materials (aluminium, brass, and steel), (b) PZT installation and measurement point and (c) zoom-in of the crack damage in one of steel beams. PZT: piezoceramic transducer.

Table 10.

Summary of the damage identification results using the proposed generalization framework in experimental case studies.

Experimental beam case	Material	Thickness (mm)	$L_{a}$ (mm)	$L_{p}$ (mm)	$L_{APD}$ (%)	$D_{a}$ (%)	$D_{P}$ (%)	$D_{APD}$ (%)
1	Aluminium	6.00	350.00	359.93	2.80	49.00	53.00	7.84
2	Aluminium	6.00	400.00	373.76	6.78	49.00	50.40	1.80
3	Aluminium	6.00	550.00	555.41	0.98	47.00	46.68	0.68
4	Brass	4.00	150.00	159.30	6.01	36.00	38.30	6.19
5	Brass	4.00	500.00	489.88	2.04	31.60	34.76	9.52
6	Brass	6.35	650.00	640.74	1.43	36.22	35.17	2.94
7	Brass	6.35	650.00	636.11	2.16	26.20	28.69	9.07
8	Brass	6.35	650.00	637.18	1.99	20.50	22.52	9.39
9	Steel	4.00	350.00	351.27	0.36	50.00	47.19	5.78
10	Steel	4.00	350.00	356.27	1.78	42.50	43.34	1.96
11	Steel	4.00	350.00	347.05	0.85	31.70	34.47	8.37
12	Steel	4.00	350.00	336.58	3.91	25.50	28.04	9.49
13	Steel	4.00	350.00	323.66	7.82	15.00	16.17	7.51
14	Steel	4.00	800.00	757.40	5.47	37.50	34.36	8.74
15	Steel	4.00	800.00	754.66	5.83	21.00	19.95	5.13
16	Steel	8.00	200.00	213.05	6.32	47.00	49.02	4.21
17	Steel	8.00	200.00	199.17	0.42	41.00	47.85	15.42
18	Steel	8.00	700.00	647.57	7.78	18.75	19.42	3.51
19	Steel	8.00	700.00	675.74	3.53	31.25	30.35	2.92
20	Steel	8.00	700.00	697.58	0.35	37.50	36.38	3.03
21	Steel	8.00	700.00	698.33	0.24	43.75	42.71	2.41
22	Steel	8.00	700.00	710.26	1.46	50.00	49.91	0.18
23	Steel	10.00	750.00	732.02	2.43	15.00	16.76	11.08
24	Steel	10.00	750.00	743.56	0.86	27.00	25.34	6.34
25	Steel	10.00	750.00	752.19	0.29	30.00	29.60	1.34
26	Steel	10.00	750.00	746.53	0.46	40.00	41.79	4.38
27	Steel	10.00	750.00	754.25	0.57	45.00	48.10	6.66
28	Steel	10.00	750.00	769.70	2.59	50.00	50.40	0.80

$L_{a}$ : actual value of crack location; $L_{p}$ : predicted value of crack location; $D_{a}$ : actual value of crack depth percentage; $D_{P}$ : predicted value of crack depth percentage; APD: absolute percentage difference for crack location ( $L_{APD}$ ) and its depth percentage ( $D_{APD}$ ).

Results and discussion

The experimental data underwent a preprocessing flow identical to that used for the simulated data to maintain consistency with the numerical cases. Specifically, the obtained GW signals were truncated to the same time window as the numerical case study. Normalization was then used to scale each signal to a [−1, 1] range. Finally, the normalized GW signals were interpolated to match feature input requirement (i.e., 4000 points) of the model. Damage identification was automatically implemented using the trained MC-SE-BiH framework from case study 3 (see the fourth section), without further fine-tuning or retraining. Both damage parameters (i.e., crack location and depth percentage) were accurately identified in most experimental beam cases. Table 10 summarizes the prediction results for the experimental beam cases using the proposed generalization framework. The absolute percentage difference (APD) for each experimental beam was calculated using the following equation, which quantifies the relative difference between the actual and predicted value²⁷:

APD = \frac{| A - P |}{(A + P) / 2} \times 100 %

(16)

where A represents the actual damage parameter, P represents the predicted damage parameter, and $(A + P) / 2$ is the mean of two values.

The identification results of crack damage parameters are clearly illustrated in Figure 16 using bar plots. As shown, most of the crack damage parameters were accurately identified in experimental beam cases. Although an additional set of unseen brass specimens with a thickness of 6.35 mm was used for damage identification in this experiment study, the proposed generalization framework is also capable of identifying damage parameters in these beam cases. The results not only validate the practicality and effectiveness of the proposed generalization framework for real-world damage detection but also highlight its significant potential for identifying damage in unseen structural configurations. During the training process, the proposed MC-SE-BiH framework encountered data generated from the HOBM composed of brass components with a 6.00-mm thickness, which is similar in cross-sectional dimension to the experimental brass beams. Given the minor difference in thickness between beam of 6.00 mm thickness and that of 6.35 mm thickness, the characteristics of GW responses may fall within the range of training data, and there is more data with similar features. Consequently, the proposed framework performs well on this dataset, which aligns with the expected research objectives of this study. However, in the case of unseen scenario 2 in case study 4 (see the fourth section), the HOBM used for testing has an 8.00-mm thickness, which presents a relatively large difference in cross-sectional dimensions compared the 6.00 or 10.00 mm thicknesses used to generate the training dataset for this unseen scenario. As mentioned, variations in the cross-sectional dimensions of structures significantly affect GW responses. This relatively significant disparity might result in the model failing to establish a more accurate mapping relationship between GW signals and damage parameters for unseen beam data in unseen scenario 2, thereby explaining the relatively poor performance observed.

Figure 16.

Bar charts of damage identification results for the experimental beam cases (predicted value vs actual value) (a) predicted crack damage location (mm) versus actual crack damage location and (b) predicted crack damage depth percentage versus actual crack damage depth percentage.

During the experiment, artificially introduced cracks (i.e., slot cuts) often deviate from idealized geometries, they typically exhibit surface roughness, slight waviness or minor misalignment, and lack perfect linearity. Consequently, a discrepancy arises between the real cracks presented in experimental beam cases and idealized slot-type cracks assumed in simulation models. In addition, certain factors inherent to the experimental setup and measurement, such as the transducer installation, imperfect coupling between the PZT and beams, the energy attenuation of GW during propagation, the limited absorption capacity of absorbing clay at both ends of the beam, and various measurement errors – introduce unavoidable discrepancies between the experimentally measured GW signals and their simulated counterparts. Therefore, while the prediction performance of the proposed generalization framework for the experimental study exhibits relatively larger APD in a few beam instances compared to its performance on numerical studies, these values remain within an acceptable error range.

Conclusion

This article has proposed a generalization framework for localizing and characterizing damage across various beam-like structures using GW. The proposed MC-SE-BiH algorithm integrates multiscale convolution, channel attention, and bidirectional RNNs, providing powerful feature distillation and interpretation capabilities for entire GW time-domain signal, without the need for traditional GW signal processing techniques. An enhanced HOBM has been employed to efficiently and accurately simulate GW propagation in distinct isotropic beam models. The numerical studies have comprehensively demonstrated the proposed framework significantly outperforms other considered DL-based methods. Furthermore, additional material uncertainties and measurement conditions have been considered in the subsequent case studies. The results indicate that the proposed approach exhibits strong generalization capability and robustness for damage detection across a broader range of beam-like structures. Further research on the performance of the proposed approach with unseen beam configurations has also been presented. The identification results from unseen beam data show that the proposed approach excels at localizing damage and also exhibits potential for characterizing damage. Multiple sets of experimental measurements have further validated the effectiveness and practicality of the proposed generalization framework in the experiment.

While the current approach has shown great potential for generalizing damage detection in different structures, there are still areas for improvement to enhance the practicality of the framework. Specifically, current generalization framework has only been tested on beam-like structures (1D waveguide), although it has the potential to been applied to plates or shells (two-dimensional (2D) waveguide), where the careful consideration of an appropriate measurement design may be necessitated to effectively capture the GW signals. Further research is needed to explore. The current research focuses on a slot cut as the form of damage, whereas different types of damage with more complex profiles can always exist in real-world scenarios, especially in 2D waveguide. The proposed framework can be extended in future work to explore how variations in damage width and spatial profile (e.g., rectangular, Gaussian, or irregular) influence the model’s ability to generalize, which could provide deeper insights into the robustness and applicability of the approach across a wider variety of damage scenarios. Furthermore, during training process, apart from the cross-validation method (i.e., splitting the dataset into training, validating, and testing sets) used in this article, other cross-validation techniques (e.g., k-fold cross-validation) could be potentially considered to further assess model robustness. As DL-based data-driven models capture patterns and nonlinear relationships directly from data without the need for physical modelling, this approach inherently raises potentially concerning issues regarding the lack of insufficient transparency in decision-making (i.e., the black-box nature of such models). Therefore, we will consider interpretable method to find the links between DL model and physical sense in the future, such as local interpretable model-agnostic explanations, SHapley Additive exPlanations, and class activation mapping. In addition, some approaches designed to address such domain-shift issues, such as maximum mean discrepancy or variational domain regularization, which may be a solution to improve the unseen cases which worth further study.

Footnotes

Appendix A

This appendix provides the regression prediction results of different DL frameworks for identifying damage parameters in different beam-like structures (case study 1 in the fourth section) using scatter plots.

Authors’ note

Given his role as an Associate Editor of this journal, Ching Tai Ng had no involvement in the peer-review of articles for which he was an author and had no access to information regarding the peer-review. Full responsibility for the peer-review process for this article was delegated to another Editor of this journal.

Declaration of conflicting interests

The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Funding

The authors disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: The first author would like to acknowledge the scholarship supported by China Scholarship Council.

ORCID iDs

Bo Yang

Zijie Zeng

Ching Tai Ng

References

Das

Saha

Patro

. Vibration-based damage detection techniques used for health monitoring of structures: a review. J Civ Struct Health Monit 2016; 6: 477–507.

Ling

Abdul Rahim

. A review on ultrasonic guided wave technology. Australian Journal of Mechanical Engineering 2020; 18: 32-44.

Y-K

Sohn

. Integrated impedance and guided wave based damage detection. Mech Syst Signal Process 2012; 28: 50–62.

Jiang

Zhou

Zhang

, et al. Guided wave-based crack detection in U-shaped flexural plate butt welds. Eng Struct 2024; 303: 117558.

Mitra

Gopalakrishnan

. Guided wave based structural health monitoring: a review. Smart Mater Struct 2016; 25: 053001.

Nerlikar

Mesnil

Miorelli

, et al. Damage detection with ultrasonic guided waves using machine learning and aggregated baselines. Struct Health Monit 2024; 23: 443–462.

Yang

Tian

, et al. A review on guided-ultrasonic-wave-based structural health monitoring: From fundamental theory to machine learning techniques. Ultrasonics 2023; 133: 107014.

Sanderson

Catton

. An analytical model for guided wave inspection optimization for prismatic structures of any cross section. IEEE Trans Ultrason Ferroelectr Freq Control 2011; 58: 1016–1026.

Deng

Yang

. Scattering of S0 Lamb mode in plate with multiple damage. Appl Math Model 2011; 35: 550–562.

10.

Ereiz

Duvnjak

Jiménez-Alonso

. Review of finite element model updating methods for structural applications. Structures 2022; 41: 684–723.

11.

Willberg

Duczek

Vivar-Perez

, et al. Simulation methods for guided wave-based structural health monitoring: a review. Appl Mech Rev 2015; 67: 010803.

12.

Krawczuk

. Application of spectral beam finite element with a crack and iterative search technique for damage detection. Finite Elem Anal Des 2002; 38: 537–548.

13.

Ostachowicz

Kudela

Krawczuk

, et al. Guided waves in structures for SHM: the time-domain spectral element method. Hoboken, NJ: John Wiley & Sons, 2011.

14.

Lee

Kim

Min

, et al. Structural damage detection using deep learning and FE model updating techniques. Sci Rep 2023; 13: 18694.

15.

Solomatine

See

Abrahart

. Data-driven modelling: concepts, approaches and experiences. In: Abrahart

See

Solomatine

(eds) Practical hydroinformatics: Computational intelligence and technological developments in water applications. Berlin, Heidelberg: Springer, 2008, pp. 17–30.

16.

Borate

Wang

. Data-driven structural health monitoring approach using guided Lamb wave responses. J Aerosp Eng 2020; 33: 04020033.

17.

Melville

Alguri

Deemer

, et al. Structural damage detection using deep learning of ultrasonic guided waves. In: AIP conference proceedings, 2018. Melville, NY: AIP Publishing.

18.

Mahajan

Banerjee

. A machine learning framework for guided wave-based damage detection of rail head using surface-bonded piezo-electric wafer transducers. Mach Learn Appl 2022; 7: 100216.

19.

Lyu

Yang

, et al. Wavelet packet energy-based damage detection using guided ultrasonic waves and support vector machine. Meas Sci Technol 2023; 34: 075107.

20.

Khan

Kim

Shin

, et al. Damage assessment of smart composite structures via machine learning: a review. JMST Adv 2019; 1: 107–124.

21.

LeCun

Bengio

Hinton

. Deep learning. Nature 2015; 521: 436–444.

22.

Liu

Zhang

. Deep learning based crack damage detection technique for thin plate structures using guided lamb wave signals. Smart Mater Struct 2019; 29: 015032.

23.

Gonzalez-Jimenez

Lomazzi

Junges

, et al. Enhancing Lamb wave-based damage diagnosis in composite materials using a pseudo-damage boosted convolutional neural network approach. Struct Health Monit 2024; 23: 1514–1529.

24.

Rai

Mitra

. Lamb wave based damage detection in metallic plates using multi-headed 1-dimensional convolutional neural network. Smart Mater Struct 2021; 30: 035010.

25.

Agrawal

Mittal

. Using CNN for facial expression recognition: a study of the effects of kernel size and number of filters on accuracy. Vis Comput 2020; 36: 405–412.

26.

Shiri

F-M

Perumal

Mustapha

, et al. A comprehensive overview and comparative analysis on deep learning models. J Artif Intell 2024; 6: 301–360.

27.

Rautela

Gopalakrishnan

. Ultrasonic guided wave based structural damage detection and localization using model assisted convolutional and recurrent neural networks. Expert Syst Appl 2021; 167: 114189.

28.

Rezazadeh

Perfetto

Polverino

, et al. Guided wave-driven machine learning for damage classification with limited dataset in aluminum panel. Struct Health Monit 2025; 24: 3138–3161.

29.

Ahmadzadeh

Zahrai

Bitaraf

. An integrated deep neural network model combining 1D CNN and LSTM for structural health monitoring utilizing multisensor time-series data. Struct Health Monit 2025; 24: 447–465.

30.

Huang

Hong

Yang

, et al. CNN-LSTM network-based damage detection approach for copper pipeline using laser ultrasonic scanning. Ultrasonics 2022; 121: 106685.

31.

Fathnejat

Ahmadi-Nedushan

Hosseininejad

, et al. A data-driven structural damage identification approach using deep convolutional-attention-recurrent neural architecture under temperature variations. Eng Struct 2023; 276: 115311.

32.

Truong

Lee

Nguyen-Thoi

. An effective framework for real-time structural damage detection using one-dimensional convolutional gated recurrent unit neural network and high performance computing. Ocean Eng 2022; 253: 111202.

33.

Chen

Jia

Yang

, et al. A vibration-based 1DCNN-BiLSTM model for structural state recognition of RC beams. Mech Syst Signal Process 2023; 203: 110715.

34.

Niu

Zhong

. A review on the attention mechanism of deep learning. Neurocomputing 2021; 452: 48–62.

35.

Weiss

Khoshgoftaar

Wang

. A survey of transfer learning. J Big Data 2016; 3: 1–40.

36.

Torrey

Shavlik

. Transfer learning. In: Olivas

Martín-Guerrero

Martínez-Sober

, et al. (eds) Handbook of research on machine learning applications and trends: algorithms, methods, and techniques. Hershey, PA: IGI Global, 2010, pp. 242–264.

37.

Postorino

Monteiro

Rébillat

, et al. Cross-structures deep transfer learning through Kantorovich potentials for lamb waves based structural health monitoring. J Struct Dyn 2023; 2: 24–50.

38.

Zhao

Alzubaidi

Zhang

, et al. A comparison review of transfer learning and self-supervised learning: definitions, applications, advantages and limitations. Expert Syst Appl 2024; 242: 122807.

39.

Sarker

. Deep learning: a comprehensive overview on techniques, taxonomy, applications and research directions. SN Comput Sci 2021; 2: 420.

40.

Feng

Reddy

RMR

Sheikh

, et al. A sub-laminate based higher-order model for bending of laminated beams containing multiple delaminations. Compos Struct 2022; 294: 115729.

41.

Cui

Chen

. Multi-scale convolutional neural networks for time series classification. arXiv 2016; abs/1603.06995, 2016.

42.

Tan

. Efficientnet: rethinking model scaling for convolutional neural networks. In: Proceedings of the 36th International Conference on Machine Learning (PMLR), Melville, NY, USA, 2019, pp. 6105–6114.

43.

Chiang

C-K

Sha

. Hyper-parameter tuning under a budget constraint. In: Proceedings of the 28th international joint conference on artificial intelligence, Macao, China, 10–16 August 2019, pp. 5744–5750.

44.

Ioffe

Szegedy

. Batch normalization: accelerating deep network training by reducing internal covariate shift. In: Proceedings of the 32nd International Conference on Machine Learning (PMLR), Lille, France, 2015, pp. 448–456.

45.

Hendrycks

Gimpel

. Gaussian error linear units (GELUs). arXiv: Learning, 2016.

46.

Szegedy

Liu

Jia

, et al. Going deeper with convolutions. In: Proceedings of the IEEE conference on computer vision and pattern recognition, Boston, MA, USA, 2015, pp. 1–9.

47.

Wang

Kuen

, et al. Recent advances in convolutional neural networks. Pattern Recogn 2018; 77: 354–377.

48.

Agarap

. Deep learning using rectified linear units (ReLU). arXiv 2018; abs/1803.08375, 2018.

49.

Shen

Sun

. Squeeze-and-excitation networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition, Salt Lake City, UT, USA, 18–22 June 2018, pp. 7132–7141.

50.

Hochreiter

. Long short-term memory. Neural Comput 1997; 9: 1735–1780.

51.

Tien

Quang

Ngoc

, et al. Time series data recovery in SHM of large-scale bridges: leveraging GAN and Bi-LSTM networks. Structures 2024; 63: 106368.

52.

Hochreiter

Schmidhuber

. Long short-term memory. Neural Comput 1997; 9: 1735–1780.

53.

Yang

Zhou

, et al. A data-driven structural damage detection framework based on parallel convolutional neural network and bidirectional gated recurrent unit. Inform Sci 2021; 566: 103–117.

54.

Cho

Merrienboer

Bahdanau

, et al. On the properties of neural machine translation: encoder–decoder approaches. In: Eighth workshop on syntax, Semantics and Structure in Statistical Translation (SSST-8), Doha, Qatar, 25 October, 2014.

55.

Chung

Gülçehre

Cho

, et al. Empirical evaluation of gated recurrent neural networks on sequence modeling. arXiv 2014; abs/1412.3555, 2014.

56.

Meyer

. An alternative probabilistic interpretation of the Huber loss. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, Virtual conference, 19–25 June 2021, pp. 5261–5269.

57.

Ciampiconi

Elwood

Leonardi

, et al. A survey and taxonomy of loss functions in machine learning. arXiv 2023; abs/2301.05579, 2023.

58.

Hastie

Tibshirani

Friedman

, et al. The elements of statistical learning: data mining, inference, and prediction. New York, NY: Springer, 2009.

59.

Nagelkerke

. A note on a general definition of the coefficient of determination. Biometrika 1991; 78: 691–692.

60.

Kingma

. Adam: a method for stochastic optimization. In: Proceeding of 3rd international conference for learning representations, San Diego, CA, USA, 2015.

61.

Zeng

Feng

, et al. A likelihood-free Bayesian approach for characterisation of multiple delaminations in laminated composite beams. Struct Health Monit 2024.

62.

Feng

Sheikh

C-T

, et al. An efficient modelling technique for simulation of guided waves in delaminated composite and sandwich structures. J Sound Vib 2023; 566: 117929.

63.

Song

J-H

Wang

Belytschko

. A comparative study on finite element methods for dynamic fracture. Comput Mech 2008; 42: 239–250.

64.

Rajagopal

Drozdz

Skelton

, et al. On the use of absorbing layers to simulate the propagation of elastic waves in unbounded isotropic media using commercially available finite element packages. NDT & E Int 2012; 51: 30–40.

65.

Pettit

Walker

Cawley

, et al. A stiffness reduction method for efficient absorption of waves at boundaries for use in commercial finite element codes. Ultrasonics 2014; 54: 1868–1879.

66.

Liashchynskyi

. Grid search, random search, genetic algorithm: a big comparison for NAS. arXiv 2019; abs/1912.06059, 2019.

67.

Pedram

Fateri

Gan

, et al. Split-spectrum processing technique for SNR enhancement of ultrasonic guided wave. Ultrasonics 2018; 83: 48–59.

68.

Zhang

Ding

, et al. Intelligent rotating machinery fault diagnosis based on deep learning using data augmentation. J Intell Manuf 2020; 31: 433–452.

Data-driven generalization for guided wave damage identification across beam-like structures

Abstract

Keywords

Introduction

Non-model-based and model-based approaches for damage detection using GWs

Data-driven approaches in damage detectionusing GWs

Generalization in GW-based damage detection

Proposed data-driven approach for damage identification

MC module

SE attention

Hybrid network of Bi-LSTM and Bi-GRU network

Bidirectional long short-term memory

Bidirectional gated recurrent unit

Loss function and evaluation metrics for damage identification

Huber loss function

Evaluation metrics

Training and structure details of the proposed approach

Dataset generation for generalization framework

Numerical simulation using HOBM

Damage cases

Generalization approach implementation for different beam-like structures

Case study 1: comparative analysis of DL frameworks for generalization performance

Case study 2: performance of the proposed framework under varying material uncertainties

Case study 3: performance of the proposed framework under different unavoidable noise and measurement errors

Case study 4: generalization capability of the proposed framework on unseen structural configurations

Experimental study

Experimental setup

Results and discussion

Conclusion

Footnotes

Appendix A

Authors’ note

Declaration of conflicting interests

Funding

ORCID iDs

References