Abstract
The acoustic emission (AE) technique is commonly utilized for identifying source mechanisms and material damage. In applications requiring numerous sensors and limited detection areas, achieving significant cost savings, weight reduction, and miniaturization of AE sensors is crucial. This prevents excessive weight burdens on structures while minimizing interference with structural integrity. Thin Piezoelectric Wafer Active Sensors (PWAS), compared to conventional commercially available sensors, offer a miniature, lightweight, and affordable alternative. The low signal-to-noise ratio (SNR) of PWAS sensors and their limited effectiveness in monitoring thick structures result in the decreased reliability of a single classical PWAS sensor for damage detection. This research aims to enhance the functionality of PWAS in AE applications by employing multiple thin PWAS and performing a data-level fusion of their outputs. To achieve this, as a first step, the selection of the optimal PWAS is performed and a configuration is designed for multiple sensors. Pencil break lead (PBL) tests were performed to investigate the compatibility between selected PWAS and traditional WSα and R15α sensors. The responses of all sensors from different AE sources were compared in both the time and frequency domains. After that, convolutional neural networks (CNNs) combined with principal component analysis (PCA) are proposed for signal processing and data fusion. The signals generated by the PBL tests were used for network training and evaluation. This approach, developed by the authors, fuses the signals from multiple PWAS and reconstructs the signals obtained from conventional bulky AE sensors for damage detection. Three CNNs with different architectures were built and tested to optimize the network. It is found that the proposed methodology can effectively reconstruct and identify the PBL signals with high precision. The results demonstrate the feasibility of using a deep-learning-based method for AE monitoring using PWAS for real engineering structures.
Keywords
1. Introduction
An in-situ monitoring system should be able to undertake a proper assessment of residual life and evaluate the degradation of structural components (Nair and Cai, 2010) as well as to change the maintenance paradigm from scheduled maintenance to condition-based maintenance (CBM) (Nokhbatolfoghahai et al., 2022). The rapid development of infrastructure and technological advances has led to a growing interest in Structural health monitoring (SHM). Acoustic emission (AE) is one of the most promising non-destructive testing (NDT) techniques in the field of SHM (Barile et al., 2020; Calabrese and Proverbio, 2020; Carrasco et al., 2021; He et al., 2021; Nair and Cai, 2010; Saeedifar and Zarouchas, 2020; Verstrynge et al., 2021). AE technology relies on the radiation of acoustic (elastic) waves in a medium. The elastic waves are produced by the energy released due to suddenly internal deformation in a material, such as plastic deformation, crack formation/expansion, fiber breakage, delamination in composites, impact etc. An AE system is normally made up of sensors, preamplifiers, and data acquisition devices. The AE sensors are mounted on the surface of the material, as shown in Figure 1 (Cheng et al., 2022). Surface motions f(t) produced by the elastic waves are transformed to electronic signals a(t) by transfer functions, which can be mathematically represented as
Where w(t), wf(t), and wa(t) are the time transfer functions of AE sensors, filters, and preamplifiers. These collected signals can yield useful information for SHM at different levels, including signal identification, damage localization, and life prediction. The frequency responses of the filters (wf(t)) and amplifiers (wa(t)) are commonly characterized by flat or constant responses (Grosse and Ohtsu, 2008). This implies that the recorded AE signal a(t) is highly sensitive to w(t) corresponding to different AE sensors. Thus, the sensitivity of the AE sensors is critical for AE monitoring.

Acoustic emission (AE) principle (Cheng et al., 2022) where w(t) is the transfer function of the AE sensor, wf(t) is the transfer function of the filter, and wa(t) is the transfer function of the amplifier.
AE-emitted signals are often recorded using high-sensitive piezoelectric sensors with the main frequency response within the 20 kHz to 1 MHz range. In conventional AE sensors, a PZT (Lead Zirconate Titanate) element is often enclosed in a stainless-steel housing to enhance sensitivity response (see Figure 2) (Nakamura, 2016). They also have a detection face with wear plate to improve the electrical isolation from the tested object. These conventional sensors have been successfully used in a variety of structures (Chen et al., 2020; Cheng et al., 2021; Joseph et al., 2021; Liu et al., 2018; Wotzka and Cichon, 2020; Yu et al., 2013). However, detecting AE signals with conventional sensors has inherent disadvantages, such as their bulky size and high cost. It is infeasible to apply large quantities of conventional AE sensors to the structure due to the high costs of the monitoring. Meanwhile, the non-negligible mass/volume of the sensors hinders their practical applications, especially for in-situ SHM (Theses et al., 2017). Bulky AE sensors can be intrusive and potentially disturb the normal operation and functionality of the monitored structure, particularly in aerospace applications (Giurgiutiu, 2020). Furthermore, a real-world example illustrating the potential application of AE in restricted access areas is shown in Figure 3. This is an innovative C1 wedge connection used in offshore wind turbine (OWT) structures (Creusen et al., 2022). To ensure the reliability of this connection and prevent catastrophic failure, it is of paramount importance to have an effective method for early fatigue crack detection in this connection. However, the limited available space, with a height of less than 1 cm, makes it impossible to install conventional bulky AE sensors in the desired areas.

Typical structure of a conventional PZT bulky AE sensor (Nakamura, 2016).

The C1 wedge connection: (a) covered by corrosion protection plates in the OWT structure (Creusen et al., 2022), (b) lay-out of the connection, (c) principal stress distribution of the lower segment (the critical component).
PWAS (Piezoelectric wafer active sensors) are miniature, low-cost, and lightweight, with several millimeters in height and a broadband response frequency. PWAS work well for both exciting and detecting ultrasonic guided waves in SHM (Giurgiutiu, 2007; Giurgiutiu and Zagrai, 2000). The installation process of PWAS can be complex and time-consuming, requiring expert knowledge and skills for proper placement and wiring. Over the past decade, numerous studies have demonstrated the feasibility of monitoring acoustic emissions with PWAS (Bhuiyan et al., 2018; Jiang et al., 2018; Yu et al., 2012; Zhang et al., 2020). Compared to commercial sensors, PWAS can deliver equivalent detection findings regarding fatigue cracks in thin plate-like materials. However, PWAS sensors are less effective for monitoring the thick steel plate where they have only captured 5% of the AE events of recorded by R15I sensors (Yu et al., 2011). Additionally, the analysis and comparison between conventional and PWAS are typically performed without any prior selection study for PWAS in passive sensing applications (Trujillo et al., 2014; Yu et al., 2012). Due to a lower signal-to-noise ratio (SNR), noise significantly impacts the efficiency of PWAS, hindering its performance. Furthermore, relying solely on measurements recorded by a single type of small PZT sensor may result in incomplete information regarding the underlying damage mechanism (Gall et al., 2018; Godin et al., 2018; Hamam et al., 2019). Using a single type of PWAS for AE monitoring has limited effectiveness, but it remains a viable alternative for monitoring critical areas with restricted access. To enhance the accuracy of passive AE monitoring using PWAS, it is essential to implement an optimized configuration of multiple PWAS along with advanced signal processing methods.
Data fusion can extract information from the combination of different datasets instead of analyzing each dataset separately (Wu and Jahanshahi, 2018). Previous works demonstrate the efficiency of combining data from multiple sensors to increase the accuracy and dependability of collected information. It is recommended to implement feature-based data fusion if the original data contain clearly distinguishable and adequate detectable features. In the case of an AE application with PWAS, data-level fusion is preferable since the original raw data from one type of PWAS cannot represent the essential characteristics of the damage mechanisms. However, the majority of research about multi-sensor data fusion has been performed at the feature level (Broer et al., 2021; Dehghan Niri et al., 2013; Guel et al., 2020; Liu et al., 2020; Saboonchi et al., 2016). Considering the higher SNR ratio of conventional bulky AE sensors, this study proposes a data-level fusion approach to reconstruct the signals from traditional AE sensors via the signals from PWAS (see Figure 4). Compared to other available approaches, deep-learning-based approaches have been applied in a range of academic domains for image processing. Although 1D-CNNs can apply to time-series data, Wu et al. (2018) have reported that 2D-CNNs perform better than the 1D-CNNs method and achieve higher accuracy and robustness. To enhance the efficiency of data-level fusion using CNN, PCA is used to minimize the dimensionality of the obtained waveforms without losing critical signal information. Dimensionality-reduced signals extracted from selected PWAS are saved as image files, which are then used as input data for the 2D-CNN. The output consists of waveforms derived from PCA-processed signals of conventional bulky sensors.

A flowchart illustrating the proposed data-level fusion approach.
This work explores the implementation of PWAS of different sizes for AE monitoring. The selection of PWAS is performed according to the proposed criteria, aiming to enhance the completeness of the collected data. A detailed experimental analysis using Pencil break lead (PBL) tests on a compact-tension (CT) specimen is conducted to test the interchangeability between different types of sensors for AE monitoring. After that, we utilize a combination of multivariate analysis based on principal component analysis (PCA) and convolutional neural network (CNN) to fuse the signals from different PWAS. Finally, the performance of the established CNN is evaluated with extracted features.
This paper is organized as follows: In Section 2, a brief methodology overview is presented. Sections 3 and 4 describe the experimental setup and discussion. Section 5 details the proposed methodology for data fusion of multiple PWAS and evaluates its performance. The conclusions are drawn in Section 6.
2. Overview of theoretical background
To explain the concept of the proposed methodology in more detail, Section 2 is divided into three parts. The most employed acoustic emission signal features are briefly explained in Section 2.1. An introduction to the CNN architecture is given in Section 2.2. Then Principal Component Analysis (PCA), which is utilized for dimensionality reduction of the input of the established CNN, is presented in Section 2.3.
2.1. Acoustic emission signal features
SHM with the AE technique is properly explained by AE signal parameters. Figure 5 shows the most widely used AE features in the time domain. Frequency-domain features also play a significant role in interpreting AE signals. Representative parameters are the peak frequency (PF) and the frequency centroid (CF). Besides, AE signals can be described more precisely by the partial power (PP) features which measure the ratio of power in a user-specified frequency range (f1 – f2) to the complete power of a signal:
Where a power spectral density formula

Conventional AE features in (a) time domain (ElBatanouny et al., 2014), (b) frequency domain (Grosse and Ohtsu, 2008).
2.2. Convolution neural network
The convolutional neural network (CNN) is a powerful tool for feature extraction and image analyses. In contrast to traditional Artificial neural networks (ANNs), CNN takes advantage of local connections, shared weights, and sub-sampling. These operations enable the CNN model to extract representations of the image automatically at a low computational cost. CNN is designed to process data from different arrays, such as 1D data like signals and sequences, 2D data for pictures or spectrograms, and 3D image data like computerized tomography scans.
A typical CNN model can be found in Figure 6. It consists of three main kinds of layers, that is, a convolutional (Conv) layer, a pooling layer and a fully connected (FC) layer (Xin et al., 2020). A convolutional layer contains multiple filter kernels to extract local features over the whole image by the sliding process. These obtained features are then triggered by a non-linear function such as Sigmoid, Tanh, and rectified linear unit (ReLU). ReLU is the most frequently used activation function due to its simple implementation and less susceptibility to vanishing gradients (Alom et al., 2019). Local response normalization (LRN) layers execute a mathematical operation on the n × n area to constrain the unbounded nature of ReLU. To address the issues of internal covariate shift, batch normalization (BN) is often used to stabilize and speed up network training.

Illustration of a typical conventional neural network.
The generated feature maps usually have a considerable number of spatial dimensions after the convolutional operation. The pooling layer is added to reduce parameters, control over-fitting, and retain valid feature information. Convolution layers and pooling layers are repeated several times before being connected to the fully connected layers. In the fully connected layers, dropout layers randomly set input neurons as zero with a specific rate to avoid overfitting (Wu and Gu, 2015). In this research, CNN is employed to fuse the signals from multiple PWAS to signals from the conventional AE sensors. The framework of the proposed CNN-based multi-sensor data fusion will be described in Section 5.
2.3. Principal component analysis
Considering computational inefficiency for data analysis, principal component analysis (PCA) is one of the most popular multivariate statistical analyses. This method can transform high-dimensional data into a low-dimensional subspace while retaining the most important features of the original data (Abdi and Williams, 2010). During the dimensional reduction process of PCA, new relevant variables defined as principal components (PC) are generated. The cumulative contribution rate (CCR) is used to select the proper number of PCs which is expressed as:
where Di is the singular values of matrix Σ in decreasing order, representing the directions of the variances. The goal is to select the value of k to be as small as possible but with a reasonably high CCR. In the context of SHM, PCA has been extensively applied with the extracted features from AE signals in the time domain (Baccar and Söffker, 2017).
3. Experimental set-up
3.1. Sensors applied for measurements
Shape, material, and geometry are key parameters for selecting PWAS. PWAS disks are commonly used due to their omnidirectional Lamb wave generation and sensing capabilities (Sohn and Lee, 2010). Regarding the material selection, two material parameters are significant when selecting suitable PZT materials (Giurgiutiu and Zagrai, 2000), namely the piezoelectric charge coefficient d31 and the piezoelectric voltage coefficient g31. The design methodology of PWAS for actuators and receivers has been addressed in depth by Ochôa et al. (2019). Ochôa proposed using this formulation for the sensor output voltage V0, which is directly proportional to the harmonic function f (ξS, ξA, ra). This function accounts for the influence of sensor radius and frequency:
where ra is the sensor radius, xS and xA are the wavenumbers of the symmetric and antisymmetric Lamb wave modes, respectively. The wavenumbers are dependent on frequency, as known as a dispersion relation. J1 is the Bessel function of Order 1. Equation (4) allows the evaluation of the frequency sensitivity of thin PZT sensors as receivers under different radiuses.
Due to the electro-mechanical (E/M) behavior, the piezo ceramic may experience two extreme states, resonance, and anti-resonance. Typically, an unstable transitory regime can be formulated from the points of resonance and anti-resonance. This regime should be avoided as it is detrimental to the steady and reliable application of PZT sensors. Giurgiutiu proposed equations to calculate the admittance Y(w) and impedance Z(w) of a circular PZT sensor constrained by a structure (Giurgiutiu, 2007):
where C is the electrical capacitance of the thin PZT sensors, kp is the complex planar electromechanical coupling coefficient of the PZT material, va is the Poisson ratio of the PZT material, J1 and J0 is the Bessel functions of Order 1 and Order 0, respectively. The complex phase angle in the PZT material is represented by
For the AE application, the criteria for employing PWAS only as receivers are proposed in (Cheng et al., 2022): (a) The selected PWAS must have a high piezoelectric voltage coefficient of g31 and sufficient sensitivity over the frequency range for the intended application. For instance, The frequency range for metallic structures is recommended as 100–900 kHz (Ozevin, 2020); (b) The electro-mechanical (E/M) resonance point of the PWAS should not coincide with or be close to any local peaks in the PWAS voltage output. Additionally, at least one of these local peaks should fall within the investigated frequency range; (c) PWAS of greater thickness are preferred since their output response amplitudes are directly proportional to the thickness (Inman et al., 2005). Following these criteria, PZT-1 (ra:10 mm × TH: 0.79 mm), PZT-2 (ra:5 mm × TH: 0.8 mm), and PZT-3 (ra:3 mm × TH: 0.5 mm) were selected ultimately. TH is the thickness of the PZT sensors.
The properties of the selected three types of PWAS are illustrated in Table 1 and Figure 7. Figure 8 displays the sensor output f (ξS, ξA, ra) and E/M admittance response |Y| of PZT-1/2/3, which satisfy criteria (b). Frequencies beyond 500 kHz are removed because they are useless for the majority of applications (Ozevin, 2020). In the frequency band of interest, PZT-1 has multiple resonant frequencies compared to other sensors. The sensor output considering thickness f (ξS, ξA, ra) × TH for PZT-1/2/3 is shown in Figure 8(d). It can be observed that these three PWAS together span a frequency range of 0–500 kHz efficiently, which is inaccessible to individual PWAS. For the R15α sensor and WSα sensor, their frequency sensitivity spectra are obtained from MISTRAS Group (Figure 9).
Characteristics of sensors.
ra and TH is the radius and thickness of selected sensors, respectively.

Illustration of the sensors used in the measurement.

Sensor output f (ξS, ξA, ra) and E/M admittance response of PZT-: (a) 1, (b) 2, (c) 3, and (d) frequency versus f (ξS, ξA, ra) × TH for PZT-1/2/3.

Frequency sensitivity spectrum of (a) R15α sensor, (b) WSα sensor.
3.2. Measurement setup
To assess the feasibility of PWAS of different sizes and to compare the results with those recorded using the conventional bulky AE sensors, compact tension (CT) specimens (85 mm length, 85 mm width, and 15 mm thickness), steel grade S355 were tested. The plate dimensions were selected such that the geometrical complexity and greater thickness are included in the measurements. Two sensor combinations were used in the tests (see Figure 10(a)): (1) R15α and PZT-1/2/3; and (2) WSα and PZT-1/2/3. To simulate the crack initiation, pencil lead breaking tests were performed at the notch tip (see Figure 10(b)). A total number of 120 pencil lead break tests were performed for each sensor layout while the parameters and settings remained the same for each test. To obtain enough data with randomness and uncertainty (Sause, 2011), the pencil leads were broken with various free lengths of 2–4 mm and contact angles of 20°–60°. The R15α sensor and WSα sensors were mounted on the specimen surface using a thin layer of silicone grease as an ultrasonic couplant. The PWAS were bonded to the specimen surface using cyanoacrylate glue.

Experimental set-up: (a) layout of sensors, (b) illustration of PBL tests, (c) mistras AE system, (d) signal amplifier.
An eight-channel MISTRAS AE system with a 40 dB pre-amplifier was used. A threshold value of 50 dB was set to filter noise. Each measurement was recorded at a sampling rate of 5 MSPS (one sample per 0.2 μs) and 800 μs long. A pre-trigger of 256 μs was defined to capture the signal sources. The peak defined time (PDT), hit definite time (HDT), hit lookout time (HLT) and Max duration was set as 400, 800, 800, and 99 ms, respectively. These four timing parameters play significant roles in data acquisition, including hits recording and signal feature extraction. The definition of these timing parameters was determined using PBL trial tests.
4. Experimental results discussion
AE signals experience multiple reflections, refractions and mode conversion as a result of the geometrical complexity and dynamic environment (Cheng et al., 2021; Haile et al., 2018). During the PBL tests, background noise can be eliminated effectively by recording signals without loading. However, the AE hits caused by side-edge reflection and electronic noise cannot be completely avoided after breaking pencil leads. Hence, the recorded signals are separated into two categories: (a) the directly arrived hits as PBL signals; (b) other hits as noise signals. This section compares the PBL signals and noise signals recorded by chosen sensors. PBL signals measured from multi-sensors (in each layout) are identified using data association (Guel et al., 2020). An association is considered if the signals appear within a temporal window of less than 20 μs. This value was determined experimentally to ensure the accurate identification of the correct number of PBL events.
4.1. Signal response from breaking pencil leads
The similarity of PBL signals recorded from five types of sensors is analyzed using a cross-correlation coefficient (CCC) (Mukherjee and Banerjee, 2020). The higher value of CCC corresponds to the higher similarity between the signals from different pairs of sensors. The average values of the maximum cross-correlation coefficient (MCCC) obtained during 120 PBL events is summarized in Table 2. A low similarity between PBL signals from conventional AE sensors and PWAS (PZT-1/2/3) is observed in Table 2.
Similarity of PBL signals.
Examples of the waveform in the time domain, corresponding Continuous Wavelet Transform (CWT) scalogram, and FFT spectrum of PBL signals are shown in Figures 11 and 12. The CWT scalograms are employed to accurately discriminate the distribution of signal energy in both the time and frequency domains. The waveforms show significant differences and only PZT-3 exhibits a burst-type response. Differences are also strong in the CWT scalograms based on the Gabor wavelet of the waveforms as well. The aperture effect significantly distorts signals received by sensors with larger sizes, potentially introducing additional distortion from scattering or reflections (Tsangouri and Aggelis, 2018). Hence, PZT-3 shows a distinctive response that is less influenced by these distortions. The WSα sensor is characterized by a flat frequency response over a wideband frequency range of 100–1000 kHz. This sensor enables more accurate identification of the natural frequency characteristics of AE sources. The average value of PP(80:500kHz) of 92% is calculated from the WSα sensor. Therefore, 80–500 kHz is selected as the primary frequency band of interest in the target objective.

Example of AE signals captured by: (a) R15α sensor, (b) WSα sensor.

Example of AE signals captured by PZT-: (a) 1, (b) 2, and (c) 3.
Table 3 describes the properties of the signals obtained from various sensors as shown in Figures 11 and 12. The first row in Table 3 presents the resonant frequency of each sensor, as identified from their sensitivity responsese curves. Pertaining to PZT-1/2/3, the peaks of their sensor output function (see Figure 8) are extracted as the resonant frequency. The frequency band where sensors exhibit high sensitivity can be determined from the typical boundaries of areas with high intensity in the CWT scalograms from Figures 11 and 12. Comparing the sensitivity curves in Figure 8(d) with the high-sensitivity frequency bands listed in Table 3, it is noteworthy that the bands for PZT-2 and PZT-3 align well with the theoretical sensitivity response. However, for PZT-1, the frequency band with high sensitivity shifts relative to the theoretical curve due to its comparatively lower sensitivity. This shift also explains why the amplitude of the signal captured by PZT-1 is smaller than that of the other two PWAS. The frequency peak observed in the low-frequency range (below 100 kHz) in the FFT response shown in Figure 12 can be attributed to electromagnetic interference during measurement (Giurgiutiu and Zagrai, 2000). It is important to note that the theoretical sensitivity response curve assumes wave propagation through an infinite isotropic plate, which is not achievable in practice. Boundary effects interfere with wave propagation and even affect the first pulse received by the sensor. Consequently, the actual frequency response of the PBL signal differs from the theoretical curves.
Time dependency of the frequency content of signals from different sensors.
The WSα sensors clarify that the primary frequency composition of detected signals is between 120 and 350 kHz, which is the original feature of the AE source. However, the R15α sensor drastically changes its frequency content to 80–200 kHz. Similarly, the frequency sensitivity of PWAS also affects the nature of the AE-emitted source (see the second row in Table 3). The significant attenuation of the signal captured by PZT-3 can be ascribed to the fact that the nature frequency of detected signals deviates from its high resonant frequency.
4.2. Signal distinction
Compared to other features, the amplitude of signals is independent of the acquisition threshold (Santo et al., 2019). The amplitude distribution of recorded signals during the PBL tests is shown in Figure 13 to quantify the uncertainty of recorded signals. As seen in Figure 13, the R15α sensor presents a much higher amplitude compared to the other sensors, representing a higher sensitivity. The median values of amplitude from the other sensors are approximately similar to each other. Noise signals can be distinguished effectively by considering the amplitude of the signals. The amplitude of PBL signals captured by R15α and WSα usually exceeds 88 dB. Signal association techniques [35] are employed to distinguish PBL signals from noise signals for PZT-1/2/3. The conventional bulky AE sensors show a more reliable performance than PWAS without overlap of amplitude distribution between PBL and noise signals. The fourth row in Table 3 presents the overlap in amplitude distribution of sensors.

Amplitude distribution of PBL signals and noise signals captured by the investigated sensors.
The distinction between signals from PBL and noise is necessary to guarantee the sensor performance for damage detection. As seen in Figure 14, the WSα sensor captures the smallest number of noise signals compared to other sensors. In contrast, it can be observed that the other sensors record a significant amount of noise signals that are not the first arrived signals. This indicates that the WSα sensor offers superior data quality compared to the other sensors, highlighting its enhanced performance in capturing the desired signals. Figure 15 shows the comparison between signals recorded by different types of sensors in the frequency domain. The PBL signals recorded by R15α, PZT-1, and PZT-3 clearly overlap with the noise signals in the frequency domain. On the contrary, the noise signals from WSα and PZT-2 can be efficiently filtered by high-pass and low-pass filtering with a narrow overlap band, respectively. The situation of overlap in the frequency domain between recorded PBL and noise signals is summarized in Table 3. Considering that the R15α sensor is more sensitive to noise than WSα, WSα shows a performance benefit in AE detection, especially in a noisy environment.

Number of received signals during PBL tests.

Comparison of signals from: (a) R15α, (b) WSα, (c) PZT-1, (d) PZT-2, and (e) PZT-3.
5. Data fusion of multiple PWAS
As described in Section 4, the WSα sensor is an attractive option for AE detection in a noisy situation. To extend the application of PWAS in fatigue monitoring of structures, this section proposes the methodology to fuse the data from multiple PWAS to WSα sensors.
5.1. Data preparation
The experimental set-up for the fatigue test is presented in Figure 16. PBL tests were manually performed at the notch tip of the installed CT specimens, using a measurement set-up similar to that described in Section 3.2. To account for variations in bonding layer and soldering quality, four CT specimens equipped with PWAS sensors were tested to collect data. In this set-up, pencil leads were broken from both sides of the specimens to represent the actual crack initiation. In total, 5484 pairs of PBL signals were captured using data association. The signals captured by PWAS are prepared as the input data, while the output is the signals received by WSα sensor.

Experimental set-up for data fusion.
To increase the computational efficiency, the signals used for data fusion are pre-processed in the following steps: (1) remove pre-trigger of 256 μs; (2) cut waveform to a certain wavelength of 200 μs to make sure a satisfactory frequency resolution of 5 kHz; (3) remove the electrical noise that contaminates the signals through the implementation of appropriate filters 80–500 kHz; (4) PCA is applied to reduce the number of features as seen in Figure 17, which makes the machine learning model simpler and less data hungry. It should be noted that PCA is employed for the signals both from the PWAS and the WSα. PCA is conducted with the open-source Python package scikitlearn.

Preparation of database by PCA.
The cumulative contribution rate (CCR) of the first 85 principal components (PCs) is described in Figure 18(a). CCR increases accordingly as the number of employed PCs increases and reaches around 95% with k = 85. This indicates that most information in the waveforms is represented by the first part of the PCs. Hence, it is determined that PCA is efficient to compress the feature number from 2048 to 85 with only a 5% loss of the total information in the original waveforms. Figure 18(b) shows an example to illustrate the comparison between original waveforms and PCA-inversed waveforms. After that, the reduced 3 × 85 matrix of 5484 events is saved as images as shown in Figure 17. These images are then passed as the input data for the CNN model in this research. The corresponding dimension of output data is also decreased from 2048 to a vector with 85 features.

(a) The contribution rate for the first 85 principal components, (b) comparison between original and PCA inversed waveforms.
5.2. Customized model evolution
The contribution rate of each principal component is represented by the PCA singular value Di. To signify the uneven contribution of features in the CNN model, a weighted Mean Squared Error (MSE) is utilized (Chang et al., 2022) as the loss function. PCs with higher contribution rates are weighted more heavily than those with low contribution rates:
The i-th sample’s actual and predicted values in the j-th feature are represented by
y
i,j
and
The cross-correlation function is commonly used to evaluate the similarity between time series (Cassisi et al., 2012). Hence, the averaged MCCC between actual and predicted value after PCA-inverse is calculated as the assessment indicator.
The i-th sample’s actual and predicted values after PCA-inverse transformation are represented by Yi and
5.3. CNN model construction
The developed model is named NAE-PZT, which means that it is a “Network trained to fuse the output from PWAS for AE monitoring.” NAE-PZT adopts the waveform information from PWAS as input values. Similar to a typical CNN model, the input is transformed into a more and more abstract and composite representation layer by layer. Finally, the signal from the conventional bulky AE sensor is reconstructed.
NAE-PZT is a fully convolutional neural network as shown in Figure 19. The first layer is the input layer of 3 × 85 × 3 pixels resolutions. The input layer receives images representing the signals from three types of PWAS, and then these images are fed into two Conv blocks. Each Conv block contains two Conv layers with ReLU as an activation function and a Batch Normalization (BN) layer. In the second Conv block, a Maximum Pooling (MaxPool) layer is added to reduce the number of features. Ultimately, the feature maps are flattened into a single 1 × 85 vector via a Global Average Pooling (GAPool) layer and a Fully Connected (FC) layer.

Architecture of the proposed model NAE-PZT.
Three kinds of CNNs with identical layers and different channels of feature maps are adopted which are named NAE-PZT,1, NAE-PZT,2, and NAE-PZT,3 respectively. The detailed network settings are given in Table 4. During the convolution operation, the kernel size is selected as 3 × 3 by considering the real input size and processing time.
The configuration of the CNN architecture.
5.4. CNN model training
The dataset as described in Section 5.1 is imported into the constructed CNN architecture. A comparison study of three kinds of CNNs was performed to investigate the influence of the complexity of the network. The CNN model adopts a split dataset in a ratio of 4:1 because of the relatively small dataset. Thus, there are 4935 and 549 samples in the training and testing datasets, respectively. Testing data is not used for training the network. A batch size equal to 32 was adopted with a shuffling approach to redistribute the data for better performance. This CNN model was built and trained with a learning rate of 0.001 in PyTorch.
5.5. Evaluation of the trained CNN
Figure 20 illustrates the training history of loss and learning curves of three models for the training and testing dataset. The loss and accuracy values are shown in every one epoch and the maximum number of epochs employed in this research is 500. The accuracy represents the difference between the actual and predicted waveforms from the associated WSα sensor. The increasing rate of training accuracy is very fast at the beginning of the training process. Afterwards, the value of loss and accuracy becomes steady. The final accuracy of three CNNs is above 95% for training data. The final testing accuracy of NAE_PZT1, NAE_PZT2, and NAE_PZT3 reaches 76%, 79%, and 82%, respectively. Under identical layers, NAE_PZT3 delivers the best prediction performance which was selected for further analysis.

(a) Loss and (b) learning curves of NAE_PZT1, (c) loss and (d) learning curves of NAE_PZT2, (e) loss and (f) learning curves of NAE_PZT3.
The relative frequency of MCCC between actual and predicted results is calculated to show the percentage of samples with high prediction accuracy. As shown in Figure 21 (bottom-left corner), the MCCC values for 99% of the training dataset and 86% of the testing dataset of NAE_PZT3 were found to be higher than 0.8. This indicates that 14% of the testing dataset fails to be reconstructed with high accuracy. To further evaluate the reliability of the NAE_PZT3, the AE features listed in Table 5 are extracted from the actual and predicted waveforms for comparison. These features are extracted as they are slightly threshold-dependent (Santo et al., 2019) and have been applied effectively for damage identification (Kharrat et al., 2016; Sause et al., 2012; Vetrone et al., 2021; Wisner et al., 2019). Figure 21 shows the correlation matrix of AE features from actual and predicted results. The average value of the correlation coefficients for the training data and testing data are 93% and 74%, respectively. It should be noted that the correlation value of the peak frequency is relatively lower than the other features. This can be attributed to the characteristics of the wide-band frequency response of the WSα sensor.

Feature correlation matrix and MCCC between actual and predicted results of (a) training dataset, (b) testing dataset.
Description of the selected AE features.
Mean absolute deviation (MAD) quantifies the spread in the dataset. It is computed as the average of the absolute differences between each data point and the mean value. A smaller MAD represents a dataset that is more clustered. After calculation, it was found that the peak frequency of the actual waveforms from the WSα sensor has the highest MAD of 0.13, while the MAD values of the other features are less than 0.09. This demonstrates that peak frequency is more widespread and less effective for identification and classification. Hence, Figure 22 presents the distributions of the parameters with high feature correlation values except for the peak frequency. By comparison, the distribution of these parameters predicted by NAE_PZT3 closely coincides with the distribution of actual results in the testing dataset. These results further justify that the proposed network model NAE_PZT3 can effectively reconstruct the waveforms from the reference sensor.

Distributions of AE parameters extracted from actual and predicted results.
5.6. Further discussion
It can be concluded that the proposed methodology achieves high precision and reliability for data-level fusion using multi-PWAS. As the feature indication of damage-related signals of WSα is well reconstructed and identified, the damage detection can be performed after successful data fusion of multiple PWAS. However, it should be underlined that this proposed methodology is validated using PBL tests on CT specimens, which are less challenging compared to real practical situations.
It should be noticed that this data fusion methodology is validated using a dataset where sensors are placed at a uniform distance from the source. In future works, the uncertainties caused by different sensor layouts, measuring cables, and soldiering qualities should be considered to train the CNN. For example, this study does not consider variations in distance between the source and each PWAS. Moreover, fatigue tests are imperative to the development and validation of the proposed approach. This method could be particularly useful for identifying the formation of crack tips. Crack propagation can alter wave propagation characteristics and potentially limit the effectiveness of this method. A comparative study of deep learning methods and architectures is also recommended to improve the accuracy of damage detection using PWAS. Last but not least, future studies should explore the field applications of the proposed method to different types of structures with restricted-access areas, especially thicker/complex structures. Such studies are expected to shed new light on possible improvements in the performance of PWAS in AE monitoring. In the future, we will continue to work in this direction, and we hope that other scientists interested in SHM will share data sets to encourage progress in the field.
6. Conclusion
To enhance the functionality of PWAS-based AE application, this paper presents a methodology for performing a data-level fusion of outputs from various PWAS. The performance of selected PWAS is compared to bulky AE sensors, which are widely used for SHM in several fields. After identifying the advantages of the WSα sensor, the reconstruction of the signal measured via WSα was successfully achieved by fusing the signals from multiple PWAS using a developed CNN model NAE-PZT. The key findings are summarized as follows:
(1) The feasibility of the selected PWAS is identified using PBL tests. The chosen three PWAS are found to be more effective in covering the frequency range than an individual PWAS. According to the comparative analysis performed for sensors response captured from breaking pencil leads test on the CT specimens, the R15α sensor and PWAS alter the original features of the AE-emitted source dramatically, while WSα sensor represents the nature of the AE source with higher fidelity due to its flat and wideband frequency response function.
(2) Compared to other types of sensors, the WSα sensor captures the smallest number of noise signals during PBL tests. Besides, the signal response of WSα sensor shows a slight overlap in amplitude and frequency content distribution between PBL and noise signals. Thus, WSα sensor is an attractive option concerning the clear distinction between signals from PBL and noise.
(3) 5484 PBL events were used to train and evaluate the performance of the three developed CNNs with different architectures. All the CNNs can deliver convincing prediction performance in reconstructing the waveforms from the reference sensor, with accuracy all above 95% for training data. NAE-PZT,3 reaches satisfying accuracy for testing data than the other two models.
(4) Feature correlation analysis and similarity analysis are carried out for all datasets. The distribution patterns of seven AE parameters with high feature correlation values predicted by NAE-PZT,3 is found to be consistent with the actual results. As a result, the proposed method shows the promising application potential to enable damage detection using multi-PWAS combing with a waveform-based data-fusion method.
Abbreviations
The following abbreviations are used in this manuscript:
SHM Structural health monitoring
NDT Non-destructive testing
AE Acoustic emission
PWAS Piezoelectric wafer active sensors
SNR Signal-to-noise ratio
PBL Pencil break lead
CNNs Convolutional neural networks
PCA Principal component analysis
OWT Offshore wind turbine
CT Compact tension
ANNs Artificial neural networks
ReLU Rectified linear unit
LRN Local response normalization
BN Batch normalization
PDT Peak defined time
HDT Hit definite time
HLT Hit lookout time
Footnotes
Acknowledgements
The first author wishes to express her gratitude for the financial support of the China Scholarship Council (CSC) under grant number 201806060122. Dr. Ali Nokhbatolfoghahai is supported by the AIRTuB project (Automatic Inspection and Repair of Turbine Blades), funded with a Top Sector Energy subsidy from the Ministry of Economic Affairs of the Netherlands.
Credit author statement
Declaration of conflicting interests
The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The authors received no financial support for the research, authorship, and/or publication of this article.
Data availability statement
The data supporting the findings of this study are available from the corresponding author upon reasonable request.
