Abstract
Impact echo (IE) is capable of locating subsurface defects in concrete slabs from the vibrational response of the slab to a mechanical impact. For an intact slab (“good” condition), the frequency spectrum of the IE is dominated by a single peak corresponding to the slab’s “thickness resonance frequency,” whereas the presence of subsurface defects (“fair” or “poor” conditions) could manifest in various ways such as multiple distinct peaks at frequencies higher, or lower, than the thickness resonance. In previous research, the authors have proposed a frequency partitioning of the spectrum for IE signal classification. Firstly, the thickness resonance frequency band is identified using a data-driven approach and then the IE signals are represented by their energy distribution in three bands—frequencies less than, within, and greater than the thickness resonance. Following this feature extraction, an unsupervised clustering approach is used to identify the centroids for each signal class—good, fair, and poor—which are further used to classify any test signal into one of the three aforementioned classes. The classification is developed by training on unlabeled IE signals from real bridge deck data (the Federal Highway Administration’s [FHWA’s] InfoBridge dataset) without making use of any labeled data. This study aims to validate the proposed methodology on a labeled dataset of eight reinforced concrete specimens constructed at the FHWA Advanced Sensing Technology Nondestructive Evaluation laboratory having known artificial defects. Our findings indicate that the physics-based feature definition and the method developed on real bridge data are robust and can classify IE signals in the labeled data with moderate accuracy.
Condition monitoring and evaluation of bridge decks are imperative to maintaining structural reliability during their service life. Concrete bridge decks are traditionally evaluated using visual inspection methods. Nondestructive evaluation (NDE) techniques have been increasingly popular for the condition evaluation of bridge decks to indicate volumetric damage and deterioration that might not be visible from the surface (
IE excites and records the modal frequencies corresponding to the thickness resonance of the slab, geometric boundaries, and crack-like defects running parallel to the surface (delamination) under a mechanical impact. IE responses (or signals) are typically analyzed in the frequency domain (using the fast Fourier transform algorithm) and the spectral responses are interpreted to assess the condition of the test slab and identify potential defects. See Figure 1 for an illustration of the IE signal–spectrum pair corresponding to each condition—good, fair, and poor. The IE spectrum for a slab in “good” condition, that is, free from damage, exhibits a unimodal spectrum with a single-frequency peak corresponding to its thickness resonance frequency (

Impact echo (IE) signals and spectra corresponding to three different delamination conditions: good, fair, and poor.
Although effective in detecting damage and delamination in concrete slabs, manual IE data interpretation has limited utility in bridge condition evaluation practices. Moreover, there exists limited research on data-driven approaches to automatically classify IE signals. Prior research on automated IE signal analysis relies on machine learning (ML) algorithms such as the support vector machine (
Data Description
In this study, the results of IE tests performed on eight reinforced concrete slabs at the FAST NDE laboratory are used for validation purposes. Each concrete slab, 3.0 m long, 1.0 m wide, and 0.2 m thick, is constructed with “known” artificial defects—shallow delamination, deep delamination, honeycombing, voids, and a transverse vertical crack, as shown in Figure 2. These defects extend 0.30 m along the length of the specimens and 0.20 m along the width of the specimens. The concrete slabs are constructed by a normal-weight concrete mix with a water-to-cement ratio of 0.37 and have a 28-day compressive strength of 27.6 MPa, with two mats of uncoated steel reinforcements of 15.8 mm diameter at 203 mm spacing in both the lateral and transverse directions. Several NDE techniques including IE are used to understand their efficacy and feasibility in detecting subsurface defects. IE tests are performed at a regular grid on the concrete slabs with a grid spacing of 100 mm, both longitudinally and transversely. The test setup consists of an impact source (11 mm diameter steel spheres on spring rods) and a receiving transducer or sensor (e.g., an accelerometer). IE signals are recorded at a sampling frequency of 200 kHz for a duration of 10 ms at every point of the grid. There are 252 recorded IE signals per laboratory concrete slab. Further details on the construction of concrete slabs and the testing can be found in Lin et al. (

Schematic diagram of the concrete slabs with artificial defects.
Methodology
This section provides a brief summary of the IE signal classification scheme as proposed in previous research (
Estimating the Thickness Resonance Bandwidth
To characterize IE signals, it is necessary to establish the spectral features for “good” signals (recorded at slab locations free from damages), where most of the impact energy is reflected back from the bottom of the deck and the frequency spectrum is dominated by one frequency peak corresponding to its thickness resonance. This allows us to automatically identify the thickness resonance frequency (
Therefore, a data-driven approach was developed to estimate the thickness resonance frequency. The IE spectra for all points in the test slab are analyzed to identify the peak frequency corresponding to the maximum energy content in each spectrum. The peak frequency values for all test points on the slab are aggregated to identify the most prevalent frequency peak, which is an indicator of the thickness resonance frequency for each bridge deck. The inherent assumption in this approach is that the thickness resonance is excited at most test points, except at extremely deteriorated points, where the flexural mode of vibration dominates. To estimate the thickness resonance bandwidth, the histogram of peak frequencies is parameterized with a Gaussian mixture model (GMM) assuming that all data points are generated from a mixture of a finite number of Gaussian distributions with unknown parameters. The parameters of the GMM are estimated using an expectation-maximization technique, while the number of Gaussian components in the mixture model is decided based on the Bayesian information criterion (BIC) score. The BIC score provides a weighted measure of the change in the log-likelihood function when different numbers of GMM components are considered. The lowest BIC score provides the model that would be closest to the model with the maximum log-likelihood value, and therefore is chosen as the final model. Once the GMM is fitted with an optimum number of mixture components, the Gaussian distribution with the maximum mixture proportion is chosen as a representation of the thickness resonance. The mode (
Partitioning the IE Power Spectrum
IE signals corresponding to different conditions (good, fair, and poor) have distinct spectral patterns, that is, the distribution of energy among frequencies differs based on the type and extent of damage. Therefore, the energy content in each frequency band might be an indicator of the damage state represented by the IE signal. Once the thickness resonance band of the deck is estimated, the IE spectrum for each signal is analyzed to calculate the energy in the three non-overlapping bands—frequencies less than (Band 1), within (Band 2), and greater than (Band 3) the thickness resonance frequency. Therefore, each IE signal is represented by a three-dimensional vector of the proportion of energy in each band in order, that is,
Classification of IE Signals
Each IE signal is represented by a three-dimensional feature vector of energy proportions in the three bands: Bands 1–3. The feature vectors are input to an unsupervised clustering algorithm, that is,
Results and Discussion
This section details the data analysis and results on the eight concrete slabs. As stated in the

Histograms of peak frequencies with the fitted Gaussian mixture models for test slabs 1–8 (color online only).
Estimates of Thickness Resonance Bandwidth for Each Test Slab
Having the thickness resonance bandwidth for each slab, we can now partition the IE spectra into three bands corresponding to lower than, within, and above the thickness resonance band. Next, the proportion of energy in each band is calculated and is used as a

Feature space representations of impact echo signals in test slabs 1–8.
As we observe from Figure 4, except for shallow delamination, IE signals obtained from different damaged portions of the slab are not localized; rather, they show significant scatter in the feature space. This could be because of the limitation of the proposed spectral approach for delamination characterization to distinguish between different damage types. Next, a
Comparison of Centroids Obtained using Laboratory Data and Long Term Bridge Performance (LTBP) Data
Using the centroids of each cluster, each signal in the dataset can be classified into one of three classes—“good,”“fair,” or “poor”—using a

Clustering of Impact Echo Signals from Different Defects
The IE spectra corresponding to voids have significant energy in the frequencies lower than the thickness resonance, which is similar to that of shallow delamination. On the other hand, spectra corresponding to deep delamination and honeycombing have a peak frequency centered around the thickness resonance with additional higher frequency components. Therefore, based on expert opinion (and the clustering results in Table 3), the accuracy of the IE signal classification is evaluated assuming that shallow delamination and voids represent the “poor” condition, deep delamination and honeycombing are the “fair” condition, and no defects are the “good” condition. The classification accuracy based on the centroids identified from the laboratory data is shown in Table 4. Note that in the confusion matrix, the term “True” refers to labels that have been assigned based on expert interpretation. From Table 4 it can be seen that the accuracy of classification for the eight slabs is around 60% on average, ranging from 46% for Slab 4 to 79% for Slab 7. Interestingly, the model performs with a lower accuracy on Slabs 1–4, compared to Slabs 5–8. Regardless, most of the misclassification is caused by a significant number of test points with “no defect” having energy in Band 3 and therefore being classified as in “fair” condition. Figure 6 presents IE signals and their corresponding spectra for two selected locations from Slab 1 that presumably have no defects and two selected locations from Slab 1 that are known to have deep delamination. As can be seen from the figure, spectra corresponding to “no defect” have significant energy in the frequency range greater than 10 kHz, in addition to the peak frequency centered around 10 kHz (thickness resonance). A very similar IE spectra is observed for deep delamination with significant energy in the frequency range greater than 10 kHz, as can be seen in Figure 6. Visual inspection of Figure 6 confirms that some locations with no defects show the signatures of deep delamination in their IE spectra, and the expert classification of these points is “fair.” Therefore, although these locations have no built-in defects, the spectra indicate the possibility of a certain degree of subsurface damage possibly generated during the construction process. Similarly, Dorafshan and Azari (
Confusion Matrices of Impact Echo Signal Classification using Laboratory Data Centroids

Impact echo signals and spectra for regions with “no defect” and “deep delamination” from Slab 1.
Moreover, the classification performance relies on how signals are grouped into the three classes. As we observe from Figure 5, honeycombing and deep delaminations span through both “poor” and “fair” classes. In fact, there seems to exist a fuzziness to the degree of association to both classes. The centroids obtained from the LTBP dataset are used to classify the experimental data in a similar fashion to compare the transferability of results across different datasets (see Table 5). The results suggest that the classification accuracy remains similar regardless of the dataset used to identify the centroids.
Confusion Matrices of Impact Echo Signal Classification using Long Term Bridge Performance Data Centroids
Using the signal classification scheme (and experimental data centroids), the condition maps for the eight slabs are generated as shown in Figure 7, where the different in-built defects can be clearly identified. However, as before, deep delamination is not accurately identified in all cases. Moreover, the condition maps generated using LTBP data (Figure 8) centroids are similar to the ones presented using experimental data.

Condition map using laboratory data centroids.

Condition map using Long Term Bridge Performance data centroids.
The proposed method uses an unsupervised clustering on energy proportions in different frequency bands as features to identify class centroids. In addition, the frequency bands are defined relative to the thickness resonance of the deck slab. Therefore, this approach does not depend on the specifics of the bridge deck itself, that is, signals with the same condition would still be identified correctly even if the thickness resonances of the decks are different. Therefore, the proposed methodology is robust and suitable for transfer learning.
Conclusions
IE is an acoustic nondestructive testing method that is used to identify defects in concrete structures such as bridge decks. Researchers have proposed various data-driven models for autonomous interpretation of IE signals to infer subsurface defects that might not be visible from visual inspection methods. IE signals belonging to different defect locations show distinct spectral characteristics that hint at the degree of subsurface damage. In Sengupta et al. (
Footnotes
Acknowledgements
The experimentation was performed at the FHWA, Turner Fairbank Highway Research Center.
Author Contributions
The authors confirm contribution to the paper as follows: study conception and design: A. Sengupta, H. Azari, S. I. Guler, P. Shokouhi; data collection: H. Azari; analysis and interpretation of results: A. Sengupta, H. Azari, S. I. Guler, P. Shokouhi; draft manuscript preparation: A. Sengupta, H. Azari, S. I. Guler, P. Shokouhi. All authors reviewed the results and approved the final version of the manuscript.
Declaration of Conflicting Interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work was partially supported by the funds available through Institute for Computational and Data Sciences Seed Grant 2021-22, The Pennsylvania State University.
