Sage Journals: Discover world-class research

Abstract

Ensuring the structural integrity of bridges is essential for maintaining infrastructure safety and promoting long-term sustainability. In this context, indirect structural health monitoring (ISHM) through drive-by bridge inspection emerges as a promising alternative to traditional inspection methods, offering a cost-effective and scalable solution by leveraging vehicle-mounted sensors to assess the condition of bridges without requiring direct instrumentation. To our knowledge, this study introduces the first purpose-built electric inspection vehicle specifically designed for drive-by bridge inspection. The autonomous platform is capable of maintaining a constant low speed and offers customisable operational parameters to maximise the accuracy and repeatability of indirect sensing capabilities not achieved in previous studies. The vehicle is deployed within an ISHM framework and tested on two full-scale bridges to evaluate its effectiveness in capturing structural dynamic responses. Three rigorous unsupervised frameworks are then employed to analyse the collected data to identify features indicative of bridge properties and structural condition, including adversarial autoencoders, matrix profile and transformer network. The promising findings from this study demonstrate the practical feasibility of the approach. The study underscores the potential of ISHM as a viable tool for efficient bridge monitoring, contributing to the development of next-generation structural health monitoring systems that enhance safety, optimise maintenance strategies and support the longevity of critical infrastructure.

Keywords

Drive-by bridge inspection deep learning adversarial autoencoders matrix profile inspection vehicle bridge monitoring damage assessment transformer

Introduction

Bridges are critical components of modern infrastructure, enabling the transportation of people and goods while supporting economic stability and connectivity.¹ Their structural integrity is vital to public safety, operational efficiency and sustainable development. However, as these structures age, they are subjected to various deteriorating factors, including environmental effects, material degradation, increasing traffic loads and unexpected external events.² Such degradation can lead to catastrophic failures, as seen in the collapse of the Morandi Bridge in Italy (2018) and the failure of the Saint Anthony Falls Bridge in the United States (2007).

Worldwide, bridge infrastructure maintenance is an increasing challenge. For instance, in Australia, the majority of bridges were constructed prior to 1976, and many are now approaching or exceeding their design lifespans.³ At the same time, bridge usage is projected to increase by 300% over the next three decades, intensifying the demand for reliable and cost-effective monitoring strategies.⁴ Currently, periodic visual inspection remains the dominant approach, typically conducted every 2 years.⁵ Despite its widespread use, this method suffers from several limitations, such as being subjective, labour-intensive, expensive and inherently restricted in scope due to its dependence on the inspector’s criteria.⁶ More specifically, in Australia, a substantial proportion of short- to medium-span bridges are located in regional and remote areas where access to power, communication infrastructure and continuous monitoring systems is limited. Accordingly, this project targets the development of a low-cost, adaptable inspection framework specifically suited to short- and medium-span bridges that operate under such constraints. Although the present study focuses on two Australian bridges, the proposed approach is designed to be generalisable to similar bridge types worldwide. In addition, recent work by the authors⁷ has proposed design guidelines for the configuration and optimisation of the inspection vehicle, further supporting the adaptability of the system to a wide range of operational environments.

To address these limitations, the demand for automated, objective and continuous structural health monitoring systems (SHMS) has grown considerably. In particular, indirect structural health monitoring (ISHM) using drive-by bridge inspection has emerged as a promising alternative to visual inspections and conventional SHMS, which rely on fixed sensors installed directly on the structure.⁸ By contrast, drive-by bridge inspection involves instrumented vehicles crossing the bridge, using the dynamic response measured by mounted sensors, to infer the bridge’s condition. This approach enables the extraction of modal properties, such as natural frequencies, damping characteristics and potential damage indicators. Its advantages include low cost, scalability across many structures and reduced deployment and maintenance effort.²

Over the past two decades, numerous laboratory studies have contributed to validating drive-by methodologies under controlled conditions. Zhang et al.⁹ used wavelet-based time–frequency analysis to extract mode shapes from a scaled beam, while Kim et al.¹⁰ introduced artificial damage through saw cuts and added masses to demonstrate damage identification using frequency and damping changes. Cerda et al.^5,11 conducted scaled laboratory tests where damage was simulated by changing boundary conditions and adding mass, validating classification-based identification techniques using frequency-domain analysis. Mei et al.¹² applied Mel-frequency spectral coefficients (MFCC) and principal component analysis to detect damage in a scaled setup using a two-axle instrumented vehicle.

Recent studies have focused on improving robustness under variable operating conditions. Liu et al.¹³ proposed a multi-task unsupervised learning framework that uses short-time Fourier transform signals from multiple vehicles to identify, localise and quantify bridge damage, achieving high identification accuracy across different test scenarios. Li et al.¹⁴ combined MFCCs with support vector machines to achieve high detection sensitivity and low computational cost in a scaled experiment. Singh and Sadhu¹⁵ introduced a hybrid signal processing technique that effectively isolates bridge dynamics from noisy vehicle signals using wavelet packet transform and synchro-extracting transform, even under varying speeds. Yang et al.¹⁶ demonstrated the effectiveness of variational mode decomposition in extracting modal frequencies despite strong vehicle-induced noise.

In terms of model generalisation and scalability, Liu et al.¹⁷ developed a transfer learning method using adversarial and multi-task networks, allowing damage classifiers to generalise across different structures without labelled data. Matarazzo and Pakzad¹⁸ introduced STRIDEX, a blind deconvolution approach that isolates bridge responses from vehicle signals without relying on detailed system models. Eshkevari et al.¹⁹ proposed the crowdsourced modal identification using continuous wavelets technique, showing that fusing data from multiple smartphone-equipped vehicles increases identification reliability compared to single-vehicle systems.

Although laboratory validations are promising, real-world applications are more complex and remain limited. Most experimental tests in real bridges focus on the identification of dynamic properties. Yang et al.²⁰ and Lin and Yang²¹ extracted bridge natural frequencies through frequency-based methods, with the latter using additional truck mass to enhance response amplitude. Locke et al.²² applied operational modal analysis in a field setting, though their methodology was constrained to short-span bridges. Yang et al.²³ conducted field testing on the Turtle Mountain Bridge using frequency-domain analysis of contact point accelerations to extract dynamic properties, providing practical validation of vehicle–bridge interaction models. Additional field validations by Kong et al.²⁴ and Yang et al.²⁵ confirmed the effectiveness of tractor-trailer and contact-point acceleration approaches, respectively, for identifying bridge frequencies without fixed sensors. McGetrick and Kim²⁶ emphasised the challenges of simulating repeatable artificial damage in operational bridges, which remains a key limitation for full-scale ISHM validation. As summarised in Table 1, previous studies focused on the scaled vehicle and bridge model in laboratory conditions or relied on a commercial passenger vehicle for full-scale testing. However, the use of a custom-built inspection vehicle specifically designed for ISHM has not been reported. Laboratory-scale setups cannot fully capture the complexities of real bridge dynamics, while commercial vehicles offer only limited control over speed and allow restricted flexibility in sensor placement.

Table 1.

Drive-by experimental studies summary.

Study	Vehicle type	Bridge type	Vehicle speed (m/s)	Sensor type (location)	Outcome
²⁷	Scaled vehicle	Laboratory scaled Simply supported	$0.22 \pm 0.021$	Accelerometer (body)	Damage identification
^28,29	1:14 Scaled Truck Scaled	Laboratory scaled Simply supported Laboratory scaled	$0.11$ $0.78$ – $1.02$	Accelerometer(body and axle)	Frequency identification
³⁰	RC car scaled	Simply supported Laboratory scaled	$0.7$ – $1.5$	Accelerometer(body and axle)	Damage identification
³¹	Vehicle	Multi-girder simply supported	$0.217$ – $0.505$	Accelerometer (axle)	Damage identification
^32,33	Scaled vehicle	Laboratory scaled Three-span simplysupported	$0.04$ $0.84$	Accelerometer (axle)	Damage identification
³⁴	Passenger vehicle	Full scaled Cable-stayed	$~ 11.16$	Accelerometer (body)	Frequency identification
³⁵	Passenger vehicle	Five-span box girder	$11.11$	Accelerometer (body)	Frequency identification
³⁶	Tractor trailer	Three-span girder	$~ 2.1$	Accelerometer (axle)	Frequency identification

Despite these successes, ongoing research in advanced signal processing, robust machine learning techniques and optimised vehicle configurations continues to enhance the feasibility of these methods for real-world applications.

This study advances drive-by bridge inspection for full-scale, operational structures by building on two rigorous ISHM frameworks.^32,33,37 For the first time, we introduce a purpose-built autonomous inspection vehicle and demonstrate its application in field tests to identify key dynamic properties and evaluate its practical potential for damage assessment. The outcomes of this work contribute to the development of scalable, cost-effective SHMS that support the long-term safety, maintenance and sustainability of bridge infrastructure.

Research contribution

Building on previous work by this research group,^32,33,37 which introduced novel drive-by bridge damage assessment frameworks validated under controlled laboratory conditions, this study extends those unsupervised approaches to two full-scale, in-service bridges. The primary contributions of this research are as follows:

(a) Development of an autonomous inspection vehicle: We designed, constructed and field-characterised what is, to our knowledge, the first purpose-built autonomous platform for drive-by bridge inspection. This platform integrates custom instrumentation and control systems to serve as a versatile mobile sensing unit.

(b) Indirect characterisation of bridge dynamics: During multiple crossings over real bridges, the autonomous vehicle successfully captured vibrational data that revealed key dynamic properties (e.g., natural frequencies) without the need for fixed sensors on the structures.

(c) Field validation of unsupervised damage assessment techniques: The study applied and validated previously proposed indirect sensing methodologies under operational conditions, demonstrating their effectiveness for real-world damage detection and underscoring the practical value of prior ISHM frameworks.^32,33,37

(d) Additionally, all experimental data and analyses presented in this paper are original. The field tests and dataset collection were conducted specifically for this study to validate the proposed customised vehicle scanning system.

(e) Integration of advanced learning frameworks for drive-by SHM: The study incorporates the AMFormer Transformer model³⁸ and the StiefelGen data augmentation method,³⁹ adapted for frequency-domain singular-value sequences from vehicle–bridge interaction data. This integration enables data-efficient, attention-based anomaly detection and addres-ses data scarcity in deep learning models.

The paper is structured as follows: section ‘A novel autonomous inspection vehicle’ details the configuration and characterisation of the proposed novel inspection vehicle. Section ‘Experimental case studies’ presents the two case studies used for field test validation. Sections ‘Methodology’ and ‘Results’, respectively, describe the proposed methodologies for damage assessment, as well as the results obtained from both direct and indirect sensing across the two case studies. Finally, section ‘Conclusions’ summarises the main findings and provides recommendations for future work.

A novel autonomous inspection vehicle

This section presents the inspection vehicle setup, specifically designed and developed to perform drive-by bridge inspections. The inspection vehicle is a customised 1:5 scale electric Remote Control (RC) car, adapted to carry the necessary instrumentation for bridge monitoring. Figure 1 illustrates the complete inspection vehicle and its data management system. Specifically, it is equipped with four timely synchronised PCB-ICP393B05 accelerometers, one mounted on the suspension system of each wheel, capable of measuring within a $\pm 0.5 g$ range and operating across a frequency range of 0.7–450 Hz. These accelerometers are used to measure acceleration along the vertical axis. The factory calibration reports nominal sensitivities of 10.15, 10.06, 9.58 and 10.34 V/g. These sensors effectively capture vehicle dynamic responses, which are crucial for assessing bridge conditions. The setup also includes four Honeywell Model 31 Series load cells (capacity 0.5, 5000 N), mounted on a custom-designed aluminium suspension tower. This frame not only houses the sensors but also allows adjustment of the pivoting point to accommodate various load configurations (see Figure 2(b)). The sensing data are collected using a NI cDAQ-9185 CompactDAQ system and transmitted wirelessly to a remote computer via a 300M PIX-LINK WR09Q wireless repeater. The data are accessed through NI MAX and recorded using NI-FlexLogger, ensuring a reliable workflow for acquisition, processing and storage of dynamic measurements.

Figure 1.

Inspection vehicle setup for drive-by bridge inspection.

Figure 2.

Illustration of (a) system block diagram and (b) suspension tower.

The vehicle’s autonomous control architecture ensures both precise path following and a stable inspection speed, critical for consistent data collection for drive-by bridge scanning purposes. Steering guidance is achieved using a RoboteQ MGSW1600 magnetic guide sensor, which continuously tracks a magnetic tape embedded along the inspection route. This approach ensures that the vehicle follows exactly the same path in every run, improving test repeatability and reducing the influence of surface unevenness. The vehicle can also be operated manually via remote control when the use of magnetic guidance is not feasible in practical conditions. Its real-time output interfaces directly with the steering servo, enabling precise path tracking and delivering high accuracy in lateral positioning, while also facilitating smooth navigation even over uneven surfaces. A closed-loop feedback system handles speed regulation: an AMS Hall-effect sensor mounted on the drive train shaft measures rotational speed and relays this data to an ESP32 microcontroller. The ESP32 implements a PID control algorithm that adjusts the motor speed via a standard R/C electronic speed controller, working in tandem with a reduction gearbox to convert high-RPM motor output into the slow, torque-rich speeds required for scanning. A double-pole double-throw switch provides seamless toggling between manual and autonomous modes for both steering and motor drive control, allowing the operator to intervene for greater flexibility in operation, if required. This integrated system maintains target speeds, maximises data repeatability, and minimises operator workload during extended field campaigns. This architecture is shown in Figure 2(a). The overall system, with a vehicle mass of 20 kg and axle distance of 80 cm, offers a robust and adaptable platform for efficient and repeatable drive-by bridge inspections, with operating speeds varying from 0.1 to 10 $m / s$ .

Inspection vehicle’s characteristics

Accurately characterising the dynamic properties of the inspection vehicle is essential for reliable drive-by sensing. To this end, a driving test is performed in which the vehicle is driven at a constant velocity of 0.17 $m / s$ for 20 s. The acceleration responses of all vehicle-mounted sensors are recorded from multiple passages of the vehicle over a smooth surface where there is no interaction with a bridge. This is to estimate the frequencies associated with the electric motor.

Acceleration responses from four accelerometers mounted on the vehicle’s suspension are recorded and processed using frequency-domain decomposition (FDD) to identify the dominant motor-related frequencies.

The FDD methodology is expressed as follows⁴⁰: considering a matrix $A \in R^{n \times m}$ , where $n$ is the number of observations in the acceleration signals, and $m$ is the number of sensors, the cross-power spectral density (CPSD), $S$ , computed at each frequency is given as

S (f) = [\begin{matrix} S_{11} (f) & S_{12} (f) & \dots & S_{1 m} (f) \\ S_{21} (f) & S_{22} (f) & \dots & S_{2 m} (f) \\ ⋮ & ⋮ & ⋱ & ⋮ \\ S_{m 1} (f) & S_{m 2} (f) & \dots & S_{mm} (f) \end{matrix}] \in C^{m \times m}

(1)

where $f$ is the vector of discrete frequencies, and $S_{ij} (f)$ is the complex-valued CPSD between signals $i$ and $j$ , with $i, j = 1, 2, . . ., m$ . Once the CPSD is obtained, the singular value decomposition (SVD) can be derived for each frequency in the spectrum as in Equation (2)

S = U \sum U^{H}

(2)

where $U$ contains the mode-shapes, $\sum$ is a diagonal matrix containing the singular values and $H$ denotes the Hermitian transpose. The dominant peaks observed in the SVD diagram correspond to the frequencies of the car in operation.

Figure 3 presents the analysis of the test conducted to characterise the inspection vehicle. Figure 3(a) shows an example of the acceleration response from one of the vehicle’s sensors during a driving test. Figure 3(b) illustrates the first singular value, from the FDD analysis, across five different tests when the vehicle’s speed is maintained at a constant velocity of 0.17 m/s. It can be observed that the electric motor produces a dominant frequency, $f_{vd}$ of 15 Hz. The vehicle-dominant frequency is speed-dependent and is primarily associated with electrical components (e.g., the drive motor); however, for a fixed speed, it remains effectively constant, as evidenced by Figure 3(b). As emphasised, $f_{vd}$ depends on the speed of the vehicle, and according to our calibration investigations, over the tested speed range of 0.17 and 0.58 m/s, it varies between 15 $\pm 2 Hz$ . This frequency range is well above the fundamental frequency band of short- to medium-span bridges (<10 Hz).⁴¹ Consequently, no interference with the bridge’s fundamental mode is expected. This information is crucial for analysing the recorded data and identifying the bridge’s dynamic properties.

Figure 3.

Identification of the driving frequency associated with the electric motor in the inspection vehicle: (a) acceleration time response from the accelerometer mounted on the front-right suspension arm and (b) the first singular value from five different driving tests incorporating all four accelerometer responses. The vehicle drives at a constant speed of 0.17 m/s over a smooth surface where no interaction with the bridge exists. The red dashed line represents $f_{vd}$ .

Experimental case studies

Case study I: pedestrian bridge at UNSW

The first case study analyses a real pedestrian bridge located at the Kensington campus of the University of New South Wales in Sydney, Australia. The bridge is a simply supported steel structure with a total length of 17 $m$ and a width of 80 $cm$ (see Figure 4).

Figure 4.

Pedestrian bridge to be inspected, located at the Kensington campus of the University of New South Wales, Australia.

Two measurement campaigns are conducted on this pedestrian bridge (see Figure 5). First, a direct sensing is conducted to identify the structure’s first natural frequency, which is later used as a benchmark to verify the indirect sensing. In the second stage, an indirect sensing using drive-by inspection is performed on the bridge.

Figure 5.

Bridge natural frequency identification.

Direct sensing

For the first test, three wireless accelerometers are installed at the quarter-span locations of the bridge, as shown in Figure 6(a). The sensors used are BeanAir Willow AX-3D triaxial accelerometers, configured with a sampling frequency of 500 $Hz$ . These sensors have a measurement range of $\pm 2 g$ , with a sensitivity of 0.1 mg and a noise density of $45 μ g / \sqrt{Hz}$ . They are equipped with an embedded fifth-order butterworth anti-aliasing filter with a programmable cut-off frequency between 1 and 2000 Hz. During the test, 30 samples of 15 s each are collected while the structure is subjected to an external excitation induced by pedestrians.

Figure 6.

Tests on the pedestrian bridge at UNSW: (a) direct test with three triaxial BeanAir sensors installed on the bridge and (b) drive-by inspection.

To identify the structure’s first natural frequency, the FDD method is applied to the acceleration signals. As described in ‘Inspection vehicle’s characteristics’ section, the FDD method involves computing SVD of the CPSD matrix constructed from the acceleration signals. In this study, only the first singular value is considered, and the peak in this spectrum is taken as the estimate of the first natural frequency of the structure.

Figure 7 shows a representative example of the acceleration time series recorded by one of the sensors located on the bridge, illustrating the typical duration and amplitude of the measured response. Then, the FDD analysis was performed on the recordings from the three sensors, following the procedure described in ‘Inspection vehicle’s characteristics’ section, but using the frequency spectra from the three fixed bridge sensors instead of those from the vehicle sensors. Figure 8 shows the first singular values obtained from five independent tests. It is confirmed that the first natural frequency of the bridge is $f_{b 1} \approx 6.65 Hz$ .

Figure 7.

Representative acceleration time series recorded by one of the sensors located on the bridge.

Figure 8.

First singular values from five different direct inspection tests. The red dashed line represents the averaged first natural frequency of the structure.

Indirect sensing

The second test consists of a drive-by inspection (see Figure 6(b)). To this end, 49 passes of the inspection vehicle are performed at a constant speed of 0.17 $m / s$ . Similar to the direct sensing test, the bridge remains in operation during the test, with random pedestrian-induced excitations present. Figure 9 shows a representative acceleration signal recorded by one of the vehicle-mounted sensors, illustrating the duration and amplitude of the measured response during the drive-by inspection.

Figure 9.

Acceleration time series from one of the vehicle-mounted sensors during the drive-by inspection.

Figure 10 presents the first singular values obtained from the FDD (i.e., as described in ‘Inspection vehicle’s characteristics’ section), considering the acceleration responses of the four mounted accelerometers. The figure displays the results from five different vehicle runs over the bridge. The average first natural frequency of the structure, identified from the peaks of the singular value plots, is $f_{b 1} \approx 6.65 Hz$ . Additionally, the frequencies associated with the motor in operation $f_{vd}$ of the vehicle are identified. The additional frequency components mentioned in earlier studies²⁰ are negligible here due to the low vehicle speed. These findings demonstrate that the drive-by bridge inspection methodology can accurately identify the structure’s first natural frequency.

Figure 10.

First singular values from five different indirect inspection tests. The red dashed line represents the averaged first natural frequency of the structure $f_{b 1}$ and the vehicle motor frequency $f_{vd}$ .

Table 2 presents the peak values of the first singular values obtained from both direct sensing tests (see Figure 8) and indirect sensing tests (see Figure 10). The low standard deviation of the obtained peaks confirms the consistency of the results across all tests, and the agreement between direct and indirect measurements supports the correct identification of the first natural frequency of the structure. The slight differences observed between the two measurement types are attributed to the different experimental setups and the use of distinct sensor systems in each configuration.

Table 2.

Comparison of peak values from the first singular value obtained from direct and indirect sensing tests(case study I).

Test no.	Direct (Hz)	Indirect (Hz)
1	7.141	6.647
2	6.714	6.720
3	6.836	6.658
4	7.202	6.668
5	6.989	6.647
Mean	6.976	6.668
Std. Dev.	0.183	0.027

Std. Dev.: standard deviation.

Case study II: Bulli Colliery Bridge

The second case study examines a real bridge located in Bulli, NSW, Australia. The bridge, named the Bulli Colliery Bridge, is a steel girder bridge that spans the Princes Highway in Bulli. The Bulli Colliery Bridge was originally built to transfer coal trucks to the Illawarra railway line, but is currently used as a pedestrian bridge. It consists of three spans, with the middle span being the focus of this study. The target span is 23.9 $m$ long. Figure 11 presents a view of the bridge and its orientation.

Figure 11.

Bulli Colliery Bridge location: (a) the side view of the bridge pointing in the south direction and (b) the top view of the bridge (taken from Google Maps).

Direct sensing

As in case study I, two tests are performed on this structure. The first test (see Figure 12(a)) corresponds to a direct sensing experiment with fixed wireless sensors installed on the structure. The same sensors and setup as in the first case study (i.e., three sensors located at the quarter points of the target span) are adopted.

Figure 12.

Bulli Colliery Bridge inspection tests: (a) direct test with three triaxial BeanAir sensors on the bridge and (b) drive-by inspection.

Figure 13 shows an example of the acceleration time series of the direct sensing test. Figure 14 presents the corresponding first singular values from five different tests obtained from the FDD, considering the three accelerometers mounted on the bridge following the procedure described in ‘Direct sensing’ section. The first fundamental frequency of the bridge, $f_{b 1} = 6.7 Hz$ , is identified and serves as a benchmark for evaluating the performance of the indirect test.

Figure 13.

Example of an acceleration time series from one of the sensors during the direct sensing test.

Figure 14.

First singular value from five different direct inspection tests. The red dashed line represents the first natural frequency of the structure.

Indirect sensing

Second, an indirect sensing test using drive-by bridge inspection (see Figure 12(b)) is conducted. The same inspection vehicle setup as in case study I, driving at the same speed (i.e., 0.17 m/s), is used for this purpose, and 39 crossings of the vehicle were performed. As in the first case study, the bridge remained in service during both tests, with pedestrians able to walk across it. Figure 15 shows an example of the acceleration response recorded by one of the accelerometers mounted on the vehicle during the test, illustrating the length of the measured signal.

Figure 15.

Acceleration time series from one of the accelerometers mounted on the inspection vehicle during the indirect test performed on the Bulli Colliery Bridge, illustrating the length and characteristics of the recorded signal.

The first singular values obtained from the FDD process are presented in Figure 16 for five different test runs, following the methodology presented in ‘Inspection vehicle’s characteristics’ section. As shown in Figure 16, the results confirm the successful identification of both the bridge’s first natural frequency and the driving frequency associated with the electric motor by the inspection vehicle during the indirect sensing tests. Specifically, peaks are consistently observed at approximately 6.7 Hz. across all tests, corresponding to the bridge’s first vibration mode. Additionally, the frequencies associated with the operation of the electric motor, $f_{vd}$ , are also identified. Other frequency components expected in previous studies²⁰ are not visible because of the low speed and light weight of the vehicle. These findings further validate the drive-by approach’s ability to capture the structure’s dynamic characteristics accurately.

Figure 16.

First singular values from five different indirect inspection tests on the Bulli Colliery Bridge. The red dashed lines represent the averaged first natural frequency of the structure $f_{b 1}$ and the vehicle motor frequency $f_{vd}$ .

Table 3 presents the peak values of the first singular values obtained from both direct sensing tests (see Figure 14) and indirect sensing tests (see Figure 16). The low standard deviation of the obtained peaks confirms the stability and repeatability of the identified frequencies across the five independent tests. The close agreement between direct and indirect results supports the correct identification of the first natural frequency of the Bulli Colliery Bridge. Minor differences between the two sensing configurations are attributed to variations in the experimental setup and the distinct characteristics of the sensors used in each configuration.

Table 3.

Comparison of peak values from the first singular value obtained from direct and indirect sensing tests (case study II).

Test no.	Direct (Hz)	Indirect (Hz)
1	6.836	6.417
2	6.531	6.407
3	6.836	6.428
4	6.714	6.396
5	6.958	6.406
Mean	6.775	6.411
Std. Dev.	0.144	0.011

Std. Dev.: standard deviation.

Methodology

After successfully identifying the bridge’s fundamental frequency using vehicle-mounted accelerometers – and confirming consistency with direct sensing – this paper focuses on drive-by sensing to explore its potential for bridge health monitoring. To this aim, three state-of-the-art unsupervised damage assessment frameworks, previously established in the studies by Hurtado et al.,³² Alamdari,³³ Hamedani et al.,³⁷ Zhong et al.³⁸ and Cheema and Sugiyama,³⁹ respectively, will be used and are presented next.

Indirect SHM using drive-by bridge inspection

This section demonstrates the potential application of the drive-by bridge inspection setup to SHM. In this stage, the data recorded by the inspection vehicle from the target bridges are processed using state-of-the-art damage assessment techniques. The first methodology, presented in the study by Hurtado et al.,³² employs an adversarial autoencoder (AAE) to identify and evaluate damage. In that study, the authors compared the proposed approach with other state-of-the-art techniques, demonstrating its superior performance. Although the methodology was validated through a scaled experimental test, its applicability in more realistic scenarios has not yet been demonstrated.

The second methodology, presented in the study by Alamdari³³ and Hamedani et al.,³⁷ involves a time-series-based analysis using an unsupervised semantic segmentation technique based on matrix profile (MP). Although promising results were obtained using the MP, previous work was limited to direct sensing and simplified laboratory experiments.

The third methodology integrates a Transformer-based framework (i.e., AMFormer)³⁸ combined with a novel time-series data augmentation approach (i.e., StiefelGen).³⁹ This approach enhances anomaly detection performance by leveraging deep spectral reconstruction through Transformer attention mechanisms, while Stiefel-manifold-based augmentation mitigates data scarcity by generating physically consistent synthetic samples. Together, these techniques enable robust and data-efficient analysis of vehicle–bridge interaction signals in real-world conditions.

The current study aims to further validate the frameworks proposed in the studies by Hurtado et al.,³² Alamdari,³³ Hamedani et al.³⁷ and Zhong et al.³⁸ using real data from these two bridges, specifically concerning the proposed sensing platform described earlier. In the first case study, the nominal condition of the bridge is intentionally varied by adding localised mass to simulate a variation on the bridge condition. However, in the second case study, such modification could not be performed due to the bridge’s high mass and operational constraints, which made it impractical to introduce a measurable dynamic change. As a result, the second case study focuses solely on characterising the nominal bridge state and evaluating the robustness of the methodology to false positives.

Adversarial autoencoders

The first damage assessment methodology employed in this work aims to establish the baseline condition of the bridge using an unsupervised machine learning approach based on an AAE.³² To this end, the data collected by the inspection vehicle are first processed following the procedure presented in ‘Experimental case studies’ section. In addition, two pre-processing steps are applied to the first singular values $S_{1}$ , as presented in Equation (2). First, the singular values from the CPSD of the vehicle’s acceleration recordings are filtered within the frequency range of 1–10 $Hz$ , which encompasses the first natural frequency of the structure, as discussed in ‘Experimental case studies’ section. The filtered signal, denoted as $S_{1, filt}$ , is used in the subsequent steps.

Then, sets of three $S_{1, filt}$ from random vehicle crossings are averaged and normalised to the range of $[0, 1]$ using a min-max scaling approach, as follows:

S = \frac{S_{1, filt} - \min (S_{1, filt})}{\max (S_{1, filt}) - \min (S_{1, filt})}

(3)

where $S$ denotes the averaged, normalised and filtered first singular values. Normalisation facilitates faster convergence of the training process. The resulting processed data, $S$ , is subsequently used as an input for training the AAE employed in this study. Figure 17 illustrates the first methodology as a potential application for ISHM, and it is explained next.

Figure 17.

The framework based on AAE to prove the potential application of drive-by bridge inspection in SHM.³² AAE: adversarial autoencoder; SHM: structural health monitoring.

AAE extends traditional autoencoders by incorporating adversarial training to impose a desired prior distribution on the latent space, thereby improving the structure and continuity of the encoded representation.⁴² An AAE consists of three components: an encoder, a decoder and a discriminator network. While the encoder–decoder function as in a standard autoencoder for data reconstruction, the discriminator encourages the latent codes to match a predefined prior distribution (e.g., Gaussian distribution).⁴³ Figure 18 shows a typical AAE structure.

Figure 18.

A typical AAE structure.⁴⁴ AAE: adversarial autoencoder.

Let the input data vector be denoted as $x$ , and the corresponding reconstructed output as $\bar{x}$ . The encoder maps $x$ to a latent code $y$ with encoding distribution $q (y | x)$ , and the decoder reconstructs $\bar{x}$ from $y$ . The latent space is regularised by aligning the aggregated posterior $q (y)$ with a chosen prior $p (y)$ , where

q (y) = \int_{x} q (y | x) p_{D} (x) d x

(4)

where $p_{D} (x)$ is the empirical data distribution. The discriminator network receives samples from both $q (y)$ and $p (y)$ , learning to classify whether a sample comes from the prior or from the encoder. This adversarial setup involves two alternating phases during training: (1) the reconstruction phase where the autoencoder minimises the reconstruction error between $x$ and $\bar{x}$ , and (2) the regularisation phase where the discriminator tries to distinguish $y ~ q (y)$ from $y_{true} ~ p (y)$ , while the encoder is trained to fool the discriminator. The discriminator loss is defined as

Los s_{D} = - \frac{1}{n} \sum_{n} [\log (D (y_{true})) + \log (1 - D (y))]

(5)

where $D (\cdot)$ is the discriminator output, $y_{true} ~ p (y)$ , $y ~ q (y)$ , and $n$ is the number of training samples. The discriminator has a sigmoid output layer, producing values in the range of $(0, 1)$ , corresponding to the probability of a sample being drawn from the prior distribution. The generator loss, used to update the encoder, combines the reconstruction error and the adversarial objective

Los s_{G} = \frac{1}{n} \sum_{n} {(x - \bar{x})}^{2} - \frac{1}{n} \sum_{n} \log (D (y))

(6)

In this work, $x$ is represented by the preprocessed signals, $S$ . Furthermore, the hyperparameters used in this work correspond to the ones used in the study by Hurtado et al.³² The structure details of the AAE are summarised in Table 4. All hidden layers use LeakyReLU activations with a slope of 0.2, the decoder output uses a tanh activation and the discriminator output uses a sigmoid. The latent code dimension is 8, batch size is 16 and the model is trained for 100 epochs using Adam optimisers, with no early stopping.

Table 4.

Summary of AAE structure.

Parameter	Value
Encoder	$256 \to 128 \to 16 \to 8$
Decoder	$8 \to 16 \to 128 \to 256$
Discriminator	$256 \to 128 \to 1$
Latent code size	$8$
Batch size	$16$
Epochs	$100$
Learning rate	$2 \times 10^{- 4}$

AAE: adversarial autoencoder.

Training and testing the model

As previously mentioned, the aim of implementing a damage assessment methodology in this work is to demonstrate the potential of drive-by bridge inspection for identifying structural damage. However, due to the challenges of modelling real damage in in-service structures, this study adopts two complementary approaches. In the first case study (i.e., the University of New South Wales (UNSW) Bridge), bridge condition variation is simulated by introducing added mass to the structure. In the other case study (i.e., Bulli bridge), the focus is on detecting false positives using only responses from the structure’s healthy, or nominal, condition. To this end, the preprocessed acceleration signals recorded by the instrumented inspection vehicle (i.e., $S$ ) are used to train and test the AAE model. These signals, presented in ‘Experimental Case studies’ section, representing the nominal state of the structure, are split into training and testing sets in an 80:20 ratio. During model validation, a damage threshold is defined as the 90th percentile of the reconstruction error $E_{r}$ (see Equation (7) and Figure 17). This threshold follows the recommendations presented in the study by Hurtado et al.⁴⁴

E_{r} = {(x - \bar{x})}^{2}

(7)

Once the damage threshold is defined, the testing set is passed through the AAE network, and the corresponding reconstruction errors are computed. These errors are then compared against the predefined threshold to identify if damage is present.

To demonstrate the methodology’s potential for ISHM using drive-by inspection, it is crucial to evaluate both its ability to minimise false positives (i.e., healthy samples misclassified as damaged) and to identify true negatives consistently. A low false-positive rate suggests that the methodology reliably characterises the nominal state of the structure, which is a necessary foundation for any SHM system. However, robustness to false positives alone does not guarantee sensitivity to actual damage. Therefore, complementary validation, such as a separate test on the UNSW Bridge where damage is simulated by adding localised mass, is essential to assess the method’s capacity to detect real structural changes.

Matrix profile

The second approach is based on the framework presented in the studies by Alamdari³³ and Hamedani et al.³⁷ In drive-by inspection, the amount of available data is often limited due to constraints in testing opportunities or operational conditions. This presents a challenge for the AAE approach, which relies on large training datasets. One potential solution is the MP method,⁴⁵ which can detect change points in bridge states using limited observations. Therefore, this study investigates the use of this technique as a second approach.

Figure 19 provides a schematic overview of the MP-based framework. The MP method is typically used in combination with the corrected arc crossings (CAC) to detect potential change points in the bridge’s state. If the bridge remains in a consistent condition throughout the observation period, the algorithm is not expected to detect any change points. This method has been validated in the study by Alamdari³³ and Hamedani et al.³⁷ using numerical simulations and laboratory experiments, but it has not yet been tested on full-scale bridges.

Figure 19.

Overview of the MP method. MP: matrix profile.

Change point detection

In the change point detection framework, drive-by inspections are carried out periodically over a defined inspection period. During each inspection, vehicle responses are recorded as the vehicle passes over the bridge. The responses from multiple inspections are then combined to form a single observation sequence. The framework analyses this sequence to identify potential change points in the bridge state.

Let $G = [{G_{1} (τ)}, {G_{2} (τ)}, \dots, {G_{n} (τ)}],$ denote the set of responses collected from $n$ independent passages of the test vehicle over the bridge. Each $G_{i} (τ)$ is a vector of length $m$ , representing the response recorded during the $i$ th passage, where $τ \in {t_{1} : t_{m}}$ denotes the discrete time index.³³ The total length of the observation sequence $G$ is defined as $p = m \times n$ . It is assumed that a change in bridge condition happens at an unknown time $t_{c} \in {t_{1} : t_{p}}$ . The subsequence ${{\hat{G}}_{k}}_{l} \in R^{l}$ is defined as a continuous subset of the values in $G$ with length $l$ . The starting position of the subsequence is denoted by $k$ , where $k \in {1 : p - l + 1}$ . The subsequence can be represented as ${{\hat{G}}_{k}}_{l} = {G_{k}, G_{k + 1}, \dots, G_{k + l + 1}}$ , in which $G_{k}$ denotes the $k$ -th element of $G$ . ${{\hat{G}}_{k}}_{l}$ can be treated as a sliding window applied to $G$ , through which all possible subsequences of length $l$ are obtained. These results could be collected in $L_{l} = [{{\hat{G}}_{1}}_{l}, {{\hat{G}}_{2}}_{l}, \dots, {{\hat{G}}_{p - l + 1}}_{l}]$ , where the $k$ th subsequence ${{\hat{G}}_{k}}_{l}$ is donated as $L_{l}^{k}$ .

Once the full set of subsequences is constructed, the all-pairs similarity search is performed to identify the first-nearest neighbour of each subsequence. During the search procedure, trivial matches, such as self-matching ones, are excluded. For each pair of subsequences $i$ and $j$ ( $\forall i, j \in {1 : p - l + 1}$ ), the similarity is measured using the $z$ -normalised Euclidean distance.⁴⁶ The distance $D_{i, j}$ for two subsequences $L_{l}^{i}$ and $L_{l}^{j}$ could be calculated as Equation (8):

D_{i, j} = \sqrt{\sum_{r = 1}^{l} {[Z ({(L_{l}^{i})}_{r}) - Z ({(L_{l}^{j})}_{r})]}^{2}}

(8)

where ${(L_{l}^{i})}_{r}$ represents the $r$ th element in the subsequence $L_{l}^{i}$ , and $Z (\cdot)$ is the $z$ -score normalisation operator. Considering the properties of the z-score, the distance expression can be simplified as Equation (9).

D_{i, j} = \sqrt{2 (l - \frac{\sum_{r = 1}^{l} ({(L_{l}^{j})}_{r} \cdot Z ({(L_{l}^{i})}_{r}))}{σ_{j}})}

(9)

Here, the $σ_{j}$ is the standard deviation of subsequence $L_{l}^{j}$ . The computational complexity of Equation (9) can be reduced by applying the convolution theorem.³³ Based on the calculated distances, the MP $P_{l} \in R^{p - l + 1}$ is constructed as a vector, where each element represents the minimum distance between a given subsequence and its nearest neighbour. The position of this nearest neighbour is recorded in the corresponding MP index $I_{l} \in Z^{p - l + 1}$ ,⁴⁷

P_{l} (i) = \min [D_{i, j}], \forall j \in {1 : p - l + 1},

(10)

I_{l} (i) = \arg \min [D_{i, j}], \forall j \in {1 : p - l + 1} .

(11)

Since the total number of subsequences is $p - l + 1$ , the searching process is performed for each subsequence individually. The resulting MP and its index are $p - l + 1$ -dimensional vectors. These results will be used for the next step of change point detection.

Each entry in the MP index defines a pair ( $i, j$ ), where $j$ is the nearest neighbour of subsequence $i$ . This pair can be represented as an arc from location $i$ to location $j$ . The arc curve (AC) can be defined as a vector in which each element indicates the number of arcs that cross over the corresponding location $i$ .⁴⁸ When the condition of the system changes at time $t_{c}$ , most subsequences are expected to find their nearest neighbours within the host regime. Since only a small number of arcs cross $t_{c}$ , the AC reaches the lowest value at this point. Low AC values can also be observed near the beginning and end of the series. This is due to the limited number of arcs that can cross at this boundary area. To solve this edge effect, the AC is normalised with respect to an idealised AC (IAC), resulting in the CAC:

CAC (i) = \min [\frac{AC (i)}{IAC (i)}, 1] .

(12)

The IAC denotes the number of arcs obtained if all matches were uniformly sampled from the entire time series. This idealised curve forms an inverted parabola, peaking at $1 / (2 p)$ .³³ The resulting CAC is a vector of length $p - l + 1$ , consistent with the length of the MP and its index. The range of CAC is located in $[0, 1]$ , and the minimum value indicates the most likely location of a change point in the series.

In the context of this work, the MP-based framework is applied to vehicle responses collected by the customised inspection vehicle. The recorded vehicle responses are processed using the methodology outlined in ‘Experimental case studies’ section to extract the first singular value from the CPSD. The extracted singular values are assembled sequentially to form the observation sequence $G$ . Change point detection within this sequence indicates potential changes in bridge condition during the inspection period.

Transformer-based anomaly detection (AMFormer)

The transformer-based architecture used in this study follows the AMFormer framework proposed in the study by Zhong et al.,³⁸ originally developed for time-series anomaly detection tasks. In its original formulation, AMFormer learns temporal dependencies directly from sequential data, modelling long-term relationships through self-attention. In this work, the model is adapted to operate on spectral sequences derived from VBI experiments, where each sequence corresponds to the first singular values obtained from the FDD of acceleration data recorded during a single vehicle passage.

Although the original temporal evolution is not explicitly present in the singular-value data, the ordering of frequency bins is treated as a pseudo-time axis, enabling the transformer to learn sequential dependencies among spectral features in an analogous manner to time-domain series. This adaptation enables the AMFormer to leverage its attention mechanism for identifying structural changes in the spectral representation of bridge vibrations over time. Figure 20 shows the structure of the AMFormer used in this work.

Figure 20.

AMFormer structure.

Model architecture

Let the input sequence be defined as $W_{i} \in R^{h \times D}$ , where $h$ denotes the sequence length (i.e., the number of consecutive vehicle passages considered within each window) and $D$ is the feature dimension (i.e., the number of frequency bins retained from the singular-value spectra). Each $W_{i}$ thus represents a sequence of $h$ consecutive passages, in which each passage is composed of the first singular values of the CPSD matrices computed from all sensors mounted on the vehicle.

For each windowed segment, the historical portion $W_{ie} \in R^{(h - s) \times D}$ and the current portion $W_{im} \in R^{s \times D}$ are defined, where $s$ is the step size used in reconstruction. The encoder transforms the input into a latent representation $H_{i}$ :

H_{i} = f_{e}^{t} (W_{i}; θ_{e}) = TransformerEncoder (W_{i} + P_{i}),

(13)

where $P_{i}$ denotes the sinusoidal positional encoding that establishes an artificial sequence ordering along the frequency axis, mimicking temporal dependencies. The notation $f_{e}^{t} (\cdot; θ_{e})$ corresponds to the encoder mapping function of the transformer, with $θ_{e}$ representing all encoder learnable parameters (multi-head attention, feed-forward, and normalisation weights).

In the decoding stage, an attention-guided masking mechanism is introduced to modulate the spectral content of the input before reconstruction. Guided by the encoded features $H_{i}$ , the decoder adaptively weights each frequency component to emphasise structurally relevant patterns while suppressing redundant information in the masked segment $W_{im}$ . This gating process is implemented through a differentiable masking operator $I_{i}$ , computed from the current segment $W_{im}$ as:

\begin{matrix} Z_{i} = LN (ϕ_{1} (W_{im})), \\ U_{i} = ReLU (Z_{i}), \\ V_{i} = ϕ_{2} (U_{i}), \\ O_{i} = ReLU (V_{i}), \\ I_{i} = \frac{O_{i}}{O_{i} + ε}, \end{matrix}

(14)

where $ϕ_{1} : R^{s \times D} \to R^{s \times d_{m}}$ and $ϕ_{2} : R^{s \times d_{m}} \to R^{s \times D}$ . $LN (\cdot)$ denotes layer normalisation, $ReLU (\cdot)$ the rectified linear unit, $d_{m}$ the mask hidden width and $ε = 10^{- 6}$ ensures numerical stability. By construction, $O_{i} \geq 0$ element-wise; hence, $I_{i} \in {(0, 1)}^{s \times D}$ acts as a soft, differentiable mask. The masked input is then

{\tilde{W}}_{im} = W_{im} ⊙ I_{i},

(15)

where ⊙ denotes the Hadamard (element-wise) product. Finally, the decoder network $f_{d}^{t} (\cdot; θ_{d})$ , parameterised by learnable weights $θ_{d}$ , reconstructs the current segment using both the masked input and the encoder context:

{\hat{W}}_{im} = f_{d}^{t} ({\tilde{W}}_{im}, H_{i}; θ_{d}) .

(16)

The hyperparameters used in the AMFormer implementation are summarised in Table 5. These values were selected after empirical tuning to ensure stable convergence and optimal reconstruction performance. The number of encoder and decoder blocks was set to 15 each, enabling deep feature abstraction and multi-scale attention over frequency-domain representations. A relatively small hidden dimension and two attention heads were adopted to balance expressiveness with computational efficiency. The model was trained for 2000 epochs using the Adam optimiser with a learning rate of $1 \times 10^{- 3}$ and a batch size of 32.

Table 5.

AMFormer model hyperparameters.

Hyperparameter	Value
Number of encoder blocks ( $N_{e}$ )	15
Number of decoder blocks ( $N_{d}$ )	15
Sequence length ( $h$ )	5
Step size ( $s$ )	5
Hidden dimension ( $d_{h}$ )	64
Number of attention heads ( $n_{h}$ )	4
Dropout rate	0.5
Batch size	32
Number of epochs	2000
Optimiser	Adam
Learning rate	$1 \times 10^{- 3}$

Active reconstruction objective for anomaly assessment

The network is trained using the Active-Mean Squared Error (MSE) loss introduced in the study by Zhong et al.,³⁸ which dynamically re-weights reconstruction errors to emphasise consistent patterns while reducing the influence of extreme outliers. For each training sample $i$ , the reconstruction error is:

A S_{i} = \sum_{t = 1}^{s} ‖ W_{im} (t) - {\hat{W}}_{im} (t) ‖_{2}^{2},

(17)

where $t$ indexes each step within the reconstruction horizon of length $s$ . Over a mini-batch of size $B$ , the error scores are normalised as:

z_{i} = \frac{A S_{i} - μ_{AS}}{σ_{AS}},

(18)

where $μ_{AS}$ and $σ_{AS}$ denote the batch mean and standard deviation, respectively. Each sample is assigned a dynamic weight:

ω_{i} = {\begin{matrix} 1, z_{i} \leq 0, \\ c^{z_{i}}, z_{i} > 0, \end{matrix}

(19)

with $c$ controlling the decay rate for large deviations. The final training loss is computed as:

L_{Active - MSE} = \frac{1}{B} \sum_{i = 1}^{B} ω_{i} A S_{i} .

(20)

This adaptive weighting improves training stability and robustness, particularly when the available data are limited or partially augmented. After training, the reconstruction error $E_{i} = ∥ W_{im} - {\hat{W}}_{im} ∥_{2}$ is used as the anomaly indicator, where higher values denote greater deviation from the learned healthy baseline.

After training, the model operates in inference mode to identify anomalous vehicle passages. For each test window $W_{i}$ , the reconstruction ${\hat{W}}_{im}$ is obtained through the trained encoder–decoder network, and the sample-wise reconstruction error is computed as:

E_{i} = \frac{1}{sD} \sum_{p = 1}^{s} \sum_{q = 1}^{D} {({\hat{W}}_{im}^{(p, q)} - W_{im}^{(p, q)})}^{2}

(21)

where $p \in [1, s]$ indexes the pseudo-temporal sequence of passages within the current window, and $q \in [1, D]$ indexes the frequency components within each spectral vector.

To ensure scale invariance across sequences, all reconstruction errors are normalised using the mean $μ_{h}$ and standard deviation $σ_{h}$ computed from the healthy validation data:

E_{i}^{norm} = \frac{E_{i} - μ_{h}}{σ_{h}} .

(22)

An adaptive threshold $T_{anom}$ is then derived from the healthy distribution using the median-of-means (MoM) criterion,³⁸ which mitigates the influence of outliers during calibration. A test instance is flagged as anomalous if:

E_{i}^{norm} > T_{anom} .

(23)

This procedure yields a binary anomaly decision for each input window. Higher normalised errors indicate structural deviations in the vehicle–bridge system’s frequency-domain response. Aggregating the anomaly scores across all test sequences enables quantitative assessment of anomaly evolution and localisation.

Data augmentation using StiefelGen

Despite the sequential formulation adopted in the transformer, the available dataset represents a limited number of vehicle passages per bridge condition. Such scarcity restricts the model’s ability to learn diverse spectral representations and may cause overfitting to a small subset of singular-value patterns. To address this limitation, the StiefelGen framework proposed in the study by Cheema and Sugiyama³⁹ was employed to synthetically expand the training data while preserving the geometric structure of the original feature space. Following the study by Peralta-Braz et al.,⁴⁹ this approach enables the synthesis of physically consistent acceleration signals that retain the intrinsic structure of the original sensor measurements.

Let the measured acceleration dataset for sensor $j$ be defined as:

A_{j} = {a_{j}^{(1)} (t), a_{j}^{(2)} (t), \dots, a_{j}^{(n)} (t)},

(24)

where $a_{j}^{(i)} (t)$ denotes the $i$ th recorded acceleration signal from sensor $j$ , and $n$ is the total number of passages. The StiefelGen model operates independently on each sensor dataset $A_{j}$ , learning the underlying manifold structure of the temporal responses.

Each signal is first embedded in a subspace representation $X_{i} \in St (k, T)$ , where $T$ denotes the number of time samples per signal and $k$ the intrinsic subspace rank. The Stiefel manifold is defined as

St (k, T) = {X \in R^{T \times k} : X^{⊤} X = I_{k}},

(25)

where ${(\cdot)}^{⊤}$ denotes the matrix transpose, and $I_{k}$ is the $k \times k$ identity matrix. This formulation ensures that the column vectors of $X_{i}$ form an orthonormal basis that captures the dominant temporal modes of the acceleration signal.

The StiefelGen framework models the local geometry of the data manifold by mapping samples to the tangent space at a reference point $X_{0} \in St (k, T)$ using the Riemannian logarithmic map:

ξ_{i} = Lo g_{X_{0}} (X_{i}),

(26)

where $ξ_{i} \in T_{X_{0}} St (k, T)$ denotes the tangent representation of $X_{i}$ . Synthetic samples are generated by sampling new tangent vectors ${\tilde{ξ}}_{i} ~ N (μ_{ξ}, Σ_{ξ})$ , estimated from the distribution of the real data in the tangent space. These samples are projected back to the manifold using the exponential map:

{\tilde{X}}_{i} = Ex p_{X_{0}} ({\tilde{ξ}}_{i}),

(27)

which guarantees that ${\tilde{X}}_{i}^{⊤} {\tilde{X}}_{i} = I_{k}$ . This process yields manifold-consistent synthetic signals that preserve the orthogonality and subspace geometry of the original acceleration data.

Both real and synthetic accelerations are subsequently processed through the FDD method to extract the first singular values of the CPSD matrices, which serve as the spectral inputs $W_{i}$ for AMFormer training and testing (see ‘Transformer-based anomaly detection (AMFormer)’ section). This two-stage pipeline, Stiefel manifold augmentation in the time domain, followed by spectral feature extraction, ensures that the augmented data maintain realistic modal properties while providing a richer training distribution for robust anomaly-sensitive feature learning.

Results

Case study I

Bridge condition assessment using AAE

Following the procedure described in ‘Adversarial autoencoders’ section, this section demonstrates the potential application of the AAE-based methodology for drive-by bridge inspection. First, 49 vehicle crossings over the bridge were analysed. From these crossings, the first singular values of the CPSD from the four acceleration responses of $S$ (see Equation (3)) were extracted and preprocessed as described in ‘Adversarial autoencoders’ section. These samples were then divided into training and testing sets, and the AAE was subsequently trained. The training loss curves for the AAE models in case study I are shown in Figure 24. Once the AAE is trained, the threshold is identified as 0.023. Figure 21 shows an example of the reconstruction of a $S$ sample after training, showing the potential of the AAE to reconstruct the nominal state of the structure.

Figure 21.

Reconstruction of a $S$ sample using the presented AAE. AAE: adversarial autoencoder.

To test the bridge condition assessment approach, damage was simulated by adding mass to the bridge (i.e., five people standing at mid-span), as shown in Figure 22. The total additional mass was approximately 375 $kg$ , representing a severe simulated damage scenario. This load was applied at the mid-span of the UNSW Bridge and maintained for the full duration of one indirect sensing run ( $~ 100 s$ , see Figure 9). The bridge’s dynamic response was measured using the same indirect sensing configuration employed in the healthy condition. The first natural frequency obtained under this modified state decreased to an average value of 4.47 $Hz$ , as shown in Table 6. When compared with the healthy condition (see Table 2), which showed a mean frequency of 6.67 $Hz$ from indirect sensing, this represents a reduction of approximately 33% in the first natural frequency. This significant decrease confirms the sensitivity of the framework to structural variations induced by the added mass.

Figure 22.

Drive-by bridge inspection with simulated damage.

Table 6.

First natural frequencies under added-mass condition (indirect sensing tests).

Test no.	Frequency (Hz)
1	4.358
2	4.180
3	4.650
4	4.450
5	4.690
Mean	4.466
Std. Dev.	0.189

Std. Dev.: standard deviation.

The vehicle crossed the modified bridge state 10 times, and the corresponding acceleration responses were pre-processed following the methodology described in ‘Adversarial Autoencoders’ section. These pre-processed signals were then fed into the trained AAE, and the resulting reconstruction error was used to demonstrate the framework’s ability to detect structural variations. Figure 23 shows an example of the reconstruction of the frequency spectrum from the modified state of the bridge, from which it is clear that the AAE fails to reconstruct the sample from the varied bridge state. Figure 24 is the result of this study.

Figure 23.

Reconstruction of a $S$ sample from the modified state of the bridge using the presented AAE. AAE: adversarial autoencoder.

Figure 24.

Training loss of the AAE model trained on case study I. AAE: adversarial autoencoder.

Figure 25 presents the reconstruction error obtained during the testing phase using the AAE-based methodology. It is evident that the reconstruction error for the simulated damage samples (i.e., modified bridge state with added mass) is significantly higher than that for the nominal state of the structure, indicating a variation in the bridge condition. Notably, all samples from the nominal bridge state were correctly classified as healthy, resulting in no false positives. Likewise, all samples from the modified bridge state were correctly identified as damaged, yielding no false negatives. These results demonstrate a perfect separation between true negatives and true positives, highlighting the potential of the proposed methodology for accurately assessing bridge condition through drive-by inspections.

Figure 25.

Reconstruction errors from healthy and simulated damage samples.

Finally, it is worth noting that, as reported in the study by Hurtado et al.,³² the AAE-based methodology is only capable of identifying damage and assessing its severity, based on the direct relationship between the reconstruction error and the extent of the damage. Identifying the type, source and location of the damage is beyond the scope of the presented methodology.

Bridge condition assessment using MP

This section examines the performance of the proposed MP method using experimental data obtained from the pedestrian bridge located at UNSW. As outlined in ‘Case study I: pedestrian Bridge at UNSW’ section, 49 vehicle passages were conducted, and the first singular value of the CPSD matrix was extracted for each run.

First, a case is studied in which every sample in the observation sequence corresponds to the benchmark state. This investigation aims to evaluate the robustness of MP to false positives. A new singular value sample is generated by averaging five randomly selected samples out of 49. This new sample consists of 900 points or spectral lines, covering the frequency range from 1 to 10 $Hz$ , and is normalised to the range of $[0, 1]$ . For the construction of observation sequences, a total of 30 samples are produced through this process. Therefore, the size of the observation sequence will be 27,000 data points, for example, 900 (spectral lines) × 30 (samples). The resulting observation sequence is shown in the first row of Figure 26, and the corresponding CAC profile is presented in the second row. The resulting CAC remains close to a constant value of 1 throughout the entire sequence, indicating that the method does not detect any change points.

Figure 26.

Observation sequence and corresponding CAC profile for the nominal state of UNSW Bridge. CAC: corrected arc crossings.

An extended assessment is conducted by repeatedly applying the random averaging and assembly procedure. In this way, 1000 observation sequences are obtained under the benchmark state for analysis. The CAC is computed for each sequence, and the results are shown as a three-dimensional surface in Figure 27. As illustrated, the surface remains close to the CAC value of one throughout all observation sequences, with only minor variations. This result confirms that the method produces stable outputs for the benchmark bridge condition.

Figure 27.

CAC profiles computed from 1000 randomly generated observation sequences using data from the UNSW Bridge. CAC: corrected arc crossings.

A second case is studied, in which the effectiveness of change point detection in identifying bridge state transitions is evaluated using an observation sequence that encompasses both benchmark and altered conditions. As described in ‘Methodology’ section, additional mass is added to the bridge to simulate a change in structural condition. An observation sequence, consisting of 20 samples from the benchmark condition and 10 from the damaged state, is established. This observation sequence is shown in the first row of Figure 28, with the corresponding CAC profile presented in the second row. A dashed line indicates the transition point between the two conditions. The CAC profile shows a clear minimum at the transition point, indicating that the method effectively detects the induced change.

Figure 28.

Observation sequence and corresponding CAC profile for the UNSW Bridge under benchmark and changed conditions. CAC: corrected arc crossings.

The consistency of the method in detecting change points is evaluated using 1000 observation sequences generated from both benchmark and damaged conditions. The corresponding CAC values from all sequences are formed into the surface illustrated in Figure 29. This surface displays a clear line-shaped minimum aligned with the transition point across all sequences, indicating the location of change. This result demonstrates the reliability of the MP method in detecting bridge state changes across multiple inspections.

Figure 29.

CAC profiles from 1000 observation sequences, including both benchmark and damaged conditions for the UNSW Bridge. CAC: corrected arc crossings.

Bridge condition assessment using AMFormer

This section presents the results obtained for case study I, based on the methodology introduced in ‘Transformer-based anomaly detection (AMFormer)’ section. The analysis aims to evaluate the performance of the proposed AMFormer-based anomaly detection framework in identifying bridge anomalies from VBI data. The complete pipeline combines transformer-based spectral reconstruction with Stiefel-manifold-driven data augmentation, enabling the network to generalise from limited experimental observations. By embedding the acceleration signals on the Stiefel manifold and generating physically consistent synthetic samples, the effective number of available passages for training is doubled, thereby improving the model’s representational diversity and mitigating overfitting due to data scarcity.

As mentioned previously, for this case, the experimental dataset comprised 39 healthy and 10 damaged vehicle passages. The augmentation procedure described in ‘Data augmentation using StiefelGen’ section was applied separately to both subsets, producing a twofold increase in the total number of training samples. This enriched dataset provided a more balanced and statistically representative input distribution for AMFormer training, enhancing its ability to learn discriminative spectral patterns associated with the bridge’s structural condition. Each sequence used for training and evaluation was composed of singular values filtered within the frequency range of $[1, 10] Hz$ , in line with previous methodologies. This ensured that the analysis focused on the most relevant vibration modes while removing low-frequency drift and high-frequency noise.

An illustrative example of the augmentation process is presented in Figure 30, which compares the normalised first singular values of an original and an augmented sample obtained from one of the vehicle-mounted sensors under the benchmark (healthy) state of the bridge. The two spectral signatures show a close spectral agreement, indicating that the StiefelGen procedure successfully preserves the dominant modal content of the original data while introducing minor stochastic variations. This confirms that the augmentation process generates physically consistent and diverse samples, thereby enhancing the model’s capacity to generalise from a limited number of real passages.

Figure 30.

Comparison between the normalised first singular values of an original and an augmented sample from one sensor under the benchmark state of the bridge.

The training and validation loss curves shown in Figure 31 illustrate the convergence behaviour of the AMFormer model under the Active-MSE objective function. A smooth and monotonic decrease in the loss function is observed across epochs, indicating stable optimisation and effective learning of spectral dependencies.

Figure 31.

AMFormer loss curve during the training process.

To evaluate the model’s capacity to discriminate between healthy and damaged bridge conditions, the normalised reconstruction errors were computed and are presented in Figure 32. The histogram reveals two clearly separated distributions: healthy passages exhibit small reconstruction errors concentrated near zero, while damaged passages display substantially higher error magnitudes. The MoM-based threshold successfully distinguishes between the two populations, validating the robustness of the anomaly detection criterion.

Figure 32.

Normalised reconstruction errors from AMFormer testing.

Overall, these findings confirm that the proposed methodology can effectively assess the bridge condition even when only a limited amount of data is available. The combination of geometry-based data augmentation and transformer-based spectral learning allows the model to detect small changes in the vehicle–bridge interaction response, showing strong potential for practical and data-efficient structural health monitoring.

It is noted that the added-mass configuration used in ‘Case study I’ section represents a controlled and repeatable way to simulate a permanent change in structural characteristics. By contrast, transient live-load effects, such as the temporary presence of pedestrians or vehicles, would not consistently appear in repeated drive-by inspections and therefore would not be misclassified as damage.

Case study II

Bridge condition assessment using AAE

Following the procedure described in ‘Adversarial autoencoders’ section, this section presents the application of the AAE-based methodology to the second case study. As previously mentioned, multiple vehicle crossings were conducted. From these measurements, the first singular values of the CPSD derived from the four acceleration responses, $S$ was extracted and preprocessed, as detailed in ‘Adversarial autoencoders’ section. The resulting samples were then divided into training and testing sets using an 80:20 split, and the AAE model was trained accordingly. The training loss curves are presented in Figure 33. Figure 34 presents an example of the reconstruction of a $S$ sample from the nominal state after training.

Figure 33.

Training loss of the AAE model trained on case study II. AAE: adversarial autoencoder.

Figure 34.

Reconstruction of a $S$ sample using the AAE. AAE: adversarial autoencoder.

Once the AAE is trained and the damage threshold (see ‘Adversarial autoencoders’ section) is defined using the training set (i.e., 0.069), the test set is passed through the network. Given the high mass of the structure and its operational condition, it was challenging to add sufficient mass to alter its local dynamic behaviour. Therefore, this case study focuses solely on identifying the nominal bridge state, without considering modified or damaged states. Figure 35 shows the reconstruction errors corresponding to the nominal test set.

Figure 35.

Reconstruction errors from the validation and test sets.

As illustrated in Figure 35, one out of ten samples in the test set of the nominal bridge condition exceeded the defined damage threshold. This result suggests that the AAE may not have fully learned the representative features of the bridge’s benchmark state. The limited number of available samples introduces uncertainty in the prediction and highlights the importance of using a larger dataset during the training phase. Although the method identified a false positive primarily due to the small number of samples used for training, the methodology still demonstrates potential for application in bridge anomaly detection by accurately reconstructing the healthy samples from the bridge.

Bridge condition assessment using MP

This section investigates the performance of the change point detection method presented in ‘Matrix profile’ section. The singular values obtained from the indirect approach presented in ‘Case study II: Bulli Colliery Bridge’ section are used to construct the observation sequences. To reduce the influence of noise, five singular value samples are randomly selected from a total of 39 samples and averaged to generate a new spectral sample. Each spectral sample contains 900 singular value points within the frequency range of 1–10 Hz. and is normalised between 0 and 1.

In this case study, the observation sub-sequence is constructed by assembling 30 samples, each generated by the averaging process mentioned previously. The complete subsequence, shown in the first row of Figure 36, represents a scenario in which all 30 inspections are conducted under benchmark conditions.

Figure 36.

Observation sequence and corresponding CAC profile for the nominal condition of the Bulli Colliery Bridge.

The observation sequence has a length of 27,000 points (30 samples × 900 points). The MP method described in ‘Matrix profile’ section is then applied to calculate the CAC. The resulting CAC is shown in the second row of Figure 36. Although minor deviations from the constant value of one are observed in the CAC curve, no significant decrease is present. Since the observation sequence does not contain any change in bridge condition, the CAC is expected to remain close to one. The result confirms this expectation, suggesting that the bridge condition remains consistent throughout the observation period.

To further evaluate the robustness of the proposed method, 1000 independent observation sequences are generated, and the corresponding CACs are computed. As shown in Figure 37, the CAC profiles form a surface with a value of one. Since the bridge condition remains unchanged in all observation sequences, no change points are expected, and the CAC reflects this by maintaining a maximum value across the surface.

Figure 37.

CAC profiles computed from 1000 randomly generated observation sequences using data from the Bulli Colliery Bridge. CAC: corrected arc crossings.

Small deviations of the CAC from unity were observed, which can be attributed to measurement noise and variability in excitation. This behaviour is consistent with previous findings in the literature.^33,37

It is important to note that both direct and indirect bridge monitoring approaches face challenges associated with operational and environmental variability. In particular, live-load variations (e.g., vehicular traffic or pedestrian movement) can influence the measured responses and, consequently, the reconstruction error in data-driven methods such as the AAE. To mitigate these effects, the present experiments were conducted under controlled conditions, including partial lane closures and a constant vehicle speed, to ensure repeatability and minimise the influence of uncontrolled excitations. The proposed methodology is therefore most suitable for short- and medium-span bridges where controlled testing can be achieved. Future research will focus on incorporating larger datasets under varied traffic and environmental conditions to enhance the robustness of the approach for broader operational scenarios.

Conclusions

An electric, custom-built inspection car for drive-by bridge inspection was developed and instrumented with onboard accelerometers, and its performance was evaluated on two full-scale bridges in NSW, Australia (the UNSW Bridge and the Bulli Colliery Bridge). The first natural frequencies of both bridges were estimated using direct and indirect approaches; in each case, FDD was employed to extract the fundamental mode. Average values of 6.65 Hz (case study I) and 6.42 Hz (case study II) were obtained, confirming that fundamental modal frequencies could be accurately identified via drive-by techniques.

An AAE was applied to assess structural variation. In case study I, the nominal condition was intentionally altered by introducing localised added mass so that damage could be simulated and the method’s classification capability evaluated; in case study II, damage simulation was not undertaken because the structure’s high mass and operational constraints made measurable dynamic changes impractical, and characterisation of the nominal state with minimised false positives was prioritised. Structural variation was successfully identified in case study I, and the nominal state was accurately characterised in case study II. Additionally, the MP was applied to both case studies to detect change points; when bridges remained in benchmark states, corrected ACs stayed near unity with no significant minima, indicating an absence of false detections.

Beyond these approaches, an additional analysis was conducted using the AMFormer framework and the StiefelGen augmentation model. This recently developed transformer-based methodology was introduced to enhance anomaly detection performance under data-scarce conditions by learning spectral dependencies from vehicle–bridge interaction responses. The Stiefel-manifold augmentation procedure effectively expanded the available dataset, ensuring physical consistency and improving generalisation. Together, these methods provided a more robust and data-efficient strategy for identifying subtle variations in bridge dynamics.

Overall, the feasibility of drive-by inspection using a configurable inspection vehicle was demonstrated, providing a foundation for indirect bridge health monitoring. Practical limitations for full-scale implementation were noted: (i) applicability was largely confined to short- and medium-span bridges where speed, traffic and test conditions could be controlled (with possible partial lane closures); (ii) optimal performance required low, constant vehicle speed; (iii) environmental and operational variations were not investigated and (iv) the number of recordings was limited and performance under varying vehicle properties required further study. Future work was planned to extend testing to larger bridges and unrestricted traffic environments, to examine long-term patterns and environmental/operational effects, and to validate the methodology across a broader set of bridge types. The modular nature of the inspection vehicle was intended to enable exploration and optimisation of the design space for enhanced sensing performance. Additionally, the generalisability of the system is supported by the ability to adapt the sensing configuration to most short- and medium-span road bridges, where only minor adjustments to sensor layout, acquisition settings, or – depending on bridge characteristics – modifications to the vehicle’s dynamic properties may be required. Because low-speed operation is typically necessary for high-quality indirect measurements, practical deployment would likely rely on controlled testing conditions, such as off-peak operation or temporary partial lane closures, to minimise traffic disruption and enable routine use in road networks.

Footnotes

Acknowledgements

The authors express their gratitude to the Australian Research Council (ARC) for its support through the Discovery Early Career Researcher Award initiative, grant number DE210101625. This research also received funding from the ARC Industrial Transformation Research Programme (IH210100048). The authors additionally thank the Transport New South Wales teams for their assistance and support during the experimental testing.

Declaration of conflicting interests

The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Funding

The authors received no financial support for the research, authorship and/or publication of this article.

ORCID iD

M Makki Alamdari

Data availability Statement

The data and codes supporting this study are available from the authors upon request. The datasets are not publicly available due to institutional and project-related restrictions, but can be shared subject to the necessary approvals.

References

Lin

Yoda

Bridge engineering: classifications, design loading, and analysis methods. Butterworth-Heinemann, 2017. https://books.google.com.au/books?id=6op4CgAAQBAJ

Zhu

Law

. Structural health monitoring based on vehicle-bridge interaction: accomplishments and challenges. Adv Struct Eng 2015; 18(12): 1999–2015.

Lemass

MRB

. A decision support methodology for remediation planning of concrete bridges. J Constr Eng Project Manag 2011; 1: 1–11.

Hodgson

Transport and logistics insights the road ahead. 2014. https://www.scribd.com/document/294129251/Transport-and-Logistics-Insights-January-2014

Cerda

Chen

Bielak

, et al. Indirect structural health monitoring of a simplified laboratory-scale bridge model. Smart Struct Syst 2014; 13(5): 849–868.

Neves

Structural health monitoring of bridges: Model-free damage detection method using machine learning. PhD thesis, KTH Royal Institute of Technology, 2017.

Hurtado

Atroshchenko

Chang

, et al. Optimal characteristics of inspection vehicle for drive-by bridge inspection. Comput Struct 2026; 320: 108037.

Yang

Lin

CW.

Vehicle-bridge interaction dynamics and potential applications. J Sound Vib 2005; 284(1–2): 205–226.

Zhang

Zhao

Lie

ST.

Estimation of mode shapes of beam-like structures by a moving lumped mass. Eng Struct 2019; 180: 654–668.

10.

Kim

Isemoto

Toshinami

, et al. Experimental investigation of drive-by bridge inspection. In: 5th International conference on structural health monitoring of intelligent infrastructure (SHMII-5), Cancun, Mexico, December 11–15, 2011. Instituto de Ingeniería, UNAM, 2011.

11.

Cerda

Garrett

Bielak

, et al. Indirect structural health monitoring in bridges: scale experiments. In: Sixth International IABMAS Conference (Bridge Maintenance, Safety, Management, Resilience and Sustainability), Stresa, Italy, July 8–12, 2012.

12.

Mei

Gül

Boay

Indirect health monitoring of bridges using mel-frequency cepstral coefficients and principal component analysis. Mech Syst Signal Process 2019; 119: 523–546.

13.

Liu

Bergés

, et al. Hiermud: hierarchical multi-task unsupervised domain adaptation between bridges for drive-by damage diagnosis. Struct Health Monit 2023; 22(3), pp.1941–1968.

14.

Lin

Zhang

Drive-by bridge damage detection using mel-frequency cepstral coefficients and support vector machine. Struct Health Monit 2023; 22(5): 3302–3319.

15.

Singh

Sadhu

A hybrid time-frequency method for robust drive-by modal identification of bridges. Eng Struct 2022; 266: 114624.

16.

Yang

, et al. An effective procedure for extracting the first few bridge frequencies from a test vehicle. Acta Mech 2021; 232: 1227–1251.

17.

Liu

Bergés

Bielak

, et al. Knowledge transfer between bridges for drive-by monitoring using adversarial and multi-task learning. arXiv:2006.03641, 2020.

18.

Matarazzo

Pakzad

SN.

Scalable structural modal identification using dynamic sensor network data with stridex. Comput Aided Civ Infrastruct Eng 2018; 33(1): 4–20.

19.

Eshkevari

Cronin

Pakzad

, et al. Bridge structural health monitoring using asynchronous mobile sensing data. arXiv e-prints, 2020, pp.arXiv-2007.

20.

Yang

Lin

Yau

JD.

Extracting bridge frequencies from the dynamic response of a passing vehicle. J Sound Vib 2004; 272(3–5): 471–493.

21.

Lin

Yang

YB.

Use of a passing vehicle to scan the fundamental bridge frequencies: An experimental verification. Eng Struct 2005; 27(13): 1865–1878.

22.

Locke

Redmond

Schmid

Experimental evaluation of drive-by health monitoring on a short span bridge using OMA techniques. In Dynamics of Civil Structures, Volume 2: Proceedings of the 39th IMAC, A Conference and Exposition on Structural Dynamics, Springer International Publishing, Cham, 2021, pp. 109–127.

23.

Yang

Zhang

, et al. Measuring bridge frequencies by a test vehicle in non-moving and moving states. Eng Struct 2020; 203: 109859.

24.

Kong

Cai

Deng

, et al. Using dynamic responses of moving vehicles to extract bridge modal properties of a field bridge. J Bridge Eng 2017; 22(6): 04017018.

25.

Yang

Wang

, et al. Refined detection technique for bridge frequencies using rocking motion of single-axle moving vehicle. Mech Syst Signal Process 2022; 162: 107992.

26.

McGetrick

Kim

CW.

An indirect bridge inspection method incorporating a wavelet-based damage indicator and pattern recognition. In: Proceedings of IX international conference on structural dynamics EURODYN, Porto, Portugal, 30 June – 2 July 2014, pp. 2605–2612.

27.

Sitton

Rajan

Story

BA.

Damage scenario analysis of bridges using crowdsourced smartphone data from passing vehicles. Comput Aided Civ Infrastruct Eng 2024; 39(9): 1257–1274.

28.

Lan

Feng

, et al. Investigation of time-varying frequencies of two-axle vehicles and bridges during interaction using drive-by methods and improved multisynchrosqueezing transform. Mech Syst Signal Process 2024; 220: 111677.

29.

Lan

Lin

Physics-guided diagnosis framework for bridge health monitoring using raw vehicle accelerations. Mech Syst Signal Process 2024; 206: 110899.

30.

Ghiasi

Corbally

Malekjafarian

Optimal feature selection for supervised bridge damage classification using responses collected from a passing vehicle. Structures 2025; 75: 108845.

31.

Peng

Zhong

, et al. Interpretable damage sensitive feature extraction for drive-by structural health monitoring using knowledge distillation-based deep learning. J Sound Vib 2025; 618: 119303.

32.

Hurtado

Kaur

Alamdari

, et al. Unsupervised learning-based framework for indirect structural health monitoring using adversarial autoencoder. J Sound Vib 2023; 550: 117598.

33.

Alamdari

MM.

An evolutionary vehicle scanning method for bridges based on time series segmentation and change point detection. Mech Syst Signal Process 2024; 210: 111173.

34.

Erduran

Gonen

Komarizadehasl

, et al. Frequency identification of non-beam bridges using vehicle scanning methods: analytical formulation and experimental validation. Struct 2025; 80: 109612.

35.

Singh

Sadhu

Contact point response-based indirect bridge health monitoring using robust empirical mode decomposition. J Sound Vib 2023; 567: 118064.

36.

Yang

Wang

, et al. Closely spaced modes of bridges estimated by a hybrid time–frequency method using a multi-sensor scanning vehicle: theory and practice. Mech Syst Signal Process 2023; 192: 110236.

37.

Hamedani

Alamdari

Atroshchenko

, et al. Bridge health monitoring through series analysis: full-scale validation on a truss bridge. Eng Struct 2025; 340: 120516.

38.

Zhong

Liu

Jiang

, et al. Refining one-class representation: a unified transformer for unsupervised time-series anomaly detection. Inform Sci 2024; 656: 119914.

39.

Cheema

Sugiyama

Stiefelgen: a simple, model agnostic approach for time series data augmentation over riemannian manifolds. arXiv preprint arXiv:2402.19287, 2024.

40.

Brincker

Zhang

Frequency domain decomposition revisited. In: Proceedings of the 3rd international operational modal analysis conference (IOMAC), Portonovo, Ancona, May 4–6, 2009.

41.

Akbari

Maadani

Maalek

On the fundamental natural frequency of bridge decks: review and applications. Proc Inst Civ Eng Struct Build 2018; 171(12): 931–945.

42.

Makhzani

Shlens

Jaitly

, et al. Adversarial autoencoders. In: International conference on learning representations, 2016.

43.

Pidhorskyi

Almohsen

Adjeroh

, et al. Generative probabilistic novelty detection with adversarial autoencoders. In 32nd Conference on Neural Information Processing Systems (NeurIPS 2018), December 3–8, 2018, Montréal, Canada.

44.

Hurtado

Alamdari

Atroshchenko

, et al. A data-driven methodology for bridge indirect health monitoring using unsupervised computer vision. Mech Syst Signal Process 2024; 210: 111109.

45.

Yeh

C-CM

Zhu

Ulanova

, et al. Matrix profile i: all pairs similarity joins for time series: A unifying view that includes motifs, discords and shapelets. In: 2016 IEEE 16th international conference on data mining (ICDM), 2016, pp. 1317–1322.

46.

Mueen

Zhong

Zhu

, et al. The fastest similarity search algorithm for time series subsequences under Euclidean distance, http://www.cs.unm.edu/∼mueen/FastestSimilaritySearch.html (accessed August 2022).

47.

Gharghabi

Imani

Bagnall

, et al. Matrix profile xii: Mpdist: a novel time series distance measure to allow data mining in more challenging scenarios. In: 2018 IEEE international conference on data mining (ICDM), 2018, pp. 965–970.

48.

Gharghabi

Ding

Yeh

C-CM

, et al. Matrix profile viii: domain agnostic online semantic segmentation at superhuman performance levels. In: 2017 IEEE international conference on data mining (ICDM), New Orleans, LA, November 18–21, 2017, pp. 117–126.

49.

Peralta-Braz

Alamdari

Chou

, et al. Simultaneous energy harvesting and bearing fault detection using piezoelectric cantilevers. IEEE Internet Things J 2025; 12(21): 45533–45549.

Development and field validation of a fully customised vehicle scanning system on two full-scale bridges

Abstract

Keywords

Introduction

Research contribution

A novel autonomous inspection vehicle

Inspection vehicle’s characteristics

Experimental case studies

Case study I: pedestrian bridge at UNSW

Direct sensing

Indirect sensing

Case study II: Bulli Colliery Bridge

Direct sensing

Indirect sensing

Methodology

Indirect SHM using drive-by bridge inspection

Adversarial autoencoders

Training and testing the model

Matrix profile

Change point detection

Transformer-based anomaly detection (AMFormer)

Model architecture

Active reconstruction objective for anomaly assessment

Data augmentation using StiefelGen

Results

Case study I

Bridge condition assessment using AAE

Bridge condition assessment using MP

Bridge condition assessment using AMFormer

Case study II

Bridge condition assessment using AAE

Bridge condition assessment using MP

Conclusions

Footnotes

Acknowledgements

Declaration of conflicting interests

Funding

ORCID iD

Data availability Statement

References