Rapid seismic performance assessment of existing building structures via field-measured features and ensemble machine learning

Abstract

Accurate and timely assessment of structural damage is critical in response to severe earthquake events. To this end, this study proposes a framework integrating ambient-vibration tests, multivariate features, and machine-learning (ML) models. The focus is to examine the capability of various ML models, including decision trees, random forest, eXtreme Gradient Boosting (XGBoost), Light Gradient Boosted Machine (LightGBM), and Category Boosting (CatBoost), in classifying the seismic performance levels of buildings. To reduce biases due to imbalanced class distribution, a simulated dataset is adopted to train ML models. Particularly, this dataset is generated from the nonlinear time-history analyses of surrogate structural models, whose dynamic properties are calibrated from prior on-site testing. The analyses show that the XGBoost model mostly outperforms others and achieves an average F1-score of 0.859 across all performance levels in the test sets. Moreover, SHapley Additive exPlanations (SHAP) analyses are performed to determine the dominant features for classification task with six critical features identified. The reduced-dimension XGBoost model attains similar average F1-scores as that using all examined features. The study also investigates cost-sensitive models that account for the asymmetrical consequences of performance levels misclassification. Lastly, the proposed method is validated using publicly available data from real-world structures with seismic monitoring and demonstrated for regional real earthquakes and hypothetical seismic risk assessments. The predictions from XGBoost models for real earthquake assessments generally agree with actual observations.

Keywords

seismic performance assessment machine learning capacity curves performance levels SHapley Additive exPlanations

Get full access to this article

View all access options for this article.

References

Arias

(1970) Measure of earthquake intensity. In: Hansen

(ed) Seismic Design for Nuclear Power Plants. Cambridge, Massachusetts: Massachusetts Institute of Technology Press, 438–483.

ATC (1995) 20-2: Addendum to the ATC-20 Postearthquake Building Safety Evaluation Procedures. Applied Technology Council.

Behzadan

Dong

Kamat

(2015) Augmented reality visualization: a review of civil infrastructure system applications. Advanced Engineering Informatics 29(2): 252–267.

Bhatta

Dang

(2023) Seismic damage prediction of RC buildings using machine learning. Earthquake Engineering & Structural Dynamics 52(11): 3504–3527.

Bhatta

Dang

(2024) Quantum-enhanced machine learning technique for rapid post-earthquake assessment of building safety. Computer-Aided Civil and Infrastructure Engineering 39(21): 3188–3205.

Bommer

Martínez-Pereira

(1999) The effective duration of earthquake strong motion. Journal of Earthquake Engineering 3(2): 127–172.

CESMD (2008) Center for engineering strong motion data. Available at: https://www.strongmotioncenter.org/ (accessed 21 September 2025).

Chen

Guestrin

(2016) XGBoost: a scalable tree boosting system. Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. San Francisco, CA, USA: Association for Computing Machinery, 785–794.

Chinese Earthquake Administration (2016) Earthquake prevention and disaster mitigation planning (2016–2020). National Development and Reform Commission of the People’s Republic of China.

10.

Elkan

(2001) The foundations of cost-sensitive learning In: International Joint Conference on Artificial Intelligence. Lawrence Erlbaum Associates Ltd.

11.

FEMA (2009) Quantification of Building Seismic Performance Factors - FEMA P695. Applied Technology Council.

12.

FEMA (2015) Rapid Visual Screening of Buildings for Potential Seismic Hazards: A Handbook – FEMA P-154. 3rd Ed. Applied Technology Council.

13.

FEMA (2020) HAZUS Earthquake Model Technical Manual, 4.2 Ed. Federal Emergency Management Agency.

14.

Galar

Fernández

Barrenechea

, et al. (2012) A review on ensembles for the class imbalance problem: bagging-boosting-and hybrid-based approaches. IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews) 42(4): 463–484.

15.

GB50011-2010 (2016) Code for Seismic Design of Building. Beijing, China: China Architecture and Building Press.

16.

Giordano

Iacovino

Quqa

, et al. (2022) The value of seismic structural health monitoring for post-earthquake building evacuation. Bulletin of Earthquake Engineering 20(9): 4367–4393.

17.

Goulet

Michel

Kiureghian

(2015) Data-driven post-earthquake rapid structural safety assessment. Earthquake Engineering & Structural Dynamics 44(4): 549–562.

18.

Ivanović

Trifunac

Novikova

, et al. (1999) Instrumented 7-storey Reinforced Concrete Building in Van Nuys, California: Ambient Vibration Surveys Following the Damage from the 1994 Northridge Earthquake. Report CE99-03, Department of Civil Engineering. Los Angeles, California, United States: University of Southern California.

19.

JBDPA (2001) Standard for seismic evaluation of existing reinforced concrete buildings. In: The Japan Building Disaster Prevention Association.

20.

Kazemi

Asgarkhani

Jankowski

(2023) Machine learning-based seismic fragility and seismic vulnerability assessment of reinforced concrete structures. Soil Dynamics and Earthquake Engineering 166: 107761.

21.

Kazemi

Asgarkhani

Jankowski

(2024) Optimization-based stacked machine-learning method for seismic probability and risk assessment of reinforced concrete shear walls. Expert Systems with Applications 255: 142897.

22.

Meng

Finley

, et al. (2017) LightGBM: a highly efficient gradient boosting decision tree. Proceedings of the 31st International Conference on Neural Information Processing Systems. Long Beach, CA, USA: Curran Associates Inc., 3149–3157.

23.

Kiani

Camp

Pezeshk

(2019) On the application of machine learning techniques to derive seismic fragility curves. Computers & Structures 218: 108–122.

24.

Kojić

Trifunac

Anderson

(1984) A Postearthquake Response Analysis of the Imperial County Services Building in El Centro. Report CE84-02, Department of Civil Engineering. Los Angeles, California, United States: University of Southern California.

25.

Lin

Xie

Gong

, et al. (2010) Performance-based methodology for assessing seismic vulnerability and capacity of buildings. Earthquake Engineering and Engineering Vibration 9(2): 157–165.

26.

Tian

Wang

, et al. (2018) A numerical coupling scheme for nonlinear time history analysis of buildings on a regional scale considering site-city interaction effects. Earthquake Engineering & Structural Dynamics 47(13): 2708–2725.

27.

Lundberg

Lee

(2017) A unified approach to interpreting model predictions Proceedings of the 31st International Conference on Neural Information Processing System. Beach, CA, USA: Long.

28.

Malhotra

(1999) Response of buildings to near-field pulse-like ground motions. Earthquake Engineering & Structural Dynamics 28(11): 1309–1326.

29.

Mangalathu

Sun

Nweke

, et al. (2020) Classifying earthquake damage to buildings using machine learning. Earthquake Spectra 36(1): 183–208.

30.

Martakis

Reuland

Stavridis

, et al. (2023) Fusing damage-sensitive features and domain adaptation towards robust damage classification in real building. Soil Dynamics and Earthquake Engineering 166: 107739.

31.

Mazzoni

Mckenna

Scott

, et al. (2006) Open system for earthquake engineering simulation user command-language manual. In: Pacific Earthquake Engineering Research Center. Berkeley: University of California.

32.

Molnar

(2020) Interpretable machine learning: a guide for making black box models explainable. North Carolina: Lulu com.

33.

Nafeh

AMB

O’Reilly

Monteiro

(2019) Simplified seismic assessment of infilled RC frame structures. Bulletin of Earthquake Engineering 18(4): 1579–1611.

34.

Wang

(2024) Long-range ising model for regional-scale seismic risk analysis. Earthquake Engineering & Structural Dynamics 53(12): 3904–3923.

35.

Pardoen

(1983) Ambient vibration test results of the Imperial county services building. Bulletin of the Seismological Society of America 73(6A): 1895–1902.

36.

Park

(2025) Machine learning-based strain response prediction method for structural members of a building using ground motion data and intensity measures. Advances in Structural Engineering 28(10): 1890–1909.

37.

Prasittisopin

Zain

Keawsawasvong

, et al. (2025) Modal decomposition and genetic algorithm–based vulnerability evaluation of high‐rise structures—an assessment of structural optimization methodologies for fragility curves development. The Structural Design of Tall and Special Buildings 34(7): e70032.

38.

Prokhorenkova

Gusev

Vorobev

, et al. (2018) CatBoost: unbiased boosting with categorical features. Proceedings of the 32nd International Conference on Neural Information Processing Systems. Montréal, Canada: Curran Associates Inc., 6639–6649.

39.

Rathje

Abrahamson

Bray

(1998) Simplified frequency content estimates of earthquake ground motions. Journal of Geotechnical and Geoenvironmental Engineering 124(2): 150–159.

40.

Reuland

Lestuzzi

Smith

IFC

(2019) An engineering approach to model-class selection for measurement-supported post-earthquake assessment. Engineering Structures 197: 109408.

41.

Rojahn

Mork

(1981) An Analysis of strong-motion Data from a Severely Damaged Structure, the Imperial County Services Building, El Centro, California. Menlo Park, California, United States: Report for US Geological Survey.

42.

Roohi

Hernandez

(2020) Performance-based post-earthquake decision making for instrumented buildings. Journal of Civil Structural Health Monitoring 10(5): 775–792.

43.

Rosti

Rota

Penna

(2018) Damage classification and derivation of damage probability matrices from L’Aquila (2009) post-earthquake survey data. Bulletin of Earthquake Engineering 16(9): 3687–3720.

44.

Saito

Spence

RJS

Going

, et al. (2004) Using high-resolution satellite images for post-earthquake building damage assessment: a study following the 26 January 2001 Gujarat earthquake. Earthquake Spectra 20(1): 145–169.

45.

Shan

Huang

Wang

, et al. (2024a) Data-driven prediction of natural period for existing RC high-rise buildings using probabilistic machine learning methods. Journal of Building Engineering 90: 109394.

46.

Shan

Wang

Loong

, et al. (2023) Rapid seismic performance evaluation of existing frame structures using equivalent SDOF modeling and prior dynamic testing. Journal of Civil Structural Health Monitoring 13(2): 749–766.

47.

Shan

Huang

Loong

, et al. (2024b) Rapid full-field deformation measurements of tall buildings using UAV videos and deep learning. Engineering Structures 305: 117741.

48.

Shan

Zhuang

Chao

, et al. (2024c) Model updating of a shear-wall tall building using various vibration monitoring data: accuracy and robustness. The Structural Design of Tall and. Special Buildings 33(11): e2114.

49.

Shome

Cornell

Bazzurro

, et al. (1998) Earthquakes, records, and nonlinear responses. Earthquake Spectra 14(3): 469–500.

50.

Snoek

Larochelle

Adams

(2012) Practical bayesian optimization of machine learning algorithms. Proceedings of the 26th International Conference on Neural Information Processing Systems. Lake Tahoe, Nevada, USA: Curran Associates Inc., 2951–2959.

51.

Tocchi

Misra

Padgett

, et al. (2023) The use of machine-learning methods for post-earthquake building usability assessment: a predictive model for seismic-risk impact analyses. International Journal of Disaster Risk Reduction 97: 104033.

52.

Todorovska

Trifunac

(2008) Impulse response analysis of the Van Nuys 7-storey hotel during 11 earthquakes and earthquake damage detection. Structural Control and Health Monitoring 15(1): 90–116.

53.

Wang

Shan

(2024) Post-event evaluation of residual capacity of building structures based on seismic monitoring. Journal of Civil Structural Health Monitoring 14(7): 1611–1628.

54.

Dong

, et al. (2024) Seismic intensity measure selection incorporating interaction effects for damage assessment across different structural sensitive regions. Structures 67: 106917.

55.

Xiong

Lin

, et al. (2017) Parameter determination and damage assessment for THA-based regional seismic damage prediction of multi-story buildings. Journal of Earthquake Engineering 21(3): 461–485.

56.

Xiong

(2020) Automated regional seismic damage assessment of buildings using an unmanned aerial vehicle and a convolutional neural network. Automation in Construction 109: 102994.

57.

, et al. (2020) Collapse capacity of inelastic single-degree-of-freedom systems subjected to mainshock-aftershock earthquake sequences. Journal of Earthquake Engineering 24(5): 803–826.

58.

Zain

Dackermann

Prasittisopin

(2024a) Machine learning (ML) algorithms for seismic vulnerability assessment of school buildings in high-intensity seismic zones. Structures 70: 107639.

59.

Zain

Kupwiwat

Kang

, et al. (2025) Establishing analytical vulnerability information for non-linear low-rise (1-to 3-storey) school building models. Steel and Composite Structures 56(6): 551–563.

60.

Zhang

Cheng

, et al. (2023) Rapid seismic damage state assessment of RC frames using machine learning methods. Journal of Building Engineering 32: 105797.

61.

Zain

Prasittisopin

Mehmood

, et al. (2024b) A novel framework for effective structural vulnerability assessment of tubular structures using machine learning algorithms (GA and ANN) for hybrid simulations. Nonlinear Engineering 13(1): 20220365.

62.

Zhang

Chen

Crempien

JGF

, et al. (2023) Regional-scale seismic fragility, loss, and resilience assessment using physics-based simulated ground motions: an application to Istanbul. Earthquake Engineering & Structural Dynamics 52(6): 1785–1804.

63.

Zhang

Reuland

Shan

, et al. (2024) Post-earthquake structural damage assessment and damage state evaluation for RC structures with experimental validation. Engineering Structures 304: 117591.

64.

Zhang

Lei

Chan

, et al. (2024) Integrating physics-informed machine learning with resonance effect for structural dynamic performance modeling. Journal of Building Engineering 84: 108627.