Abstract
This article presents a suite of response spectrum (RS) and Fourier amplitude spectrum (FAS) site amplification models for Central and Eastern North America (CENA). The amplification database used in model development was produced through large-scale one-dimensional site response analyses and overcomes limitations of prior databases by providing broader coverage of anticipated site conditions. New amplification functions conditioned on either VS30 or site natural period (Tnat) as the primary independent variable are provided, the latter of which better captures features of the computed site amplification from the simulated database (i.e. model dispersion is reduced). Models for standard deviation also are developed. RS and FAS linear amplification functions that adjust the reference condition of VS = 3000 m/s to VS30 = 760 m/s, and account for amplification for profiles either with a sharp VS impedance or a more gradual increase in VS are also developed.
Keywords
Introduction
Site amplification factors are used to compute ground motion intensity measures (GMIMs) of interest at the ground surface condition through adjustment of GMIMs at reference conditions. Site amplification factors can be derived for specific sites (nonergodic site response) or can be derived for large ensembles of sites, where the site amplification is conditioned on different predictive variables (ergodic site response models). In recent practice, ergodic models are regionalized to reflect specific geologic conditions in application regions, such as Central and Eastern North America (CENA), Western United States (WUS), and Cascadia.
The focus of this article is on site amplification in CENA, which has been investigated in a series of studies prior to, during, and since the Next Generation Attenuation for Eastern North America (NGA-East) project (Goulet et al., 2021). The work performed prior to NGA-East (∼2020), previously synthesized by Stewart et al. (2020), entailed one-dimensional (1D) ground response simulations for National Earthquake Hazards Reduction Program (NEHRP) site classes (Building Seismic Safety Council (BSSC), 1995) or specific regions such as the Mississippi Embayment or Charleston, SC. The work conducted as part of, or coincident with, NGA-East included both empirical studies (Hassani and Atkinson, 2016; Parker et al., 2019) and simulation-based studies (Harmon et al., 2019a), which was coordinated by the NGA-East Geotechnical Working Group (GWG); the results of the NGA-East work and prior work were synthesized to develop recommendations for linear and nonlinear (NL) site response for use in the National Seismic Hazard Model (NSHM) (Hashash et al., 2020a; Stewart et al. 2020). Those models were developed using VS30 (time-averaged shear-wave velocity (VS) to a depth of 30 m) as the primary conditioning variable.
The applicability of VS30 as a representative site parameter for CENA has been questioned by several studies (Harmon et al., 2019b; Hashash et al., 2020a; Hassani and Atkinson, 2016, 2018; Nikolaou et al., 2012; Nikolaou et al., 2001). It is known that VS30 does not capture site-specific resonance effects, which can be appreciable for sediments with different depths to reference rock, or bedrock (Hashash et al., 2014). As an example, Figure 1(a) shows VS profiles from two sites (TYNO and PKRO), which are located in Tyneside and Pickering, Ontario, respectively, and are plotted up to their maximum explored depth of 70 m (Beresnev and Atkinson, 1997). The TYNO and PKRO sites have almost the same values of VS30 of 404 and 403 m/s, but their site fundamental frequencies (fpeak) are distinct (3.86 and 1.42 Hz, respectively) due to different depths to bedrock. These different resonant frequencies are evident in observed site amplifications (Hassani and Atkinson, 2016), as shown in Figure 1(b). Tyneside, Ontario (TYNO) shows peak amplification at higher frequencies relative to Pickering, Ontario (PKRO) because of the shallower bedrock location, but the latter exhibits observably larger site amplification for lower frequencies (<2.0 Hz). Examples such as these have motivated the development of models conditioned solely on fpeak, such as Zhao and Xu (2013) and Hassani and Atkinson (2016). While such models can capture site response at the peak, they tend to be less effective at capturing mean site response over a wide frequency range. As a result, a more common approach has been to consider VS30 and fpeak jointly. In CENA, coupled models of this sort have been presented by Harmon et al. (2019b) using simulations and by Hassani and Atkinson (2018) using empirical methods.

(a) VS profiles of two sites (TYNO and PKRO) from Beresnev and Atkinson (1997), and (b) natural logarithm of site amplification from mixed-effect regression of residuals of observed ground motion relative to GMM with respect to hard-rock reference condition from Hassani and Atkinson (2016).
Work conducted since the end of the NGA-East project (∼2020) by investigators independent of NGA-East has examined the effects of sediment depth (i.e. from ground surface to VS horizon of ∼3000–3500 m/s) on site response in the Atlantic coastal plain (CP) and Gulf Coast using newly available maps from Boyd et al. (2022). Those maps demonstrate that sediment depths across the Atlantic and Gulf CP can exceed 10,000 m, which exceeds the maximum depth (1000 m) of randomized VS profiles in this study as discussed in the next section. These studies employed a reference site approach using earthquake and microseism data, and the results suggest strong effects of sediment depth on low-frequency components of ground motion (Chapman and Guo, 2021; Pratt and Schleicher, 2021). These models are conditioned on depth alone as defined from the Boyd et al. (2022) maps (i.e. VS30 and Tnat are not used in tandem).
The simulation work by the GWG, which is denoted subsequently as the prior study, consisted of a parametric study of more than 1.7 million linear (L), equivalent-linear (EL), and NL site response analyses (SRA) to represent the variability and uncertainty in site stratigraphy, material properties, and ground motions at CENA sites. The simulations were used to produce several response spectrum (RS) and smoothed Fourier amplitude spectrum (FAS) site amplification functions derived using conventional NL regression methods (Harmon et al., 2019b). Since that work was completed, some limitations have been identified and are addressed here:
1. The diversity of CENA site conditions was not fully represented:
a. Gradient profiles, which are defined as those that lack a sharp impedance between the reference rock conditions and the overlying material, are known to exist in CENA (Stewart et al., 2020), particularly south of the southern limit of Wisconsin glaciation (Reed and Bush 2005), and were not represented in the generic site profiles.
b. Gaps in the distribution of VS30 and Tnat (site natural period) within the site database (Figure 2(b)) were observed.
2. A model for the standard deviation associated with the median model was not developed.

Comparison of site profile databases from Harmon et al. (2019a) and New Study as functions of (a) VS30 (m/s) and Z3.0 (m), and (b) VS30 (m/s) and Tnat (s).
These limitations motivated a second parametric study with additional VS profiles and updated parameter selection protocols to more accurately represent CENA site conditions and variability from the median predictions. Using these updated simulation results, a new suite of RS and FAS site amplification models is developed to provide functions (this article) and deep learning-based (artificial neural network (ANN)) site amplification models (upcoming article). The models are derived using (1) both VS30 and Tnat, and (2) only Tnat. Standard deviation relationships that represent the aleatory variability associated with the median model are proposed.
All the models proposed in this article estimate site amplification relative to a VS = 3000 m/s reference condition. For practical application, amplification factors that convert ground motions applicable to the VS = 3000 m/s reference condition to ground motions applicable to the softer reference condition of VS30 = 760 m/s (referred to as F760) for both impedance and gradient conditions (Stewart et al., 2020) are suggested.
Site amplification simulation study design
Limitations of prior simulation-based study
The GWG or prior simulation-based study (Harmon et al., 2019a,b; denoted as HEA19) has the following limitations pertaining to the parametric study design:
Discrete soil depth (ZSoil) values were used, which introduce gaps in the parameter space that affect the distributions of profiles when plotted in VS30-Tnat or VS30-Z3.0 (depth to VS = 3000 m/s) domains. ZSoil is defined as depth to weathered rock or reference rock.
Randomized VS profiles were derived using the Monte Carlo sampling (MCS) technique (Toro, 1995). This provides no constraint on the tails of the normally distributed random variable, which can cause unreasonably soft or stiff profiles to be generated. Moreover, if a limited number of realizations (e.g. less than 30) are used, the logarithmic mean of the randomized profiles might diverge from the representative seed VS profile.
The procedure used to generate VS profiles for simulations did not allow for velocity VS reversals (i.e. a VS decrease in an underlying layer), which can occur in real sites.
All of the seed profiles (i.e. prior to randomization) are underlain by either weathered rock or a hard-rock horizon (VS = 3000 m/s), leading to significant impedance contrasts in all profiles. However, profiles with a more gradual increase in VS with depth can occur, particularly below the southern limit of Wisconsin glaciation (e.g. Darragh et al., 2015; Frankel et al., 1996). Such profiles were referred to as gradient VS profiles in the development of F760 models (Stewart et al., 2020).
The procedures used to generate VS profiles through weathered rock zones (WRZs) are unable to capture some sites with high VS gradients and low WRZ thickness (Hashash et al., 2014).
In the development of soil NL properties, shear strength was considered as a limiting shear stress at large shear strains (∼10%). The strength values that were used include cohesion intercepts that are related to VS. These strengths have been found to be unreasonably high for soil layers at shallow depths.
Needs have also been identified in the formulation of the site amplification functions, including:
Amplification functions derived for some profiles have multiple peaks that result from multiple impedance contrasts within the profile. The prior VS30 and Tnat-based amplification functions are not able to capture these features.
The primary conditioning variable in the formulation of the amplification models was VS30, with other effects (site natural period, depth) modifying the VS30-conditioned amplification. Because some researchers have advocated for Tnat as the primary site parameter (Hashash et al., 2020a; Hassani and Atkinson, 2016), the model has been formulated with this feature.
The HEA19 FAS amplification model required VS30 and ZSoil, but due to the lack of ZSoil parameter for most CENA sites (Bayless, 2019, personal communication), it cannot be used in many practical applications. Hence, FAS models conditioned only on primary site parameters (VS30 or Tnat) are needed.
Standard deviation models are needed for probabilistic applications.
This study updates certain elements of the HEA19 parametric study and functional form to overcome the identified limitations and needs.
New parametric study plan
We adopt the parametric study design of Harmon et al. (2019a), but with significant updates as summarized in Figure 3. The study has two main branches in which the surficial materials are soil or rock (illustrated in the lower and upper parts of Figure 3, respectively). In the soil branch, the generic site profiles consist of three VS segments: (1) soil, (2) WRZs representative of the transition from soil to bedrock, and (3) a hard-rock reference condition of VS = 3000 m/s (Hashash et al., 2014). These profiles are developed through VS randomization using 10 geology-based representative seed VS profiles, which were originally created in the work by Harmon et al. (2019a). More than 800 VS profiles from 60 different references (i.e. the literature (Ahdi et al., 2018; Kottke et al., 2023) and open geotechnical reports (Figure A.1) as summarized in Table B1 of Harmon et al. (2019a)) were compiled to develop these profiles. Then, gathered data were classified using the geologic classes proposed in the work by Kottke et al. (2012), and the ultimate seed VS profiles were produced as detailed in Appendix A.2.1. To assign NL properties to soil layers, the following parameters were considered: (1) index and stress history soil properties (unit weight γ, over-consolidation ratio, soil friction angle ϕ, and plasticity index (PI)); (2) shear modulus G reduction and damping D (MRD) curves, which are conditioned on stresses in combination with properties from (1); (3) soil shear strength, which is used to constrain the large-strain MRD components; and (4) small-strain damping (Dmin) values. As a result, the sites in the parametric study, which includes 147,420 unique site profiles, can be described as follows:
1 Representative seed VS profiles, 13 of which are for soil (Figure A.4) and have assigned geology-dependent soil properties (Figure 3) and 3 of which are for WRZ (Figure A.6). These profiles are used as input for VS statistical randomization (Toro, 1995).
2 Thirty random statistical realizations of the VS profile for each representative seed VS profile (from Figures A.8 to A.20).
3 Three random statistical realizations of modulus reduction (G/Gmax) and damping curves (MRD) by Darendeli (2001) for mean, systematically high (+ε), and systemically low (−ε) curves. The MRD randomization is assumed to be independent of the VS randomization. Shear strength is not randomized.
4 Eighteen uniformly distributed bins of soil depth from the ground surface (from 5.0 to 1000 m), ZSoil, which establish the depth limit of the generic site profiles.
5 Seven models to represent the WRZ horizon and corresponding material properties.

Parametric study tree. Weathered rock zone represents the depth to weathered rock, and the acronyms and abbreviations are presented in Table A.2 of Appendix A.
For near-surface rock conditions (lower branch in Figure 3), the depth to weathered rock, WRZ, is placed at the surface underlain by reference rock. In this branch, 630 unique site profiles are generated, with 3 seed WRZ VS profiles, 30 VS profile realizations, and 7 WRZ models. The elements of this branch are similar to those of the upper branch except the truncation of profiles at the uniformly distributed depth bins. All simulations are performed using DEEPSOIL V7.0 (Hashash et al., 2020) in the DesignSafe (Rathje et al., 2017) environment as detailed in Appendix A.4.
In total, the parametric analyses consist of 1,218,945 NL, EL, and linear 1D SRA (for a total of 3,656,835) conducted using 148,050 unique generic site profiles and 247 motions (Appendix A.1) randomly assigned to the site profiles.
Improvements relative to prior parametric study plan
The parametric study described in the previous subsection improves upon that used previously (HEA19) in the following aspects:
Uniformly distributed ZSoil bins are adopted over the depth range of 5–1000 m, resulting in significantly more parameter combinations in VS30-Z3.0 space (Figure 2(a)) and VS30-Tnat space (Figure 2(b)).
Latin hypercube sampling (Figure A.7) is introduced into the work by Toro (1995) statistical randomization technique in lieu of MCS to produce randomized VS profiles whose log-mean is almost equal to or close to the seed VS profile (Figure 4(b)).
VS reversals (Figure 4(b)) are permitted through an interlayer correlation coefficient (ρ) value of 0.8, which is calculated from collected VS profiles as given in the work by Ilhan (2020) and is compatible with the value used in the work by Pehlivan et al. (2015).
A VS gradient model, which gradually increases from the bottom of the soil horizon to the top of bedrock and is included in WRZ column of Figure 3, is added to represent gradient VS profiles. An example of such VS profile appended to a 55 m deep soil profile is shown in Figure 5, with the other WRZ models used in this study. The linear site response simulations conducted using these profiles are utilized to provide a new set of 760–3000 reference adjustment factors (F760).
Two new WRZ models (red symbols in Figure A.5) with depths of 3 and 5 m for the near-surface rock branch are added to represent WRZ over bedrock conditions (Figure A.6).
A soil strength-VS relationship from Agaiby and Mayne (2015) is used to more reasonably capture strength profiles at shallow depth (further information in Appendix A.3).

VS realization of RR representative seed VS profile with logarithmic mean, 95% confidence interval of the logarithmic mean of VS realizations, and the representative seed VS profiles with respect to depth of 1000 m from (a) Harmon et al. (2019a) and (b) New Study. RR: Residual material from metamorphic and igneous rock.

New weathered rock zone model given under IEA title along with models proposed in the work by Harmon et al. (2019a) and Darragh et al. (2015), denoted as HEA19 and DEA15, respectively.
Improvements relative to prior amplification functions
The HEA19 site amplification models are summarized in Table 1. This article updates the HEA19 functions as follows:
After observing the constant linear amplification at low-to-mid VS30 sites for short periods (Figure 6), the functional form of the VS30-based model (i.e. L1 model) was revised to predict a flat region (L1 in Figure 7(a)). This change was made because the linear RS amplification is approximately flat for ∼120 m/s ≤ VS30 ≤ 500 m/s.
The functional form for the first mode of amplification near the natural period (Tnat) and higher modes now includes a combination of the Ricker wavelet and Gaussian functions, with the latter added to represent second-mode peaks.
A new model was developed in which the primary conditioning variable was Tnat for RS and fnat for FAS. This first-mode period/frequency can be evaluated from a VS profile or from horizontal-to-vertical spectral ratio (HVSR) data.
Tabular values of FAS amplification are replaced with regression equations having the same form as those for RS amplification.
Standard deviation (σ) relationships that describe the dispersion of simulation results relative to the median models are developed. The σ model in the work by Hashash et al. (2020a), HEA20, is revised to better represent the dispersion of residuals for NL amplification model.
The functional forms of linear RS amplification models in the work by Harmon et al., (2019a)
R * is identical to the Equation 6. NL: nonlinear;

(a) Linear RS ln(amplification) together with the binned mean of linear RS ln(amplification) as function of VS30 and Tnat for TOSC of 0.1 s, and (b) linear FAS ln(amplification) along with the binned mean of linear FAS ln(amplification) for VS30 and Fnat for f of 10.0 Hz. The Fnat bins are the reciprocal of Tnat bins. RS: response spectrum; FAS: Fourier amplitude spectrum.

Comparison of estimations from L1 and L1 (HEA19) model calculated using Harmon et al. (2019b) coefficients along with linear response spectrum ln(amplification) and its binned mean ± 1σ for (a) 0.1 s and (b) 1.0 s. The residuals of L1 and L1 (HEA19) models are given for (c) 0.1 s and (d) 1.0 s.
Details of these changes are presented in the following subsections.
RS amplification
The amplification is formulated in natural log units as the sum of linear (Flin) and NL (Fnl) components (Seyhan and Stewart, 2014):
The linear component has two model terms, the second of which is optional,
An alternative form of the linear component is conditioned only on Tnat (similar to Hassani and Atkinson, 2016),
The NL function,
List of all RS and FAS site amplification models
K1 and N1 components are regressed only for RS amplification.
f(Tnat) and f(fnat) are used for RS and FAS amplification models, respectively.
NL: nonlinear; SA: spectral acceleration; PGA: Peak ground acceleration.
VS30 scaling
VS30 is commonly used as an independent variable in site amplification functions (e.g. Borcherdt, 1994; Parker et al., 2019). The VS30-based L1 model (in which VS30 is the sole site parameter) is given as,
where c is the slope of the model component producing linearly varying ln(amplification) with ln(VS30) between corner velocities of V1 and V2, Vb and Vu are the model coefficients that vary with period. Constant amplification is modeled for VS30 less than V1, replacing the decay function (i.e. decreasing amplification as VS30 decreases) in HEA19, and constant amplification (constrained by simulations) is also modeled at VS30 greater than V2. For VS30 greater than Vu, the amplification decreases linearly to 0.0 between Vu and 3000 m/s. Equation 4 is modified from the FV term (representing VS30 scaling) given by Stewart et al. (2020) (denoted as SEA20) but with Vref (760 m/s) in SEA20 replaced with Vb in Equation 4.
Figure 6(a) shows trends of simulated amplification results for 0.1 s RS as a function of VS30. The figure shows binned means for all results and for sites with selected values of Tnat; the Tnat results are addressed in the next subsection. The overall results in Figure 6(a) show that the amplification is constant with respect to VS30 for 150 m/s ≤ VS30 ≤ 750 m/s as represented by the first term in Equation 4 (this behavior is also observed for bins of Tnat ≤ ∼1.1 s, but not for some higher periods). Notably, this VS30-independent amplification is not observed in empirical data (Parker et al., 2019). However, the focus of this article is on fitting the simulated data and not on VS30 models that would be recommended for application. For higher velocities, the overall site amplification reduces linearly with ln(VS30) and is represented by the second and fourth terms in Equation 4.
Figure 7(a) and (b) compares the L1 estimates from Equation 4 and from the model of HEA19 for TOSC (oscillator periods) of 0.1 and 1.0 s. Even though these models are similar for TOSC of 1.0 s (Figure 7(b)), the HEA19 L1 model significantly deviates from the L1 model of this study at TOSC ≤ 0.1 s due to the decay in ln(amplification) for low VS30 sites. At TOSC = 0.1 s, the L1 and L1 (HEA19) models are biased positive and negative, respectively, relative to the simulated amplifications at VS30 < 200 m/s (Figure 7(c)), and both models are unable to capture the peak amplification around VS30 = 1000 m/s. The observable bias in L1 estimations is anticipated since it does not include Tnat term in its functional form and is, therefore, unable to capture influences of site resonances, which have the effect of reducing amplification at low VS30 sites at illustrated in Figure 6(a). This is addressed in subsequent sections by enhancing the model through the use of the Tnat parameter, both in the VS30-scaling model (as depicted in Figure 6) and by the introduction of peaks in the amplification function (resonance effects). The coefficients of the L1 model are presented in Figure B.1.
Tnat effects
The trend of site amplification with VS30 is Tnat-dependent, as can be seen in Figure 6(a). In the case of short-period RS, as Tnat increases, the amplification decreases, which is caused by increased material damping from deeper profiles. However, we argue that amplification effects related to Tnat should principally be directed toward the modeling of peaks related to site resonances and with the decay in amplitudes that reflect cumulative damping effects that depend on soil depth. As described in the previous section, the 1D simulations were performed using two types of profiles: (1) impedance and (2) gradient sites. The former is expected to produce a peaked response due to resonance effects caused by the existence of a sharp impedance contrast between the WRZ, or reference rock of VS = 3000 m/s (Hashash et al., 2014), and the overlying soil material.
Justification for the addition of the G component of the model in Equation 5 is provided by plotting residuals of the VS30-scaling model (L1) versus

(a) The binned residuals of the L1 response spectrum model computed as the difference between linear ln(amplification) and L1 estimations as a function of TOSC/Tnat for TOSC of 0.01, 0.1, 0.3, 0.6, 1.0, 3.0, and 5.0 s, and (b) the residuals of the L1 as a function of TOSC/Tnat for TOSC of 0.3 s along with f(Tnat) and f(Tnat)-HEA19 model estimations.
For the modeling of Tnat effects on site amplification, a framework is adopted from HEA19, in which the amplification from a VS30-only model (Equation 4) is modified using a period-dependent equation. HEA19 used a Ricker wavelet centered near Tnat with ramp functions at shorter and longer periods. That approach is updated here by adding a second-mode peak that is assumed to occur at 0.31Tnat and is described by a Gaussian function. The model is formulated as follows:
where c5, c6, and c7 are period-dependent coefficients, and R is the Ricker wavelet term to capture the first-mode peak amplification at TOSC/Tnat≈ 0.81 together with two troughs next to it:
and the second-mode peak is captured by Gaussian term denoted as G:
where α1 and α2 are period-dependent regression coefficients, and β1 and β2 are given as:
The G component is only applied for TOSC < 3.0 s since the second mode of peak amplification seems to disappear for TOSC > 3.0 s (Figure 8(a)).
Figure 9(a)–(d) compares computed amplifications for the L5 model in this study and HEA19 and the residuals of L5 models, respectively, for TOSC of 0.1 and 1.0 s. The linear amplification data are plotted as dispersed points in log(VS30) space since each randomized profile terminates at different depths beyond 30.0 m (Figure 3), resulting in profiles with the same VS30 but different Tnat values, and thus different amplifications. As expected, this condition is observed in L5 estimations. The only appreciable differences between the two generations of L5 models are for 100 m/s ≤ VS30 ≤ 200 m/s; the updated model provides residuals closer to zero. The amplification behavior in this VS30 region is strongly influenced by profile damping, which is captured much better when Tnat is included as an independent variable. For VS30 > 200 m/s, simulation results and model fits are similar. The coefficients for f(VS30) and f(Tnat)L5 are illustrated in Figures B.1 and B.2, respectively.

Comparison of estimations from L5 and L5 (HEA19) model calculated using Harmon et al. (2019b) coefficients (HEA19) along with linear response spectrum ln(amplification) and means of ln(amplification) ± 1σ for (a) 0.1 s and (c) 1.0 s. The models’ residuals with their means ± 1σ are given for (c) 0.1 s and (d) 1.0 s.
Model with Tnat as primary independent variable
As noted previously, an alternative to the L5 model (Equations 4 and 5) in which Tnat is used as the only independent variable (Equation 3) is provided. Denoted as L6, the form of the model was selected based on inspections of amplification plots (relative to reference rock) as shown in Figure 10(b), in which the mean of amplification for soil over bedrock sites and WRZ over bedrock site is plotted with respect to log(TOSC/Tnat). Two principle trends are observed:
Soil sites: These sites characteristically have a region with a series of peaks and troughs associated with TOSC ∼ Tnat and TOSC < Tnat (i.e. first and higher modes of the site) along with an amplification decay region at short periods (TOSC << Tnat) due to soil-damping effects. These trends are similar to those observed in the mean of L1 model residuals (Figure 10(a)).
WRZ sites: Amplification decreases with increasing log(TOSC/Tnat), which is distinct behavior relative to that at soil sites.

Comparison of (a) L1 model residuals for all sites and (b) binned mean of linear amplification at soil and weathered rock zone sites in TOSC/Tnat space for TOSC of 0.001, 0.01, 0.1, 1.0, 3.0, and 10.0 s.
Given the different trends, separate relationships for amplification of soil and WRZ sites were developed. The soil model, denoted f(Tnat)L6, Soil, is formulated as:
This model is an updated version of f(Tnat)L5 using a second-order polynomial (i.e. the component with c6,i coefficients) to better capture the decay in amplification due to soil-damping effects and the shape of amplification at very short periods (e.g. for TOSC of 0.001 s in Figure 10(b)). The resulting fit to the amplification data and the model’s residuals are shown in Figure 11(a)–(d), respectively, and these plots show an excellent fit to the data.

L6 model (both for soil weathered rock zone site amp) and linear response spectrum ln(amplification) along with its binned mean ± 1σ in evenly distributed log(TOSC/Tnat) space for TOSC of (a) 0.1 s and (b) 0.1 s. The residuals are given for TOSC of (c) 0.1 s and (d) 1.0 s.
For WRZ sites, the amplification function is given as,
where t5 and t6 are period-dependent coefficients. Figure 11 also illustrates this model and its residuals, which are shown to be very close to zero. The coefficients of L6 models are plotted in Figures B.3 and B.4. The WRZ model (Equation 10) should be used for profiles that lie within the VS30 and Tnat parametric ranges of WRZ profiles in this study, which are VS30 > 1000 m/s and Tnat < 0.1 s (Figure 2). Otherwise, the soil model (Equation 9) would apply.
NL and total amplification
HEA19 accounts for NL amplification (Fnl) using the difference between NL and L SRA and omits the EL analyses due to its tendency to overdamp high-frequency components (especially for frequencies (f) > 2.0 Hz; Kim et al., 2016). However, NL analyses produce lower spectral ratios (ratio of spectral acceleration (SAr) at ground surface to that of input motion) than those from EL because of phase incoherence introduced by NL for f > 25.0 Hz (Rathje and Kottke, 2011). Thus, the envelope of NL and EL SRA is adopted such that the maximum of the surface response is taken across all TOSC to avoid reductions in NL amplification at large frequencies. This indicates that two different sets of amplification results can be considered, as follows:
NL amplification: (a) The difference between NL and L SRA, and (b) the difference between the envelope of NL and EL simulations, which is denoted as envelope, and L SRA
Total amplification: (a) NL and (b) enveloped results
where Ir represents the outcropping rock motions (VS = 3000 m/s) in one of two possible forms—SAr for a given TOSC or PGA at rock (PGAr) (these models are denoted as N1 and N2, respectively) and f2 is a coefficient that associates the degree of nonlinearity with the stiffness of the site and is given as (modified from Chiou and Youngs, 2008),
where f3 represents the transition ground motion level from linear to NL site amp, and Ir is an intensity measure that represents the strength of the ground motion for reference rock conditions (VS = 3000 m/s). Vc is the velocity at which no amplification occurs relative to VS = 3000 m/s and comes from the L1 model regressions.
Figure 12(a) and (c) shows the NL-L amplification data as functions of VS30 and PGAr and fits from Equations 13 and 14. Some oscillations with respect to PGAr occur in the data due to different numbers of data points for different PGAr intervals. The fitted model from this study is compared with that from HEA19b, which generally provides similar results, although more nonlinearity is now predicted for VS30 (m/s) and PGAr (g) pairs less than (110, 0.35), (175, 0.45), and (270, 0.6) for 0.1 s. The N2 model is applied to the Envelope-L data set as given in Figure 12(b) and (d), producing larger amplification (less nonlinearity) relative to NL-L analyses. Using N2 coefficients regressed through the Envelope-L data provides future users with the option of working with larger amplification values. The residuals and the coefficients of N2 relationships are presented from Figures B.8 to B.11.

The estimations from N2 and N2 (HEA19) models for response spectrum NL site ln(amplification) from (a and c) the difference between NL and L simulations (NL-L analyses) and from (b and d) the difference between the envelope of NL and equivalent-linear (EL) simulations and L analyses (Envelope-L analyses) along with corresponding simulated amplification data (i.e. binned mean of amplification data) as functions of VS30 and PGAr for oscillator periods of 0.3 and 1.0 s. NL: nonlinear.
The total amplification estimates from L5 + N2 (this study) and L5 + N2 (HEA19) are assessed in Figure 13 for TOSC of 0.1 and 1.0 s. The enhancements of L5 + N2 (this study) over L5 + N2 (HEA19) are limited such that the binned mean residuals from the new model are slightly closer to zero at 200 m/s < VS30 < 400 m/s and 120 m/s < VS30 < 150 m/s for 0.1 and 1.0 s, respectively. In addition to L5 + N functions, K models defined in Table 2 and the N function using NL-L and Envelope-L amplification are discussed in Appendix B.1.

Comparison of estimations from L5 + N2 and L5 + N2 (HEA19) computed using Harmon et al. (2019b) coefficients along with total response spectrum ln(amplification) from nonlinear simulations and its binned mean ± 1σ for (a) 0.1 s and (b) 1.0 s. In addition, the models’ residuals are presented for (c) 0.1 s and (d) 1.0 s.
FAS amplification
The natural log of the FAS amplification (ES) is defined as the sum of modular linear (Elin) and NL amplification (Enl) terms in a similar manner to RS amplification,
The same functional forms are used for FAS amplification components as described previously for RS amplifications, based on the similarity of the data sets (Figure 6 and Figure B.21 for linear and NL, respectively). For the Tnat-effect models, L5 is re-cast using the independent variable of natural frequency (
where R and G terms are analogous to the Equations 6 and 7, respectively, and β1 and β2 in the R and G terms of Equation 14, respectively, are given as,
The coefficients of the f(VS30) part of the L5 model are given in Figure B.25.
Figure 14 compares L5 and tabulated values of HEA19 as functions of VS30 and ZSoil for frequencies of 10 and 1.0 Hz. L5 seems to reasonably capture the overall linear FAS amplification, although the residuals of L5 exhibit fluctuations across the VS30 space indicating the model’s inability to capture the complex trends in the FAS amplifications. We considered adding additional terms to Equation 14 to bring the binned means of residuals closer to zero, but ultimately decided against this to avoid over-complicating the functional form and introducing coefficients that might be poorly constrained. There are two issues related to the use of the HEA19 model: (1) the ZSoil information might not be available for sites of interest, and (2) the tabulated values of VS30 and ZSoil do not seem to cover the data space accurately. Thus, a continuous L5 model conditioned on VS30 and Tnat is preferred over HEA19. The attributes of the Fnat-only relationship (L6) and NL and total FAS amplification models are discussed in Appendix B.2.

Comparison of estimations from L5 and L5 (HEA19) tabulated values as functions of VS30 and ZSoil, and along with linear Fourier amplitude spectrum ln(amplification) and its binned mean ± 1σ for (a) 10.0 Hz and (b) 1.0 Hz. In addition, the models’ residuals are presented for frequency (f) of (c) 10.0 Hz and (d) 1.0 Hz.
Reference site correction factors from 760 to 3000 m/s condition
In the development of ergodic site amplification models for practical application, both empirical and simulation-based results are considered (Stewart et al., 2020). The empirical models are able to predict relative levels of amplification within the range over which the independent variable occurs at ground motion recording sites. The primary independent variable that is considered is VS30, and its range of observed values in CENA is about 200 to 2000 m/s (Parker et al., 2019). This presents practical difficulties in predicting site response relative to the 3000 m/s reference condition. To overcome this problem, empirical amplification studies have evaluated amplification relative to the NEHRP B-C boundary (760 m/s), which is comfortably within the data range and is sufficiently stiff that complexities associated with NL site response are unlikely to be significant. Within this framework, amplification relative to VS = 3000 m/s is taken as the log sum of amplification relative to 760 m/s and amplification for sites with VS30 = 760 m/s relative to the reference condition VS = 3000 m/s (denoted as F760),
where ln(amplification)3000 and ln(amplification)760 are the natural logarithm of linear RS amplification relative to 3000 and 760 m/s, respectively. To be clear, ln(amplification)3000 has the same physical meaning as
The NSHM (Petersen et al., 2020) uses models for F760 that are derived from a series of simulations using two groups of soil profiles, one with strong impedance contrasts and another with relatively gradual increases of stiffness with depth (Stewart et al., 2020). One of the sets of simulation results that was considered in formulating those recommendations was for impedance profiles involving underlying hard-rock or overlying weathered rock or soil materials by HEA19. The gradient profiles that were considered were from other sources (Darragh et al., 2015; Frankel et al., 1996). In this study, consistent sets of F760 factors were derived for linear RS site amplification at impedance and gradient profiles with 700 m/s ≤ VS30 ≤ 800 m/s (Figure 15). The levels of diminutive parameter
where κ0,rock is the κ0 for the underlying rock half-space (0.006 s), and H and z are the profile thickness and depth, respectively.

F760 factors for impedance and gradient conditions along with those from and Stewart et al. (2020) and F760-impedance from HEA19 for linear response spectrum ln(amplification) (a and b). F760 factors compute from linear Fourier amplitude spectrum ln(amplification) are given in (a) and (c).
Figure 15(a) and (b) compares F760 results from this study with those used in the NSHM for impedance and gradient profiles. At impedance sites, larger F760 values are obtained for TOSC ≤ 0.4 s from this study and from HEA19 than those provided in SEA20, which is attributed to 83% of the simulations in this study utilizing profiles with κ0 < 0.01 s. A similar trend is observed for the gradient sites, and here, approximately 70% of the profiles have κ0 < 0.01 s. For mid-to-long periods (TOSC > 0.4 s), all F760 factors seem to converge due to limited soil-damping effects in this period range.
F760 obtained from the mean of the linear FAS amplification at sites with 700 m/s ≤ VS30 ≤ 800 m/s is presented in Figure 15(c) and (d). The FAS amplifications at short periods (high frequencies) decrease with decreasing period more strongly than is seen in RS amplifications, which is an expected consequence of oscillator response characteristics.
Residuals and dispersion analyses
To check the adequacy of the models developed in this work, residuals were computed as the differences of the natural log of computed amplifications (for each considered profile and combination of other dynamic soil properties) and model median prediction. In this section, the residuals are evaluated to check for bias and dispersion characteristics.
Figure 16 shows residuals for three major classes of models for linear simulations plotted as functions of VS30 and TOSC/Tnat: VS30-scaling L1, combined VS30-scaling and Tnat L5, and Tnat-only L6. For TOSC of 0.1 s, the averages of L5 and L6 models’ residuals in equally log-spaced VS30 and TOSC/Tnat bins exhibit analogous trends and are aligned with the zero line for 100 m/s < VS30 < 1000 m/s and for TOSC/Tnat < 1.0. On the contrary, residuals of the VS30-only L1 model when averaged in TOSC/Tnat bins (Figure 16(c)) diverge from zero, highlighting the importance of Tnat in influencing simulated site responses at short periods. Comparing long-period residuals of the L5 and L6 models binned with respect to TOSC/Tnat, averaged L6 residuals are greater than zero for VS30 < 300 m/s (Figure 16(b)) whereas L5 residuals do not, which indicates the value of including VS30 into the model form.

The residuals of L1 (VS30), L5 (VS30 and Tnat), L6-Soil (Tnat), and L6-weathered rock zone (Tnat) models along with their binned mean∓σ in VS30 and TOSC/Tnat spaces for TOSC of 0.1 s (a and c) and 1.0 s (b and d).
Next, the dispersion of the residuals is evaluated. Probabilistic seismic hazard analysis (PSHA) considers two types of dispersion in their formulation (McGuire, 2004; Rodriguez et al., 2014; Stafford, 2015): (1) aleatory variability, defined as the inherent randomness of a phenomenon, which cannot be reduced with additional data, and (2) epistemic uncertainty that results from the lack of knowledge and can be diminished through supplementary information. The dispersion of residuals is considered to be a combination of aleatory and epistemic because the VS profile uncertainty would be dramatically reduced if a site-specific profile was available (hence, it is epistemic), whereas the remaining variability related to motion-to-motion variability and variability in other dynamic properties is arguably mainly aleatory. Since it is a combination of uncertainty and variability, the residual dispersion is subsequently referred to simply as uncertainty.
The uncertainty in this study is represented by standard deviation (σ) relationships, which are computed from the dispersion of residuals. Figure 17(a) shows σ for linear RS L1, L5, L5 (HEA19), and L6 models as a function of TOSC. Figure 17(b) similarly shows σ for total RS amplification functions L5 + N2 and L5 + N2 (HEA19). The comparison of the model residuals indicates that
The L5 function given as the summation of f(VS30) in Equation 4 and f(Tnat)L5 in Equation 5 results in an ∼11% reduction in σ relative to the L5 model from HEA19.
Values of σ for f(Tnat)L6-Soil are similar to those for the L5 model except that the former is slightly larger than the latter for 0.2 < TOSC < ∼0.9 s. This indicates that a combined VS30 and Tnat model (L5) performed better than one conditioned on Tnat alone (L6).
L5 + N2 produces lower σ values relative to L5 + N2 (HEA19) for TOSC ≥ 1.0 s. This is a result of improvements in the f(Tnat) term.

Standard deviation (σ) of (a) linear L1 and L5 (HEA19) calculated using Harmon et al. (2019b) coefficients, L5 and L6 models, and (b) total L5 + N2 and L5 + N2 (HEA19) models as function of TOSC.
Figure 18(a) and (b) presents the σ of L1 and L5 residuals, respectively. For the Flin model, the trends in σ are described as,
where u1 and u2 are period-dependent coefficients; Vl1 is the limiting velocity below which σ is constant; and Vl2 is the limiting velocity beyond which σ is equal to u2. The proposed σ relationships for the L1 and L5 FAS models are presented in Figure B.37. For median L6 RS and FAS soil amplification models, the same function in Equation 17 is utilized, but Vl1, Vl2, and VS30 are replaced with T1, T2, and Tnat, respectively (Figure 18(d)). Moreover, a power relationship (

Standard deviation (σ) model estimations for (a) L1, (b) L5, (d) L6 RS (Soil), and (e) L6 RS (weathered rock zone) median site amplification functions along with σ of all these models’ residuals, and (c) σ of N2 RS model residuals as a function of VS30 and PGA at rock and estimations from newly proposed σ model and HEA20 for TOSC of 0.1 s. RS: response spectrum.
For the NL components of amplification, a prior model Hashash et al. (2020b) (HEA20) is updated due to the following concerns:
HEA20 significantly underestimates the σ from the residuals of N2 model as functions of VS30 and PGAr such that σ values from HEA20 for VS30 ≤ 650 m/s are significantly lower that those from simulated models’ residuals. (Figure 18(c)).
HEA20 produces unreasonably low σ (around 0.3) for low VS30 sites (175 m/s) and high PGA values (>0.5 s), where larger σ values (on the order of ∼0.65) from the simulations are observed (Figure 18(c)).
Accordingly, a new σ function is suggested for the NL model,
where σf is a period-dependent coefficient, and σf2 is given as:
The new σFnl function differs from HEA20 in terms of using Vn1 and Vn2 period-dependent coefficients instead of constant values of 300 and 1000 m/s. Figure 18(c) compares the
The two sources of uncertainty are assumed to be independent and can be combined as given in Equation 20 to produce σ for the total amplification (
The influence of

SEA20 linear model ∓1σFlin for (a) VS30 = 185 m/s, (d) VS30 = 250 m/s, SEA20 linear + HEA20 NL model ∓1σFS where σFlin from SEA20, and σFlin from HEA20 and from this study for (b) VS30 = 185 m/s and PGAr = 0.65 g, (c) VS30 = 185 m/s and PGAr = 0.95 g, (e) VS30 = 250 m/s and PGAr = 0.65 g, and (f) VS30 = 250 m/s and PGAr = 0.95 g.
Model limitations and application
The site amplification functions described in this article have limitations that should be considered in potential forward applications. These include
Assumptions of 1D site response: The simulated amplification data do not consider multidimensional wave propagation effects that can occur in sedimentary basins.
Representative VS profiles: Even though more than 800 profiles were compiled in HEA19 to produce the representative VS profiles for each geologic unit in CENA, limited profile data exist for depths (Z) > 100 m. To extend the profiles to larger depths and faster velocities corresponding to weathered and reference rock conditions, VS is extrapolated to account for the effects of increasing overburden pressure using
Strain level: Results from NL (and EL) SRA with calculated maximum strains greater than 1.0% were excluded during model development. This strain limitation was applied because we are not confident in the estimated shear strengths that were used in model development and wish to exclude results that are strongly influenced by such estimates (i.e. sites where large strains develop). Similar criterion can be given using the strain index, Iy (PGVr/VS30) used to predict the peak strain in these analyses (Kim et al., 2016). Figure 20 shows the dependence of Iy on the maximum strains computed in NL analyses; the 1% strain limit applied for NL analyses corresponds to Iy < 0.1%–0.2%, indicating that this Iy range can be accepted as the upper bound of model applicability.
Site profile: The use of all VS30-conditioned RS and FAS models is limited to sites with VS30 ≥ 180 m/s since profiles with VS30 < 180 m/s might contain a very soft layer resulting in a large impedance contrast where strain concentrations are expected. Such conditions can produce a “base isolation” behavior that is considered to be unrealistic. Moreover, the depths of site profiles in this study are limited to 1000 m, and thus, the proposed functions likely do not apply for deeper profiles such as in the CP region (Boyd et al., 2022).

Strain index, Iy, given as a function of maximum strain occurred in each nonlinear analyses along with Iy applicability limit as <0.1%–0.2%.
The L1 function presented above is modular, and the fundamental parameter to utilize these relationships is VS30. An alternative single-parameter model conditioned solely on Tnat (L6), which reduces uncertainties, is also provided. However, our recommendation is not to use an Tnat-only model but a combined VS30 and Tnat model (L5), which has improved capabilities for capturing site response over a wide frequency range. For NL models, there is no preference between the models based on PGAr and SAr, but modular L# + N# is recommended over K models. We provide the multiple models for NL response to enable users to consider model-based epistemic uncertainties if they choose. The RS relationships are applicable for 0.001 s ≤ TOSC ≤ 10.0 s. For the FAS relationships, this range is defined as 0.1 Hz ≤ f ≤ 33.0 Hz since the maximum frequency content (fmax) is limited to 50.0 Hz, and large scatter of the NL and total FAS amplification data leads to difficulties in the implementation of models for fOSC > 33.3 Hz (Ilhan, 2020). Both the RS and FAS models developed here provide amplification relative to VS = 3000 m/s.
The σ relationships share the applicable period / frequency bounds described above. The modular terms (e.g. linear f(Tnat)L5 or NL N2) or Tnat-based L6 from this study can be combined with linear amplification models from literature (e.g. the ergodic linear SEA20 relationship) as described in the work by Hashash et al. (2021).
Conclusion
This article develops a series of RS and FAS site amplification models for sites in CENA using a database of 3,656,835 1D simulations (1,218,945 each of linear, EL, and NL). The simulation database composed of 148,050 unique generic site profiles is approximately two times larger than the prior data set and incorporates a more comprehensive range of site conditions for model development. The VS30-scaling L1 relationship deviates from prior VS30-dependent models—whereas prior models had a decay in short-period amplification as VS30 decreases for sites with VS30 < ∼600 m/s, the current models provide flat amplification in this range. The prior L5 (VS30 and Tnat-dependent) RS models are modified to provide higher-mode amplification at sites with multiple VS impedance contrasts through a Gaussian term. This improvement reduces model uncertainties modestly (∼11%).
A new Tnat-conditioned L6 RS function is provided to capture resonance effects that are commonly observed in the simulated amplification results. The L6 model produces lower uncertainties than VS30-scaling model (L1). The uncertainties from L6 are similar to those from a combined VS30 and Tnat model (L5), which indicates that the influence of Tnat on simulated site amplification is strong and VS30 values add relatively little additional information. However, it is known from prior empirical studies that VS30-based models are effective over a wide frequency range, which leads to our recommendation to use coupled VS30 and Tnat models when practical. For the common case in which empirical models conditioned on VS30 are used to estimate linear amplification relative to 760 m/s sites, the Tnat-component of the L5 model can be used along with new F760 models in applications. The resemblance between the linear RS and FAS amplification is utilized to implement all RS functions for the modeling of FAS amplification. Standard deviation relationships for combined aleatory and epistemic uncertainties for median RS and FAS models are also proposed. Model limitations are described in the previous section.
Supplemental Material
sj-docx-1-eqs-10.1177_87552930231215723 – Supplemental material for Simulated site amplification for Central and Eastern North America: Data set development and amplification models
Supplemental material, sj-docx-1-eqs-10.1177_87552930231215723 for Simulated site amplification for Central and Eastern North America: Data set development and amplification models by Okan Ilhan, Youssef MA Hashash, Jonathan P Stewart, Ellen M Rathje, Sissy Nikolaou and Kenneth W Campbell in Earthquake Spectra
Supplemental Material
sj-xlsx-2-eqs-10.1177_87552930231215723 – Supplemental material for Simulated site amplification for Central and Eastern North America: Data set development and amplification models
Supplemental material, sj-xlsx-2-eqs-10.1177_87552930231215723 for Simulated site amplification for Central and Eastern North America: Data set development and amplification models by Okan Ilhan, Youssef MA Hashash, Jonathan P Stewart, Ellen M Rathje, Sissy Nikolaou and Kenneth W Campbell in Earthquake Spectra
Supplemental Material
sj-xlsx-3-eqs-10.1177_87552930231215723 – Supplemental material for Simulated site amplification for Central and Eastern North America: Data set development and amplification models
Supplemental material, sj-xlsx-3-eqs-10.1177_87552930231215723 for Simulated site amplification for Central and Eastern North America: Data set development and amplification models by Okan Ilhan, Youssef MA Hashash, Jonathan P Stewart, Ellen M Rathje, Sissy Nikolaou and Kenneth W Campbell in Earthquake Spectra
Supplemental Material
sj-xlsx-4-eqs-10.1177_87552930231215723 – Supplemental material for Simulated site amplification for Central and Eastern North America: Data set development and amplification models
Supplemental material, sj-xlsx-4-eqs-10.1177_87552930231215723 for Simulated site amplification for Central and Eastern North America: Data set development and amplification models by Okan Ilhan, Youssef MA Hashash, Jonathan P Stewart, Ellen M Rathje, Sissy Nikolaou and Kenneth W Campbell in Earthquake Spectra
Supplemental Material
sj-xlsx-5-eqs-10.1177_87552930231215723 – Supplemental material for Simulated site amplification for Central and Eastern North America: Data set development and amplification models
Supplemental material, sj-xlsx-5-eqs-10.1177_87552930231215723 for Simulated site amplification for Central and Eastern North America: Data set development and amplification models by Okan Ilhan, Youssef MA Hashash, Jonathan P Stewart, Ellen M Rathje, Sissy Nikolaou and Kenneth W Campbell in Earthquake Spectra
Supplemental Material
sj-xlsx-6-eqs-10.1177_87552930231215723 – Supplemental material for Simulated site amplification for Central and Eastern North America: Data set development and amplification models
Supplemental material, sj-xlsx-6-eqs-10.1177_87552930231215723 for Simulated site amplification for Central and Eastern North America: Data set development and amplification models by Okan Ilhan, Youssef MA Hashash, Jonathan P Stewart, Ellen M Rathje, Sissy Nikolaou and Kenneth W Campbell in Earthquake Spectra
Footnotes
Acknowledgements
The authors would like to thank Guangchao Xing of University of Illinois at Urbana-Champaign (UIUC) for the compilation of DEEPSOIL V7.0 computational core for Linux OS and Hao Chen of UIUC for the development of R codes for producing conventional models’ estimations. The scholarship granted by the Turkish Ministry of National Education for this Ph.D. study is also greatly acknowledged.
Disclaimer
The opinions, recommendations, findings, and conclusions in this article are of the authors and do not necessarily reflect the views or policies of the National Institute of Standards and Technology or the US Government.
Declaration of Conflicting Interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) received no financial support for the research, authorship, and/or publication of this article.
Supplemental material
Supplemental material for this article is available online.
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
