Abstract
Highlights
For postmenopausal Dutch women who have been referred to the FLS, direct access to CT2S may be cost-effective compared with DXA for age groups 70s and 80s, when considering the ICER threshold of the Netherlands. This study positions CT2S as a potential novel osteoporosis-screening tool for secondary fracture prevention in the clinical setting.
A shorter screening interval of 2 y increases the effectiveness of both screening strategies, but the ICER of CT2S compared with DXA also increased substantially, which made CT2S no longer cost-effective for the 70s age group; however, it remains cost-effective for individuals in their 80s.
Annual zoledronic acid treatment with better adherence may contribute to a lower cost-effectiveness ratio when comparing CT2S to DXA screening and the treat all strategies for all age groups.
In the past decade, specialized fracture and osteoporosis clinics, called fracture liaison service (FLS), run by nurse practitioners and supervised by physicians, have been introduced in several countries, including the Netherlands. 1 Typically, FLS use dual-energy X-ray absorptiometry (DXA) screening for persons with previous fracture history to measure the areal bone mineral density (aBMD) and then offer treatment advice, if necessary, to the referring practitioner. 1 This radiological examination directly gives a measure of the bone mineral content, which is compared with epidemiological data to evaluate its deviation from the average healthy population. However, the predictive performance of DXA in terms of fracture risk prediction is not perfect,2–4 depending on the characteristics of the individual patient. For this reason, various authors have proposed an alternative measurement technique, called biomechanical computed tomography analysis (BCT), also known as a digital twin, using a low-dose CT scan and a patient-specific computer-based biophysics model to predict the fracture force of the femur.4–6 Computed Tomography To Strength (CT2S) is a particular implementation of the BCT, developed in collaboration between the University of Sheffield and the University of Bologna.3,6,7
CT2S is run as an online service (https://ct2s.insigneo.org/ct2s/) hosted at the University of Sheffield. The user (usually clinician) would make a job request via the Web site. The user then uploads anonymized patient CT scans through the Web site’s secure portal. The CT scans are downloaded by the CT2S operator. The femur shape is segmented in 3 dimensions and meshed using finite elements. The local material property of the femur is estimated based on the CT attenuation. This fully personalized model of the femur is then used to simulate various sideways fall scenarios including all feasible ranges of loading directions to determine the failure load of the bone (i.e., bone strength) and the weakest orientation. This information is then used to calculate the absolute risk of fracture at present, or time 0 (ARF0), which is the ratio between the number of falls (that led to fracture) over the total number of falls. The results are then summarized in a simple PDF file, which is sent back to the user who requested the job. 8
Evaluation of the CT2S approach showed better discrimination between the fractured and nonfractured groups in a pair-matched elderly cohort of 98 women, who are prone to osteoporosis. The area under the receiver operating characteristic curve (AUC) was 0.85 using CT2S compared with 0.75 using the aBMD-DXA as the predictor.3,6,7 This study was conducted in a single National Health Service (NHS) center at Sheffield in the United Kingdom. Similar results have also been shown in studies conducted in other countries, 9 using cohorts of different age, race, and sex. It is worth mentioning that other groups have used BCT to also analyze vertebral fracture. 9 However, such analysis has not been done using CT2S. Nevertheless, the question remains as to whether the higher costs associated with measuring the CT2S biomarker would be expected to translate into meaningful health benefits and whether those benefits would be expected to be cost-effective in osteoporosis screening in the Netherlands.
According to the Health Insurance Act (Zorgverzekeringswet), all residents of the Netherlands are legally obliged to take out standard health insurance. 10 The cost of examinations and medications prescribed by the general practitioner or medical specialist is covered by the standard health insurance (e.g., DXA screening). 11 Each year, the government updates the national health care scheme, and insurance companies are mandated to incorporate changes within their offered benefits. Health policy decision makers would be interested in acknowledging whether the novel osteoporosis screening tools are potentially cost-effective relative to DXA for particular populations in order to consider further development, investments, and reimbursement policies.
Early health technology assessment (HTA) is a methodology used to evaluate the potential value and feasibility of emerging innovations at their early stages. 12 Therefore, early HTA is necessary for providing initial insight by revealing the cost and health quality outcome of different osteoporosis-screening strategies. In this study, we adopted early HTA to assess the potential value of CT2S to be used as a novel osteoporosis screening tool. We used a 2-dimensional simulation to estimate data on which a cost-utility analysis (CUA) was conducted for 4 strategies: CT2S, the standard of care represented by aBMD (hereafter referred to as “DXA”), 13 treat all patients without screening, and no screening in postmenopausal women referred for assessment to the FLS.
Methods
Study Perspective
The following cost-effectiveness analysis was conducted from a health care system perspective, as it considered the costs and consequences incurred by the health care system.
Screening Scenarios and Strategies
The following scenarios were simulated: 3 age groups (60s, 70s, and 80 y), 2 screening intervals (5 y as the base case and 2 y in scenario analysis), 2 types of treatment (weekly oral alendronate as the base case and once-yearly intravenous infusion of zoledronic acid in scenario analysis), and 2 discount rates (4.0% and 1.5% for costs and effectiveness as the base case according to the Dutch guideline for economic evaluations in health care 14 and 3% rate for benefits as well as costs in scenario analysis). Each scenario consists of a combination of age groups, screening intervals, treatments, and discount rates, with 4 strategies: CT2S screening, DXA screening, treat all, and no screening (Figure 1).

Diagram of the screening model.
Screening Interval
In the literature, a variety of osteoporosis-screening intervals have been suggested. An Australian clinical guideline for osteoporosis prevention and treatment suggested that follow-up screening should not be scheduled fewer than 2 y apart. 15 A US study showed that repeating BMD measurement after 8 y adds little marginal value in terms of predicting fractures beyond the initial testing. 16 Another study from the University of Missouri–Columbia suggested that the rescreening interval should be 5 y for moderate osteopenia and 1 y for advanced osteopenia. 17 There is no clear guideline as to whether the DXA scan should be performed only once or regularly in FLS. However, the FLS model from a UK study indicated that FLS advises primary care clinicians to arrange follow-up DXA monitoring after completion of 5 y of treatment. 18 In addition, a longitudinal cohort study by White VanGompel et al. 19 showed that approximately 40% of low-risk women and 60% of high-risk or osteoporotic women were rescreened within 5 y.
Women who received drug treatment after their initial DXA scan had a higher likelihood of undergoing short-interval repeat DXA scans. 19 The frequency and type of follow-up screening for patients in FLS programs may vary depending on individual patient factors and should be determined by a health care provider in consultation with the patient. In this study, the recommended procedures in the literature were followed, and rescreening was simulated for all patients within the FLS, using a 5-y interval as a base case and a 2-y interval as an alternate case.
Screening Test Characteristics
The receiver-operating characteristic (ROC) curves for DXA and CT2S were obtained from the Sheffield cohort (98 subjects, 49 osteoporotic hip fractures, and 49 controls). 6 Detailed descriptions of the CT2S pipeline methods and results on the Sheffield cohort can be found in previous studies.3,6,7 A brief description is provided here.
The Sheffield cohort is a retrospective cohort of 98 postmenopausal women, divided equally into 2 groups: a fracture group and a control group. The fracture group (n = 49) consisted of women who had been diagnosed with low-energy trauma fractures in the proximal femur (mean age 75 y). The control group (n = 49) consisted of women who were pair-matched for age, height, and weight. All patients received a DXA scan (Hologic Discovery scanner, Hologic Inc, Bedford, MA, USA) and a bilateral proximal femur CT scan (LightSpeed 64 VCT, GE Medical Systems, Milwaukee, WI, USA).
From the individual CT scans, the proximal femur of each patient was segmented by a trained mechanical engineer using ITK-Snap 2.0.0 (University of Pennsylvania). The segmented bone was fitted with 10-node tetrahedral finite elements using an averaged mesh size of 3 mm (ANSYS software). The element-based material property was estimated from the CT attenuation using Bonemat v3.0.20,21 External load was applied to the personal-specific femur model in a range of possible sideways fall directions in ANSYS. The predicted maximum strain in the model was used to calculate bone strength for each individual. Subject-specific fall dynamics and hip impact mechanics were then incorporated into the model via ARF0 to obtain the final classification of fracture status and ROC curve.
To ensure a fair comparison, we chose biomarker thresholds to maximize the overall accuracy for both screening strategies (T score = −1.41 for DXA and ARF0 = 37.4% for CT2S). These sensitivities and specificities were used to determine the true-positive, false-positive, true-negative, and false-negative probabilities in the simulation model (Table 1). A percentile bootstrap method was used to construct 95% confidence intervals (CIs) for the sensitivity and specificity values. 50 The stratification accuracy of DXA and CT2S in identifying osteoporotic hip fractures has been used as the approximation of the osteoporosis-screening accuracy. In the screening arm, the model assumed that individuals identified as osteoporotic (true or false positives) would be treated with alendronate or zoledronic acid. We also modeled rescreening after a fracture. 51
Key Input Parameters in the Model
CI, confidence interval; CT2S, Computed Tomography To Strength; DXA, dual-energy X-ray absorptiometry; GHO, Global Health Observatory; RR, relative risk.
Sensitivity and specificity for DXA are from the results for the Sheffield cohort reported above.
The value was varied by ±20% to create the CI.
The percentage of patients staying on alendronate treatment was calculated based on the first- and fifth-year data declines in a linear manner; the same applies to obtaining the fifth- and sixth-year persistence data for zoledronic acid.
Standard deviation was assumed to be 20% of the mean.
The increment in the osteoporosis prevalence of the general population was used to generate the moderate estimation of the osteoporosis prevalence for the 60s and 80s age groups within the fracture liaison service.
Cost values were adjusted for the inflation rate in the Netherlands to year 2021.
The detailed calculation of the CT2S service cost can be seen in the Supplementary Appendix, Table 3.
Study Population
This study was focused on the Dutch postmenopausal women (from 60 y old) who have been referred to the FLS with a recent clinical fracture.
Structure of the Individual-Level Model
We used a discrete-time, discrete-state, individual-level (microsimulation) model to simulate osteoporosis disease trajectories to estimate lifetime costs and quality-adjusted life-years (QALYs) for each simulated individual under the various screening scenarios and strategies. The model considered 9 discrete health states: no osteoporosis, osteoporosis without experiencing any new fracture in our simulation, 3 main types of fractures (hip, vertebral, and wrist), the corresponding postfracture state and, finally, death (Figure 2). The model counted time from referral to an FLS until either death or age 100 y in discrete, 1-y time steps (cycles). According to the Dutch guideline for economic evaluations in health care, annual discount rates of 4.0% and 1.5% have been used for costs and effectiveness, respectively. 14 The model was constructed in TreeAge Pro 2021 R1 (TreeAge Pro Inc., Williamston, MA, USA).

Structure of the microsimulation model.
Validation of the Individual-Level Model
Internal and external validations for the model were conducted based on guidelines from ISPOR. 52 For internal validation, validation output generated from the model was used to compare with real, observed values from among the data used for creating the model. The prevalence of osteoporosis at different age groups were compared to assess the degree to which the model reproduced the observed data. For external validation, simulated validation outputs were compared with observed data from sources that were not used in the creation of the model: life expectancy and lifetime probability of fracture in women older than 50 and 80 y. The performed goodness-of-fit analysis was performed by fitting a linear relationship between modeled and observed data. Model validity was assessed via the squared linear correlation coefficient (R 2). These analyses were performed using SPSS Statistics version 25. Detailed validation approaches are elaborated on in the Appendix (sections “Interval Validation” and “External Validation”).
Two-Dimensional Simulation
We used a 2-dimensional simulation. 53 The 2 dimensions were defined in terms of parameter-level iterations and individual-level iterations. The parameter-level iterations captured input parameter uncertainty for parameters that were relevant for groups of persons, while the individual-level iterations (microsimulations) captured patient-level stochasticity. For each scenario consisting of a combination of starting age, screening interval, and CT2S cost, 10,000 parameter-level iterations were ran. For each parameter-level iteration, model input parameters were sampled from distributions. With each set of input parameters, 1,000 iterations of the individual-level model (microsimulations) were simulated as described above. Each sampled individual then entered the microsimulation model 4 times, once for each of the strategies. Lifetime costs and QALYs for each strategy were then averaged across individual-level iterations, and grand averages were computed as averages of the individual-level averages across parameter-level iterations. Distributions of the model outputs − average lifetime costs and QALYs among parameter-level iterations provided an estimate of the combined uncertainty related to input parameter uncertainty and patient stochasticity. Despite the lack of reported CIs or standard deviations for the health utility, osteoporosis prevalence, mortality rate, and annual fracture rate for the Dutch general population, we incorporated these values into our parameter-level sampling by estimating beta distributions. The standard deviation was estimated to be 20% of the mean (Table 1). Moreover, the screening interval and discount rates were considered to be fixed values.
Cost-Utility Analysis
Using the simulated data, we performed CUAs. For each starting age, screening interval, treatment, and discount rates scenario, the grand averages for the 4 strategies (CT2S, DXA, treat all, and no screening) were recorded. Incremental cost-effectiveness ratios (ICERs) were calculated as the difference in lifetime costs divided by the difference in QALYs between pairs of strategies. A strategy was considered cost-effective if the ICER was below the lower-bound ICER threshold set in the Netherlands (€20,000/QALY gained). 54 If a strategy had a lower estimated lifetime cost yet higher QALYs than its comparator, it was considered to dominate the comparator.
To assess the uncertainty in the CUAs, the net monetary benefit (NMB) was calculated for each parameter-level iteration and each strategy as λ× QALYij – Cij, where λ is the willingness-to-pay (WTP) threshold expressed as the average cost of generating 1 additional QALY in the Netherlands, i is the index of the parameter-level iteration, j is the index of the strategy (0 for no screening, 1 for DXA, 2 for CT2S, and 3 for treat all). QALYij and Cij represent the QALYs and average lifetime cost estimated for the i-th iteration of the j-th strategy, respectively. To illustrate the combined parameter-level and person-level uncertainty, we constructed cost-effectiveness acceptability curves (CEACs), where the value of λ was varied, and at each value, the proportions of parameter-level iterations for which each strategy had the highest NMB was determined. The cost-effectiveness acceptability frontier (CEAF) was also constructed for comparison.
One-Way Sensitivity Analysis
We performed deterministic 1-way sensitivity analysis (DSAs) to study the impact of changes of each input parameter on the simulation results when all other input parameters were sampled as per usual. Screening sensitivity and specificity, fracture cost, fracture risks, discount rates, treatment efficacy, medication cost, excess mortality, fracture disutilities, screening interval, and screening cost were included in the 1-way sensitivity analyses. In each DSA, most of the given parameter varied from 80% to 120% of its base-case value. 55 To reflect more uncertainty of the CT2S test characteristics, we expanded the sensitivity analyses of the CT2S test characteristics (both sensitivity and specificity) by varying the value from 70% to 130% and incorporating more finely grained intervals (10%). The discount rate of effectiveness and cost were varied from 0% to 6%. The screening interval varied from 3 to 7 y. The gradient in average lifetime cost and the QALYs across the range of values used for the DSA for no screening, DXA, CT2S, and treat all were estimated for each input parameter.
Model Input Parameter Data
Medication and efficacy
The bisphosphonate alendronate is the most frequently prescribed medication for osteoporosis in the Netherlands, 56 weekly oral alendronate has been adopted in our base-case analysis, together with the combination of vitamin D, calcium, and lifestyle recommendation. 57 A once-yearly intravenous infusion of zoledronic acid has become a popular alternative to oral alendronate for treating osteoporosis in postmenopausal women. This treatment has better adherence and similar effectiveness in reducing the risks of different types of fractures.26,58 Therefore, as part of our scenario analyses, annual zoledronic acid treatment was also considered. The relative risk (RR) of osteoporotic fracture with alendronate and zoledronic acid treatment are presented in Table 1.25,26,59
Treatment duration and adherence
Based on the NHS guidance in the United Kingdom, it may take 6 to 12 mo for alendronate to be fully effective in terms of bone protection. 60 Since the cycle length of our simulation was 1 y, we assumed no treatment efficacy if the treatment was discontinued within the first year. Several studies have recommended applying a “drug holiday” after 5 y of continuous alendronate treatment, which is in accordance with evidence to show that the residual effect of the alendronate will be sustained for up to 5 y.61,62 Therefore, we assumed that the maximum duration of continuous alendronate prescription was 5 y with a residual effect that declined linearly to 0 over a 5-y period after treatment was discontinued. 63 Two studies of alendronate treatment in postmenopausal women with osteoporosis showed that 69% of people remained on treatment after 1 y, 22 whereas 18.2% received the full 5 y of treatment. 23 We assumed the percentage of people staying on treatment declined in a linear manner and calculated the discontinuation rate of alendronate treatment from years 1 to 5.
A study of the effect of zoledronic acid treatment showed that there was no significant difference in treatment efficacy between patients with 6 versus 9 y of treatment. 64 Since the residual effect of zoledronic acid could be up to 3 y, 58 we assumed that the maximum duration of continuous alendronate treatment was 6 y and that the residual effect of zoledronic acid would decline in a linear manner to 0 over a 3-y period after treatment was discontinued. Only 1 study of zoledronic acid reported the yearly percentage of patient stay on treatment from years 1 to 4. 24 We made the assumption that the proportion of patients who remained on the treatment would decrease in a linear fashion for years 5 and 6. For both alendronate and zoledronic acid treatment, retreatment was modeled to be provided if a fracture occurred or the rescreening result was positive. Treatment was also provided to women who initially screened negative but subsequently screened positive.
Osteoporosis prevalence and incidence
Osteoporosis-related studies revealed that the prevalence of osteoporosis in the Netherlands is 1.9 per 1,000 for men and 16.1 per 1,000 for women and increases with age.57,65 However, specific osteoporosis prevalence data for elderly women across age groups were not available for the Dutch population. Therefore, data from a Swedish population study was used. 27 Since the current study is focused on Dutch postmenopausal women referred to the FLS, the Swedish data were supplemented with the reported osteoporosis prevalence of the people referred to the FLS in the Netherlands. 28 Due to the fact that the osteoporosis prevalence of the 5 FLSs study was not reported by sex, the model’s osteoporosis prevalence estimate (40.7%) was obtained for women within the 70s age group from the FLS with the highest proportion of female persons (88.2%), which also had an average age of 69 y. 28 Since data from the Netherlands FLS lacked detailed osteoporosis prevalence for the 60s and 80s age group for women, the increment in the osteoporosis prevalence of the general population was used, and the osteoporosis prevalence was estimated to be 25.7% and 55.2% for each of the age groups, respectively (Table 1). The incidence of osteoporosis based on the difference of osteoporosis prevalence plus the mortality rate for the specific age group was estimated. 66 We considered the differences in the prevalence of osteoporosis as 5-y cumulative osteoporosis incidence to calculate the 1-y incidence rate and transition probability of developing osteoporosis. 29 Detailed calculations of osteoporosis transition probability are elaborated upon in the Appendix (section “Transition Probability of Developing Osteoporosis”).
Fracture rates
The age-specific annual hip, vertebral, and wrist fracture rates were obtained for the general population from the Dutch-specific osteoporosis reports by the International Osteoporosis Foundation. 67 We calculated the annual osteoporotic fracture incidence rate using the annual incident rate of fracture for the general population times Melton’s osteoporosis attributed rates 33 and then divided the result by the prevalence of osteoporosis for that age band. Subsequently, the annual incidence rate was translated to transition probability using equation 2 shown in the Appendix. 29 Since all people in the FLS had a previous history of fracture, the fracture rates for people experiencing “first fracture” in our simulation were adjusted by multiplying the RRs of any secondary fracture for people with a previous fracture history (1.86 according to Kanis’s study). 32 Simulated people in our model could suffer more than 1 fracture, and the risk of sustaining a subsequent fracture was therefore set higher than the initial fracture risk. The RRs of secondary hip, vertebral, and wrist fractures were therefore obtained as 2.9, 2.52, and 1.69, respectively.30,31 The corresponding transition probabilities of fractures per cycle were calculated based on the annual fracture rates. 29 Detailed age-specific annual fracture rates and the calculations of the osteoporotic fracture probabilities are elaborated in the Appendix (section “Fracture Rates for Dutch General Population and Osteoporotic Fracture Rates Calculations”).
Mortality rate
Consistent with the previous economic evaluations of secondary fracture prevention intervention, 68 age-specific, general population mortality risks for Dutch women were derived from the Global Health Observatory data repository published by the World Health Organization as our baseline mortality rates. 36 Mortality risk of osteoporotic individuals were higher than the general population (RR = 1.19); this value has been applied to the patients having osteoporosis but without experiencing “first fracture” in our simulation. 37 The relative mortality risk of people with and without hip fractures within the general population is 2.87 and 1.78 for the first and subsequent years, respectively. 38 Several studies have shown that the excess mortality rate after vertebral fracture is similar to that of hip fracture.40,69,70 Therefore, in the model, the relative mortality risk after vertebral and hip fracture was assumed to be the same. Excess mortality rate after wrist fracture was estimated to be 1.43. 39 Since comorbidities could also be a contributing factor to excess mortality, the proportion of excess mortality following fractures attributable to the fractures themselves was assumed to be 25%40,41; therefore, the excess mortality of hip, vertebral, and wrist fractures has been adjusted accordingly to avoid overcounting. Besides, consistent with the previous economic evaluations of osteoporosis intervention, 71 the model considers only the additional mortality caused by hip or vertebral fractures in patients who have experienced both nonhip/nonvertebral fractures and hip/vertebral fractures. In the case of patients who have multiple hip or vertebral fractures, or both types of fractures, only 1 excess mortality value was included in the model.
Costs
The following cost items were considered in our model: direct cost of fractures, annual medication cost, screening cost, and nursing home costs after hip fracture. In the first year postfracture, the direct costs associated with hip, vertebral, and wrist fracture in the Netherlands were reported to be €21,477, €2,740, and €1,573, respectively.42–44 A study revealed a substantial rise in the use of generic alendronate among the Dutch population since its availability in 2005, with up to 62% of patients opting for this medication by 2011. 72 Furthermore, the Dutch health insurance companies tend to reimburse the more affordable generic form of the drug. Therefore, our study incorporated the annual cost of generic alendronate in the Netherlands. 45 When a hip fracture occurred, patients either entered into a nursing home or the community for recovery, and the corresponding transition probabilities were applied.34,35 The annual cost of nursing home residence (€63,549) was calculated using the subtraction of the total cost of a person with a hip fracture confined to a nursing home and the first-year direct cost of hip fracture. 46 The DXA scan price in the Netherlands is €96.74. 43 The cost of the CT2S screening was calculated with the following components: CT scan cost, computing cost, storage cost of raw data, licence cost, and personnel cost. The average cost of CT2S screening was estimated to be €274 (Table 3, Appendix). All costs were adjusted for the inflation rate in the Netherlands to the 2022 value. 73 We assumed everybody received sufficient calcium and vitamin D, so the costs of supplements were not included in the model.
Health-state utility values
The health utilities values were obtained from a previous application of the EQ-5D multidimensional health index to a sample from the general population in the Netherlands, responses on the 5 domains of the index were mapped to utility values using the European VAS value set scoring algorithm. 49 The baseline health-state utility values for patients with prior fractures were estimated by multiplying the health-state utility of the general population by 0.85, a health utility multiplier for subsequent years (>2 y) after a prior vertebral fracture. This is consistent with the previous economic evaluations of fracture liaison services patients. 68 Since fractures and postfracture recovery will be associated with a decline in quality of life, we applied a health utility multiplier of the subsequent years after fractures in our simulation. 49 In addition, the health utility of 0.4 was used for those people entering nursing homes after a hip fracture. 48
Results
Internal and External Validation Results
For internal validation, the modeled and observed prevalence of osteoporosis were compared. The squared correlation coefficient (R 2) was 0.998. For external validation, the R 2 was jointly calculated for life expectancy and fracture risk and the value was 0.988. The collective results are shown in Tables 1 and 2 in the Supplementary Appendix.
Base-Case Analysis
Grand averages of lifetime costs and QALYs, as well as cost-utility analyses results, are presented in Table 2. In the base-case analysis, the screening interval was set to 5 y, with discount rates of 4.0% for costs and 1.5% for effectiveness (according to the Dutch guideline for economic evaluations in health care). The treatment strategy involved weekly oral alendronate treatment. The ICERs of CT2S versus DXA for the age groups of 60s and 70s were €41,200 and €14,083, respectively, per QALY gained. For the 80s age group, CT2S was dominant (an increase in QALYs at lower lifetime cost) compared with DXA screening. The ICERs of CT2S versus treat all strategy for the age groups of 60s and 80s were €1,727 and €1,583, respectively. For the 70s age group, CT2S was dominant compared with treat all. In all simulated populations, CT2S was cost-saving (more effective and less costly) compared with the no screening scenario. According to the cost-effectiveness in practice published by the Dutch National Health Care Institute, the WTP threshold in the Netherlands varies from €20,000 to €80,000. 54 In this study, we conservatively used €20,000 to evaluate the cost-effectiveness of the screening strategies across all populations and scenarios. The results indicated that the ICER of CT2S versus DXA for the 70s and 80s age groups are less than the lower-bound ICER threshold (€20,000) in the Netherlands. It is worth mentioning that the QALY outcomes of the treat all strategy without screening are always lower than those with screening. Treatment adherence is considered in our model, and the lifetime horizon simulation also simulates the progression of patients developing osteoporosis. Even though the treat all strategy treats all patients referred to the FLS at the beginning, the rescreening process can identify those patients who developed osteoporosis after treatment discontinuation. The targeted treatment will affect the final outcomes of QALY and cost for each individual patient referred to the FLS. However, we observed that the gap in QALY outcomes between the treat all strategy and CT2S or DXA screening diminishes as the simulation group ages. This is largely due to the increase in osteoporosis prevalence in the older age group and shorter simulated lifetime.
Lifetime Health Care Costs, QALYs, Averaged across Simulation Iterations and Incremental Cost-Effectiveness Ratio of CT2S Compared with DXA, Treat All, and No Screening for the Various Age Groups under the Base-Case Scenario: Weekly Oral Alendronate Treatment, Screening Interval = 5 y, Discount Rates Are 4.0% and 1.5% for Costs and Effectiveness a
CT2S, Computed Tomography To Strength; DXA, dual-energy X-ray absorptiometry; ICER, incremental cost-effectiveness ratio; QALY, quality-adjusted life-year.
Dominant = CT2S more QALYs, lower costs than DXA or treat all. Cost-saving = CT2S more QALY and lower costs than no screening.
Using the lower bound of the willingness-to-pay threshold in the Netherlands (€20,000), these values are considered cost-effective for CT2S compared against DXA or treat all.
One-Way Sensitivity Analysis
Table 3 reports the result of 1-way sensitivity analysis in women in their 70s using the base-case scenario: with a screening interval of 5 y, discount rates at 4.0% and 1.5% for costs, and effectiveness with weekly oral alendronate treatment. In all the sensitivity analyses, CT2S was cost-saving compared with the no screening scenario. Furthermore, CT2S dominated the treat all strategy in most of the sensitivity analyses, except when CT2S sensitivity was at −30%. In that case, the ICER of CT2S versus treat all was €3,984.
One-Way Sensitivity Analyses on the Incremental Cost-Effectiveness Ratio of CT2S versus No Screening, versus DXA, and CT2S versus Treat All for the 70s Age Group under the Base-Case Scenario: with Weekly Oral Alendronate Treatment and 5-y Screening Interval, Discount Rates of 4.0% and 1.5% for Costs and Effectiveness a
CT2S, Computed Tomography To Strength; DXA, dual-energy X-ray absorptiometry; ICER, incremental cost-effectiveness ratio.
Dominant = CT2S more QALYs, lower costs than DXA or treat all. Dominated by DXA = CT2S lower QALYs, more costs than DXA. Cost-saving = CT2S more QALY and lower costs than no screening.
Note that the absolute value of sensitivity exceeded 1 when the CT2S screening sensitivity was increased by 30%.
Note that the absolute value of specificity exceeded 1 when the CT2S screening specificity was increased by 30%.
When comparing CT2S with DXA, the CT2S sensitivity had a marked impact on the ICER outcome. When CT2S sensitivity increased by 10%, the ICER of CT2S versus DXA was €2,870. When CT2S sensitivity increased by 20% and 30%, CT2S dominated DXA. On the contrary, when CT2S sensitivity decreased by 10%, the ICER of CT2S versus DXA was €24,751. When CT2S sensitivity decreased by 20% and 30%, DXA dominated CT2S. This is expected as the sensitivity of CT2S decreases (by 20% or more), its ability to identify osteoporosis patients reduces, making it less superior compared with DXA. Similarly, when DXA sensitivity decreased by 20%, the CT2S method became dominant compared with DXA. Conversely, when the sensitivity of DXA increased by 20%, the ICER of CT2S versus DXA increased to €112,711.
The results were also strongly affected by other parameters such as screening interval and the cost of CT2S. The ICER of CT2S compared with DXA more than doubled when the screening interval was equal to 3 y. The ICER of CT2S versus DXA decreased to €2,222 when the CT2S cost decreased by 20%. Moreover, the ICER of CT2S versus DXA increased to €27,109 if the medication efficacy decreased by 20%. The decrease in medication persistence resulted in lower QALY and higher cost and changed the ICER of CT2S versus DXA to €24,029. Cost per QALY increased to €30,077 when the discount rate increased to 6% and decreased to €4,586 when the discount rate decreased to 0%, respectively. In addition, when the specificity of CT2S varied from −30% to +30%, the ICER continued to increase.
Alternative Scenario Analyses
As an extension to the base-case scenario, we extended our analysis to also consider 1) an alternative discount rate (3% rate for benefits as well as costs used in other countries), 2) alternative treatment of once-yearly intravenous infusion of zoledronic acid, and 3) an alternative screening interval of 2 y in the scenario analyses. The results for these alternative scenarios were presented in Table 4.
ICER of CT2S Screening Compared with DXA, Treat All, and No Screening with 3 Alternative Scenarios: Discount Rate = 3% for Costs as Well as QALYs, Annual Zoledronic Acid Treatment, and 2-y Screening Interval Scenarios a
CT2S, Computed Tomography To Strength; DXA, dual-energy X-ray absorptiometry; ICER, incremental cost-effectiveness ratio; QALY, quality-adjusted life-year.
For each alternative scenario, the other parameters were kept the same as the base-case scenario. Dominant = CT2S more QALYs, lower costs than DXA or treat all. Cost-saving = CT2S more QALY and lower costs than no screening.
Using the lower bound of the willingness-to-pay threshold in the Netherlands (€20,00), these values are considered cost-effective for CT2S compared against DXA, treat all, or no screening.
In the 3% discount rate scenario (screening interval = 5 y, weekly oral alendronate treatment), the ICER of CT2S versus DXA in the 60s age group was €53,875, which exceeded the WTP threshold of the Netherlands (€20,000). For the 70s age group, the ICER of CT2S versus DXA was €15,300 per QALY gained. CT2S dominated DXA screening in the 80s age group. In all simulated populations, CT2S dominated the treat all scenario and was cost-saving compared with no screening. It is worth mentioning that the treat all strategy dominated DXA screening in the 80s age group (an increase in QALYs at lower lifetime cost as shown in the table).
In the scenario with annual zoledronic acid treatment (screening interval = 5 y, discount rates are 4.0% and 1.5% for costs and effectiveness), the ICERs of CT2S versus DXA for the age groups of 60s and 70s were €20,667 and 6,000, respectively, which were substantially lower than those in the base-case analyses. However, the ICER for the 60s age groups was still higher than the WTP threshold of the Netherlands. CT2S was dominant compared with DXA screening in the 80s age group. In all simulated populations, CT2S again dominated the treat all scenario and was cost-saving compared with no screening.
In the scenario with a 2-y screening interval (weekly oral alendronate treatment, discount rates are 4.0% and 1.5% for costs and effectiveness), similar to the base-case analysis, CT2S was cost-saving compared with the no screening scenario in age groups 70s and 80s. The QALYs for both screening strategies were increased, and the ICER of CT2S compared with DXA also increased substantially compared with the 5-y interval. The ICER of CT2S compared with DXA was higher than the ICER threshold in the Netherlands for the age groups 60s and 70s (€100,333 and 55,571, respectively). For the 80s age group, the ICER of CT2S versus DXA was €15,750, which is different from that of the base-case scenario in a way that CT2S no longer dominated DXA. CT2S dominated the treat all scenario in the 70s and 80s age groups. For the 60s age group, the ICER of CT2S versus treat all was €8,879.
Probabilistic Sensitivity Analyses
Figure 3 shows the CEACs and CEAFs for DXA, CT2S, no screening, and treat all with 2 age groups (70s and 80s) and 2 treatment strategies (weekly oral alendronate and annual zoledronic acid treatment). The analysis considered a screening interval of 5 y, with discount rates of 4.0% and 1.5% for costs and effectiveness (used in the base case). CEAFs are highlighted as a bold dashed line in dark gray. For women in their 70s, at the threshold of €20,000 per QALY gained, CT2S screening was the most cost-effective strategy in 45% of the simulations with weekly oral alendronate treatment, compared with 41% for DXA, 13% for treat all, and 1% for no screening. When the patient received annual zoledronic acid treatment, at the same WTP threshold, CT2S was the most cost-effective strategy in 56% of the simulations, followed by DXA (37%), treat all (7%), and no screening (close to 0%). For women in their 80s with weekly oral alendronate treatment, at the threshold of €20,000, CT2S was cost-effective in 43% of the simulations compared with 36% for treat all, 21% for DXA, and 0% for no screening. CT2S was the most cost-effective screening strategy at any WTP threshold when patient received annual zoledronic acid treatment for the 80s age groups.

Cost-effectiveness acceptability curve and cost-effectiveness acceptability frontier for the 70s age group with weekly oral alendronate treatment (a) and zoledronic acid treatment (b). Cost-effectiveness acceptability curve for the 80s age group with weekly oral alendronate treatment (c) and zoledronic acid treatment (d). The other parameters are the same as the base case: screening interval = 5 y, discount rates are 4.0% and 1.5% for costs and effectiveness.
Discussion
In this study, we applied early HTA to evaluate the cost-effectiveness of CT2S, DXA, treat all, and no screening strategies for osteoporosis screening and treatment in the Dutch postmenopausal women referred to FLS. These strategies were evaluated across various age groups, screening intervals, treatments, and discount rate scenarios. The aim of this research was to investigate the potential extra benefit, in terms of QALYs, and extra cost of a novel screening method, CT2S, in order to evaluate its potential economic attractiveness if adopted broadly among the FLS population in the Netherlands. To our knowledge, this was the first cost-effectiveness study for a fracture risk assessment tool using CT-based finite element model in osteoporosis screening. The findings suggest that compared with the current standard clinical approach DXA screening, CT2S has the potential to be cost-effective in women referred to Dutch FLS, especially for those within the 70s and 80s age groups. This finding of cost-effectiveness was also observed with a shorter osteoporosis screening interval of 2 y and an alternative treatment of annual zoledronic acid treatment with better adherence.
Our analysis has several strengths. The model structure allowed for realistic simulation of patients’ osteoporosis case trajectories and inherently accounted for the competing risk of death. We conducted internal and external validation and showed that modeled validation output, prevalence of osteoporosis by age, lifetime risk of fracture, and undiscounted, un–quality-adjusted life expectancy matched observed data quite well. In recent years, several cost-effectiveness studies have been conducted for osteoporosis screening (DXA and ultrasound) and treatment strategies.66,74–77 However, compared to Markov cohort models, we adopted a 2-dimensional, discrete-state simulation methodology that offered 2 main benefits: first, the large number of total iterations (parameter-level number multiplied by individual-level number) produced stable grand averaged lifetime costs and QALYs, on which we performed our CUAs. Second, the 2-dimensional structure allowed us to assess the effect on the model output of both the uncertainty surrounding input parameter point estimates and patient-level stochasticity, which was expressed by CEACs.
To our knowledge, this was the first study in which DXA was not treated as the gold standard for osteoporosis screening (100% accurate) since CT2S has been shown to have better discrimination between women with and without fractures. The current study addresses some of the limitations of the previous work. For example, the rescreening process was not considered in Mueller and Kingkaew’s model,74,76 and Li’s model only considered rescreening with a 5-y screening interval. 66 Our study considered both 5- and 2-y rescreening scenarios. In addition, Li’s model applied rescreening only to people with previously negative results. However, the study by White VanGompel et al. 19 showed that approximately 60% high-risk or osteoporotic women were rescreened within 5 y. Therefore, it was useful to simulate rescreening for all persons within FLS in the current study.
The classification accuracy of DXA and CT2S in identifying osteoporotic hip fractures has been used as the approximation of the osteoporosis screening accuracy in this study. Since osteoporotic fracture is one of the serious disease consequences of osteoporosis, and osteoporosis cannot be reversed without medication, it was reasonable to assume that the predictive accuracy cannot exceed the stratification accuracy of osteoporotic fractures for the same osteoporosis screening tool. If we treat the “identification of osteoporosis” was considered as finding those with a high risk of getting an osteoporotic fracture in the future, then we could treat the accuracy of the “identification of osteoporosis” as the predictive accuracy of getting an osteoporotic fracture in the coming 10 or 20 y. Therefore, if we use the stratification accuracy of osteoporotic fractures as an approximation to the identification of osteoporosis and prove it to be cost-effective, we could elucidate that CT2S has potential advantages in the identification of osteoporosis compared with DXA.
The performance of CT2S, particularly its ability to accurately identify patients with osteoporosis (sensitivity), has a substantial impact on the ICER outcome, as indicated by the results of our 1-way sensitivity analyses. Previous studies in the group have validated the CT2S approach using cadaver bones, which resulted in a 7% to 15% standard error for failure strength and strain estimation when compared against experimental results. 6 Part of this was attributed to the inter- and intraobserver error during the femur segmentation process, which was quantified to be within 5%. The ARF0 (CT2S with added subject-specific fall dynamics and hip impact mechanics) was also verified with respect to all numerical approximations and achieved an overall error tolerance of 1.92 percentage points with an uncertainty of 4 percentage points due to model inputs. 6 The CT2S input data used in this study have been carefully considered by taking results derived from the most appropriate boundary conditions to minimize potential error propagation. 7 The range of values considered in the 1-way sensitivity analysis here therefore represents the absolute worst-case scenario when CT2S screening sensitivity was reduced by 20% and 30%, which are higher than the error range reported in previous sensitivity analyses. Future uncertainty quantification of CT2S input parameters will provide a more comprehensive understanding of its performance and enhance the robustness of cost-effectiveness analyses. Such studies could also be extended to include the application toward predicting other types of osteoporotic fractures such as vertebral and wrist fractures. As early HTA analyses do not yield definitive recommendations for the development or adoption of innovations, their purpose is to contribute to ongoing discussions regarding investments, financial accountability, and reimbursement policies. 12 In this context, our observations can contribute to the discourse surrounding the development, implementation, and reimbursement of CT2S as a potential novel osteoporosis screening tool for secondary fracture prevention.
In the current study, fixed screening intervals were adopted for both DXA and CT2S screening (5 and 2 y). In clinical practice, the screening interval depends on the presence of risk factors and age.78,79 However, our screening interval scenarios would be expected to encompass the likely range of screening intervals that might be employed in practice. In addition, we assumed all people within the FLS were willing to cooperate and perform rescreening in a fixed screening interval. This assumption is reasonable given that women chose to attend FLS at the onset but could limit the generalization of the CUA results to osteoporosis screening in non-FLS clinics.
The current model considered 3 main fractures (hip, vertebral and wrist) in our model. Other types of fractures were not considered due to the lack of relevant clinical data. This might lead to the underestimation of the potential lifetime cost savings and increase in quality-adjusted life expectancy from the prevention of other types of osteoporotic fracture. However, considering other types of fractures would serve to lower ICER values, which would strengthen the current findings of cost-effectiveness. Since the CT2S service to date has been used only in clinical research and has not been commercialized, there was no profit margin added to the cost of CT2S. However, our estimation of CT2S screening cost is conservative and involved setting the mean of a gamma distribution from which a cost was selected for each parameter-level iteration. These distributions covered a range of possible CTS costs. Furthermore, the automation of the CT2S image preprocessing step using a open-source convolutional neural network (such as U-Net) may further reduce the cost.
This study has several other limitations. Since the probability of nursing home admission after hip fracture in the Dutch population is available only for the 80s age group, 35 the admission probabilities for age groups 60s and 70s were obtained from a Norway study. 34 In addition, while the prevalence of osteoporosis among women is nearly 10 times higher than that among men in the Netherlands, 57 it would be highly valuable to also investigate the cost-effectiveness of CT2S screening for men. Moreover, since the health utilities, osteoporosis prevalence, mortality rate, and annual fracture rates for the Dutch population are not reported with confidence intervals or standard deviations, distributions were created for these parameters by assuming a standard deviation of 20% of the mean to account for the parameter uncertainty. These data can be updated when relevant information becomes available in the future. The current model assumes that treatment recommendations based on DXA screening rely solely on the T-score, whereas other factors such as smoking, glucocorticoid use, and alcohol intake can be combined with the T-score in other fracture risk predictors such as the FRAX score. 80 Unfortunately, it was not possible to include the FRAX model in this study due to a lack of required clinical data.
According to statistical data from 2020 and the report by the International Osteoporosis Foundation, there were 618 hospitals in the Netherlands in 2020, with more than 50% of these hospitals having FLS programs.81,82 However, at the national level, there are only 256 CT scanners available. 83 This limitation may affect the future adoption of CT2S in clinical settings. Furthermore, in terms of the radiation effects of CT scans on patients, according to the study by Viceconti et al., 4 the CT2S service recommends a whole femur CT scan protocol that results in an effective radiation dose ranging from 1.3 to 3.2 mSv for females. However, a recent study also suggested that even more aggressive reductions can be implemented without compromising the predictive accuracy. 84 The 2018 study also demonstrated that the reduction in the risk of death due to complications related to hip fractures through CT2S is nearly 10 times greater than the additional risk of death resulting from the additional radiation exposure for elderly women. 4 Therefore, the radiation effects were not incorporated into the health utility of the patient in our model. Finally, this study focuses on the cost-effectiveness of CT2S osteoporosis screening in Dutch postmenopausal women and may not be directly applicable to other countries and health care systems without recalibrating some of the input parameters.
In conclusion, this study demonstrated how early HTA may be applied for a potential novel osteoporosis-screening tool for secondary fracture prevention in the Netherlands. The findings suggest that CT2S has the potential to be cost-effective in women referred to Dutch FLS at ages 70 and 80, compared with DXA screening, as the ICER fell below the lower-bound ICER threshold in the Netherlands. The finding of cost-effectiveness was also observed with a shorter osteoporosis-screening interval of 2 y and alternative treatment (annual zoledronic acid treatment) with better adherence. These findings suggest that CT2S could be used as a potential osteoporosis-screening tool in the clinical setting in the Netherlands. Perhaps more importantly, it raises the prospect of how simulation applications enabled by high-performance computing might contribute to enhanced health care quality and potential cost savings.
Supplemental Material
sj-docx-1-mpp-10.1177_23814683231202993 – Supplemental material for Cost-Effectiveness Analysis of CT-Based Finite Element Modeling for Osteoporosis Screening in Secondary Fracture Prevention: An Early Health Technology Assessment in the Netherlands
Supplemental material, sj-docx-1-mpp-10.1177_23814683231202993 for Cost-Effectiveness Analysis of CT-Based Finite Element Modeling for Osteoporosis Screening in Secondary Fracture Prevention: An Early Health Technology Assessment in the Netherlands by Jieyi Li, Marco Viceconti, Xinshan Li, Pinaki Bhattacharya, David M. J. Naimark and Anwar Osseyran in MDM Policy & Practice
Footnotes
Acknowledgements
We also would like to thank Antonino Amedeo La Mattina, PhD, who provided the results of the screening performance experiment; Marco Verdicchio, PhD, who provided support for the screening cost calculation; Axel Berg, PhD, who provided general support and critical reading of this manuscript; and Rob Goedhart, PhD, who provided critical reading of this manuscript.
The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article. The authors disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This study was also partially supported by the European Commission H2020 program through the CompBioMed Centre of Excellence (grant No. 823712).
Data Availability
The data sets used and/or analyzed during the current study are available from the corresponding author on reasonable request. The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: Financial support for this study was provided in part by Atos through the HPC, AI and Quantum Life Sciences Centre of Excellence (CEPP); as well as by SURF, the collaborative organisation for IT in Dutch education and research. The funding agreement ensured the authors’ independence in designing the study, interpreting the data, writing, and publishing the report.
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
