Abstract
Background
Periprosthetic bone loss is considered to be a potentially contributing factor in aseptic loosening of acetabular hip components, but no studies have shown this association. The lack of association might be caused by insufficient image quality because of metal artifacts and challenges in measuring bone density (BMD) in complex anatomic structures which might be overcome using dual-energy computed tomography (DECT).
Purpose
To test inter- and intra-observer agreement and reliability of in-house segmentation software measuring BMD adjacent to acetabular cup and to compare measurements performed with single-energy CT (SECT) and DECT in cemented and cementless cups.
Material and Methods
Twenty-four acetabular cups inserted in porcine hip specimens were scanned with SECT and DECT. Bone density was measured in a three-dimensional volume adjacent to the cup. Double measurements were performed.
Results
BMD derived from SECT was approximately four times higher than that of DECT. In both scan modes, intraclass correlation coefficient (ICC) was >0.90 with no differences between repeated measurements, except for uncemented cups where a statistically significant difference of 11 mg/cm3 was found with DECT. DECT showed narrower limits of agreement than SECT. Inter-observer analysis showed small differences.
Conclusion
BMD can be estimated with high intra- and inter-observer reliability with SECT and DECT around acetabular cups using custom software. The intra- and inter-observer agreement of DECT is superior to that of SECT and better in the cementless concept. Good intra- and inter-observer reliability can be obtained in both cemented and cementless cups using the segmentation software. SECT and DECT cannot be used interchangeably.
Keywords
Introduction
Aseptic loosening of orthopedic implants is the most common cause of failure after total hip arthroplasty (THA) (1,2) and periprosthetic bone loss is considered a predictor of aseptic loosening (3,4). To our knowledge, no studies have shown this association, but a recent study found osteoporotic patients with low systemic bone mineral density (BMD) to be at higher risk of cup migration compared with patients with normal BMD (5). Bone loss may occur around THA because of stress shielding and may be located around both the cup and stem (4,6). Most attention has been paid to the femoral side (4), but cementless press-fitted cups loading the acetabular rim may also cause stress shielding of the central iliac bone (7). Bone loss occurs in all compartments adjacent to the cup (8,9), but in cementless THA it occurs most often in the supra-acetabular ileum (10) and often also in the medial wall in both cemented and cementless cups (11). In cemented cup systems, bone loss seems to be less substantial (12). Bone loss must be considerable, i.e. 30–50%, before it can be detected on radiographs (13) and furthermore the sensitivity of radiographs is low which might explain the lack of correlation between bone loss and loosening of the prosthesis.
Dual X-ray absorptiometry (DXA) is widely used for bone density measurements in osteoporosis, but it is of limited value in the presence of metal because it acquires two-dimensional (2D) images only and it cannot obtain reliable measurements in the presence of bone cement (14). Bone density changes adjacent to acetabular hip implants have been studied with computed tomography (CT) (12,15–18), but CT is challenged in visualizing the interface between bone and implant due to metal artifacts (14), thus none of the studies have focused solely on the interface. Dual-energy CT (DECT) can theoretically remove or reduce beam hardening artifacts (19–21) and has shown promising results in bone density evaluation adjacent to spinal implants (22,23) and better delineation of the acetabular bone-implant interface in THA (24). In general, measuring BMD using CT in the acetabular region is also difficult because of the complex three-dimensional (3D) anatomic structure of the bone.
The primary objective of the present study was to test the intra- and inter-observer agreement and reliability of an in-house 3D segmentation software solution that measures BMD in close proximity to the acetabular cup with the use of volumetric CT images. The secondary objective was to compare BMD measurements performed with single-energy CT (SECT) and DECT in cemented and cementless prosthetic acetabular cups. If measurements can be performed with sufficiently high intra- and inter-observer reliability, the method might contribute to further research in bone loss and potentially impact the choice of acetabular cup in future patients.
Material and Methods
Twenty-four fresh frozen female porcine hemi-pelvic specimens were included. The animals were skeletally mature sows with closed triradiate cartilage. The exact age of the animals was unknown, but usually they are slaughtered when they reach 2–4 years. One orthopedic surgeon with more than ten years of experience in hip replacement surgery inserted 12 cementless and 12 cemented cups in the specimens. The specimens were stored at −25℃ between surgery and the imaging procedure.
Surgical procedure
The specimens were partly thawed at room temperature for approximately 3 h prior to surgery to ensure that surgery was practically feasible yet avoiding the possibility of decomposition of the specimens. Surgery was performed with Stryker components and cement (Stryker Corp., Kalamazoo, MI, USA). In the cementless concept 48-mm Trident® hemispherical acetabular shells were used in combination with Trident® X3® polyethylene liners. The acetabulum was reamed line-to-line or under-reamed according to the surgeon’s judgment. In the cemented concept, we used 48-mm Contemporary® Hooded cups. The acetabulum was over-reamed by 2 mm according to the manufacturer’s recommendations. Antibiotic Simplex® Bone Cement was used. In both concepts, an Exeter stem was placed in the cup for SECT and DECT examination. In the cementless concept, a 32-mm steel head was used and in the cemented concept a 28-mm head was used. The divergence was due to the relatively small porcine acetabulum, i.e. approximately 50 mm. As the cemented concept demands over-reaming by 2 mm, 48-mm cups were used. However, this cup size can only contain a 28-mm head. The stem was not fitted in the femoral bone because the femoral side was beyond the scope of the current study.
CT scanning
Acquisition parameters for SECT and DECT.
ASIR, adaptive statistical iterative reconstruction; CTDIvol, volumetric CT dose index.

Example of an axial DECT slice with the hemispherical volume defined by the pixels included in the intersection between the circles (red) and the borders of the free-hand drawn area (blue). Underneath the specimen, five ROIs (yellow) are positioned in the calibration phantom rods.
Quantitative image analysis
We used a custom in-house 3D Fiji plugin to perform the image analysis. Fiji is a platform for biological image analysis based on ImageJ (27). Using the plugin we could draw free-hand ROIs slice by slice and position circular ROIs in the phantom calibration rods (Fig. 1). The plugin stacked the ROIs and segmented a cup-shaped 3D ROI based on a user-defined diameter and thickness, and the ROI was divided into four quadrants (Fig. 2). In this study, a combination of 50-mm diameter and 5-mm thickness was used. The plugin created a comma-separated file with the pixel values of the ROIs. A Python script (Python Software Foundation, Beaverton, OR, US) was then used to calculate the mean pixel value of each ROI using a lower pixel value threshold of –300 Hounsfield units to exclude air deposits. Image analysis was performed by a radiographer with more than 20 years of experience in CT and image analysis (BM). In the cementless concept, the analysis was repeated by a second observer (KK) with no radiologic experience in order to assess inter-observer agreement and reliability.
Example of a ROI divided into quadrants superimposed onto axial and coronal slices in SECT (left) and DECT (right). Beam hardening artifacts are present in both scan modes, but appear slightly more prominent in SECT (arrow).
Statistical analysis
According to Kress et al. (16), we used a minimal relevant difference of 5%. Based on a sample size calculation using an estimated SD of 15, a difference of 15 mg/cm3 (i.e. 5%) could be detected by the inclusion of ten specimens in each group. All variables were continuous and summarized by mean, number of observations, and 95% confidence intervals (CI). The differences between repeated BMD measurements were estimated with mixed effects regression modeling, using measurement as fixed effect. The absolute agreement between the methods and between the observers was analyzed with Bland–Altman plots (28), including mean difference and limits of agreement. The intra- and inter-observer reliability was assessed by intraclass correlation coefficients (ICC) based on two-way random effects models (29,30). The statistical significance of the difference between SECT- and DECT-based BMD measurements was assessed by comparing the 95% CIs. Repeatability coefficients (RC) were calculated for the repeated measurements according to Bartlett and Frost (30). RC is an estimate below which the absolute difference would be expected to lie with 95% certainty (30–32). To compare RC and the minimal clinically relevant difference, the RCs were converted to percentages by dividing RC with the mean BMD of measurement 1. All analyses were performed using STATA/SE 14.0 (StataCorp. LP, College Station, TX, USA). The study is reported in accordance with the Guidelines for Reporting Reliability and Agreement Studies (GRRAS) (33).
Results
Intra-observer analysis. Differences between repeated BMD measurements derived from SECT and DECT (units of mg K2HPO4/cm3).

Bland–Altman-plots of repeated acetabular density measurements in the cementless cup derived from SECT (left) and DECT (right). Horizontal lines indicate limits of agreement and the mean difference between the measurements. n = 10.

Bland–Altman-plots of repeated acetabular density measurements in the cemented cup derived from SECT (left) and DECT (right). Horizontal lines indicate limits of agreement and the mean difference between the measurements. n = 12.
Repeatability coefficients and corresponding 95% CIs in units of mg K2PO4/cm3 for repeated BMD measurements performed with SECT and DECT. The absolute difference between the measurements would be expected to lie below RC with 95% certainty.
Inter-observer analysis. Differences between average BMD measurements in cementless cups for observers 1 and 2 in SECT and DECT (units of mg K2HPO4/cm3). n = 10.

Box plot with medians, quartiles, and ranges comparing BMD in units of mg K2PO4/cm3 for two observers (BM and KK) in the cementless concept. n = 10. Dots are outliers.

Bland–Altman plots of density measurements from observers 1 and 2 derived from SECT (left) and DECT (right). Horizontal lines indicate limits of agreement and the mean difference between the observers. n = 10.
Discussion
In this experimental ex-vivo study with porcine specimens, we found that BMD estimates derived from SECT were approximately four times higher than those of DECT using segmentation software. The difference is supposedly caused by differences in the estimation of BMD, i.e. the pixel values in the low-density rods of the calibration phantom were comparable between SECT and DECT, while the pixel values at higher densities differed. Because the BMD estimation is basically a regression maneuver, the difference will cause SECT to result in higher BMD than DECT (34).
In the cemented concept, we found a statistically significant difference between the repeated measurements in DECT and a borderline significant difference in SECT. The difference is probably caused by the relatively low number of specimens. In all cases, the reliability expressed by ICC was good and no statistically significant difference was found between SECT and DECT.
In both scan modes, the RCs were larger than the minimal clinically relevant difference of 5%, i.e. 8% in the cementless concept and 11% in the cemented concept.
Measurements in the subdivided acetabular ROIs returned small differences between the repeated measurements and followed the same tendency between the scan modes. However, the results of this analysis must be considered explorative, because such sub-group analyses would supposedly require more specimens as reflected in the variability of the P values.
The study has some limitations. The statistically significant difference between the two observers may be caused by the difference in experience between the observers potentially leading to a consensus issue. The choice of observers was made to create a worst-case scenario in order not to overestimate the reliability and agreement of the method. Even in this worst-case scenario the absolute differences were small and most likely not clinically relevant, but experienced observers should perform all analyses in clinical studies. Further steps should be taken to ensure consensus if the method is to be used with multiple observers.
Because the study solely focused on agreement between repeated measurements and not accuracy of the BMD estimates, the external validity of the study is limited. The implants were inserted in dead porcine specimens. Thus, no biochemical reactions from the implants could occur and the cement did not infiltrate the bone tissue as much as it would do in living subjects. Furthermore, the animals were young compared with hip patients who are most often elderly with degenerative changes and lower BMD. The anatomy of porcine hips differs somewhat from human anatomy and even though the specimens were kept at low temperature there were small air deposits in the cancellous bone. We also considered positioning the specimens in a water tank to mimic the absorption of human tissue, but it was not possible to obtain stable positioning without movement during the scan procedure. Thus, more image noise would be expected in a patient compared to the experimental setup. However, the image quality of both scan modes would supposedly decrease similarly from this. Finally, the cemented cups could not contain stem heads larger than 28 mm compared with the commonly used 32-mm heads. This may in principle have caused fewer artifacts in the cemented concept.
In conclusion, the present study suggests that the intra-observer agreement of acetabular BMD measurements performed with DECT is better than that of SECT, while the intra-observer reliability is equally high for both scan modes and cup types. The inter-observer agreement was better in DECT while the inter-observer reliability was equally high in SECT and DECT. However, due to the difference in experience between the observers the results on inter-observer reliability and agreement must be interpreted with caution. Thus, DECT may be beneficial as a research tool for longitudinal studies of bone loss around acetabular cups. Further experimental in vivo studies are needed before the software can be used in longitudinal clinical studies of bone loss.
Footnotes
Acknowledgements
We would like to thank specialist radiographer Peter Traise, Bathurst, Australia who worked at our department for his assistance during the scans and data acquisition procedure.
Declaration of conflicting interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) received no financial support for the research, authorship, and/or publication of this article.
