Sage Journals: Discover world-class research

Abstract

Study Design

Retrospective observational study.

Objectives

The prediction of curve progression in patients with adolescent idiopathic scoliosis (AIS) remains an unresolved area in orthopedic surgery. To make a rapid meaningful prediction, easily accessible multi-dimensional data at the patient’s first consultation should be used. Current studies use clinical growth parameters and numerical values extracted from radiographs to compile a predictive model, leaving out the radiographs themselves. Such practice inevitably wastes a lot of information. Thus, this study aims to create a neural network that can predict AIS progression among patients with curves indicated for bracing by integrating both one-dimensional (1D) clinical and two-dimensional (2D) radiological data collected at the patient’s first visit in a fully automated manner.

Methods

513 idiopathic scoliosis patients indicated for and managed with bracing orthosis were recruited. After exclusion, 463 patients were included in deep learning analysis. Processed first-visit growth parameters and posteroanterior radiographs are used as training inputs and the curve progression outcomes obtained in follow ups are used as binary training outputs. The CapsuleNet architecture was modified and trained accordingly to make a prediction.

Results

The final model achieved 90% sensitivity with an overall accuracy of 73.9% in the prediction of AIS in-brace curve progression by using first-visit multi-dimensional data, outperforming conventional convolutional neural networks.

Conclusions

This first-ever multidimensional-input model shows promise in serving as a screening tool for AIS in-brace curve progression. The incorporation of such a model into routine AIS diagnostic pipeline can assist orthopedics clinicians in personalizing the most appropriate management for each patient.

Keywords

adolescent idiopathic scoliosis prediction screening neural network deep learning

Introduction

Scoliosis is a three-dimensional spinal deformity and AIS is the most common form, attributing to about 70% of all cases, and is usually diagnosed during puberty.^1-4 Throughout puberty, the rapid increase in linear growth may exacerbate the structural deformity, contributing to the progressive nature of AIS. A curved spine can detrimentally affect the patient’s daily activities, quality of life and even cardiopulmonary function.⁵ Treatment options depend on the severity of disease ranging from observation alone, to bracing and surgery. It is also highly influenced by the patient’s remaining growth potential and therefore, AIS warrants an early diagnosis and an appropriate and timely treatment to prevent progression. However, the etiology of AIS progression still remains unclear up to this date due to the lack of consistency between studies⁶; this is especially the case among the patients indicated for and managed by bracing orthosis where individuals already share certain characteristics like large curves and skeletal immaturity at presentation. Therefore, only patients managed with bracing orthosis were recruited to participate in this study to examine the possibility of predicting curve progression within the bracing population.

Determining the risk of AIS progression is a complex task that should integrate both clinical and radiological components; however, most of the current curve progression prediction tools still rely on 1D numerical data alone and fail to make use of 2D radiographs which contain spatial information valuable to prediction accuracy. In addition to the relevant clinical parameters, such as sex, chronological age, date of menarche and height,^7-11 current AIS progression estimation focuses on various numerical or categorical parameters extracted from radiographs, including the Cobb angle for measuring the angle of spinal curvature,^12,13 skeletal maturity approximated particularly by the distal ulna and radius classification,^10,11,14 curve pattern^7,15 and supine flexibility.^15-18 Researchers in the field have created various composite predictive models using these 1D clinical data and extracted radiological parameters,^19-21 and one model even used circulating markers of AIS, such as microRNA-145-5p, achieving a sensitivity of 91.7% and specificity of 79.8%.²⁰ However, current prediction tools have a significant limitation in that they exclude radiographs from the prediction algorithms, sacrificing a large amount of subtle spatial information within the spine that could be invaluable to AIS progression prediction.

Recent advancements in computer vision and deep learning artificial intelligence (AI) algorithms have been employed for medical research in various specialties.²² For deep learning applied specifically in AIS progression prediction, there is one study that employed state-of-the-art deep learning techniques to predict curve progression using radiographs alone and showed comparable results to other composite models that use clinical parameters.²³ Another team performed three-dimensional (3D) reconstruction of the spine from bi-planar radiographs and used physics equations to create an impressive mechanistic neural network to characterize stress components along the 3D spine and added it to a bone growth model to predict the curve status of AIS patients at any given time-point.²⁴ These exciting recent studies show that current machine learning technology enables the utilization of raw radiographs and even their reconstructions to predict AIS curve progression.

Therefore, both 1D clinical parameters and 2D whole-spine radiographs should be used simultaneously in order to optimally utilize routinely collected data to make a better-informed prediction of AIS curve progression while minimizing costs. The objective of this study is to develop an AI neural network that can predict the probability of in-brace curve progression of AIS patients at their first visit to the orthopedic clinic by utilizing both routinely collected 2D radiological images and 1D clinical data. To the best of our knowledge, this is the first ever attempt in literature to directly utilize whole-spine radiographs, instead of using extracted parameters, combined with growth parameters to make a prediction for AIS progression. The results of this study should provide new insights into predicting in-brace curve progression risk within the bracing population, where progression prediction is more challenging. This paper is composed of several sections including the recruitment and inclusion of patients, the X-ray and bracing protocol and the processing of radiographs and clinical parameters for training, the neural network design, performance of the generated models, discussion of results, model design, limitations, and future directions.

Materials and Methods

Patient Recruitment and Demographics of the Study Population

We recruited a total of 513 patients from two tertiary scoliosis referral centers between December 2016 to November 2019. Written informed consent was obtained from all participants and their parents or legal guardians before examinations and measurements were performed. This study was approved by the Institutional Review Board of the University of Hong Kong/Hospital Authority Hong Kong West Cluster (IRB Reference Number: UW 15-596, 7 December 2016). Study inclusion criteria were (i) clinical diagnosis of AIS within the recruitment period, (ii) indicated and managed by bracing orthosis upon diagnosis, (iii) Cobb angle ≥ 20°, (iv) a minimum of one follow-up at least three months after the first visit. 43 patients were consequently excluded due to lack of follow-up and 7 patients were diagnosed with juvenile idiopathic scoliosis instead as they were younger than ten years old, leaving 463 patients for neural network training and analysis (Figure 1).

Figure 1.

Patients were recruited for this study based on their diagnosis of idiopathic scoliosis, measured Cobb angle ≥ 20°, bracing orthosis management regime and one or more follow-ups at least three months after the patient’s first diagnosis. Most patients fulfilled these inclusion criteria; however, 7 patients were younger than 10 years old and 43 patients defaulted follow up earlier than the three-month time-point. A total of 50 patients were hence excluded from this study.

Table 1 summarizes the demographics of the participants and the clinical data taken. There were 383 (82.7%) females and 80 (17.3%) males with ages ranging from 10 to 16 at time of recruitment. 232 (50.1%) suffered from in-brace curve progression whereas 231 (49.9%) did not progress. In addition to sex and age, clinical parameters including weight, sitting height, standing height and arm span were measured and recorded upon physical examination at the corresponding specialist clinics. Bi-planar X-rays and left-hand radiographs were taken at each visit. Growth potential parameters like Risser sign and distal radius and ulna classification were determined from the acquired radiographs and were recorded accordingly.

Table 1.

Baseline Demographics and Clinical Parameters at Time of Recruitment.

Sex
Male	80 (17.3)
Female	383 (82.7)
Missing	0 (.0)
Age at first visit (years)
10-12	157 (33.9)
13-14	237 (51.2)
14-16	67 (14.5)
>16	2 (.4)
Missing	0 (.0)
Standing Height (cm)
≤120	2 (.4)
121-140	16 (3.5)
141-160	316 (68.2)
>160	100 (21.6)
Missing	29 (6.3)
Sitting height (cm)
≤60	6 (1.3)
61-80	128 (27.7)
>80	296 (63.9)
Missing	33 (7.1)
Weight (kg)
≤40	199 (43.0)
41-60	225 (48.6)
61-80	6 (1.3)
>80	1 (.2)
Missing	32 (6.9)
Arm span (cm)
≤120	2 (.4)
121-140	19 (4.2)
141-160	271 (58.5)
161-180	128 (27.6)
>180	5 (1.1)
Missing	38 (8.2)
Risser sign
0	189 (40.9)
1	75 (16.2)
2	76 (16.4)
3	45 (9.7)
4	15 (3.2)
5	1 (.2)
Missing	62 (13.4)
Distal radius maturity
4	1 (.2)
5	7 (1.5)
6	72 (15.6)
7	100 (21.6)
8	116 (25.1)
9	26 (5.6)
10	1 (.2)
Missing	140 (30.2)
Ulna maturity
3	1 (.2)
4	10 (2.2)
5	98 (21.2)
6	130 (28.1)
7	61 (13.2)
8	21 (4.5)
9	1 (.2)
Missing	141 (30.4)

Table 2 lists out the initial type of the major curve and its Cobb angle of the participants included in the analysis. At the first visit, 111 (24.0%) participants had a single curve, 283 (61.1%) had a double curve, and 69 (14.9%) had a triple curve. Most participants (52.3%) presented with a double curve with a lumbar major curve.

Table 2.

Distribution of Curve Types and Corresponding Cobb Angles at Time of Recruitment.

Single curve
Degrees (°)	Thoracic	Lumbar
20-29	15	79
30-39	5	11
≥40	0	1
Double curve
Degrees (°)	Thoracic	Lumbar
20-29	37	151
30-39	4	75
≥40	0	16
Triple curve
Degrees (°)	Thoracic	Lumbar
20-29	15	22
30-39	4	18
≥40	1	9

Bracing Protocol and Compliance

The two tertiary referral centers practice stringent measures to manage AIS and advocate bracing treatment. AIS patients are generally offered bracing for curves between 20° and 40°, less than 1 year post-menarche for girls, and Risser 0-2. However, some AIS patients with curves near to or over 40° and Risser >2 are offered bracing treatment to minimize the risk of any curve progression during growth.

Custom-moulded thoracolumbar braces were prescribed, and participants were advised to wear them for at least 20 hours per day and engage in recreational and physical activities during their off-brace periods. Nonetheless, bracing for at least 16 hours per day was considered acceptable. Their average daily hours of compliance were recorded at each follow-up visit and was subsequently included as one of the clinical parameters analyzed in this study. Table 3 stratifies the compliance ranges according to the presence of curve progression among the participants.

Table 3.

AIS Patients’ Compliance to Bracing Stratified by Presence of In-Brace Curve Progression.

	Compliance (hours per day)
Progression	≥16 hrs	<16 hrs	Missing	Total
Yes	69	135	28	232
No	94	101	36	231
Total	163	236	64	463

X-ray Protocol

All radiographs used in this study were scanned by bi-planar stereoradiography (EOS® imaging system, Paris, France). This imaging system collects simultaneous full body or upper body whole-spine bi-planar radiographs of the participants. Baseline full body radiographs were taken at the first visit for every patient. For each follow-up visit, patients were required to take two sets of radiographs – one with bracing orthosis and one without bracing orthosis for treatment efficacy assessment purposes. But to achieve consistent comparison specifically for this study, only the follow-up radiographs without bracing orthosis were used for appropriate comparison to pre-brace first-visit radiographs. Only the posteroanterior X-ray images were used for training in this study.

X-ray Image Preprocessing for Model Training

Several steps were employed to prepare the radiographical images for neural network training.

First, the brightness and contrast of all X-ray images were normalized with the Contrast Limited Adaptive Histogram Equalization (CLAHE) image processing algorithm with OpenCV.

Second, three different regions of interest (ROI) of the original X-ray images were synthesized -- (i) whole-spine, (ii) upper end vertebra of the major curvature, and (iii) lower end vertebra of the major curvature, as demonstrated in Figure 2. To obtain the cropping borders for the ROIs, AlignProCARE²⁵ was employed to automatically generate two sets of coordinates (Figure 3). The first set consisted of 72 pairs of endplate coordinates from C7 to L5 level and the second set consisted of endplate coordinates that represent the first and last end vertebrae of all detected curvatures. These two sets of coordinates were labelled on top of the radiographs, as demonstrated in Figure 3, for all participants and verified manually. All coordinates generated were accurate, hence no manipulation was necessary. Subsequently, the very first and very last endplate coordinates were used to generate the whole-spine ROIs, spanning from C7 to the pelvic bone. For the upper and lower end vertebrae ROIs, endplate coordinates of the major curvature were used.

Figure 2.

Three regions of interest (ROI) were selected and processed for training from each original radiograph. The leftmost image is the original coronal view radiograph. The original radiograph was passed through the CLAHE algorithm to enhance image brightness and contrast. The three crop boxes outlined by red dotted lines illustrate the three crops generated for model training. Crop 1 selects the ROI for the whole-spine region according to coordinates exported by AlignProCARE. Crops 2 and 3 select the ROIs for the upper and lower end vertebra of the major curvature. All ROIs were subsequently resized into 256 × 256 for Crop 1 and 50 × 50 for Crop 2 and 3. Crop 1 ROI was additionally multiplied by a Gaussian mask to amplify the signal of the vertebrae.

Figure 3.

AlignProCARE automatically generates 36 pairs of vertebral end plate coordinates represented by the red crosses in this coronal view radiograph. The four pairs of blue hollow circles are the upper and lower vertebral end plates of the detected spinal curvature. In other words, there are two curvatures detected in this given patient.

Third, due to the overwhelming amount of extra-axial information on the radiograph, for example the lung fields, the heart and the upper limbs, the endplate coordinates of all vertebral bodies generated by AlignProCARE were adopted to segment the spine and to create a gaussian mask that enhances its intensity in the whole-spine crop images.

Lastly, the whole-spine ROIs were resized to 256 × 256 square images and the end vertebra ROIs were resized to 50 × 50 square images. The ROIs were saved as single channel grayscale images in JPG format which then served as 2D inputs of the modified CapsuleNet model.

Clinical Parameters Preprocessing for Model Training

The clinical parameters used in this study include sex, age, weight, sitting height, standing height, arm span, Risser sign, distal radius and ulna classification for skeletal maturity assessment,¹⁰ and bracing compliance. Categorical data were represented as 1D unit vectors, whereas numerical data were normalized to a number between 0 and 1.

One of the biggest potential challenges of this study was perhaps the issue of missing data. Unfortunately, the numerical parameters of this study were highly specific to adolescent growth which is time-dependent, and thus could not be re-recorded appropriately if missed. Nonetheless, data imputation was not indicated, and the missing data was handled by replacing them with the value -1 after normalization. This manipulation aimed to help the neural network recognize the presence of data that was well outside of the normal ranges.

All data was then concatenated into a single 1D vector with the same order for all patients; the specific order was randomly decided. These processed vectors then served as the 1D input of the modified CapsuleNet model.

Definition and Calculation of AIS Curve Progression for Model Training

The presence of AIS curve progression was defined by a minimum increase of 5° in major curve Cobb angle²⁶ from the time of diagnosis to any follow-up within the follow-up timeframe, which is at least three months in duration. Participants with less than 5° increase in major curve Cobb angle and those with curve correction within their follow-up timeframe are thus labeled as non-progression. Paired examples of each class are illustrated in Figure 4. The initial and final Cobb angles were calculated with the major curve endplate coordinates automatically generated by AlignProCARE after which the presence of progression could be easily determined. The progression status of each patient was labeled as [0, 1] for progression or [1, 0] for non-progression. This binary vector was the expected output of the modified CapsuleNet model.

Figure 4.

A. An example of curve improvement with bracing orthosis management between the first and most recent clinic consultation. B. An opposite example showing curve progression despite bracing management. Note that the radiographs were obtained with the bracing orthosis taken off.

Prediction Model and Pipeline

CapsuleNet, developed by Sabour et al.,²⁷ is believed to be suitable in extracting orientation-related features within the radiographs, including vertebral rotations and rib asymmetry, in addition to other general spatial features efficiently. The capsule design, dynamic routing and equivariance property enable CapsuleNet to extract more orientation information accurately and thus can make more accurate classifications of 2D images in spinal radiographs compared to conventional convolutional neural network (CNN) architectures.

Using CapsuleNet as the backbone, a new model is eventually developed by feeding preprocessed posteroanterior radiographs and clinical growth data to automatically predict the likelihood of AIS progression. The modified CapsuleNet of this study (Figure 5) had a four-channel multi-dimensional input and a binary output. The three 2D-input channels consisted of one 256 × 256 input for the whole-spine image and two 50 × 50 inputs for the end vertebra images. The end vertebra images were included to put more emphasis on the major curvature. The remaining 1D-input channel received the vector of clinical growth parameters.

Figure 5.

The modified CapsuleNet model adopted in this study. The major feature is the concatenation of the 1D inputs and 2D inputs before passing into the backbone of CapsuleNet. The 1D inputs consist of the clinical parameters whereas the 2D inputs consist of the three ROIs generated per radiograph. The 1D inputs are passed to a Dense layer first and repeated into 2D square matrices to expand in dimensions to meet those of the 2D ROIs. With the same dimensions, the once 1D parameters are now concatenated with the 2D ROIs. The concatenated tensor is first passed to a Conv2D layer for preliminary feature extraction, then into the PrimaryCapslayer where squashing takes place. The two 50 × 50 layers are concatenated before passing to the PrimaryCaps layer. The outputs of the PrimaryCaps layer are then all concatenated and passed to one capsule, ProgressionCaps, for routing. The length of the capsules is then calculated and presented as the main output of this model which represents the probability of IS progression.

The 1D input was quickly passed into a fully connected layer to extract features and expanded in size up to 256 and 50 elements. The expanded vectors were then repeated to form 256 × 256 and 50 × 50 squares to match the dimensions of the whole-spine images and the end vertebra images respectively. Then, with synchronized dimensions, the images and the expanded clinical parameter layers were concatenated and organized into three main branches, one for each 2D ROI input. Each branch was followed by a convolutional layer with batch normalization and rectified linear unit activation. L2 regularization was also added to minimize overfitting. The two end vertebra branches were then concatenated into one and passed to an additional convolutional layer. Then, the two remaining branches were passed into separate Primary Capsule layers where corresponding features were extracted. The two sets of Primary Capsule outputs were then concatenated and passed into one Capsule Layer. Lastly, the votes from the routing-by-agreement mechanism were then summed and normalized by a squash function to output a likelihood vector.²⁷ The output vector of CapsuleNet was binarized into a [0, 1] or [1, 0] used for AIS progression prediction.

To start training the model, the dataset was split into a training set and test set with a four-to-one ratio with a training size of 371 and a test size of 92. All 2D input images were randomly augmented by (1) lateral flip, (2) rotation between -30° and +30° and (3) zoom between .8 to 1.2. The modified CapsuleNet model was trained for 100 epochs and at a learning rate of 1e-3, decaying at a rate of .9 per epoch. The Adam optimizer was employed, and cross categorical entropy loss function was used.

Evaluation Metrics

The development of the modified CapsuleNet model aims to screen for AIS patients with high risk of in-brace curve progression. In addition to high sensitivity and specificity, a reliable healthcare screening test should also have a good negative predictive value to confidently discharge patients without the condition screened. Nonetheless, for a comprehensive evaluation of model performance, the following metrics were generated:

S e n s i t i v i t y = \frac{T r u e P o s i t i v e}{T r u e P o s i t i v e + F a l s e N e g a t i v e}

(1)

S p e c i f i c i t y = \frac{T r u e N e g a t i v e}{T r u e N e g a t i v e + F a l s e P o s i t i v e}

(2)

P o s i t i v e P r e d i c t i v e V a l u e (P r e c i s i o n) = \frac{T r u e P o s i t i v e}{T r u e P o s i t i v e + F a l s e P o s i t i v e}

(3)

N e g a t i v e P r e d i c t i v e V a l u e = \frac{T r u e N e g a t i v e}{T r u e N e g a t i v e + F a l s e N e g a t i v e}

(4)

A c c u r a c y = \frac{T r u e P o s i t i v e + T r u e N e g a t i v e}{T r u e P o s i t i v e + T r u e N e g a t i v e + F a l s e P o s i t i v e + F a l s e N e g a t i v e}

(5)

F 1 s c o r e = \frac{2 * T r u e P o s i t i v e}{2 * T r u e P o s i t i v e + F a l s e P o s i t i v e + F a l s e N e g a t i v e}

(6)

Results

In the in-brace test population, 45 (90%) were correctly predicted among the 50 progressed patients, all of whom have had at least a 5° increase in the major curve Cobb angle. For the 42 non-progression patients, 23 (54.8%) were correctly predicted and the remaining 19 (45.2%) were incorrectly predicted as “will progress”. In terms of the prediction power of the model, among the 64 predicted positive results, 45 (70.3%) were true positives and 19 (29.7%) were false positives. For the 28 predicted negative results, 23 (82.1%) were true negatives and 5 (17.9%) were false negatives. These numbers are represented in a confusion matrix in Table 4 and a list of performance metrics is calculated in Table 5.

Table 4.

Confusion Matrix of the Testing Dataset Predicted by the Modified CapsuleNet Model.

	Test positive	Test negative
Predicted positive	45	19	64
Predicted negative	5	23	28
	50	42	92

Table 5.

Performance Metrics of the Modified CapsuleNet Model, ResNet Model and DenseNet Model.

	ModifiedCapsuleNet	ResNet	DenseNet
Sensitivity/Recall	90.0%	69.6%	95.6%
Specificity	54.8%	60.4%	18.7%
Positive Predictive Value (Precision)	70.3%	62.7%	53.0%
Negative Predictive Value	82.1%	67.4%	81.8%
Accuracy	73.9%	64.9%	56.4%
F1 score	79.0%	66.0%	68.2%
ROC AUC	.76	.69	.53

The modified CapsuleNet achieved an overall accuracy of 73.9% with a high sensitivity of 90% and a negative predictive value (NPV) of 82.1%. The ResNet and DenseNet models were trained in order to compare our modified CapsuleNet architecture with conventional CNNs and their performance metrics are juxtaposed in Table 5. The receiver operating characteristic (ROC) curves of the respective models are plotted in Figure 6. It is evident that the ROC curve of the modified CapsuleNet has a significantly larger area under the curve than the conventional CNNs.

Figure 6.

The ROC curve of the modified CapsuleNet model (blue line) has the largest area under the curve, outperforming the ResNet model (red line) and the DenseNet model (green line). This indicates that the modified CapsuleNet model is able to achieve a higer sensitivity with a smaller compromise in specificity.

Table 6 presents a contingency table that examines the relationship between AIS progression and bracing compliance among participants with recorded compliance with a cut-off at 16 bracing hours per day. A chi-square test was conducted to test the independence of these two variables, which yielded a P-value of .0035. Table 7 compares performance metrics between the model trained with bracing compliance added as an input, and one trained without.

Table 6.

Contingency Table for Presence of In-Brace Curve Progression Observed Against Compliance to Bracing After Excluding Those With Missing Compliance Data.

	Compliance (hours per day)
Progression	≥16 hrs	<16 hrs	Total
Yes	69	135	204
No	94	101	195
Total	163	236	399

Table 7.

Performance Metrics of the Modified CapsuleNet Model Trained With and Without Adding Bracing Compliance to the 1D Model Input Vector.

	Modified CapsuleNet trained with compliance	Modified CapsuleNet trained without compliance
Sensitivity/Recall	90.0%	80.0%
Specificity	54.8%	66.7%
Positive Predictive Value	70.3%	74.1%
Negative Predictive Value	82.1%	73.7%
Accuracy	73.9%	73.9%
F1 score	79.0%	76.9%
ROC AUC	.76	.74

Discussion

The prediction performance metrics of the generated model in this study show promising potential applications. The best metrics are sensitivity (90%) and NPV (82.1%), which outperformed a radiographs-only model.²³ As all recruited participants had already undergone treatment with bracing orthosis upon diagnosis, these results suggest that this model could be a great screening tool to confidently identify individuals without the risk of in-brace curve progression, who can be managed with bracing orthosis until their spine is fully developed. The straightforward pipeline also allows this model to be easily applied to supplement daily clinical practice in AIS management.

In addition to the conventional growth predictors, bracing compliance was included in the 1D input for model training. Patient compliance to bracing treatment is essential to successfully arrest curve progression.²⁸ However, in this study population, bracing compliance was poor, with only 163 (35.2%) participants wearing the braces for the accepted minimum of 16 hours per day, and 236 (50.9%) participants wearing the braces for less than 16 hours per day, as shown in Table 3. To determine whether including bracing compliance in model training was necessary, a brief analysis, excluding those for whom compliance hours were not recorded, was conducted by performing a chi-square test. The test yielded a P-value of .0035, indicating that curves were highly unlikely to progress independently of bracing compliance in this population. As a result, compliance data was added to the 1D inputs for model training, as seen in the set of performance metrics in Table 7, which demonstrates that adding compliance improved model performance. Therefore, the discussion below focuses on the model trained with bracing compliance included.

Moreover, the specific metrics have meaningful clinical implications. Firstly, the model’s high sensitivity of 90% and PPV of 70.3% mean that it has a low chance of missing a diagnosis of in-brace curve progression, and that management indicated has a good chance of benefiting patients who may still progress despite bracing. Therefore, in addition to offering immediate bracing, almost all patients with a predicted risk of in-brace curve progression can be reviewed more frequently or at least flagged for close observation.²⁹ More rigorous lifestyle modifications, including exercises and physiotherapy, can also be encouraged,³⁰ and bracing compliance should be ensured in patients at risk of progression. Specialists can also pay more attention accordingly when performing physical examinations and radiographic interpretations and further decide if it is necessary to monitor progression more closely only for those at risk, which will alleviate any busy clinic and reduce unnecessary healthcare costs. Additionally, it is economically favorable to offer bracing treatment at a higher frequency as such arrangement can reduce the rate of costly surgeries and potential complications^29,31 by slowing down curve progression. Thus, from a health economics perspective, it is still feasible to commence bracing management on a patient if the model predicts a positive in-brace progression risk despite its relatively low specificity of 54.8%. Of course, the possible psychosocial and physical impacts of bracing should still be explored with the patients.³² Integrating the model’s prediction with the clinician's expertise and experience can maximize the overall accuracy of AIS progression prediction in clinical practice and can limit the influence of false positives on the patients, ultimately minimizing unnecessary stress and lifestyle adjustments for patients and families as well as preventing unnecessary treatment and resource consumption.

Secondly, the relatively high NPV of 82.1% indicates that as long as the model labels a patient as “non-progression” at the first visit, there is a high probability that the patient will not undergo in-brace curve progression if given bracing. Consequently, healthcare resources can be redistributed to those more likely at risk and reassurance can be given to the patient and their family. With the help of the model, all these decisions can be made with the patients and families during the first visit to the clinic.

The integration of this model to routine consultations is straightforward and shows promise in assisting clinicians in the process of formulating a management plan for AIS patients by automating the prediction of the probability of in-brace curve progression. With the utilization of AlignProCARE in this study, the ROIs for the model can be automatically generated, making the entire prediction pipeline fully automated. Recently, there have been several genetic bioinformatics studies that explored the genetic variants associated with AIS manifestation.³³ Although several genes have been validated by literature, the incorporation of an expensive and time-consuming genetic study in a routine diagnosis pipeline may not be the most cost effective and convenient arrangement. Another study proposed a prediction pipeline involving 3D structural parameters, like plane of spinal torsion and disk wedging, which require 3D reconstruction of biplanar radiographs.¹⁹ Not all clinics have access to an EOS system to take biplanar X-rays and possess the computation power to perform 3D reconstruction and analysis quickly. Although the posteroanterior radiographs of the participants in this study were collected with the EOS system which offers high resolution and great readability, this does not undermine the potential scalability of this model as only single-plane posteroanterior radiographs are required for prediction, which could be obtained by regular X-ray imaging systems. Nevertheless, a future study may be conducted with the modified CapsuleNet model to further evaluate the possibility of using conventional 2D radiographs as the 2D input.

Regarding the modified CapsuleNet model design used in this study, only the version using a hybrid input with 2D radiographs and 1D clinical parameters managed to perform at such a level of accuracy, sensitivity and negative predictive value. Using either inputs alone, 2D only or 1D data only, the overall performance accuracies are 54% and 57% respectively. In addition, several structural modifications were made to the CapsuleNet model backbone to improve the performance metrics. Neural networks with multiple input channels of the original and processed images have been shown to improve the segmentation accuracy of the target organ and tumor in computer tomography images compared to those with single inputs only.³⁴ Moreover, CapsuleNet performs better with images with lower complexity³⁵ but a typical frontal spinal radiograph is visually complicated and simply downsizing the whole-spine ROI would tremendously jeopardize the spatial resolution and information. Therefore, the end vertebral bodies of the major curvature, which are visually much less complex, were cropped into ROIs and used as supplementary 2D inputs together with the whole-spine ROI. In terms of the manipulation of multi-dimensional inputs, instead of merging the different dimensional inputs at the end by concatenation and adding fully connected layers thereafter,³⁶ this model mapped the 1D clinical data into 2D matrices that match the corresponding 2D radiograph input shapes (50 × 50 and 256 × 256) and were concatenated together before passing into the capsule layers. This early merge of the hybrid data improved the accuracy of the model as more features could be extracted by the early layers of the network to make a more accurate prediction. All these modifications in the design of the network architecture contributed to the final model presented in this study. The fact that our model is able to predict AIS progression with acceptable overall accuracy within the bracing population implies that the features associated with in-brace progression are present in the input data — both the 1D clinical growth parameters and the 2D radiographs, and the integration of these inputs is essential to making a reliable prediction.

The training outcome clearly indicates that the modified CapsuleNet model architecture outperformed conventional CNNs, like ResNet and DenseNet. Instead of using scalar-output feature detectors and max-pooling in conventional neural networks, CapsuleNet implements vector-output capsules and routing-by-agreement to increase the model efficiency and precision in learning the entities of each specific region of the image.²⁷ This advantage makes CapsuleNet a great candidate in detecting fine spatial information, specifically orientation information in this case, more accurately with better generalizability.

This study came with several limitations. The first challenge was the relatively small sample size. For example, in a recent study using a modified version of CapsuleNet to detect COVID-19 infection from chest X-rays, a total of 1342 images were used.³⁷ The second challenge was the amount of missing 1D clinical data. Data imputation is not appropriate in this case as the clinical data appears to be missing at random.³⁸ Since the recruitment of AIS patients commenced in 2016, the retrospective nature of the study did not allow the investigators to acquire a complete set of clinical data.

In order for the current modified CapsuleNet model to function as a reliable screening tool with enhanced sensitivity and especially specificity, several improvements to the current study should be implemented. As immediate next steps, more patients with AIS need to be prospectively recruited to generate a larger and complete dataset involving the accurate recording of at least the conventionally acknowledged risk factors for AIS progression⁸ and skeletal maturity indicators, along with the radiographs. With a more comprehensive dataset, the feasibility of this automated multi-dimensional approach in predicting AIS progression can be further evaluated in a prospective clinical trial.

Conclusions

This study utilized a modified version of the CapsuleNet model to create an AI model that predicts in-brace AIS progression risk with a high sensitivity level at 90%. The results demonstrate the complementary nature of 2D radiographs and 1D clinical data in developing a reliable model for predicting progression risks. Using such an AI model in clinical practice as a preliminary screening tool would aid in the personalized management of AIS patients indicated for bracing. Patients without in-brace progression risk can be reassured and managed by bracing and other lifestyle modifications, while patients with suspected in-brace progression risk can receive first-line bracing treatment and can be monitored more closely with more frequent reviews on the braces to suppress progression and minimize the need for costly interventions. Thus, the results from the model offer additional insights for clinicians and assist in better personalizing the management of AIS patients indicated for bracing.

Footnotes

Declaration of Conflicting Interests

The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Funding

The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This research was funded by Mid-stream research fund (MRP/038/20X) and HKU impact case development fund.

ORCID iDs

Kenneth Chu

Teng Zhang

References

McAlister

Shackelford

. Classification of spinal curvatures. Radiol Clin North Am. 1975;13(1):93-112.

Goldstein

Waugh

. Classification and terminology of scoliosis. Clin Orthop Relat Res. 1973;93:10-22. doi:10.1097/00003086-197306000-00003.

Riseborough

Wynne-Davies

. A genetic survey of idiopathic scoliosis in Boston, Massachusetts. J Bone Joint Surg Am. 1973;55(5):974-982.

Miller

. Cause and natural history of adolescent idiopathic scoliosis. Orthop Clin North Am. 1999;30(3):343-352. doi:10.1016/s0030-5898(05)70091-2.

Huh

Eun

Kim

Jung

Choi

Kim

. Cardiopulmonary function and scoliosis severity in idiopathic scoliosis children. Korean J Pediatr. 2015;58(6):218-223. [published Online First: 20150622]. doi:10.3345/kjp.2015.58.6.218.

Bunnell

. Selective screening for scoliosis. Clin Orthop Relat Res. 2005;434:40-45. doi:10.1097/01.blo.0000163242.92733.66.

Hwang

Lee

Cho

. Progression of trunk imbalance in adolescent idiopathic scoliosis with a thoracolumbar/lumbar curve: is it predictable at the initial visit? J Neurosurg Pediatr. 2017;20(5):450-455. doi:10.3171/2017.6.PEDS17313.

Noshchenko

Hoffecker

Lindley

, et al. Predictors of spine deformity progression in adolescent idiopathic scoliosis: A systematic review with meta-analysis. World J Orthop. 2015;6(7):537-558. doi:10.5312/wjo.v6.i7.537.

Cheung

PWH

Cheung

JPY

. Does the use of sanders staging and distal radius and ulna classification avoid mismatches in growth assessment with risser staging alone? Clin Orthop Relat Res. 2021;479(11):2516-2530. doi:10.1097/CORR.0000000000001817.

10.

Cheung

JPY

Cheung

PWH

Samartzis

Luk

. APSS-ASJ Best Clinical Research Award: Predictability of Curve Progression in Adolescent Idiopathic Scoliosis Using the Distal Radius and Ulna Classification. Asian Spine J. 2018;12(2):202-213. doi:10.4184/asj.2018.12.2.202.

11.

Cheung

JPY

Cheung

PWH

Samartzis

Luk

. Curve progression in adolescent idiopathic scoliosis does not match skeletal growth. Clin Orthop Relat Res. 2018;476(2):429-436. doi:10.1007/s11999.0000000000000027.

12.

Cobb

. Outline for the study of scoliosis. Instr Course Lect AAOS. 1948;5:261-275.

13.

Cassar-Pullicino

Eisenstein

. Imaging in scoliosis: what, why and how? Clinical radiology. 2002;57(7):543-562.

14.

Johnson

Flynn

Anari

, et al. Risk of scoliosis progression in nonoperatively treated adolescent idiopathic scoliosis based on skeletal maturity. J Pediatr Orthop. 2021;41(9):543-548. doi:10.1097/BPO.0000000000001929.

15.

Wong

LPK

Cheung

PWH

Cheung

JPY

. Curve type, flexibility, correction, and rotation are predictors of curve progression in patients with adolescent idiopathic scoliosis undergoing conservative treatment : a systematic review. Bone Joint J. 2022;104-B(4):424-432. doi:10.1302/0301-620X.104B4.BJJ-2021-1677.R1.

16.

Cheung

JPY

Cheung

PWH

. Supine flexibility predicts curve progression for patients with adolescent idiopathic scoliosis undergoing underarm bracing. Bone Joint J. 2020;102-B(2):254-260. doi:10.1302/0301-620X.102B2.BJJ-2019-0916.R1.

17.

Wong

LPK

Cheung

PWH

Cheung

JPY

. Supine correction index as a predictor for brace outcome in adolescent idiopathic scoliosis. Bone Joint J. 2022;104-B(4):495-503. doi:10.1302/0301-620X.104B4.BJJ-2021-1220.R1.

18.

Cheung

JPY

Cheung

PWH

Yeng

Chan

LCK

. Does curve regression occur during underarm bracing in patients with adolescent idiopathic scoliosis? Clin Orthop Relat Res. 2020;478(2):334-345. doi:10.1097/CORR.0000000000000989.

19.

Nault

Beausejour

Roy-Beaudry

, et al. A predictive model of progression for adolescent idiopathic scoliosis based on 3d spine parameters at first visit. Spine (Phila Pa 1976) 2020;45(9):605-611 doi: 10.1097/BRS.0000000000003316.

20.

Zhang

Cheuk

, et al. A validated composite model to predict risk of curve progression in adolescent idiopathic scoliosis. EClinicalMedicine. 2020;18:100236. doi:10.1016/j.eclinm.2019.12.006.

21.

Tajdari

Maqsood

Saha

Sarwark

Liu

. Artificial intelligence data-driven 3D model for AIS. Stud Health Technol Inform. 2021;280:141. doi:10.3233/SHTI210453.

22.

Esteva

Chou

Yeung

, et al. Deep learning-enabled medical computer vision. NPJ Digit Med. 2021;4(1):5. doi:10.1038/s41746-020-00376-2.

23.

Wang

Zhang

Cheung

KM-C

Shea

GK-H

. Application of deep learning upon spinal radiographs to predict progression in adolescent idiopathic scoliosis at first clinic visit. EClinicalMedicine. 2021;42:101220.

24.

Tajdari

Pawar

, et al. Image-based modelling for Adolescent Idiopathic Scoliosis: Mechanistic machine learning analysis and prediction. Computer Methods in Applied Mechanics and Engineering. 2021;374:113590. doi:10.1016/j.cma.2020.113590

25.

Aimed. AlignPro . Secondary AlignPro 2020. aimed.hku.hk/alignprocare.

26.

Soucacos

Zacharis

Gelalis

, et al. Assessment of curve progression in idiopathic scoliosis. Eur Spine J. 1998;7(4):270-277. doi:10.1007/s005860050074.

27.

Sabour

Frosst

Hinton

. Dynamic routing between capsules. Advances in neural information processing systems. 2017;30:456.

28.

Nachemson

Peterson

. Effectiveness of treatment with a brace in girls who have adolescent idiopathic scoliosis. A prospective, controlled study based on data from the Brace Study of the Scoliosis Research Society. J Bone Joint Surg Am. 1995;77(6):815-822. doi:10.2106/00004623-199506000-00001.

29.

Asher

Burton

. Adolescent idiopathic scoliosis: natural history and long term treatment effects. Scoliosis. 2006;1(1):2. doi:10.1186/1748-7161-1-2.

30.

Fusco

Zaina

Atanasio

Romano

Negrini

. Physical exercises in the treatment of adolescent idiopathic scoliosis: an updated systematic review. Physiother Theory Pract. 2011;27(1):80-114. doi:10.3109/09593985.2010.533342.

31.

Al-Mohrej

Aldakhil

Al-Rabiah

. Surgical treatment of adolescent idiopathic scoliosis: Complications. Ann Med Surg (Lond). 2020;52:19-23. doi:10.1016/j.amsu.2020.02.004.

32.

Piantoni

Tello

Remondino

, et al. Quality of life and patient satisfaction in bracing treatment of adolescent idiopathic scoliosis. Scoliosis Spinal Disord. 2018;13:26. doi:10.1186/s13013-018-0172-0.

33.

Xia

, et al. A genetic predictive model estimating the risk of developing adolescent idiopathic scoliosis. Curr Genomics. 2019;20(4):246-251. doi:10.2174/1389202920666190730132411.

34.

Aghamohammadi

Ranjbarzadeh

Naiemi

Mogharrebi

Dorosti

Bendechache

. TPCNN: Two-path convolutional neural network for tumor and liver segmentation in CT images using a novel encoding approach. Expert Systems with Applications. 2021;183:115406. doi:10.1016/j.eswa.2021.115406

35.

Patrick

Adekoya

Mighty

. Capsule networks–a survey. Journal of King Saud University-computer and information sciences. 2022;34(1):1295-1310.

36.

Yuan

Jiang

Huang

. Hybrid-DNNs: Hybrid deep neural networks for mixed inputs. arXiv preprint arXiv:2005.08419 2020.

37.

Toraman

Alakus

Turkoglu

. Convolutional capsnet: A novel artificial neural network approach to detect COVID-19 disease from X-ray images using capsule networks. Chaos, Solitons & Fractals. 2020;140:110122.

38.

Jakobsen

Gluud

Wetterslev

Winkel

. When and how should multiple imputation be used for handling missing data in randomised clinical trials–a practical guide with flowcharts. BMC medical research methodology. 2017;17(1):1-10.

Predicting Progression in Adolescent Idiopathic Scoliosis at the First Visit by Integrating 2D Imaging and 1D Clinical Information

Abstract

Study Design

Objectives

Methods

Results

Conclusions

Keywords

Introduction

Materials and Methods

Patient Recruitment and Demographics of the Study Population

Bracing Protocol and Compliance

X-ray Protocol

X-ray Image Preprocessing for Model Training

Clinical Parameters Preprocessing for Model Training

Definition and Calculation of AIS Curve Progression for Model Training

Prediction Model and Pipeline

Evaluation Metrics

Results

Discussion

Conclusions

Footnotes

Declaration of Conflicting Interests

Funding

ORCID iDs

References