Abstract
Background:
Magnetic resonance imaging (MRI) assessments are considered important for predicting patellar dislocation. Risk factors for patellar dislocation have not yet been clearly identified.
Purpose:
To identify patellofemoral instability anatomic risk factors using MRI and enhance the area under the curve (AUC) through optimized machine learning (ML) methods.
Study Design:
Case-control study; Level of evidence, 3.
Methods:
An age- and sex-matched control group of 121 patients was selected. The assessed patellofemoral morphological parameters included trochlear depth, sulcus angle, trochlear facet asymmetry, lateral trochlear inclination, Wiberg index, lateral patellar facet angle, Wiberg angle, patellar tilt, and trochlear medialization. Differences between groups were analyzed based on the MRI parameters. In addition, ML models were created and optimized using these measured parameters to investigate their diagnostic potential. Logistic regression analysis (LRA), support vector machine (SVM), and light gradient boosting machine (LGBM) were employed as machine learning techniques.
Results:
Sex differences were found in trochlear depth, trochlear groove medialization, lateral patellar facet angle, and Wiberg angle, whereas no sex differences were observed in the other measured parameters. Significant differences between the control and dislocation groups were found for all patellofemoral morphological parameters, except for trochlear facet asymmetry. A significant difference was observed in the patellar height parameter between the 2 groups. Among the measured patellofemoral parameters, patellar tilt showed the highest AUC (0.8). Of the optimized ML models, the LGBM demonstrated the highest AUC at 0.873. When the number of variables used in the SVM model, which employed the fewest variables, was applied to both LGBM and LRA, the SVM model achieved the highest AUC at 0.858.
Conclusion:
Patellar tilt and trochlear depth exhibited the strongest correlation with patellar dislocation. The optimized ML techniques showed improved AUC values compared with those of existing models. However, while a higher AUC can be achieved by using more variables, this approach has proven to be inefficient. Therefore, for practical clinical applications, it is important to focus on using the minimum number of variables in optimized ML models.
Keywords
Patellar instability is one of the most common causes of anterior knee pain in young adult patients.1,14 Several studies have demonstrated a correlation between anatomic features of the patellofemoral joint and patellar instability, which can lead to anterior knee pain and eventually patellofemoral joint arthritis.2,4,5,9,10,23 Trochlear dysplasia and tibial tubercle-trochlear groove distance are particularly well-known risk factors for patellar instability. 4 Once a patellar dislocation occurs, recurrent patellar dislocation is very common.8,12 Recurrent patellar dislocation often leads to the gradual development of focal cartilage lesions in the patellofemoral joint, increasing the risk for developing patellofemoral joint arthritis at a younger age. 2
Predicting the likelihood of patellar instability or recurrence is important, and previous studies have identified various combined risk factors for this condition.19,20,31 However, the biomechanics of the patellofemoral joint, which affect patellar stability, involve complex interactions among numerous contributing factors.13,24,31 Furthermore, research on anatomic factors affecting well-known factors, such as trochlear dysplasia and tibial tubercle-trochlear groove distance, remains limited. 13
Recently, machine learning (ML) algorithms have been increasingly applied in orthopaedics to extract essential information from large datasets, particularly radiologic images.25,29 These applications of ML demonstrated remarkable performance in diagnosis, decision-making, and predicting critical events. 29 ML algorithms, in particular, are effective at analyzing complex anatomic structures such as the patellofemoral joint. 21
Recently, Kwak et al 18 identified risk factors for acute patellar lateral dislocation using magnetic resonance imaging (MRI) measurements and achieved clinically relevant performance with an interpretable, model-based, optimized ML method in pediatric patients. 18 Therefore, applying ML algorithms to radiologic images is a potentially useful approach for analyzing patellofemoral anatomic structures that affect patellar instability.
This study aimed to develop reproducible ML to assess patellofemoral morphology and anatomic factors associated with recurrent patellar dislocation using MRI and to identify patients at risk of recurrent patellar dislocation through optimized ML methods. Moreover, we aimed to develop and compare 3 optimized ML models to determine which method achieves the highest area under the curve (AUC) with the fewest variables for predicting patellar dislocation in adults. We hypothesized that applying optimized ML would effectively predict the risk of recurrent patellar dislocation in young adult patients with fewer variables.
Methods
Study Population
This study was approved by the institutional review board (IRB) of our institution (4-2022-0366). Informed consent requirements were waived owing to the retrospective design. After IRB approval, we collected and reviewed MRI data from 194 patients >20 years who were diagnosed with acute patellar dislocation between January 2010 and April 2022. Patients who visited the outpatient clinic because of knee pain after minor trauma, such as a sprain, and had acute patellar dislocation confirmed on MRI were included in the study. Patients were excluded if they had any of the following: a history of knee dislocation or traumatic injury; previous knee surgery; fractures; tumorous conditions affecting knee morphology; or unsuitable MRI scan slices for measurement. After applying these criteria, 124 patients were included in this study. A total of 200 control patients were selected using stratified random sampling with proportional allocation from our MRI database. Strata were divided based on sex and age, with sex classified as male or female, and age group categories: 20-29, 30-39, 40-49, 50-59, and 60-69 years. Each stratum was then randomly selected using proportional allocation. The same exclusion criteria applied to patients with patellar dislocation were also applied to patients in the control group. Ultimately, data from 121 control patients were collected and compared.
Measurements of Patellofemoral and Patellar Height Anatomy
MRI was performed on a 1.5- or 3-T MR system (Achieva or AchievaTX Ellition, Philips Healthcare, Best; MR 750, GE Healthcare; Trio, Siemens Healthcare) using a dedicated multiphased array knee coil. The patients were transferred to the MRI unit and examined in the supine position.
The patellofemoral joint was assessed using sagittal and axial views, while patellar height was measured exclusively using the sagittal view. For sagittal measurements, we used the sagittal slice that displayed the maximum patellar length. For axial measurements, we selected the slice where the cartilage covered the entire trochlear area, including both the trochlea and the anterior aspect of the femoral condyle. In addition, the axial slice showing the most posterior aspect of the femoral condyles was employed to establish the posterior baseline and landmarks for condylar height measurements. To assess interobserver variability, the MRI scans were remeasured by 2 observers >4 weeks after the initial measurements. The patellofemoral morphological parameters assessed included trochlear depth,1,2,5,10 sulcus angle,1,2,4,5,10 trochlear facet asymmetry, 2 lateral trochlear inclination,1,2,5 trochlear medialization, 5 Wiberg index,4,5 lateral patellar facet angle, 10 Wiberg angle, 5 patellar tilt,5,9,10,14 and the Insall-Salvati ratio (IS).4,10 The measurement methodologies for each parameter are summarized in Table 1 and Figure 1.
Description of Morphological Parameters a
IS, Insall-Salvati ratio.

Schematic representation of the (A) trochlear depth, (B) sulcus angle, (C) trochlear facet asymmetry, (D) lateral trochlear inclination, (E) trochlear groove medialization, (F) Wiberg index, (G) lateral patellar facet angle, (H) Wiberg angle, (I) patellar tilt, and (J) IS. IS, Insall-Salvati ratio.
Statistical Analysis
Statistical analysis was conducted using R software Version 3.6.3 (R Foundation). The Student t test was employed to compare parameters between the control and dislocation groups, while the chi-square test was used to compare the proportions of pathology between these groups. To assess both inter- and intraobserver reliability, the intraclass correlation coefficient was calculated. Receiver-operating characteristic (ROC) curves were employed to determine the cutoff value with the highest sum of sensitivity and specificity. The performance of the ROC curves was assessed by the AUC, with higher AUC values categorized as follows: excellent (0.90-1), good (0.80-0.90), fair (0.70-0.80), poor (0.60-0.70), and failure to discriminate (0.50-0.60).
Machine Learning
Using the measured parameters, we developed ML models to investigate their diagnostic potential. We employed logistic regression analysis (LRA), support vector machine (SVM), and light gradient boosting machine (LGBM) as ML techniques. To identify parameter combinations that maximize the AUC, we used a genetic algorithm, treating each parameter as a binary gene. K-fold cross-validation was used to calculate the AUC for each iteration of the genetic algorithm. We selected the combination with the fewest parameters among the 3 models that showed optimal results. Subsequently, we assessed the performance of the remaining 2 models with the same number of parameters and compared the performance of all 3 models based on these results.
Results
Measurement Morphology Results
Radiologic measurements showed excellent reliability, with interobserver reliability ranging from 0.82 to 0.93 and intraobserver reliability ranging from 0.87 to 0.98. Of the 245 total patients, 128 were women and 117 were men.
Sex differences were observed in several measurements. The mean trochlear depth was 3.3 ± 1.6 mm in the female group and 3.7 ± 1.9 mm in the male group. The mean trochlear groove medialization was 40.4 ± 3.4 and 45 ± 3.5 in the female and male groups, respectively (P < .05). The mean facet angle was 24.1°± 4.2° in the female group and 22.2°± 4.6° in the male group (P < .05). The mean Wiberg angle was 134.8°± 8.6° in the female group and 140.4°± 8.1° in the male group (P < .05). However, no sex differences were found in other variables (Table 2).
Comparison of Morphological Parameters Between Female and Male Participants
Significant differences were observed between the control and dislocation groups in 9 of the measured patellofemoral parameters, except for trochlear facet asymmetry. The mean trochlear depth was 2.7 ± 1.8 mm in the dislocation group and 4.3 ± 1.3 mm in the control group (P < .05). Using a threshold of 3.7 mm, 77% of the dislocation group and 23% of the control group had a pathologic trochlear depth. The mean sulcus angles in the dislocation and control groups were 160.5°± 11.9° and 152.4°± 7.8°, respectively (P < .05). The mean trochlear facet asymmetry was 59.7% ± 17.7% in the dislocation group and 62.6% ± 12.8% in the control group, with no significant difference between the groups. In the dislocation and control groups, the mean lateral trochlear inclination angles were 10.7°± 6.1° and 14.4°± 4.7°, respectively (P < .05). The mean trochlear groove medialization was 41.8 ± 4.2 in the dislocation group and 43.4 ± 3.9 in the control group (P < .05) (Tables 3 and 4).
Comparison of Morphological Parameters Between Control and Dislocation Participants
Anatomic Patellofemoral Instability Risk Factors a
AUC, area under the curve; IS, Insall-Salvati ratio; OR, odds ratio.
The mean Wiberg index was 0.51 ± 0.09 in the dislocation group and 0.47 ± 0.06 in the control group (P < .05). The mean lateral patellar facet angle was 22.4°± 5.1° in the dislocation group and 24°± 3.6° in the control group (P < .05). In the dislocation and control groups, the mean Wiberg angles were 136.2°± 10.1° and 138.6°± 7.2°, respectively (P < .05). The mean patellar tilt was 19.2°± 11.1° in the dislocation group and 9.8°± 6.2° in the control group (P < .05). Using a threshold of 12.9°, 77% of the dislocation group and 27% of the control group exhibited a pathologic patellar tilt. In addition, the mean patellar height, measured by the IS ratio, was higher in the dislocation group than in the control group (1.15 ± 0.22 vs 1.10 ± 0.17, respectively; P < .05) (Tables 3 and 4).
Assessment of Diagnostic Tests Considering Anatomic Factors of Patellar Instability
Among the instability factors, the highest AUC values were observed for patellar tilt (0.8), trochlear depth (0.79), and sulcus angle (0.74). Conversely, the lowest AUC values were observed for the Wiberg angle (0.58), lateral patellar facet angle (0.59), and trochlear facet asymmetry (0.59) (Table 4). Using all these factors, we developed optimized ML models, including LRA, SVM, and LGBM. The AUCs for these models were 0.849, 0.832, and 0.859, respectively (Table 5).
Comparison of AUC and Optimized AUC Across Different ML Models a
AUC, area under the curve; LGBM, light gradient boosting machine; LRA, logistic regression analysis; ML, machine learning; SVM, support vector machine.
Using a genetic algorithm to optimize the AUC, we observed a notable improvement across all 3 ML models compared with their preoptimized versions (Table 5). After optimization, the parameter selection was as follows: LRA with 7, SVM with 3, and LGBM with 8 parameters. Trochlear depth, facet angle, and patellar tilt were consistently identified across all 3 models. Among these, the LGBM model demonstrated the highest optimized AUC. Notably, the SVM model achieved the highest AUC when using only 3 out of 10 parameters. Further analysis was conducted on the LGBM and LRA models using 3 parameters. Among the various combinations of 3 parameters, the combinations yielding the highest results were identified. In the LRA model, the combination of trochlear depth, patellar tilt, and IS ratio resulted in the highest AUC of 0.848. Similarly, in the LGBM model, the combination of trochlear depth, Wiberg index, and patellar tilt yielded the highest AUC of 0.848.
Discussion
The most important finding of this study is that integrating anatomic analysis using MRI with optimized ML techniques significantly improved the AUC value. Specifically, when comparing the 3 ML methods, the optimized approach enhanced the AUC while using fewer variables: LRA reduced the number of variables by 3, SVM by 7, and LGBM by 2 compared with standard ML methods. Notably, the optimized SVM achieved an AUC of 0.858 using only 3 variables, while the optimized LGBM reached an AUC of 0.873, with 8 variables, indicating only a small difference.
To further validate this, we applied the ML models using only 3 variables to LRA and LGBM, but SVM still outperformed the others with the highest AUC. Therefore, the optimized SVM method stands out as the most efficient approach for predicting patellar dislocation, making it suitable for clinical application.
Optimized ML research starts with the accurate measurement of morphological parameters from MRI scans of the patellofemoral joint. A meta-analysis of previous studies found that measuring knee patellar height and groove angle from MRI images is reasonably reliable. 27 However, the reliability of other measurements lacked sufficient evidence. 27 Our study contributes to this field by demonstrating reliable intra- and interobserver consistency in anatomic MRI measurements. Previous studies suggested that variations in trochlear dysplasia may stem from measurements taken at different locations.
The trochlea of the patellofemoral joint begins to engage when the knee is flexed at approximately 26 20°. Consequently, it is highly likely that the femoral trochlea will not be visible on the axial slice taken through the largest axis of the patella, particularly when the knee is fully extended.7,30 The high reliability observed in our study may be attributed to taking the measurements through the superimposition of multiple slices.18,33
Our analysis of sex differences in the patellofemoral joint revealed significant differences in trochlear depth, trochlear groove medialization, Wiberg angle, and facet angle, while no other parameter was significant. This finding aligns with that of a previous study, except for the difference observed in trochlear depth. 11 These differences can be attributed to individual morphometric traits, which are influenced not only by genetic, environmental, and cultural conditions but also by lifestyle, health, and functional status. These factors complicate the development of standardized measurements and the interpretation of common values.
The morphological features of the patellofemoral joint have been extensively studied to identify risk factors for various conditions, including patellofemoral joint cartilage damage,15,28 radiographic features of patellofemoral osteoarthritis, 16 pain and function, 17 and patellar dislocation.6,18 Our study found that trochlear depth and patellar tilt are the most important anatomic risk factors for patellar dislocation, which is consistent with recent research findings. 6 Chen et al 6 characterized patellofemoral morphology and identified major anatomic risk factors for recurrent patellar dislocation using computed tomography (CT) images. Their study showed that trochlear depth, the most frequently observed anatomic risk factor, exhibited the strongest relationship with recurrent patellar dislocation. Previous research also identified a significant relationship between patellar shape and patellar tilt under both static and dynamic conditions. 22 Recently, Yamada et al 32 found that patellar shift is moderately to strongly correlated with patellar tilt in 95% of patients with patellar dislocation. This finding aligns with our study, where patellar tilt exhibited the highest AUC of 80. When medial patellofemoral traction decreases or there is dysplasia of the vastus medialis obliquus, the patella tends to tilt and shift laterally. 22 This results in increased force on the lateral facet of the patella and reduced stress on the medial facet. Consequently, the medial facet may develop hypoplasia, leading to an elevated Wiberg index. 22 Our results support this mechanism, as the dislocation group exhibited a higher Wiberg index compared with the control group.
Recent research has increasingly used ML to analyze factors contributing to patellofemoral joint instability. 21 Nagawa et al 21 conducted a statistical shape analysis using 3-dimensional (3D) MRI to compare patellofemoral instability models with normal femur models and developed an ML-based prediction model. However, the clinical application of their approach is limited owing to the time-intensive process of creating 3D MRI models. Conversely, Kwak et al 18 optimized ML techniques to identify risk factors for patellar dislocation in pediatric patients using fewer variables from MRI analysis. However, their study reviewed only 1 ML method. Notably, Nagawa et al 21 and Kwak et al 18 used different ML models. In this study, we trained ML models using a total of 10 parameters, including LRA, SVM, and LGBM. Considering that using all parameters may not yield the best performance, we applied a genetic algorithm to identify the optimal parameter combination. The combination yielding the highest AUC varied across the different models. Specifically, LRA achieved the highest AUC with 7 parameters, SVM with 3 parameters, and LGBM with 8 parameters. The maximum AUC was achieved with LGBM at 0.873, followed by LRA and SVM at 0.858. Despite using only 3 parameters, SVM demonstrated notably high AUC performance, suggesting that fewer parameters could be advantageous in terms of time and cost.
To further investigate the benefits of using the SVM with 3 parameters, additional analysis was conducted. We calculated the highest AUC for all combinations using only 3 parameters for both the LRA and LGBM models, with each model yielding a maximum AUC of 0.848. Thus, the SVM consistently demonstrated the highest AUC when limited to 3 parameters.
To date, no studies have specifically compared various ML models for predicting dislocation based on patellofemoral parameters. The Askenberger study used the LRA model with 4 parameters but did not include model comparisons or validation, making it more of a statistical analysis rather than a comprehensive ML approach. 3 In contrast, our study demonstrates that employing a minimal, optimized SVM ML technique offers practical advantages and may facilitate easier application in clinical practice.
However, this study has some limitations. First, all measurements were taken with the knee in a static position, which may not fully capture dynamic patellofemoral mechanics. Second, limb alignment was not accounted for owing to the lack of CT and whole-lower-leg radiographs. The retrospective design limited data on lower limb alignment, femoral anteversion, and external tibial torsion, which are relevant to patellar instability. Furthermore, tibial rotation and knee flexion angle were not controlled during the MRI scans, although the study was conducted under consistent conditions. Third, the parameters, including the modified Insall-Salvati, Caton-Deschamps, Blackburne-Peel, and the tibial tuberosity to trochlear groove distance, could not be incorporated into the study. Although we attempted to measure these dimensions, it was not feasible to include them because of the retrospective nature of the study and the limitations of the imaging data. Nevertheless, our study holds significant clinical relevance, as it aims to investigate the applicability of ML in the diagnosis of patellar disorders. Fourth, we did not distinguish between first-time and recurrent patellar instability patients, and the retrospective nature of this study does not allow us to establish clear diagnostic criteria for patellar instability. Although we assumed that the recorded diagnoses were accurate, this assumption inherently carries a risk of bias. Fifth, the timing of imaging could have influenced the presence or absence of joint effusion, which in turn may have affected patellar tilt measurements. Because patellar tilt was a key distinguishing factor between the cohorts, variability in effusion status at the time of imaging could have introduced additional bias. Sixth, the threshold values for the parameters were determined from radiological measurements, and additional biomechanical and clinical evidence is needed to make these values clinically actionable. Finally, while 3 optimized ML methods were compared, future research should consider additional methods that might offer better performance and faster results.
Conclusion
A notable difference in morphological parameters was observed between the control and dislocation groups, with patellar tilt and trochlear depth exhibiting the strongest correlation with patellar dislocation. This study introduces a novel, clinically accessible approach to identifying risk factors for patellar dislocation using MRI measurements and optimized ML techniques. Compared with existing models, the optimized ML model demonstrated the ability to predict patellar dislocation with fewer variables. Although models with higher AUC values were achieved, using an excessive number of variables proved inefficient. For practical clinical applications, it is important to consider using a minimum number of variables in optimized ML models.
Footnotes
Final revision submitted March 6, 2025; accepted April 7, 2025.
The authors have declared that there are no conflicts of interest in the authorship and publication of this contribution. AOSSM checks author disclosures against the Open Payments Database (OPD). AOSSM has not conducted an independent investigation on the OPD and disclaims any liability or responsibility relating thereto.
Ethical approval for this study was obtained from Severance Hospital, Yonsei University, College of Medicine (4-2022-0366).
