Abstract
Introduction
Sarcopenia (low muscle mass and strength) causes dysmobility and loss of independence. Sarcopenia is often not directly coded or described in electronic health records (EHR). The objective was to improve sarcopenia detection using structured data from EHR.
Methods
Adults undergoing musculoskeletal testing (December 2017–March 2020) were classified as meeting sarcopenia thresholds for 0 (controls), ≥1 (Sarcopenia-1), or ≥2 (Sarcopenia-2) tests. Electronic health record diagnoses, medications, and laboratory testing were extracted from the Indiana Network for Patient Care. Five machine learning models were applied to EHR data for predicting sarcopenia.
Results
Of 1304 participants, 1055 were controls, 249 met Sarcopenia-1 and 76 met Sarcopenia-2. Sarcopenic participants were older, with higher fat mass, Charlson Comorbidity Index, and more chronic diseases. All models performed better for Sarcopenia-2 than Sarcopenia-1. The top performing models for Sarcopenia-1 were Logistic Regression [area under the curve (AUC) 71.59 (95% confidence interval [CI], 71.51–71.66)] and Multi-Layer Perceptron [AUC 71.48 (95%CI, 71.00–71.97)]. The top performing models for Sarcopenia-2 were Logistic Regression [AUC 91.44 (95%CI, 91.28–91.60)] and Support Vector Machine [AUC 90.81 (95%CI, 88.41–93.20)]. For the best Logistic Regression Model, important sarcopenia predictors included diabetes mellitus, digestive system complaints, signs and symptoms involving the nervous, musculoskeletal and respiratory systems, metabolic disorders, and kidney or urinary tract disorders. Opioids, corticosteroids, and antihyperlipidemic drugs were also more common among sarcopenic participants.
Conclusions
Applying machine learning models, sarcopenia can be predicted from structured data in EHR, which may be developed through future studies to facilitate large-scale early detection and intervention in clinical populations.
Introduction
Sarcopenia describes a state of low muscle mass and strength, which can occur both with aging,1,2 and in younger and middle-aged adults with chronic diseases.3,4 Sarcopenia is multifactorial in etiology and has features overlapping with cachexia and frailty.1,5,6 Sarcopenia is estimated to affect 10% of older adults across the world7,8 contributing to dysmobility, disability, loss of independence, and hospitalizations with attendant healthcare costs.9–12
There is an unmet clinical need to detect sarcopenia across the life span to implement interventions and improve health outcomes. Current detection methods for sarcopenia with objective measures of muscle mass, strength, and performance, use test thresholds primarily based on studies in older adults who typically exhibit a synchronous decrease in muscle mass and strength. 13 However, sarcopenia also occurs at younger ages and in disease states where loss of muscle mass and strength may not always occur in tandem.14,15 Large electronic health records (EHR) databases provide an opportunity to detect diseases among populations of patients and facilitate clinical care, but have thus far not been used to detect sarcopenia, due to lack of formal assessments in routine clinical reports. For example, lean body mass is rarely quantified or reported when dual-energy x-ray absorptiometry (DXA) is typically performed for assessing bone density. Even when lean body mass or assessments of strength and physical performance are conducted, they may not be easily extractable from the EHR as a specific field or coded variable.
We previously used a combination of ICD codes and text terms to detect 9594 patients with sarcopenia, cachexia and/or frailty from the EHR. 16 Notably the sarcopenia ICD-10 code and the text term sarcopenia were highly specific (>90%) but had poor sensitivity for detecting features suggesting sarcopenia. 16 Additionally, in the absence of an objective gold standard, EHR detection of sarcopenia may be limited by false positive and false negative occurrences.
The purpose of this study was to develop a model to detect sarcopenia by applying machine learning methodologies to structured EHR variables (ICD codes, medications, laboratory tests, etc.) of a large group of adult participants recruited to undergo standardized research measures of muscle mass, strength, and physical function (external to their EHR data) which were used as our standard for the presence of sarcopenia. Machine learning studies have not been applied to sarcopenia studies in EHR in the past. Our study provides a novel approach to identifying risk factors predicting sarcopenia from EHR data. However, EHR-based studies often do not have a true external gold standard, while our study is made possible by utilization of our standard measurements to detect sarcopenia from our research core, which are external to our EHR.
Methods
Study cohort
The study cohort consisted of adult participants (aged 18 years and older) recruited into the Indiana Center for Musculoskeletal Health's Musculoskeletal Function, Imaging and Tissue Resource Core (FIT Core) protocol from December 2017 to March 2020 and having EHR data accessible in the Indiana Network for Patient Care (INPC). The INPC contains EHR data from more than 120 Indiana hospitals and healthcare entities, including Indiana University Health and Eskenazi Health. The FIT Core has Institutional Review Board approval from Indiana University to test any participants who provide written informed consent, broadly including healthy controls and persons with any disease state. The FIT Core informed consent includes access to and analysis of participant EHR data. FIT Core testing includes body composition, bone densitometry, and physical function tests (detailed below). The use of FIT Core data and access to and analysis of EHR data for the current study was also separately approved by the Institutional Review Board of Indiana University.
Data from EHR and the FIT core
The EHR data of the FIT Core participants were extracted for available clinical encounters in INPC between January 2016 and August 2020, with the goal of providing 2 years of EHR data around each participant's FIT Core visit. The EHR data elements included all available demographic data, diagnosis codes, laboratory test results, medication records, and inpatient and outpatient procedures. To ensure adequate EHR data for analysis, the numbers of structured variable occurrences and clinical text notes per participant were determined, then analysis was restricted to those having above the group's 25th percentile for both the total numbers of structured variable occurrences (n ≥ 122) and the number of clinical text notes available per participant (n ≥ 8). Demographic variables were obtained from FIT Core records but were not used for prediction modeling. Body mass index (BMI) was also obtained from FIT Core records.
Participants were categorized as sarcopenic or nonsarcopenic (controls) according to their testing in the FIT Core, using grip strength,17,18 repeat chair stand test, 17 and the Short Physical Performance Battery,19,20 and appendicular skeletal muscle mass adjusted for height in meters2 (ASM/m2) as measured by DXA (see Supplemental Methods). Participants were classified using definitions and thresholds for sarcopenia from the EWGSOP2 guidelines (see Supplemental Methods). 13 For the primary analyses, we categorized persons as controls if meeting none of the criteria thresholds, and as sarcopenic if meeting ≥1 criteria thresholds for sarcopenia (Group Sarcopenia-1) or if meeting ≥2 criteria thresholds for sarcopenia (Group Sarcopenia-2).
Sample size and power
The study used a convenience sample of all adult participants recruited to date in the overall FIT core protocol that had sufficient EHR data as described above, and then the sarcopenia parameters were extracted. Thus, power was calculated based on this available sample size. Given that the resulting cohort was categorized as 1055 Non-sarcopenia, 249 Sarcopenia-1 and 76 Sarcopenia-2, the resulting positive and negative case sample sizes for machine learning analysis has 95% power to gain area under the curve (AUC) of 0.7 or more (with 0.5 as the null hypothesis) using alpha of 0.05.
Variable selection and processing
Supplemental Methods and Tables describe details on processing and groupings applied to the structured data from the EHR, such as diagnosis codes (ICD-10), laboratory test results, and medications, which were included as variables for prediction modeling. Diagnoses were grouped into categories by the third level to the leaf nodes of ICD-10 code hierarchy. 21 The Charlson Comorbidity Index was calculated from ICD-10 codes as a marker of disease burden.22,23 Missing laboratory tests or other data were not imputed.
Analysis
Data were summarized as frequency and proportion or as mean and standard deviation. The Wilcoxon sum-rank test was used to assess the differences for individual variables between the sarcopenic and control groups. P values <0.05 were considered statistically significant. For machine learning model comparisons, the performance of the different models can be compared using one-way ANOVA test. We ran 10-fold cross-validation on the training data to gain the AUC values of each model and applied the one-way ANOVA test to investigate whether there is any significant difference between the model performances to select which models to use on test sets. Since no comparisons had p < 0.05 between models, the numerically best performing models (highest AUC achieved) of each machine learning method were used.
In the study cohort, there were more controls than sarcopenic participants. We split the cohort proportionally into a training (70% of the data), validation (10% of the data), and a test set (20% of the data). The validation set was used to fine-tune the parameters of the models, and the test set was used to evaluate the performance of all models. We developed and compared six different machine models—Support Vector Machine (SVM), 24 Logistic Regression (LR), 25 Random Forest (RF), 26 Gradient Boosting, 27 extreme gradient descent boosting (XGBoost),28,29 and Multilayer Perceptron neural network. 30 Each model has been tuned through hyperparameter search to optimize the performance.
We considered all coded EHR variables for inclusion in machine learning models and then investigated the important variables driving the model decisions. We used SHAP (SHapley Additive exPlanations) 31 to assign an importance value to each variable of the best performing model and explain which variables have greater impact on the output. SHAP has been used to explain machine learning prediction models for many applications, such as prevention of hypoxemia during surgery. 32 We sorted the variables by the sum of SHAP value magnitudes over all samples, drew the variable importance curve and applied the elbow method to select the top variables to interpret the results. The elbow method 33 was originally developed to identify the optimal number of clusters for an analyzed data set. The objective of the elbow method is to find an explicit elbow point in a curve so that before the point, the value rapidly decreases, and after exceeding the point, the decrease plateaus.
Results
From 2209 participants enrolled in the FIT Core as of the cutoff date, 1988 had any structured data and 1981 had any clinical text note data within the EHR, with 1304 meeting the final criteria for amount of EHR data available for analysis. After employing the variable processing steps and groupings, a total of 345 structured EHR variables were available (206 different diagnosis grouping categories, 67 laboratory test variables, 71 medication variables, and the BMI). From the included participants, 1055 met no criteria for sarcopenia (controls) and 249 met ≥1 of the sarcopenia test thresholds (Sarcopenia-1); 76 of Sarcopenia-1 met ≥2 of the sarcopenia test thresholds (Sarcopenia-2). Table 1 shows the demographic and comorbidity characteristics of the study cohort.
Characteristics of the study cohort.
Continuous variables are shown as mean (SD), and frequencies as n (%). P value (1) indicates comparison with patients meeting ≥1 criteria for sarcopenia vs. controls. P value (2) indicates comparison with patients meeting ≥2 criteria for sarcopenia vs. controls.
The average age of the sarcopenic groups was higher than the control group. The sarcopenia groups had a higher percentage of African Americans, more males, higher BMI, and total body fat mass (p < 0.01). As expected by the sarcopenia threshold definitions, the mean sarcopenia test results differed between groups with sarcopenia and controls. Sarcopenia-1 had a lower mean BMD Z-score than controls at the Total Body, Total Hip, and Femur Neck, but not at the Spine. Sarcopenia-2 had lower mean BMD Z-score than controls at the Total Body and Femur Neck, but not at the Total Hip or Spine.
Multiple chronic diseases including those often associated with sarcopenia were present in higher proportions in the sarcopenia groups than in controls (Table 1). Additionally, the sarcopenia groups had higher Charlson Comorbidity Index and a higher proportion of patients with Charlson Comorbidity Index ≥2 (p < 0.01). Participants with sarcopenia also had more structured EHR variables.
Model performance
We built predictive models using machine learning separately for Sarcopenia-1 and Sarcopenia-2 (versus controls). Table 2 shows the AUC and Brier scores with confidence intervals (CIs) generated based on both the training and test datasets when different predictive machine learning models were applied to each cohort. Figure 1 shows the AUC-ROC curves and calibration slopes for the test dataset for Sarcopenia-1 and Sarcopenia 2. The mean AUC differences between best results on the different machine learning model methods were quite small in magnitude (6.3% or smaller), and not significantly different based on ANOVA (p > 0.05). None of the machine learning methods performed significantly better than the others on either Sarcopenia-1 or Sarcopenia-2. All models achieved better performance for predicting Sarcopenia-2 versus controls, than for predicting Sarcopenia-1 versus controls. The top performing models on the test data set for Sarcopenia-1 were LR and Multi-Layer Perceptron which gained AUC 71.59 (95% CI, 71.51–71.66) and AUC 71.48 (95%CI, 71.00–71.97), respectively. The top performing model on the test data set for Sarcopenia-2 was LR and SVM which gained AUC 91.44 (95%CI, 91.28–91.60) and AUC 90.81 (95%CI, 88.41–93.20), respectively. The best Brier score gained on test data of Sarcopenia-1 were 0.148 (95%CI, 0.146–0.150) and 0.154 (95%CI, 0.139–0.169) when XGBoost and LR were used, respectively. The best Brier score gained on test data of Sarcopenia-2 were 0.042 (95%CI, 0.037–0.047) and 0.043 (95%CI, 0.041–0.044) when SVM and RF were applied, respectively. With both methods of assessing the models, the machine learning model accuracy of classification was greater for Sarcopenia-2 than for Sarcopenia 1.

Performance of machine learning models in the test set for sarcopenia-1 (≥1 sarcopenia criteria) vs. Control (A) area under the curve (AUC) of ROC and (B) Brier Score Calibration Curves; and for Sarcopenia-2 (≥2 sarcopenia criteria) vs. Controls (C) AUC of ROC and (D) Brier Score Calibration Curves.
Comparison of ROC AUC and Brier scores.
For Brier score, lower is better and <0.11 would be excellent. (The columns marked Test area under the curve [AUC] and Test Brier correspond to the graphs in Figure 1.)
Prediction model interpretation
We applied SHAP to interpret the important variables driving the prediction of the best performing model (LR model with the highest AUC on the test set for Sarcopenia-2). Applying the elbow method (Supplemental Figure 1) to determine the most impactful variables, the top 31 variables with the highest SHAP values were selected (Table 3). The selected variables show statistically significant differences between the Sarcopenic-2 and control groups (with differences for all but one variable p < 0.01). Several chronic condition categories were more represented in the sarcopenia group including: diabetes mellitus, upper gastrointestinal disorders and other digestive system complaints, signs and symptoms involving general systems, the nervous and musculoskeletal systems, the respiratory system, metabolic disorders, and kidney and other urinary tract disorders.
Statistical summary of the selected 31 variables with the highest SHAP values from the logistic regression analysis of participants meeting ≥2 sarcopenia criteria.
Total for this table is only those in the Sarcopenia-2 and control groups.
Variables are listed in order of importance within each of the categories (diagnosis, medication groupings, and clinical measurements).
Regarding medication categories, participants with sarcopenia criteria were also more likely to be prescribed opioid analgesics, corticosteroids, antihyperlipidemic drugs, as well as anti-asthmatic/bronchodilators, antiemetics, and miscellaneous endocrine/metabolic agents (each p < 0.01). The drugs listed under the NDC classification “endocrine and metabolic agents (miscellaneous),” consisted mostly of hormone receptor modulators and bone density medications. The main hormone receptor modulators were selective estrogen receptor modulators which were present in none of the sarcopenic group and 17 (1.61%) of the control group (p = 0.1327). Bone density regulators were used by 16 (21.05%) in the sarcopenic group and 49 (4.64%) of the control group (p < 0.01), most of which were bisphosphonates [sarcopenia group 15 (19.74%) and control group 41 (3.89%), p < 0.01). Parathyroid hormone agonists were administered to none of the sarcopenia group during this period and only 6 (0.57%) of the control group.
In SHAP analysis, some laboratory measures contributed among the top variables in the model. Participants with sarcopenia had more hyperglycemia, but less documented hypoglycemia on laboratory testing. Differences were also seen in the proportion with low hemoglobin (more in sarcopenia), low HDL (more in sarcopenia) and high LDL (less in sarcopenia). Triglyceride measurements were not among the top 31 variables in SHAP analysis (it ranked number 45). There was no difference between groups in the proportions having high or low triglycerides. However, a higher proportion of the sarcopenia group had low HDL [19 (25%)] than in the control group [(77 (7.3%)], and patients with sarcopenia were more likely to have the combination of high triglycerides with low HDL [19 (25.00%)] than the control group [64 (6.06%)] (p < 0.01). Participants with sarcopenia were more likely than controls to have recorded either high or low QRS interval.
Discussion
In this study of 1304 individuals undergoing measurements of muscle function and lean mass, 19.1% met at least one criterion for sarcopenia and 5.8% met at least two criteria. Using only structured data from their EHR and machine learning approaches, we were able to identify patterns of structured variables that were highly predictive of sarcopenia. Models with mean AUCs over 90% were achieved across test data using multiple approaches, with similar overall AUC between models. Each of the machine learning methods performed better at predicting Sarcopenia-2 than Sarcopenia-1 vs. controls. Sarcopenia is often undercoded and undetected; indeed, only one participant had the sarcopenia ICD code, and none had the related codes cachexia or frailty.
The mean age of sarcopenic patients was older than that of controls but was still relatively young (58 years). This study highlights the importance of chronic diseases as risks for sarcopenia among the non-elderly population3,4 and the importance of recognizing early signs of sarcopenia at younger ages to support preventative interventions to preserve mobility.
As expected, the Charlson Comorbidity Index was higher and several chronic conditions were more common in the sarcopenia groups than in controls. These included diabetes, renal diseases, malignancy, osteoporosis, and pulmonary, cardiac, and liver diseases. While all diagnosis groups were considered for models, in the highest performing models most of these were not among the top importance variables by SHAP for predicting sarcopenia. The most important diagnosis group variables were diabetes, upper gastrointestinal diseases, signs and symptoms for the nervous and musculoskeletal systems, thyroid disorders, and kidney disease along with circulatory and respiratory symptoms and disorders. Malignancy was more common among our sarcopenia group than controls and also is commonly clinically associated with sarcopenia and cachexia, but was not one of the top model variables detected by SHAP. Malignancies, with SHAP of 0.7228 (ranked 83 among all variables) were far below the cutoff from the elbow method. One possible reason for this is the overall small proportion of participants in this study with any malignancy.
Sarcopenia can be present even among obese patients, 34 where it is often missed clinically. Sarcopenia, especially in combination with obesity, has also been associated with the metabolic syndrome. 35 In our population, those with sarcopenia had higher BMI, and higher total body fat mass, despite lower height-adjusted appendicular lean mass. More participants with sarcopenia also had general metabolic abnormalities as indicated by diabetes, kidney disease, and ICD codes for metabolic disorders. Indeed, glucose value and diabetes diagnosis had the highest overall SHAP values. Sarcopenic participants were more likely to have low HDL in combination with high triglycerides, features of the metabolic syndrome often present with diabetes. Similarly, a study of Korean adults also indicated those with sarcopenia had more metabolic syndrome, higher triglycerides, and lower HDL than nonsarcopenic controls. 35 In contrast, in the West China Health and Aging Trend study, sarcopenic patients had lower triglycerides but higher HDL compared to controls. 36 In our study, we also note that fewer participants with sarcopenia than controls had high LDL on laboratory measurements, which might have been influenced by the greater proportion of sarcopenic participants already receiving antihyperlipidemic drugs.
The presence of low hemoglobin level was also among the top variables in our model. More patients with sarcopenia had low hemoglobin levels, which can be a marker of nutritional status, similar to albumin. This is consistent with studies that have shown associations between anemia and sarcopenia, slow gait speed and low muscle strength in older adults,37,38 as well as post-stroke 39 or with malignancies. 40
Machine learning models are data-driven approaches and depend on the information available within the EHR. Additional variables from unstructured text notes describing patients’ mobility may be useful to further improve performance. We used EHR data available from a multi-institution health data exchange, but different variables may be more important for models built on different individual EHR systems.
Strengths of our study and approach include standardized measurements of muscle strength and function, inclusion of a broad age range and >1000 adults, and combining “gold-standard” sarcopenia assessments with robust multi-health system EHR data (rather than relying on a single health system EHR). Our approach to exclude subjects who did not have sufficient EHR data ensured that robust structured EHR data were available for both cases and controls. This was likely to exclude some healthy control participants, who may have less interaction with the healthcare system. An additional strength was the use of multiple robust and unbiased machine learning models, all of which demonstrated consistent AUC within a narrow range, in support of the method. This allowed the detection of the most important predictors from the large structured EHR data.
Some weaknesses of our study include that a limited total number of patients met criteria for sarcopenia. Because of this, we performed analysis based on participants having either 1 or more criteria for sarcopenia or 2 or more criteria for sarcopenia. Thus, some of our patients were perhaps more mildly affected, which might have decreased the differences between groups. Additionally, it should be noted that factors beyond muscle influence performance on standardized muscle tests, including pain, joint dysfunction, neurologic, cardiac, and pulmonary function. In this study, our sample size was not large enough to evaluate the performance in different demographic, racial/ethnic, or disease-based populations. Finally, while we have demonstrated internal validation by dividing the cohort into training and test sets, we did not have an external cohort with which to validate the findings. External validation will be required in future studies of external datasets and different EHR systems to confirm the ability to accurately predict sarcopenia and demonstrate utility for screening purposes in various populations.
Sarcopenia and frailty are likely to be related in an individual, as both increase with aging, and many of the tests used in sarcopenia represent components of frailty. We designed this analysis specifically around sarcopenia or a loss of skeletal muscle mass, strength, and function, whereas frailty may more broadly include states of declining energy, disability, symptoms, and comorbidities resulting in overall decompensation of the individual.41–43 However, there is overlap between these conditions, as also noted in our prior publication evaluating EHR-coded diagnoses and simple text terms for sarcopenia, frailty, and cachexia. 16 While to our knowledge, there have not been prior electronic sarcopenia instruments to identify sarcopenia from EHR, other authors have reported EHR-based electronic frailty indexes, focused on either collections of preconceived concepts for contributors to frailty or on the accumulations of deficits in relation to aging, defined by ICD codes, and laboratory tests or on focused interview and examination results.42–44 Using this accumulated deficit concept to address frailty, electronic frailty indices have been developed to predict various outcomes including fracture, hospitalization, mortality, and other healthcare utilization.42,44 Some frailty models required specific history to be obtained regarding activities of daily living and various symptoms, which also may not be consistently obtainable from the structured data within the EHR.41–43
However, these previous electronic frailty index tools were not devised based on actual gold standard outcome measures of body composition or muscle function tests as the outcome of interest. This is not surprising as the necessary measurements are often not conducted clinically, and even when performed, the results are recorded usually in a manner not readily extracted from most EHR systems. In contrast, our models were developed based on all participants having undergone standardized measures through a research protocol, allowing us to develop EHR-based models for these outcomes.
Long-term studies of the natural history of sarcopenia have been hampered by inconsistent definitions, which led to relatively recent alterations of consensus definitions. 45 However, given that the sarcopenia prevalence increases with advancing age, and that some sarcopenia feature measures predict the development of other sarcopenia features, it is likely that the natural history of sarcopenia is to worsen over time. Multiple studies have indicated that resistance training is likely beneficial in treatment of sarcopenia,46,47 though with heterogeneity in the duration and delivery of the intervention as well as the magnitude of results. Additionally, multiple studies of various nutrition supplementation (leucine, whey, casein, omega 3 fatty acids, collagen peptides, and others) also show benefit in small trials, especially when combined with resistance training.48–51 The development of EHR-based tools to identify and recruit sarcopenic patients early may be useful to support targeted large-scale interventions in the future.
Future directions include testing of our models in external EHR cohorts having similar standardized muscle function measures, incorporation of unstructured text notes to further improve detectability of sarcopenia in special patient populations and to use machine learning to generate algorithms that could then be applied to EHR data to prospectively identify sarcopenia patients for selection for targeted interventions.
Conclusion
Sarcopenia can occur in individuals throughout their life span. While standardized muscular testing is needed to define sarcopenia, machine learning can effectively detect combinations of variables from within the EHR that are predictive of sarcopenia. This method may provide opportunities in the future for targeted selection of individual patients for more detailed sarcopenia testing and clinical intervention.
Supplemental Material
sj-docx-1-dhj-10.1177_20552076231197098 - Supplemental material for Using machine learning to detect sarcopenia from electronic health records
Supplemental material, sj-docx-1-dhj-10.1177_20552076231197098 for Using machine learning to detect sarcopenia from electronic health records by Xiao Luo, Haoran Ding, Andrea Broyles, Stuart J Warden and Ranjani N Moorthi, Erik A Imel in DIGITAL HEALTH
Supplemental Material
sj-docx-2-dhj-10.1177_20552076231197098 - Supplemental material for Using machine learning to detect sarcopenia from electronic health records
Supplemental material, sj-docx-2-dhj-10.1177_20552076231197098 for Using machine learning to detect sarcopenia from electronic health records by Xiao Luo, Haoran Ding, Andrea Broyles, Stuart J Warden and Ranjani N Moorthi, Erik A Imel in DIGITAL HEALTH
Footnotes
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
