Abstract
A stringent outcome assessment is a key aspect of establishing evidence-based clinical guidelines for anterior cruciate ligament (ACL) injury treatment. To establish a standardized assessment of clinical outcome after ACL treatment, a consensus meeting including a multidisciplinary group of ACL experts was held at the ACL Consensus Meeting Panther Symposium, Pittsburgh, Pennsylvania, USA, in June 2019. The aim was to establish a consensus on what data should be reported when conducting an ACL outcome study, what specific outcome measurements should be used, and at what follow-up time those outcomes should be assessed. The group reached consensus on 9 statements by using a modified Delphi method. In general, outcomes after ACL treatment can be divided into 4 robust categories: early adverse events, patient-reported outcomes (PROs), ACL graft failure/recurrent ligament disruption, and clinical measures of knee function and structure. A comprehensive assessment after ACL treatment should aim to provide a complete overview of the treatment result, optimally including the various aspects of outcome categories. For most research questions, a minimum follow-up of 2 years with an optimal follow-up rate of 80% is necessary to achieve a comprehensive assessment. This should include clinical examination, any sustained reinjuries, validated knee-specific PROs, and health-related quality of life questionnaires. In the midterm to long-term follow-up, the presence of osteoarthritis should be evaluated. This consensus paper provides practical guidelines for how the aforementioned entities of outcomes should be reported and suggests the preferred tools for a reliable and valid assessment of outcome after ACL treatment.
The evolution of evidence-based medicine is considered one of the most important paradigm shifts in modern medicine, 28,104 for which conduction of high-quality research is fundamental. Anterior cruciate ligament (ACL) injuries are among the most studied in the field of orthopaedics and sports medicine, with over 25,000 publications available in the PubMed database up to mid-2019. Despite ongoing research and advancements in treatment regimens for ACL injuries over the past decades, the goal of restored knee function and preserved long-term knee-related health remains a challenge. Reinjury rates are high, especially among the young and active, 117,120 and the high rate of subsequent development of posttraumatic osteoarthritis (OA) is worrying. 1,21,81,86 In the best interest of our patients, a deepened understanding of how to optimize an individualized approach to ACL injury treatment is needed. One important part of this process is to strive for a standardized and homogeneous research methodology of clinical outcome assessment after ACL treatment.
A rigorous outcome assessment after ACL injury is a key aspect for determining the clinical efficacy and effectiveness of treatment. It can also identify modifiable and nonmodifiable predictors of good and poor outcome, which provide valuable insights for the patient’s prognosis and should be discussed in the context of shared decision making for the treatment choice after ACL injury. Moreover, a standardized outcome assessment and reporting of data are required for comparisons between studies and for pooling of data in meta-analyses to provide the highest level of evidence-based medicine. Current literature related to ACL treatment is limited by the fact that no consensus exists on how to assess and report clinical outcome. There is a wide range of validated outcome assessment tools for ACL treatment. Although each of these outcome measures may offer certain advantages and the patient’s perspective of outcome should always be evaluated, caution must be taken to ensure that outcome measures accurately capture patient-centered and clinically relevant outcomes for an ACL-injured patient. Another debated area in ACL outcome assessment is the use of ACL graft failure as an endpoint for research. This is highly relevant to the patient; however, there is no universally accepted definition of graft failure utilized in the literature. Moreover, the lack of a consistent approach as to the timing of when outcomes should be measured after treatment and how such measures are reported makes appraisal of the current literature challenging, which limits the recommendations for the patient’s best possible care.
As the body of evidence on ACL treatment grows, there is an urgent need to reach consensus on how clinical outcome should be assessed and reported. Surgeons and researchers should strive to create optimal conditions for appraisal of the cumulative evidence regarding ACL treatment, thereby promoting an evidence-based approach by using outcome measures that are reliable, valid, responsive over time, and comparable. Therefore, a multidisciplinary group of experts was assembled for an international consensus meeting aiming to establish a standardized approach to clinical outcome assessment for patients receiving ACL treatment, that is, both operative and nonoperative treatment. 79 The purpose of this article is to provide the results from the consensus meeting in terms of what outcomes should be reported when conducting an ACL outcome study, the recommended outcome measurements, and at which follow-up time points those measurements should be used.
Methods
A multidisciplinary panel of national and international experts in ACL injury, including orthopaedic surgeons, physical medicine and rehabilitation physicians, physical therapists, and scientists, was convened in a 1-year consensus-building effort, which culminated in the ACL Consensus Meeting Panther Symposium held at the University of Pittsburgh and University of Pittsburgh Medical Center in Pittsburgh, Pennsylvania, USA, in June 2019. The symposium included delegates from 18 countries encompassing 6 continents. The working group of this topic consisted of 25 participants.
A list of 13 statements on clinical outcomes was drafted by the steering committee of the meeting based on current literature and controversies in clinical outcome assessment. The consensus group members completed an online survey addressing the 13 statements before the consensus meeting. The initial statements and corresponding responses are found in the supplementary material (Appendix).
A modified Delphi consensus discussion for each of the 13 statements was subsequently held at the in-person consensus meeting. The session was moderated by 2 senior researchers (K.E.W. and J.K.). Each statement was discussed and revised by the working group, after which a vote on agreement with the statement was performed. No count was held on the number of roundtables, but discussion was continued until consensus was met for each statement. A majority of 80% agreement was determined a priori as being a satisfactory level of consensus. Opposing views were documented, and it was determined that those statements for which 80% agreement was not achieved should be discussed in the paper, noting the percentage of agreement and accompanied with the discussion held during the meeting. Statements that the panel determined as irrelevant, redundant, or overlapping with another statement were either excluded or combined with the overlapping statement. Statement 2 in this consensus paper was combined from 2 original statements (originally statements 10 and 11 in the online survey [Appendix]) because these were considered as overlapping. There was 100% agreement for the original statement 10, and when proceeding to discussion and voting on the original statement 11, the panel instead agreed to combine statements 10 and 11 into one. However, no formal voting was undertaken for the finalized combination of the two. Thus, the percentage of agreement for statement 2 in this consensus paper could not be reported.
This working group was assigned 2 liaisons (E.S. and E.H.S.), who were responsible for amending each statement as requested over the course of the discussion. Liaisons transcribed the discussion, performed data analyses, and subsequently completed a MEDLINE literature review for each finalized statement. To reduce the potential for bias in the data analysis and/or literature review, liaisons did not submit answers to the online questionnaire, nor did they partake in the voting process. A description of the consensus process is presented in Figure 1, and a list of definitions used at the consensus meeting for the specific statements is provided in Table 1.

The process of the consensus project. ACL, anterior cruciate ligament.
Operational Definitions
Consensus Statements and Discussion
Of the 13 statements discussed by the working group, 9 achieved consensus, and 4 were excluded because these were considered to include information similar to ≥1 of the other statements. Thus, some of the 9 statements achieving consensus were slightly modified to include aspects from the 4 excluded statements. The 9 final statements, with supporting literature review, are presented below. These statements are presented in 3 main sections for readability purposes: (1) planning for outcome assessment, (2) clinical outcome assessment, and (3) patient-reported outcome (PRO). An overview of the consensus statements is presented in Table 2.
Summary of the Consensus Statements for Clinical Outcome Assessment After ACL Injury
Section 1: Planning for Outcome Assessment
1. A priori power calculation of sample size in relation to the primary endpoint must be performed and reported to avoid underpowered studies (25/25; 100% agreement)
A critical point when evaluating a study outcome is to ensure that the sample size is large enough to detect a difference when a true difference in fact exists. Otherwise, the study may be underpowered and subject to beta error (type II error). This can have serious consequences on clinical practice if no difference in outcome is concluded to exist between, for example, 2 interventions, even though one of the interventions is truly inferior, or superior, compared with the other. Ultimately, underpowered studies fail to identify the best possible care for our patients. Approximately two-thirds of randomized controlled trials related to ACL reconstruction failed to report an a priori sample size calculation. 4,94 Although a more recent assessment of the literature shows that these numbers have substantially improved since 2009, 54 improvements can be made. A study should have a power of at least 80% (1 – β), which means that the risk of a type II error, or false negative result, is 20%. A priori power calculation helps to ensure that the sample size will be large enough to minimize the risk of type II error. The power calculation should be determined for the primary patient-centered endpoint, meaning that if an endpoint is chosen that has a low event rate, the study sample size will need to be larger than if one expects that many patients will reach the endpoint. The sample size calculation therefore aids in the determination of feasibility and will help reduce the rate of incomplete studies and wasted resources. It is also an ethical responsibility to perform a sample size calculation because it is unethical to include substantially more patients than necessary. In relation to large registry studies, a power calculation may be redundant, but this can depend on the outcome. It is therefore recommended that a statement on power always should be included. A sample size calculation should be performed whenever possible before the start of the study. However, a post hoc power calculation to test the validity of the study results can be an acceptable method under certain conditions, for instance, in the case of a retrospective study, but caution must be given to the high risk of overestimating power. 38,116
Conclusion
Researchers must report the power of the study to ensure that the sample size is sufficient to detect a difference if one truly existed and to give readers of the paper an understanding of the strength and generalizability of the results.
2. Improvement from pretreatment status is the outcome of interest. Minimum description of pretreatment status should include demographic data, validated knee-specific PRO assessment, HRQoL, and measure of type and level of preinjury sport/activity
The goal of all available treatments for an ACL injury is to improve the outcome from the pretreatment status. Hence, without assessment of the pretreatment status, the relative improvement cannot be measured and reported. Assessment of the pretreatment status is also important to identify baseline variables that may confound or explain a given study result. When comparative trials are conducted, variables known to influence the outcome of interest should be equally distributed between the groups or otherwise adjusted for by using appropriate statistical methods. Adjustments can be planned a priori based on previous studies or assessed by adjusting for variables that correlate with both the predictor and the outcome. Researchers should thoroughly plan data collection before the study start while considering their study population and their research question.
The demographic data should give an overview of the characteristics of the investigated population, which aids to determine the generalizability of the study results. Demographic data should at a minimum include patient sex, age, anthropometric data, relevant medical history, and prior knee joint injuries. Family history of ACL injuries may also be relevant because a heritable component of ACL injuries appears to exist.
18,118
Moreover, the type and level of preinjury sport or activity should be reported to determine whether the treatment successfully returned the patients to their preinjury activity level. The recommended tool for sport and activity assessment is the Marx activity scale,
66
which has been validated and has high reliability. The Marx activity scale enables an evaluation of both the type of activity and the exposure time, which are both crucial aspects when reporting on activity. In this aspect, it differs from other measures of activity, for example, the Tegner activity scale,
105
which enables grading of activity level but does not account for activity exposure. Other validated tools for activity include, for example, the International Knee Documentation Committee Subjective Knee Form (IKDC-SKF),
50
which includes 1 item (item 8) related to the activity level that the patient performs on a regular basis. The item is answered by choosing 1 of 5 responses ranging from
Tools for Activity Assessment
Pretreatment assessment of PROs is particularly valuable for patients with chronic ACL injuries or as a presurgical treatment baseline for patients undergoing delayed ACL reconstruction. This is because patients with chronic ACL injury may have had the time to live with and try to cope with the potential limitations of their ACL-deficient status as opposed to the acutely injured patients who are impaired because of injury-related factors (eg, pain and hemarthrosis). There is, however, no strict definition for what should be regarded as early and delayed ACL reconstruction, and the timing of ACL reconstruction varies considerably between geographical regions. 87 Surgery within 3 weeks has been defined as an early ACL reconstruction, 33,101 although this definition is not consistent, and a recent literature review found that the definition of early ACL reconstruction ranged from 2 days to 7 months among the included trials. 2 For correct interpretation of the pretreatment assessment, it is important that the time from injury to pretreatment assessment is always reported, as outcomes may be very different for a patient who is completing such an assessment soon after injury compared with a patient who was injured many years previously.
The impact of the ACL injury on the patient’s overall well-being and quality of life before treatment should also be measured. 72,85 A health-related quality of life (HRQoL) measure covers a larger picture of how an ACL injury affects a patient in terms of physical, social, and emotional health, which must not be overlooked among patients sustaining an ACL injury. 34 Pretreatment assessment of HRQoL allows for evaluation of health status over time and whether the treatment restores the patient to better, similar, or worse health. Most measures of HRQoL also have the advantage of providing the possibility to determine utilities that are used in estimating the economic impact of the injury and allow for comparison between many other conditions and treatments. A list of HRQoL measures is provided in Table 4.
Health-Related Quality of Life Outcome Measures
Conclusion
Description of the sample in terms of demographic characteristics, preinjury activity level, and pretreatment PROs is necessary to interpret the results of treatment and generalizability of the study.
Section 2: Clinical Outcome Assessment
3. Minimal length of follow-up when reporting outcomes depends on the outcome being assessed and should optimally include 80% of the entire cohort (25/25; 100% agreement)
The follow-up time of a study should be defined depending on what is relevant in relation to the primary investigated outcome. In general, outcomes after ACL treatment can be divided into 4 categories: early adverse events, PROs, ACL failure/recurrent ligament disruption, and clinical measures of knee function and structure (Table 5), all of which could be further stratified in specific outcomes, necessitating different considerations for follow-up time as exemplified in Table 6.
4 Robust Outcome Categories After ACL Injury Treatment
Examples of Outcome Measurements and Considerations for Follow-up Time
Evidence provided by previous research as well as clinical experience is the foundation to determine what a relevant follow-up time is. For example, the rates of ACL rerupture and ACL revision peak at 1 to 2 years after an ACL reconstruction and with return to sport (RTS). 32,40,61,82,118 Therefore, a study with a shorter follow-up than this is not relevant if the primary outcome is rerupture or ACL revision, and a study aiming to make conclusions about ACL treatment failure should not have a follow-up time of less than 2 years and should report RTS as a proxy of risk exposure. In contrast, the outcome of septic arthritis or hardware failure can manifest soon after an ACL reconstruction, 99,114 and a follow-up time of ≤6 months is sufficient to collect data that will represent a true estimation of such outcomes. Thus, it is important that the follow-up time is defined and based upon the study aims and outcomes.
In most studies, especially with increasing length of follow-up time, a certain degree of patients lost to follow-up is inevitable. Even a small proportion of patients lost to follow-up can lead to considerable study bias, 16 although a common opinion is that a dropout rate of more than 20% is associated with a serious threat to the internal and external validity and power of the study. 93 A study is therefore recommended to optimally include at least an 80% follow-up rate. However, the possibility of dropout/retention bias should always be considered when patients are lost to follow-up. Data should be presented such that the dropout rate is accurately reported. A strict adherence to the use of checklists is encouraged to facilitate complete data reporting, such as the CONSORT (Consolidated Standards of Reporting Trials) statement 97 for randomized controlled trials and the STROBE (STrengthening the Reporting of OBservational studies in Epidemiology) statement 113 for cohort studies. Clear step-by-step flowcharts are encouraged. Whenever dropouts are present, the authors are recommended to perform a dropout sensitivity analysis to enable interpretation of the possible dropout effects. This should include a comparison of the baseline characteristics of those who completed versus those who did not complete the study.
It should be emphasized that there can be circumstances where an acceptable follow-up rate for a study is determined by weighing the disadvantages of loss to follow-up against certain advantages, for example, a long-term follow-up or a considerable amount of data in a study. In such cases, a lower threshold for follow-up rate is acceptable. Large registry studies can be used to exemplify this, where the patient response rates to PROs are a challenge. 43 Registries comprise data on large numbers of patients and include multiple follow-up occasions, sometimes over more than a decade. 43,103 Hence, they are important sources for determining the effectiveness of ACL treatment and for providing hypotheses-generating results. 104 Nonetheless, a large dropout rate increases the importance of a stringent data reporting, and a statistical analysis of patients lost to follow-up also needs to be considered.
Conclusion
Follow-up time should be determined by the purpose of the study and primary outcome and should be stated a priori. The follow-up rate should optimally exceed 80%, and data must be reported so that the possible effects of patients lost to follow-up can be considered.
4. Comprehensive assessment after ACL surgery (minimum 2 years) should include adverse events, clinical measures of knee function and structure, PRO, activity level, and recurrent ligament disruption (25/25; 100% agreement)
A comprehensive assessment after ACL reconstruction should aim to provide a complete picture of outcome related to different dimensions of limitations, which involves numerous aspects of knee-related health and function, objective assessment of hard endpoints (Table 6), and technical aspects of the surgery (graft choice, fixation, tunnel placement, meniscus/cartilage assessment, and treatment). A minimum follow-up of 2 years is likely necessary to enable a comprehensive assessment. Multiple follow-ups during the first 2 years could certainly fulfill the purpose of evaluating, for example, the progress such as in the early, middle, and end state of the rehabilitation. However, the final assessment should be withheld until 2 years postoperatively because a substantial number of outcomes require that this time has been given for the ACL reconstruction to completely heal 47,83,112,123 and for the patient to complete rehabilitation and progress to testing the knee in more demanding activities including full participation in sport or activity. A follow-up of 2 years should allow for determining the patient’s capability of a successful RTS, 6 and importantly, it will include a period when patients are participating at high-risk exposure for ACL failures and reinjuries. 32,40,61,82,118 An optimal 2-year outcome assessment should therefore include reporting of the rate and time of RTS. A consensus statement related to assessment and reporting of RTS was similarly reached at the ACL Consensus Meeting Panther Symposium 2019 and is provided in a separate publication. 67
A comprehensive assessment also implies that the contralateral knee should be examined and assessed for each outcome. Outcome tools such as the IKDC Knee Ligament Standard Evaluation Form 45 require a comparison with the contralateral knee for the standardized reporting. The uninjured contralateral knee serves as a reference for the ACL-injured knee in terms of range of motion, laxity, and functional performance, 119 which helps to account for differences between patients. It should also be noted that the contralateral limb/leg/knee might also be affected by an ACL injury such as altered kinematics 53,68 and a decrease in muscle strength, 119 which underscores the importance to ensure that the function of the contralateral limb is optimized before allowing the patient to return to knee-strenuous activities by assessing it likewise. It is therefore recommended that the standard practice is to assess the contralateral knee and report such data, which ultimately will contribute to increased knowledge of risk factors for a patient sustaining a subsequent contralateral ACL injury.
Failure of ACL reconstruction is a nonspecific term that is commonly used without a stringent definition in the literature. It is therefore recommended that well-defined outcome assessments are used and that the authors, if choosing to use the term
Conclusion
A minimum of 2-year follow-up is necessary for a comprehensive and reliable determination of outcome. The comprehensive assessment should include outcomes provided by clinical examination, PROs, activity level, and verified reinjuries.
5. Comprehensive assessment after ACL surgery in the medium to long term (≥5 years) should also include measures of posttraumatic OA (25/25; 100% agreement)
It is well known that sustaining an ACL injury entails a high risk of developing posttraumatic OA in the midterm to long term, especially if concomitant intra-articular injuries are present. 1,21,81,86 Reducing the risk of OA is a clinical priority, which means that the midterm to long-term follow-up assessment should include measures of OA to monitor and evaluate the degenerative changes in the knee joint. This is necessary for developing therapeutic interventions aiming to counter the high rate of OA after an ACL injury.
Measures of OA may include clinical examination, PROs, and imaging modalities. Clinical examination findings that may indicate OA are joint-line tenderness or crepitus, which previously have been found to be strong predictors for OA. 96 Good interobserver reliability for joint-line tenderness and crepitus has been reported when a standardized approach is used. 65 The IKDC Knee Ligament Standard Evaluation Form includes a grading system for such an examination and should be used for standardized reporting. 45
The use of PROs is valuable to capture the patients’ perception of impairments caused by OA. Questionnaires specifically developed and validated for assessment of OA are the Western Ontario and McMaster Universities Osteoarthritis Index (WOMAC) 13 and the Knee injury and Osteoarthritis Outcome Score (KOOS). 91 However, the WOMAC was developed for evaluation of established OA; as such, the KOOS may be a more appropriate assessment for patients after ACL injury. This is because the KOOS is more likely to detect early development of OA compared with WOMAC, as the KOOS was developed to cover a broader spectrum from knee injury to manifest OA. 91,92
Imaging modalities still provide the most sensitive assessment of OA, although not without limitations. One should remember that radiographic findings of OA are not necessarily accompanied by symptomatic OA, 5,84 and other intra-articular abnormalities may give similar symptoms as OA. It is therefore recommended to combine radiographic imaging assessment with PROs for decision making when it comes to symptomatic OA. Radiographic findings should be described in a standardized manner using validated tools, where the Kellgren-Lawrence perhaps is the most commonly used tool, taking into account osteophyte formation, sclerosis, joint space narrowing, and bone deformity. 55 Although plain radiography has long been the established method for imaging of OA, it must be acknowledged that the modality has a limited capacity to visualize early stages of OA and to grade OA progression. 63
The rapid evolution of MRI techniques enables a much more comprehensive assessment of knee joint structure, such as early morphological and biochemical changes of articular and periarticular structures. Quantitative measurements of cartilage thickness on MRI have a higher sensitivity for knee OA compared with traditional radiological measures. 121 In addition, MRI detects characteristic OA signs earlier and with a greater sensitivity compared with radiography. 41 Structural intra-articular changes are indicative for OA and can be seen as early as 2 years after an ACL reconstruction with MRI, which is earlier than these changes can be seen on radiographs. 19,108 In addition, MRI can also rule out other intra-articular injuries that may explain symptoms perceived by patients. Thus, although plain radiography has an established role in assessment of OA and is favorable from an availability and cost perspective, its main role is to assess the development of OA in the long term and for already established OA. For early or midterm assessment of OA, attempts should be made to include MRI to detect early changes with greater validity and sensitivity. 41
It is not known when clinically relevant posttraumatic OA occurs or when in this process the structural changes of the knee joint start to appear. With advancement in imaging techniques, there is a risk of overdiagnosis of OA because structural changes without clinical significance might be detected. Future research will hopefully provide a clearer picture of this as well as methods to distinguish between what are pathological changes and what changes are related to normal aging. 64 Until then, an assessment of knee OA should always be made in relation to a “control knee” to provide a reference for such variables. A synthesis of current literature shows that the contralateral knee is most commonly used for this purpose, followed by using an age- and sex-matched control group. 86 The latter methodology, using a separate comparison group, is the preferred method because degeneration can occur in the contralateral knee, although it was not part of the original injury. Some studies have used baseline imaging of the acute ACL-injured knee as the control, 1 which cannot be recommended because this method does not take into account the impact of natural aging occurring between the injury and the long-term follow-up.
Conclusion
Outcome assessment of OA should include clinical examination, PROs, and imaging modalities, for which MRI is the preferred modality for increased accuracy. Imaging findings should always be set in context with the patient’s perception and the clinical examination for decision making. Hence, these outcome assessments are equally important for determining the outcome of OA.
6. Clinical assessment of ACL injury treatment should include measures of anteroposterior and rotatory knee laxity (25/25; 100% agreement)
The anatomic properties of the ACL make it a primary passive restraint to both anteroposterior (AP) and rotatory forces of the knee joint.
46
Valid assessment of knee joint laxity is therefore key in the evaluation of the outcome of surgical treatment after ACL injury, preferably at multiple follow-ups to detect any changes over time. Failure to eliminate knee joint laxity with ACL reconstruction could indicate treatment failure, while patients undergoing nonoperative treatment should be assessed for excessive laxity or propagation of knee joint laxity. The latter scenario might be an indication for subsequent operative treatment, although the term
Laxity assessment consists of static and dynamic examinations, and methods for both grading by the examiner and quantification of laxity have been developed. Laxity assessments should always include a side-to-side comparison with the contralateral knee. Static AP knee laxity tests consider a single degree of freedom of motion and include application of a unidirectional force in a single plane, such as the Lachman test and the anterior drawer test. The IKDC Knee Ligament Standard Evaluation Form provides a standardized classification of the degree of AP translation. 45 For instrumented quantitative assessment of AP laxity, the KT-1000/2000 arthrometer (MEDmetric Corp) 27 and the Rolimeter (Aircast) 8 provide among the most accurate measurements, although the intraclass correlation coefficient is variable according to the literature, and the results are examiner dependent. 88 Another instrument is the GNRB (Genourob), which is a robotic arthrometer developed to alleviate the difficulties with examiner-dependent measurements. The patient’s leg is placed in the robotic system, and a predefined force is applied to the proximal calf, while the relative displacement of the anterior tibial tubercle with respect to the patella is recorded by a displacement sensor. The GNRB also offers the advantage of using electromyography sensors to record hamstring activity to detect incomplete hamstring relaxation that affects the result. 89 Static AP measurements do not necessarily correlate with clinical outcome and function, 7,57,58 which indicates that laxity assessment should not solely rely on static AP translation because it fails to capture the more complex knee kinematics.
The pivot-shift (PS) test is considered to simulate a more physiological multiaxial loading of the knee joint because it is a dynamic test of laxity that evaluates both AP and rotatory laxity. 48 It has been reported as the most specific test for ACL deficiency. 14 On the other hand, the PS is characterized by a large variability in execution techniques, 60,77 which may lead to a variation in clinical grading between examiners. To overcome this, a standardized PS test has been described, which has led to an improved accuracy of the test. 77 Moreover, user-friendly devices for noninvasive quantitative PS have been developed and determined to be valid for objective assessment of the PS. 76 Such devices may include an inertial sensor system (KiRA; Orthokey) 124,125 to quantify the tibial acceleration during the PS and an image analysis system, 74 which enables a quantification of the lateral tibial translation during the PS. Both devices have been shown be able to validly detect differences between clinically high- and low-grade PS (Figures 2 and 3). 76 Examples of devices for quantitative AP and rotatory knee laxity that are easily applicable in the clinical setting are summarized in Table 7.

The KiRA inertial sensor system for quantifying lateral tibial acceleration during the pivot-shift test.

Image analysis system on iPad for quantifying lateral tibial translation during the pivot-shift test.
Devices for Quantitative Assessment of Knee Joint Laxity
Conclusion
Knee joint laxity should be assessed after ACL treatment and reported in a standardized manner using the IKDC Knee Ligament Standard Evaluation Form when clinical grading is used. The use of quantitative measures is encouraged to increase the reliability and validity of the assessment.
Section 3: Patient-Reported Outcome
7. Assessment of PRO should optimally include at least 1 knee-specific outcome tool, 1 activity rating scale, and 1 measure of HRQoL (25/25; 100% agreement)
The use of PROs has become a cornerstone for researchers to understand the patients’ perspective of the impact of ACL injury and treatment. During recent decades, technical development has facilitated the use of PROs, as patients can report and researchers can collect responses electronically. The time-efficient collection has tempted researchers to burden patients with more PROs in studies. Responder burden is an important term in research and is defined as the time to complete items as well as the physical energy and cognitive demands placed on those responding. In addition, all clinical testing of patients is part of the burden placed on our patients. Because of the risk of excessive responder burden, which threatens the validity of a patient’s responses and thus their score, researchers are advised to wisely choose PROs specific for the study purpose.
Similar to statement number 2 of this consensus paper on baseline information to collect, it is recommended to use at least 1 knee-specific tool, 1 HRQoL tool, and 1 activity rating scale. This provides the researcher with a comprehensive picture of the patients’ perception of outcome after treatment.
Conclusion
To give a comprehensive assessment of the patients’ perception of the impact of ACL injury and outcome of treatment, validated knee-specific PRO assessment, HRQoL, and measure of type and level of preinjury sport/activity should be collected before and after treatment.
8. The IKDC-SKF is the recommended knee-related outcome measure for ACL injury and treatment (24/25; 96% agreement)
The evaluation of treatment outcome started historically with use of objective measurements as proxies for what clinicians and patients really cared about. For instance, both rating scales and measures of range of motion, strength, and laxity were frequently used; however, these measures are limited by interrater and intrarater variability and alone failed to determine symptoms and limitations perceived important by the patient. Failure to report and quantify the patients’ perspective of treatment outcome after ACL injury led to the development of knee-related PROs during the late 1990s and early 2000s. The 2 most commonly used PROs after ACL injury are the KOOS and the IKDC-SKF, which were both developed during this time period. Measurement properties of the IKDC-SKF and KOOS are presented in Table 8.
Psychometric Properties of the IKDC-SKF and the KOOS
37
These PROs have advantages and disadvantages, and when choosing between them, one should evaluate what the population is and what it is that you want to capture. Most importantly, measurements should consist of those that are relevant to the patient and capture the full range of symptoms, activity limitations, and participation restrictions to increase the relevance and validity in results attained from PROs. 23 It is essential that the PROs have undergone rigorous validation to the target condition to be able to differentiate better from worse treatment outcome. The inappropriate use of a PRO can distort results from a study and cause difficulties to detect differences, as items may not be relevant for the given population. This can be the case when a questionnaire aimed to assess outcome in patients with OA is used to assess patients with an ACL injury.
The KOOS is an extension of the WOMAC 12 (covers the subscales of Pain, Symptoms, and limitations in Activities of Daily Living) and was validated for patients with OA of the knee. The initial idea of the KOOS was to develop a region-specific outcome to capture the progression of knee-related symptoms across the life span of a patient from knee injury to the development of OA. Despite the inclusion of the Sport and Recreation and Quality of Life subscales, the KOOS has limited measurement properties in the 3 original WOMAC subscales when used for patients after ACL reconstruction. 23,59 It is also worth mentioning that the hybrid version of the KOOS, the KOOS4 (a modified version in which the items related to activities of daily living have been excluded to avoid ceiling effects) 36 has not undergone a validation. 22,23 This is problematic, as the ability to detect differences between treatments will be limited with the KOOS used in patients with an ACL injury. 59 Using PRO measurements that include items that are not relevant or do not cover important limitations of the target condition is not optimal. Using such PROs entails a potential washout of treatment effects, inadequate measurement properties, and risk of false negative findings. 23,95,102 In terms of the KOOS, several questions are at risk for a ceiling effect when used in patients after ACL reconstruction; that is, the item is too “easy” for the patient. In addition, the KOOS does not include specific items relating to instability, which is one of the most common symptoms and one of the strongest indications for an ACL reconstruction. The KOOS consists of 42 items entailing higher responder burden compared with other outcomes such as the IKDC-SKF. Awareness of the limitations of the KOOS for the patients after an ACL injury or reconstruction is important to avoid missing the effects of treatment results.
The IKDC-SKF was developed as a region-specific outcome relevant for a variety of conditions including ligament and intra-articular abnormalities. 50 This PRO underwent rigorous testing during its development including a reduction from 42 to 18 items and an exploratory factor analysis, suggesting that it was reasonable to combine the items into a single overall score. To test the relevance of the IKDC-SKF for patients with an ACL injury, Rasch analysis was performed separately for patients with and without knee ligament injury. 50,109 The analysis supported the premise that the items of the IKDC performed similarly in terms of difficulty for individuals with or without a ligament injury. The results from the primary testing of the IKDC-SKF also indicated that the IKDC-SKF items performed the same, regardless of age, sex, and a variety of diagnoses including ligament, meniscal, and articular cartilage injury and patellofemoral pain. 29,50
The IKDC-SKF is recommended as the knee-related PRO to use for patients after ACL reconstruction because of its quick-to-use 18 items. 50 The IKDC-SKF shows adequate internal consistency and has no floor or ceiling effects across mixed groups of patients with knee conditions. 29 It also has high levels of test-retest reliability, construct validity, and responsiveness. Moreover, normative data have been determined, which is valuable for comparisons as well as cutoffs for what the patients consider an acceptable symptom state. 52
There are also other promising PROs used to cover different aspects of recovery after ACL reconstruction, including the Quality of Life Outcome Measure for Chronic Anterior Cruciate Ligament Deficiency (ACL-QoL) 69 and the Knee Numeric-Entity Evaluation Score (KNEES-ACL). 24 The ACL-QoL is used to determine the effectiveness of ACL reconstruction, or any other treatment, and is a 32-item condition-specific quality of life scale for patients with ACL deficiency. 69 The KNEES-ACL was developed in 2013, 24 and the thorough development process and dimensionality assessment resulted in 42 items across 7 latent constructs. There is strong positive evidence given to content validity. 24,25
The ACL-QoL and the KNEES-ACL are promising outcome measurements and likely will help us to better understand patients who have sustained an ACL injury. However, these PROs have mainly been used in comparative studies and are yet to be compared with the established IKDC-SKF and KOOS to prove their respective strengths of constructs.
Conclusion
The IKDC-SKF is the recommended knee-related outcome measure for ACL injury and treatment.
9. Measurement of the patient acceptable symptom state (PASS) is valuable in the assessment of outcome of ACL injury and treatment (25/25; 100% agreement)
As researchers and clinicians of today, we are equipped with a great variety of PROs. However, the development and use of these PROs means little if the results are not interpreted in a clinically meaningful manner. The use of numeric scores poses a risk that researchers focus myopically at numbers and statistically significant findings without reflecting over whether such findings really are impactful from the patient’s perspective. For many such PROs, the same score can be achieved, despite that patients respond differently to the items that comprise the PRO measure. The question of whether the patient perceives an acceptable symptom state is a priority for all clinicians, and the use of the PASS in PRO assessment is important. The PASS considers a single-item question and aims to determine a threshold beyond which the patients consider themselves “well.” 75 Thresholds for the PASS have been established for the KOOS and the IKDC-SKF by asking the question “Taking account of all the activity you have during your daily life, your level of pain, and also your activity limitations and participation restrictions, do you consider the current state of your knee satisfactory?” alongside the administered PRO. 75 Several studies have since then applied the PASS values for the KOOS and IKDC-SKF when reporting on outcome after ACL treatment. 26,42,44,111
A single-item outcome like the PASS summarizes the patient’s perception and allows the patient to make an overall statement through a binary answer: “yes” or “no.” A numeric scale might have its advantages; however, it is associated with difficulties of interpretation for both patients and researchers. That is, what is considered as a good and poor outcome, respectively? The PASS reference value at which a majority of the patients feel well is valuable for determining this important question, and its use is warranted to overcome limitations with numeric PROs such as ceiling effects and poor responsiveness. 49,72
In addition, the evidence to support the interpretation and use of a PRO should include the minimum detectable change (MDC) score and the minimal clinically important difference (MCID) score. These scores collectively describe the responsiveness of the PRO, which is the ability to detect a clinically important change in outcome for the metric. The MDC is the amount of change that is needed to confidently state that the change is beyond measurement error. 10 Thus, if a study finds a difference that is smaller than the MDC for the chosen PRO, one should be careful to draw any conclusions because the observed difference is within the range of measurement error for the PRO. On the other hand, if the change in outcome is larger than the MDC, it still remains unknown whether this change is clinically relevant. This is where the MCID becomes valuable. If a change in outcome exceeds the value of the MCID for the PRO, the difference is likely to be perceived as important by most patients. 51
Conclusion
The PASS is a valuable complement to numeric PROs and should be used to facilitate interpretation of PROs. Researchers should also consider the MDC and MCID for the PRO when reporting and discussing their study findings.
Future Directions
Reaching consensus for clinical outcome assessment after ACL treatment is an important step toward refining and improving the quality of ACL research. Further efforts should be made to develop methods for outcome assessment that provide the most relevant and valid data for patients receiving ACL treatment. A focus is to improve the PRO assessment. The collection of PROs has become increasingly important among health care professions. Not only is it a valuable asset for a clinician to understand a patient’s perception of health and results of treatment, it has also gained importance for policy makers in determining health care quality and developing a value-based health care. 72 Commonly used PROs in ACL research are limited by a format of fixed-length surveys that many times include items of questionable relevance for the young and active population sustaining ACL injuries, leading to ceiling effects and potentially survey fatigue. Therefore, a current priority is to decrease the responder burden for patients in PRO assessment.
Improved PRO data collection may be achieved through the use of the item response theory (IRT), 20,35 which has enabled the introduction of computer adaptive testing (CAT). The underlying premise of IRT is that the way a patient responds to an item (question) is based on the difficulty of the question and the ability of the patient. When administered as a CAT, a mathematical algorithm is utilized to select items that are matched to the ability of the patient. For example, if a patient responds to an item that he/she is unable to walk a mile, the computer algorithm will bypass “harder” items such as running a mile and select an easier item such as ability to walk a block. This means that only items that are relevant about the patient’s ability level are administered, which substantially reduces the time and burden associated with administration of PROs. Efforts are under way to convert the IKDC-SKF to a CAT format that is based on IRT.
Although computer-aided PRO assessment likely is the future, further research for optimization of currently used PROs is needed. Research should focus on determining the most responsive items of current PROs to condense the surveys to include only the most responsive questions. This is important when considering the already collected PRO data for tens of thousands of patients in large registries and national databases. Such data might need to be reanalyzed using the condensed PROs and thereby provide results with a greater precision on clinically relevant outcomes.
Other important aspects for further research are outcome measures on activity and RTS after ACL treatment. Optimally, a tool that is able to quantify sports participation in terms of level, volume, and intensity should be developed and implemented as a standardized tool used across studies. With the rapid evolution of technology, the future will likely also hold easily accessible use of quantitative instruments for quantitatively measuring patient activity, for example, the use of Global Positioning System and motion detectors during sports participation, measurements of joint function, and measurements of heart rate and speed to estimate intensity.
Conclusion
Clinical outcome assessment after ACL injury can be divided in 4 robust categories: early adverse events, PROs, ACL failure/recurrent ligament disruption, and clinical measures of knee function and structure. A minimum of 2-year follow-up is necessary for a comprehensive and reliable determination of outcome, which should include outcomes provided by clinical examination, PROs, and verified reinjuries. The PRO assessment is a cornerstone in evaluating outcome after ACL injury, where validated knee-specific PRO assessment, HRQoL, and measure of type and level of sport/activity should be collected. The IKDC-SKF is the recommended knee-related PRO measure for ACL treatment, and the use of PASS is encouraged to facilitate interpretation of PROs.
Footnotes
Final revision submitted April 28, 2020; accepted May 12, 2020.
One or more of the authors has declared the following potential conflict of interest or source of funding: The ACL Consensus Meeting Panther Symposium at the University of Pittsburgh Medical Center (UPMC) was sponsored by Smith & Nephew, the UPMC, Elizur, Arthrex, Conmed, DJO, Mid-Atlantic Surgical Systems, RTI Surgical, and BlackRhino Medical. K.S. has received consulting fees from the National Football League, Service Excellence, Mitek, Flexion Therapeutics, Samumed, and NovoPedics and royalties from nPhase. C.B.M. has received consulting fees from Linvatec, Medacta, Stryker, Wright Medical, and Zimmer Biomet. V.M. has received educational grants, consulting fees, and speaking fees from Smith & Nephew and educational grants from Arthrex. O.R.A. has received speaking fees from Conmed and honoraria from DJO. S.D. has received personal fees from Zimmer Biomet. A.G. has received grants and personal fees from Smith & Nephew and Ossur and personal fees from Graymont and Olympus. C.C.K. has received grant support from DJO, educational support from CDC Medical, consulting fees from Zimmer Biomet, and nonconsulting fees from Arthrex. R.K. has received grants from Smith & Nephew, Zimmer Biomet, Stryker, and Johnson & Johnson; consulting fees from Medacta, Arthrex, Japan Tissue Engineering, and Hirosaki Life Science Innovation; and speaking fees from Arthrex, Smith & Nephew, Zimmer Biomet, Johnson & Johnson, and Japan Tissue Engineering. B.L. has received royalties from Wolters Kluwer Health–Lippincott Williams & Wilkins. R.G.M. has received stock/stock options from MEND Nutrition and royalties from Springer and Demos Health. L.P. has received research support from the Australian Orthopaedic Association, Friends of the Mater Foundation, and Smith & Nephew; has received speaking fees from Smith & Nephew; has received royalties from Australian Biotechnologies and Signature Orthopaedics; has stock/stock options in Australian Biotechnologies; and has patents with Hip Developments, Smith & Nephew, and Surgical Apps. A.R. has received other support from Enhatch, Conformis, Stryker, Smith & Nephew, Arthrex, Anika Therapeutics, and Bodycad. B.R. has received fellowship support from Smith & Nephew, has received royalties from Elsevier, has stock in Johnson & Johnson and Merck, and is the Editor-in-Chief of
