Abstract
Introduction:
Dynamic changes in liver function tests have been proposed to correctly reclassify the risk of choledocholithiasis; however, information is scarce and insufficient to recommend its use.
Methods:
Retrospective cohort of patients undergoing endoscopic retrograde cholangiopancreatography (ERCP) due to moderate and high risk of choledocholithiasis according to the 2019 American Society of Gastrointestinal Endoscopy (ASGE) guidelines. We evaluated whether significant changes in liver function tests (bilirubin, transaminases, or alkaline phosphatase), defined as an increase or a reduction ⩾30 or ⩾50% between two measurements taken with a difference of 24–72 h can correctly reclassify the risk of choledocholithiasis beyond the ASGE guidelines. The net reclassification index (NRI) was calculated for patients with and without choledocholithiasis.
Results:
Among 1175 patients who underwent ERCP, 170 patients were included in the analysis (59.4% women, median 59.5 years). Among patients without a diagnosis of choledocholithiasis, the number of patients correctly reclassified by transaminases was slightly higher than those incorrectly reclassified (NRI = 0.24 for aspartate amino transaminase and 0.20 for alanine amino transaminase). However, among patients with a diagnosis of choledocholithiasis, it led to incorrect reclassification in a greater number of cases (NRI = −0.21 and −0.14, respectively). The benefits of reclassification were minimal for bilirubin and alkaline phosphatase, or for value changes >50%. A subgroup analysis showed similar findings in patients without a history of cholecystectomy and in those with normal bile duct.
Conclusion:
Dynamic changes in liver function tests do not improve choledocholithiasis risk classification beyond the 2019 ASGE criteria. New criteria should continue to be sought to optimize risk stratification.
Keywords
Introduction
Choledocholithiasis is responsible for more than 1.4 million annual emergency department visits,1,2 with indirect costs of approximately $6.5 billion annually in the United States.3,4 Stone removal by endoscopic retrograde cholangiopancreatography (ERCP) is the preferred treatment, but is a complex endoscopic procedure associated with life-threatening complications, such as post-ERCP pancreatitis, hemorrhage, perforation, and infection,5,6 so it is reserved for cases where clinical suspicion is very high. In patients with a low clinical suspicion, it is preferable to make the diagnosis with less invasive evaluations such as magnetic resonance cholangiopancreatography (MRCP) or endoscopic ultrasound (EUS). 7
Scientific societies recommend stratifying the probability of having choledocholithiasis through identifiable predictors in clinical presentation, biochemical tests, and abdominal ultrasound findings, 8 restricting ERCP only to patients with the highest risk. The most frequently used stratification algorithm was developed by the American Society of Gastrointestinal Endoscopy (ASGE). 9 In 2010, a first version was proposed that classified patients into low risk (<10% probability), intermediate risk (10−50%), and high risk (>50%). As a high-risk criterion, the presence of any of the following predictors was established: (1) ascending cholangitis, (2) choledocholithiasis visualized in imagen, (3) total bilirubin >4 mg/dL, or (4) the simultaneous appearance of a dilated common bile duct on ultrasound (>6 mm in a patient with gallbladder or >8 mm in cholecystectomy) and bilirubin level between 1.8 and 4 mg/dL. Intermediate risk was classified as those with the presence of one strong predictor (common bile duct dilatation or moderate elevation of bilirubin) or any moderate predictor (abnormal liver biochemical tests other than bilirubin, age over 55 years and clinical biliary pancreatitis), and as low risk those without any of these predictors present.
After the publication of the first ASGE guidelines, several evaluations concluded that these criteria had a suboptimal performance, and their application would lead to unnecessary ERCP in more than one-third of patients, resulting in an unacceptable risk of potential complications.10 –12 Therefore, ASGE modified its stratification tool in 2019 seeking to improve diagnostic accuracy in the high-risk group. The new high-risk criteria include: (1) ascending cholangitis, (2) choledocholithiasis in ultrasound or abdominal CT, or (3) total bilirubin >4 mg/dL plus dilated common bile duct. With intermediate risk: (1) abnormal liver biochemical other than bilirubin, (2) age over 55 years, and (3) common bile duct in the image are included. The recommendations derived from this stratification were not modified: ERCP to patients at high risk, and MRCP or EUS to patients at intermediate risk. Recent evaluations comparing the previous 2010 criteria with those of 2019 have consistently found an improvement in specificity without reaching the desired optimal level,13 –15 so new methods have been proposed to improve the identification of patients who can be taken directly to ERCP, trying to minimize the proportion of negative studies. 16
One of the complementary methods to assess the likelihood of choledocholithiasis is the evaluation of dynamic changes in liver function tests,17 –19 considering that stones can migrate in the biliary tree and modify these parameters. However, to date, the information is scarce and insufficient to recommend or limit its use, especially considering the methodological limitations of previous studies, such as the variable and often excessive time between paraclinical measurements and confirmation of choledocholithiasis, and the lack of a single gold standard for diagnosis, which has led to divergent results.
The aim of this study is to evaluate whether dynamic changes in liver function tests (defined as an increase or decrease greater than 30% or 50% in two measurements), with a minimum difference of 24 h and a maximum difference of 72 h, can correctly reclassify patients at intermediate or high risk of choledocholithiasis, using the ASGE 2019 criteria as a baseline and considering ERCP as the only gold standard. To this end, we evaluate the net reclassification index (NRI), 20 a measure increasingly used to evaluate improvements in risk prediction by estimating the incorporation of new parameters into validated statistical models.
Methods
A retrospective cohort study was designed which included all hospitalized patients who underwent ERCP for moderate and high risk choledocholithiasis according to ASGE 2019 guidelines at the Hospital Universitario San Ignacio (HUSI) in Bogotá, Colombia between 1 January 2015 and 31 December 2022. Patients with a history of choledocholithiasis, previous biliary intervention, known liver disease, and gastric or biliary surgery (Roux-en-Y gastric bypass, Billroth I or II, choledochojejunostomy, or hepaticojejunostomy) were excluded, as were patients with acute conditions such as sepsis and septic shock, and those who did not have two liver function test measurements with a minimum of 24 h and a maximum of 72 h between them before undergoing ERCP. In addition, patients with failed ERCP (where cannulation of the duodenal papilla was not achieved) and those who underwent ERCP six or more days after admission were excluded. The study was approved by the Ethics Committee of the Hospital Universitario San Ignacio and the Pontificia Universidad Javeriana.
Patients were identified from a database that systematically records all ERCPs performed by the HUSI Gastroenterology Service. After reviewing the inclusion and exclusion criteria, information was collected from the institutional electronic medical records, including demographic variables, paraclinical reports [bilirubin, transaminases, and alkaline phosphatase (ALP) levels], available imaging studies (abdominal ultrasound, scenography, nuclear magnetic resonance, MRCP, or EUS), and the written report of ERCP, using a standardized format.
Patients at high and intermediate risk for choledocholithiasis were defined according to previously established ASGE 2019 criteria. A diagnosis of choledocholithiasis on ERCP was considered if stones, stone fragments, or biliary sludge were seen in the duodenal lumen. Significant changes in liver function tests for the diagnosis of choledocholithiasis were considered as an increase ⩾30% or ⩾50% between the two measurements, with a minimum difference of 24 h and a maximum difference of 72 h. A decrease ⩾30% or ⩾50% of the same tests was considered a criterion for excluding the diagnosis. If there were more than two measurements in this period, the second most different value was chosen.
Continuous variables with a normal distribution are presented as mean and standard deviation and those without as median and interquartile range. The Shapiro–Wilk test was used to assess the assumption of normality. Categorical variables are presented as absolute frequencies and percentages.
To assess the extent to which the risk of choledocholithiasis is correctly or incorrectly reclassified, the category-based NRI 21 was used, evaluating the addition of the marker of significant change in liver function tests to the 2019 ASGE criteria. The entire sample was first stratified into intermediate- and high-risk categories, separating for analysis the groups with confirmed and excluded diagnosis of choledocholithiasis by ERCP. Within each group, the percentage of patients correctly or incorrectly reclassified with the addition of the new variable was evaluated. Changes in risk classification were quantified by the NRI, with each of its components reported independently: event NRI (confirmed diagnosis of choledocholithiasis) and non-event NRI (exclusion of choledocholithiasis). Statistical analysis was performed using the STATA 16® statistical package (StataCorp, College Station, TX, USA).
Results
We identified 1175 unique patients who underwent ERCP on their first admission. We excluded 233 patients whose primary indication was not suspected choledocholithiasis (201 for malignant stenosis, 32 for bile duct revision for suspected postoperative injury), 56 for history of choledocholithiasis with previous bile duct intervention, 10 for known liver disease, 17 for septic shock or sepsis, 3 for Roux-en-Y gastric bypass, 1 for Billroth II, and 63 for ERCP performed six or more days after admission. A total of 602 patients did not have two liver function tests available within the specified time.
Finally, 170 patients were included in the analysis. The general characteristics of the patients are shown in Table 1. Of the total number of patients, 59.4% were women (n = 101), the mean age was 59.5 years (IQR: 32–72), and 22.9% had a history of cholecystectomy. Patients without choledocholithiasis had higher ALP levels (median 378 versus 251 U/L, p = 0.023) and were more likely to have total bilirubin levels >4 mg/dL (40.4% versus 67.6%, p = 0.004). On the other hand, patients with choledocholithiasis had higher AST levels (246 versus 203 U/L, p = 0.019). Abdominal ultrasound documented common bile duct stones in 30 patients (17.6%), of whom 27 were diagnosed with choledocholithiasis. Common bile duct diameter was larger in patients without choledocholithiasis (9.75 versus 7.9 mm, p = 0.002). A total of 70.6% of patients were classified as high risk according to ASGE 2019 criteria. Six patients had complications related to ERCP, four cases of bleeding, and two cases of post-ERCP pancreatitis.
General characteristics of the population according to the diagnosis of choledocholithiasis.
ALP, alkaline phosphatase; ALT, alanine aminotransferase; AST, aspartate aminotransferase; BT, total bilirubin; CT, computed axial tomography; IQR, interquartile range.
Considered significant if p < 0.05.
Table 2 shows the NRI corresponding to a 30% change in liver function tests. Among patients without a diagnosis of choledocholithiasis, the number of correctly reclassified patients was slightly higher than the number of incorrectly reclassified patients (NRI = 0.24 for AST and 0.20 for ALT). However, among patients diagnosed with choledocholithiasis, incorrect reclassification occurred in a greater number of cases (NRI = −0.21 and −0.14, respectively). The benefit of reclassification was minimal for bilirubin and ALP.
Net reclassification rate – 30% changes.
Among patients without choledocholithiasis, patients correctly reclassified (moving from high-risk to intermediate-risk) are shown in light green and incorrectly reclassified patients (moving from intermediate-risk to high-risk) are shown in orange. Patients no reclassified are in gray. For the calculation of the NRI without event, the incorrectly classified patients are subtracted from the correct ones and divided by the total number (n = 34). Among patients with choledocholithiasis, patients correctly reclassified (moving from intermediate risk to high-risk) are shown in light green and incorrectly reclassified patients (moving from high-risk to intermediate) are shown in orange. For the calculation of the NRI with event, the incorrectly classified ones are subtracted from the correct ones and divided by the total number (n = 136).
It was calculated on 163 patients with second available measurement.
ALT, alanine aminotransferase; AST, aspartate aminotransferase; ASGE, American Society of Gastrointestinal Endoscopy; NRI, net reclassification index.
Table 3 shows the NRI for 50% changes in liver function tests. In patients without choledocholithiasis, AST correctly reclassified more patients than it incorrectly reclassified (NRI = 0.18), but in patients with choledocholithiasis, the situation was reversed (NRI = −0.16). The benefit of reclassification was minimal for ALT, bilirubin, and ALP.
Net reclassification rate – 50% changes.
Among patients without choledocholithiasis, patients correctly reclassified (moving from high-risk to intermediate-risk) are shown in light green and incorrectly reclassified patients (moving from intermediate-risk to high-risk) are shown in orange. Patients no reclassified are in gray. For the calculation of the NRI without event, the incorrectly classified patients are subtracted from the correct ones and divided by the total number (n = 34). Among patients with choledocholithiasis, patients correctly reclassified (moving from intermediate-risk to high-risk) are shown in light green and incorrectly reclassified patients (moving from high-risk to intermediate) are shown in orange. For the calculation of the NRI with event, the incorrectly classified ones are subtracted from the correct ones and divided by the total number (n = 136).
It was calculated on 163 patients with second available measurement.
ALT, alanine aminotransferase; AST, aspartate aminotransferase; ASGE, American Society of Gastrointestinal Endoscopy; NRI, net reclassification index.
Finally, a subgroup analysis was performed in the population without a history of cholecystectomy, and in those with a normal bile duct (common bile duct <8 mm), with values close to 0 for the NRI in both patients with and without choledocholithiasis, similar to what was found in the general population (Table 4).
Net rate of reclassification in patients with normal common bile duct, and no history of cholecystectomy.
ALT, alanine aminotransferase; AST, aspartate aminotransferase; NRI, net reclassification index.
Discussion
Our data suggest that the addition of dynamic changes in liver function tests (defined as 30% or 50% changes in measurements between 24 and 72 h before ERCP) does not improve the classification of choledocholithiasis risk over ASGE criteria 2019, regardless of cholecystectomy history or bile duct diameter.
It drew attention in our study that 70% of patients without choledocholithiasis were classified as high risk based on the presence of total bilirubin values >4 mg/dL and dilated bile duct. Similarly, we found higher values of bilirubin, ALP, and larger diameter of common bile duct. A possible explanation is that in this subgroup, 26% of patients had findings of malignancy, which was not known before performing the procedure, which may represent the reality of gastroenterology units in reference hospitals such as ours. The above result suggests that some patients in the high-risk category for choledocholithiasis might benefit from prior diagnostic imaging to rule out biliary obstructive malignant syndrome.
In our study, dynamic changes in liver function tests did not improve the risk classification of choledocholithiasis beyond the criteria proposed by ASGE 2019. Although there is a small benefit in patients without choledocholithiasis, where a >30% reduction in AST or ALT correctly reclassifies a greater number of patients from high to intermediate risk, this benefit is lost because a similar proportion of patients with choledocholithiasis are inappropriately reclassified when moving from high to intermediate risk. Thus, the net clinical benefit is limited and should not be incorporated into decision-making because, although it would reduce the number of unnecessary ERCPs, it would also increase the performance of supplemental imaging in high-risk patients who do not need it. When evaluating changes >30% or 50% in bilirubin and ALP, the NRI did not show a significant change in the categorization of risk, neither in the group of patients with choledocholithiasis, nor in the group of patients without it. In conclusion, the monitoring of dynamic changes of liver function tests to reclassify the risk of choledocholithiasis is not recommended, so new criteria for the optimization of risk stratification should continue to be used.
Our studio has multiple strengths. First, we assessed the evidence in a consecutive series of all ERCPs performed at the institution limiting the risk of disease spectrum bias. Second, we compared the tests with the same gold standard for the diagnosis of choledocholithiasis (ERCP) unlike what was done in previous studies. Third, we standardized the time between the two liver function test measurements, which allows the findings to be reproduced, and prevents the findings from being secondary to actual changes in the presence of the disease.
However, there are limitations that we must recognize: The number of patients was relatively small, especially in the subgroup of patients without choledocholithiasis, which limits the precision of the estimates. Further studies, probably multicenter, will be needed to confirm these results. In addition, some authors have suggested that the NRI is not a reliable measure because it ignores the relative severity of different types of errors, knowing that different types of model errors have different harms for patients. 21 However, we report both correct and incorrect reclassifications in both choledocholithiasis and non-choledocholithiasis patients, making it easier to interpret the results by recognizing the consequences of different types of errors in each group.
Conclusion
Dynamic changes in liver function tests do not improve the risk classification of choledocholithiasis beyond the 2019 ASGE criteria. On the other hand, the ASGE classification is suboptimal for determining the risk of choledocholithiasis in the high-risk category since it tends to incorrectly categorize other causes of obstruction such as malignant obstructive biliary syndrome.
