Abstract
Large administrative healthcare (including insurance claims) databases are used for various retrospective real-world evidence studies. However, in pulmonary arterial hypertension and chronic thromboembolic pulmonary hypertension, identifying patients retrospectively based on administrative codes remains challenging, as it relies on code combinations (algorithms) and the accuracy for patient identification of most of them is unknown. This study aimed to assess the performance of various algorithms in correctly identifying patients with pulmonary arterial hypertension or chronic thromboembolic pulmonary hypertension in administrative databases. A systematic literature review was performed to find publications detailing code-based algorithms used to identify pulmonary arterial hypertension and chronic thromboembolic pulmonary hypertension patients. PheValuator, a diagnostic predictive modelling tool, was applied to three US claims databases, yielding models that estimated the probability of a patient having the disease. These models were used to evaluate the performance characteristics of selected pulmonary arterial hypertension and chronic thromboembolic pulmonary hypertension algorithms. With increasing algorithm complexity, average positive predictive value increased (pulmonary arterial hypertension: 13.4–66.0%; chronic thromboembolic pulmonary hypertension: 10.3–75.1%) and average sensitivity decreased (pulmonary arterial hypertension: 61.5–2.7%; chronic thromboembolic pulmonary hypertension: 20.7–0.2%). Specificities and negative predictive values were high (≥97.5%) for all algorithms. Several of the algorithms performed well overall when considering all of these four performance parameters, and all algorithms performed with similar accuracy across the three claims databases studied, even though most were designed for patient identification in a specific database. Therefore, it is the objective of a study that will determine which algorithm may be most suitable; one- or two-component algorithms are most inclusive and three- or four-component algorithms identify most precise pulmonary arterial hypertension or chronic thromboembolic pulmonary hypertension populations, respectively.
Keywords
Introduction
Pulmonary hypertension (PH) is a rare and progressive disease characterised by increased pulmonary vascular resistance that ultimately leads to right heart failure and death.1,2 Pulmonary arterial hypertension (PAH) and chronic thromboembolic pulmonary hypertension (CTEPH) are two groups of PH (groups 1 and 4, respectively), 2 and derived estimates from UK and Swedish centres show that PAH and CTEPH are estimated to affect 44.8–46.0 patients per million and 24.2–34.9 patients per million, respectively.3–5 Patients with PAH or CTEPH are diagnosed on average between the age of 50 and 65 years and may present with similar symptoms, but there are differences in risk factors, pathologic mechanisms, diagnostic strategies and treatment approaches for the two diseases. 2
Administrative databases include structured, coded information for health reimbursement (claims), hospital management and national vital statistics. These databases are widely used in epidemiology, health economics and public health surveillance to provide real-world data on disease burden, treatment patterns, healthcare resource utilisation and costs. 6 Claims databases cover a large and representative sample of the population, making them of particular interest in the rare disease context. These databases typically collect information on diagnoses (e.g. using International Classification of Diseases (ICD) codes), procedures and drug use. While research is not the primary purpose of such databases, their advantages include easy access at a relatively low cost, not being subject to recall bias and data availability in a well-structured format. Limitations include their lack of clinical data and a total reliance on coding accuracy. 7
In PH, dedicated diagnosis codes for each group, including PAH and CTEPH, became available in the October 2017 ICD-10 update. 8 However, given the retrospective design of most claims database studies and a potential under-utilisation post-implementation, these new PH group-specific ICD-10 codes are still of limited usefulness for patient identification. Currently, identifying patients retrospectively relies on ICD-9 and pre-October 2017 ICD-10 codes, which differentiate only between primary (idiopathic/heritable PAH) and secondary PH (associated forms of PAH and PH groups 2–5), and do not appropriately reflect the current PH classifications.
To identify PAH or CTEPH patients from claims databases, different combinations of diagnosis, procedure and drug codes are used, herein collectively referred to as code-based algorithms. A large variety of code-based algorithms for PH have been published but the accuracy of very few has been assessed using clinical data.6,9 Therefore, the reliability of these algorithms remains unclear, and there are calls for future studies to address this. 6 As such, the aim of this study was to assess which code-based algorithms most accurately identified patients with PAH or CTEPH in claims databases, using the diagnostic predictive modelling tool, PheValuator. 10
Methods
A systematic literature review was performed to identify published code-based algorithms developed for PAH and CTEPH. The performance characteristics (sensitivity, specificity, positive predictive value (PPV) and negative predictive value (NPV)) of the algorithms in identifying patients with PAH or CTEPH were then assessed using PheValuator. 10
Identification of published code-based algorithms
A systematic literature search was completed on 1 October 2019 using PubMed and Embase databases, in accordance with Preferred Reporting Items for Systematic reviews and Meta-Analyses guidelines (Supplementary Figure 1), 11 by the lead author (V.P.S.). The search was limited to English, German, French or Spanish language manuscripts published between 1 January 2000 and 30 September 2019, inclusive, without restrictions on type and origin of database, or age of the population studied. The definition and management of PAH and CTEPH have changed substantially in the past two decades. Thus, this time frame was chosen to capture the diversity of administrative algorithms used for patient identification. The full search strategy, including search terms, is shown in Supplementary Table 1. Further details on the screening of search results and extraction of information are provided in the Supplementary methods.
For PAH algorithms, priority for further evaluation was given to (1) algorithms that identified patients with PAH or any PAH subgroup in any administrative database, published between 2000 and 2019, for which the accuracy of patient identification was evaluated in the original publication, and (2) algorithms identifying patients with PAH in administrative databases for health reimbursement (claims databases) published between 2015 and 2019. The time frame was reduced in the second group to include only the most recently published algorithms, which were assumed to be based on a combination of ICD-9 and ICD-10 codes as well as potentially on all currently available PAH-specific drugs, and therefore represent an improved version of past algorithm application strategies. When several algorithms were similar, only the most inclusive algorithm was selected for further evaluation. Due to the limited number of CTEPH algorithms identified in the literature, all were selected for further evaluation.
Evaluation of code-based algorithm performance using the diagnostic predictive modelling tool, PheValuator
In order to cover the age groups mainly affected by PAH and CTEPH, the following three US claims datasets were used in the evaluation of algorithm performance: Optum® De-Identified Clinformatics® Data Mart Database (OptumInsight, Eden Prairie, MN), ages ≥18 years (‘Optum’); IBM® MarketScan® Commercial Claims and Encounters Database, ages 18–62 years (‘CCAE’, limited to ≤62 years to observe a patient for at least three years post-inclusion); and IBM® MarketScan® Medicare Supplemental and Coordination of Benefits Database, ages ≥66 years (‘Medicare’, limited to ≥66 years to observe a patient for at least one year pre-inclusion). Data used included patient records from 1 January 2010 until 30 March 2019, inclusive. The extent of overlap between patients in Optum® and IBM® MarketScan® datasets is unknown and impossible to estimate due to data anonymisation. However, this potential duplication is likely to be sufficiently small to allow for the presentation of averages across all datasets. The Optum® and IBM® MarketScan® databases used in this study were reviewed by the New England Institutional Review Board. Studies conducted in these databases were determined to be exempt from ethics approval as they do not qualify as human subjects research.
Using the ATLAS tool within the Observational Health Data Sciences and Informatics (OHDSI) toolset, 12 PAH and CTEPH cohorts were created in the three selected datasets, based on the code-based algorithms identified in the literature. When available, the exact source codes were used for creating the cohorts, otherwise standardised coding concepts from the Common Data Model Vocabularies of the Observational Medical Outcomes Partnership 13 were applied. Diagnosis codes based on ICD-9 were translated into pre-October 2017 ICD-10 and vice versa per published conversion suggestions, i.e. 416.0 to/from I27.0, and 416.8 to/from I27.2. 8 This conversion was performed to allow PAH and CTEPH patient identification according to both of these past coding conventions.
The cohorts reflecting the code-based algorithms were then exported into R and evaluated using the OHDSI R package PheValuator. 10 Disease-specific diagnostic predictive models (i.e. for PAH and CTEPH separately) were developed for each dataset, using logistic regression with Least Absolute Shrinkage and Selection Operator L1-regularisation. 14 The models yielded sets of weighted predictors for the diseases of interest. Among the most important predictors from the PAH models were history of pulmonary heart disease or PH, echocardiography and phosphodiesterase type-5 inhibitors. For CTEPH, similar important predictors as for PAH were obtained, with the addition of embolism and pulmonary ventilation/perfusion imaging. These predictors were used to estimate the probability of a patient having PAH or CTEPH in a randomly selected cohort from each dataset. In the absence of a diagnostic standard defining patients’ true disease status (i.e. based on clinical assessment results and laboratory values), these predictions defined a probabilistic gold standard for evaluation, with the assumption that the probability of an individual having PAH or CTEPH based on the predictive model corresponded to the true disease status of that individual.
The PAH and CTEPH algorithms selected for evaluation were compared with the respective probabilistic gold standard. The resulting estimates for the number of true positives (TP), false positives (FP), true negatives (TN) and false negatives (FN) were used to calculate the following: (i) sensitivity (defined as TP/(TP + FN)), (ii) specificity (defined as TN/(TN + FP)), (iii) PPV (defined as TP/(TP + FP)) and (iv) NPV (defined as TN/(TN + FN)).
Results
The systematic literature review returned a total of 59 publications describing code-based algorithms developed to identify patients with PAH (n = 50), with CTEPH (n = 5) or with either PAH or CTEPH (n = 4). The majority of the publications (n = 43 (72.9%)) identified patients in US databases, and others were from the UK (n = 5), Canada (n = 4), France (n = 2), Taiwan (n = 2), Norway (n = 1), Spain (n = 1) and South Korea (n = 1).
PAH
Identification of published code-based algorithms
In total, 51 unique algorithms describing PAH in an administrative database were identified in the literature (Fig. 1; Supplementary Table 2). Of the 48 algorithms using diagnosis codes, 38 were based on ICD-9 (79.2%), five on ICD-10 (10.4%) and four on a combination of ICD-9 and ICD-10 (8.3%), as well as one algorithm on UK-specific Read codes (2.1%), which was subsequently excluded from this evaluation on the basis of being too database-specific and requiring substantial modification to convert into ICD-9/-10.
Summary of PAH algorithms identified and selected for evaluation using PheValuator.
Evaluation of code-based algorithm performance using the diagnostic predictive modelling tool, PheValuator
Performance characteristics of code-based PAH algorithms: averages and ranges of sensitivity and PPV across Optum, CCAE and Medicare databases.
CCB: calcium-channel blocker; Dx: diagnosis code; PAH: pulmonary arterial hypertension; PDE5i: phosphodiesterase type-5 inhibitor; PH: pulmonary hypertension; PPV: positive predictive value; RHC: right-heart catheterisation; Rx: drug code.
Note: PAH-specific Rx: sildenafil, tadalafil, bosentan, ambrisentan, macitentan, epoprostenol, iloprost, treprostinil, selexipag and riociguat.
The highest average sensitivities across the databases were 48.1% (range: 47.0–48.7%) using an algorithm published by Papani et al. (at least one primary/secondary PH diagnosis code (outpatient setting)) 15 and 61.5% (range: 60.1–64.2%) using the algorithm reported by Choi et al. (at least one primary/secondary PH diagnosis code (any setting); Table 1). 16 The highest average PPVs across the databases were 63.3% (range: 47.6–83.8%) in the algorithm by Burke et al. (at least one primary/secondary PH diagnosis code (outpatient) + at least one primary/secondary PH diagnosis code (inpatient) + at least one PAH drug code), 17 and 66.0% (range: 49.8–83.9%) for another algorithm reported by Papani et al. (at least one primary/secondary PH diagnosis code (outpatient) + at least two classes of PAH drug codes). 15
Fig. 2 presents the grouping of algorithms according to the number of algorithm components. Algorithms based on a single diagnosis code had comparatively moderate PPVs and differing sensitivities. The algorithms that were based on two components yielded sensitivities below 10%, and higher PPVs, with the exception of one algorithm that included a therapy not specific to PAH (calcium-channel blockers) that generated an outlier in terms of PPV. Algorithms based on three components also returned low sensitivities and comparatively high PPVs.
PAH algorithms: graphical presentation of average positive predictive value and average sensitivity across the three databases examined (Optum, CCAE and Medicare). Code-based algorithms including a single diagnosis code are shown as green dots, those based on two components as orange dots and those based on three components as red dots.
Comparison of published PAH algorithm performance results with presented PheValuator averages across Optum, CCAE and Medicare databases.
Performance evaluated twice in the same publication.
Dx: diagnosis code; NPV: negative predictive value; PAH: pulmonary arterial hypertension; PDE5i: phosphodiesterase type-5 inhibitor; PH: pulmonary hypertension; PPV: positive predictive value; Rx: drug code.
Note: PAH-specific Rx: sildenafil, tadalafil, bosentan, ambrisentan, macitentan, epoprostenol, iloprost, treprostinil, selexipag and riociguat.
CTEPH
Identification of published code-based algorithms
A total of 11 unique CTEPH algorithms were identified (Supplementary Table 4), of which four applied diagnosis codes only, six applied a combination of diagnosis and procedure codes, and one applied a diagnosis code plus either a procedure or a drug code. Diagnosis codes were based on ICD-9 in seven algorithms (63.6%), on ICD-10 in two (18.2%) and a combination of both in two (18.2%).
Evaluation of code-based algorithm performance using the diagnostic predictive modelling tool, PheValuator
Performance characteristics of code-based CTEPH algorithms: averages and ranges of sensitivity and PPV across Optum, CCAE and Medicare databases.
CTA: computed tomography angiography; CTEPH: chronic thromboembolic pulmonary hypertension; Dx: diagnosis code; ECG: electrocardiography; Echo: echocardiography; HRCT: high-resolution computed tomography; LT: lung transplant; MRI: magnetic resonance imaging; PA: pulmonary angiography; PAH: pulmonary arterial hypertension; PE: pulmonary embolism; PEA: pulmonary endarterectomy; PH: pulmonary hypertension; PPV: positive predictive value; RHC: right-heart catheterisation; Rx: drug code; V/Q scan: ventilation/perfusion scan; VTE: venous thromboembolism.
Note: PAH-specific Rx: sildenafil, tadalafil, bosentan, ambrisentan, macitentan, epoprostenol, iloprost, treprostinil, selexipag and riociguat.
The highest average sensitivities across databases were 20.4% (range: 18.9–22.1%) and 20.7% (range: 17.5–22.6%) observed for two algorithms reported by Tapson et al. (at least one pulmonary embolism (PE) diagnosis code + at least one PH-related symptom + at least one code for PH-specific procedure; and at least one PE diagnosis code + at least one PH-related symptom, respectively). 18 The highest PPV averages across databases were 55.1% (range: 33.0–68.1%) for the algorithm described by Martinez et al. (at least one venous thromboembolism diagnosis code + at least one primary PH diagnosis code + at least one therapy (lung transplant / pulmonary endarterectomy / PAH drug code)) 19 and 75.1% (range: 67.4–87.9%) for the algorithm by Teal et al. (at least one PE diagnosis code + PH-related symptoms + at least one diagnosis code for chronic PE or PH without primary PH + code for PH-unspecific procedure + code for PH-specific procedure). 20
The algorithm providing the highest accuracy when considering all four performance characteristics was reported by Said et al. (at least one PE diagnosis code + first diagnosis code for PH + procedure code for echocardiography or right-heart catheterisation (RHC) + second diagnosis code for PH), which had an average sensitivity of 6.0% (range: 4.4–6.9%), PPV of 48.7% (range: 28.4–66.7%), with specificity and NPV of ≥99.8%. 21
Fig. 3 presents the grouping of algorithms according to their complexity. Algorithms based on two components had differing PPVs and low to moderate sensitivities. Algorithms based on three components yielded differing sensitivities together with higher PPVs. The most complex algorithms (≥4 components) had very low sensitivities and comparatively high PPVs.
CTEPH algorithms: graphical presentation of average positive predictive value and average sensitivity across the three databases examined (Optum, CCAE and Medicare). Code-based algorithms including two components are shown as green dots, those based on three components as orange dots and those based on at least four components as red dots.
Comparison of published CTEPH algorithm performance results with presented PheValuator averages across Optum, CCAE and Medicare databases.
Performance evaluated twice in the same publication. In the original publication, CTEPH patients were identified applying a code-based algorithm in addition to screening complementary general practitioner information and clinical notes, which may increase the probability of identifying CTEPH patients. This additional information was not available for evaluation in the present study.
CTA: computed tomography angiography; CTEPH: chronic thromboembolic pulmonary hypertension; Dx: diagnosis code; ECG: electrocardiography; Echo: echocardiography; HRCT: high-resolution computed tomography; LT: lung transplant; MRI: magnetic resonance imaging; NPV: negative predictive value; PA: pulmonary angiography; PAH: pulmonary arterial hypertension; PE: pulmonary embolism; PEA: pulmonary endarterectomy; PH: pulmonary hypertension; PPV: positive predictive value; RHC: right-heart catheterisation; Rx: drug code; V/Q scan: ventilation/perfusion scan; VTE: venous thromboembolism.
Note: PAH-specific Rx: sildenafil, tadalafil, bosentan, ambrisentan, macitentan, epoprostenol, iloprost, treprostinil, selexipag and riociguat.
Discussion
The aim of this study was to evaluate the performance characteristics of published code-based algorithms identifying patients with PAH or CTEPH in administrative databases. In total, 15 PAH and 11 CTEPH algorithms were included in the evaluation, and a complete set of performance characteristics for each were provided, making this the first study of its kind. PPV increased with increasing algorithm complexity and was inversely related to sensitivity. Since specificity and NPV were similarly high (≥97.5%) for all algorithms, these accuracy measures were not the main focus of the study.
In the current study, a PPV of ≥ 60% was reported for three of the PAH algorithms evaluated but these also reported very low sensitivity (2.7–3.7%).15,17,22 Conversely, the highest sensitivity obtained using PheValuator was 61.5% in an algorithm reporting a low PPV (13.4%). 16 For the six PAH algorithms with previously published performance results,15,23–26 increasing algorithm complexity and recentness of published evaluation were factors that contributed to achieving similar PPV results between the PheValuator predictive modelling and the published performance results. Overall, algorithm performance depended on the number of components used. For PAH, none of the evaluated algorithms achieved an overall best performance in accurately identifying patients. Thus, despite the large number of published algorithms, there may be an unmet need for an algorithm that accurately identifies PAH patients with a good balance of all parameters.
Two of the CTEPH algorithms evaluated reported a PPV of ≥ 50%, the highest sensitivity according to PheValuator was 21%.18–20 The algorithm with the best performance results across all parameters was ‘at least one PE diagnosis code + first diagnosis code for PH + procedure code for echocardiography or RHC + second diagnosis code for PH’, 21 with a PPV of 48.7% and a sensitivity of 6.0%. Comparison of the three CTEPH algorithms with previously published performance results19,20,27 to PheValuator predictive modelling was difficult due to the incomplete set of performance characteristics published. Where results were available, specificity, PPV and NPV were reasonably similar but sensitivity estimates obtained using PheValuator were much lower than the values reported in the original publication.
When comparing PheValuator results with published algorithm evaluations, differences in coding and reporting practices as well as in the amount of information available – between countries, healthcare centres and systems, database types and over time – may play a role in algorithm performance variability between different databases. This has been demonstrated in studies where identical algorithms were evaluated in two different settings and returned different results.15,23,24 In addition, the lack of exact codes in the original publications limited the recreation of identical cohorts.
Disease prevalence in the study population influences the PPV, such that the lower the prevalence in the study population, the lower the PPV. Future work should therefore consider evaluating PAH and CTEPH algorithms in a PH population rather than the general population captured in US claims databases. Furthermore, algorithms may identify only patients with severe disease, who have all the codes in the algorithm present in their claims records. Whether this is the case for more complex algorithms with low sensitivity needs to be assessed with patient-level clinical data, such as comorbidities, symptoms and PAH-/CTEPH-related procedure results.
Strengths and limitations
An important strength of the methodology applied is its extendibility and adaptability to various types of administrative databases and databases from different countries, provided that these databases are available in coded structured format. Despite not being validated yet, PheValuator facilitated the evaluation of several code-based algorithms across different databases, which, given the large number of algorithms evaluated, would not have been as efficient via a manual medical chart review. Thus, it provides a useful alternative for algorithm evaluation. PheValuator will likely be updated over time, allowing for more flexibility in the underlying factors and assumptions that drive the predictive modelling. 10 Should this evaluation be repeated in the future, the results may vary. In addition, the accuracy of the algorithms depends not only on correct coding by healthcare professionals, but also on the accuracy of the codes within the established coding systems. Analysing algorithms that did not include the new disease-specific ICD-10 codes (October 2017 update) in more recent datasets may have limited the overall accuracy of the algorithms evaluated.
Recommendations based on published algorithms
Summary of recommended published algorithms according to research objective.
CTA: computed tomography angiography; CTEPH: chronic thromboembolic pulmonary hypertension; Dx: diagnosis code; Echo: echocardiography; PA: pulmonary angiography; PAH: pulmonary arterial hypertension; PE: pulmonary embolism; PH: pulmonary hypertension; RHC: right-heart catheterisation; Rx: drug code; V/Q scan: ventilation/perfusion scan.
Conclusion
This study evaluated code-based algorithms used to identify patients with PAH or CTEPH against administrative databases and provided an approximate measure of their accuracy. Published algorithms were able to identify PAH and CTEPH patients with high specificity and NPV, but with widely variable low-to-moderate sensitivity and PPV. The objective of a study will determine the best-suited algorithm for patient identification in large administrative databases, from one- or two-component algorithms that produce the most inclusive study population to three- or four-component algorithms for identification of the most precise study population, for PAH and CTEPH, respectively.
Supplemental Material
sj-pdf-1-pul-10.1177_2045894020961713 - Supplemental material for Evaluation of code-based algorithms to identify pulmonary arterial hypertension and chronic thromboembolic pulmonary hypertension patients in large administrative databases
Supplemental material, sj-pdf-1-pul-10.1177_2045894020961713 for Evaluation of code-based algorithms to identify pulmonary arterial hypertension and chronic thromboembolic pulmonary hypertension patients in large administrative databases by Viviane P. Sprecher, Eva-Maria Didden, Joel N. Swerdel and Audrey Muller in Pulmonary Circulation
Supplemental Material
sj-pdf-2-pul-10.1177_2045894020961713 - Supplemental material for Evaluation of code-based algorithms to identify pulmonary arterial hypertension and chronic thromboembolic pulmonary hypertension patients in large administrative databases
Supplemental material, sj-pdf-2-pul-10.1177_2045894020961713 for Evaluation of code-based algorithms to identify pulmonary arterial hypertension and chronic thromboembolic pulmonary hypertension patients in large administrative databases by Viviane P. Sprecher, Eva-Maria Didden, Joel N. Swerdel and Audrey Muller in Pulmonary Circulation
Footnotes
Guarantor
A.M. will act as guarantor for integrity of data and its reporting in this manuscript.
Author contributions
All authors contributed to the conception and design of the study, analysis and interpretation of the data, and critical revision of the manuscript.
Conflict of interest
The author(s) declared the following potential conflicts of interest with respect to the research, authorship, and/or publication of this article: V.P.S., E.-M.D. and A.M. are employees of Actelion Pharmaceuticals Ltd. J.N.S. is an employee of Janssen Pharmaceuticals.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This research was funded by Actelion Pharmaceuticals Ltd, Allschwil, Switzerland, a Janssen Pharmaceutical Company of Johnson & Johnson.
Acknowledgements
Medical writing and editorial support were provided by Zoe Schafer, PhD, and Richard McDonald (Watermeadow Medical, an Ashfield Company, part of UDG Healthcare plc), funded by Actelion Pharmaceuticals Ltd.
Supplemental material
Supplemental material for this article is available online.
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
