Beyond convenience: practical considerations with using routine health data for evaluations

Abstract

Routine health information systems (RHIS) comprise data collected at regular intervals at public, private, and community-level health facilities and institutions and health programs. This article looks at how this data may be used for evaluations, and the reasons behind why some are optimistic about this and some have concerns.

As digital routine health information systems (RHIS) become more prevalent, there is a growing interest in using them for evaluations. Rakha et al.¹ used data from an integrated management of childhood illness (IMCI) database to evaluate under-5 mortality and the timing of IMCI implementation in 213 districts in Egypt. Bennett et al.² used data from Zambia’s RHIS to evaluate an insecticide-treated net program and confirmed outpatient malaria incidence. Lalla-Edward et al.³ used data collected from nine roadside wellness centers in South Africa to evaluate the services accessed by long-distance truck drivers and the drivers’ sociodemographic characteristics. Brennan et al.⁴ used data from the National Health Service (NHS) in Leicester to assess hospital cost efficiency.

Some researchers are optimistic about this underutilized data source. Wagenaar et al.⁵ discussed how certain evaluation designs, such as interrupted time series analysis, are better suited to routine data than to population-based surveys. Others are cautious about routine data’s limitations; Ashton et al.⁶ expressed ‘concerns over internal validity, completeness and potential bias in estimates of program and intervention impact’, and Bennett et al.² argued that RHIS data cannot be used for rigorous program evaluation without complex modeling strategies.

MEASURE Evaluation used RHIS data to conduct several process and outcome evaluations in low- and middle-income countries (LMIC). From these, we have insights not yet mentioned in the literature on several topics: RHIS data access, quality, and relevance.

National RHIS belong to national governments, to monitor and evaluate health service delivery, health system outcomes, and health outcomes and to improve lives. Ownership and privacy rights are assumed. Often, though, nongovernmental actors are credentialed to build the systems and capture and access data. Furthermore, systems themselves may be housed on servers paid for by nongovernmental partners. Governments have the right to control what is collected, used, and shared⁷ and to whom they give access. Patients in some countries, like the United Kingdom, can choose whether their health records are used for evaluation.⁸

Depending on the type of data (sensitive, such as HIV status or patient-level records, or important for national security, such as birth and death registrations) and the research purpose, the levels of approval can be numerous, complex, and time-consuming. Yet, these elements of a data system and data sharing should not be perceived as mere procedural hurdles. They reflect the ethical integrity of the system and processes.

A central concern of using routine data for research and evaluations is whether the data can be trusted and whether the data systems and the data stewards are trustworthy. Data quality⁹ comprises accuracy, reliability, precision, completeness, timeliness, integrity, and confidentiality. The most common metrics – accuracy, completeness, and timeliness – contribute significantly to whether reliable conclusions can be drawn from the data. For mature and well-established routine data systems such as the NHS, patient privacy, data security, and public trust are a preoccupation.⁸ The NHS with its large, consistent, and clean data has been called a ‘goldmine’ for entities interested in voluminous data for pharmaceutical research and machine learning.¹⁰ In addition, concerns have been raised about the use of public assets for commercial and financial gain.¹¹

Disease-response definitions, such as confirmed malaria versus suspected malaria, are often conflated in reporting processes. Furthermore, updating and distributing data collection tools such as facility registers can be years behind changes in national policies and indicator definitions. Local contextual changes (e.g. in administrative units) can also affect the data’s scope. Finally, updating the data in electronic systems can introduce errors; changes in program outcome estimates affect the ability to determine trends over time.

An evaluation using RHIS data alone should ensure that those data accurately represent indicators of interest. This requires understanding each indicator’s definition, how it is disaggregated (e.g. by age and sex), its original data source(s) (e.g. registers), the levels where it is aggregated (e.g. facility and district levels), how it is reported and on which forms (e.g. monthly reporting forms), its frequency of collection and reporting, and how it is represented in the RHIS. Countries with robust RHIS have indicator reference documents, but details are usually incomplete and additional steps are required to map indicator numerators and denominators from their original sources of collection to the RHIS. RHIS data elements are often abbreviated and do not intuitively reflect the names on indicator reference sheets and/or source documents. These discrepancies are rarely documented. Furthermore, many indicators in the RHIS require denominators derived from population-level data sets. Subnational unit or subpopulation estimates are often mathematically modeled from national-level data and come with their own data quality issues. Researchers may not have access to the models used to determine denominators and cannot easily assess how appropriate the models are for the variables used.

Wagenaar, Ashton, and Bennett discuss the methods possible with data generated from RHIS and inherent limitations. However, not until the data are accessed and extracted can the final specification of models take place. Although RHIS data provide repeated measures over time, several factors affect their utility for this purpose: changes in indicator definition, availability of population-based denominators, splitting of administrative units, and levels of disaggregation. Multi-country analyses have similar limitations. For example, indicator elements may be defined or calculated dissimilarly. Finally, when new data sets are created, it introduces provenance management or data origination issues which could become problematic if the software used to manage them are not ‘provenance-aware’.¹²

Using data in hand is inevitably cheaper than collecting new data, but cost is not the only consideration. We have described issues of data access, quality, and relevance for methods and questions. The use of routine data to inform population health decisions should continue to be explored and refined. But the current limitations of routine data inherently restrict the questions these data can answer.

Footnotes

ORCID iD

Stephanie Watson-Grant

References

Rakha

Abdelmoneim

Farhoud

et al. Does implementation of the IMCI strategy have an impact on child mortality? A retrospective analysis of routine data from Egypt. BMJ Open 2013;3(1):e001852.

Bennett

Yukich

Miller

et al. A methodological framework for the improved use of routine health system data to evaluate national malaria control programs: evidence from Zambia. Popul Health Metr 2014;12(1):30.

Lalla-Edward

Ncube

Matthew

et al. Uptake of health services among truck drivers in South Africa: analysis of routine data from nine roadside wellness centres. BMC Health Serv Res 2017;17(1):649.

Brennan

Sampson

Deverill

Can we use routine data to evaluate organizational change? Lessons from the evaluation of business process re-engineering in a UK teaching hospital. Health Serv Manage Res 2005;18(4):265–76.

Wagenaar

Sherr

Fernandes

et al. Using routine health information systems for well-designed health evaluations in low- and middle-income countries. Health Policy Plan 2015;31(1):129–35.

Ashton

Bennett

Yukich

et al. Methodological considerations for use of routine health information system data to evaluate malaria program impact in an era of declining malaria transmission. Am J Trop Med Hyg 2017;97(Suppl. 3):46–57.

Spigel

Wambugu

Villella

mHealth data security, privacy, and confidentiality: guidelines for program implementers and policy makers. Chapel Hill, NC: MEASURE Evaluation; 2018. Available online at: https://www.measureevaluation.org/resources/publications/ms-17-125a

Mathers

Sullivan

Dhillon

et al. The use of NHS patient data: report by the National Data Guardian for Health and Care. Br J Gen Pract 2017;67(655):56–57.

Global Fund to Fight AIDS, Tuberculosis and Malaria & MEASURE Evaluation. Data quality audit tool: guidelines for implementation. Chapel Hill, NC: MEASURE Evaluation, University of North Carolina; 2008. Available online at: https://www.measureevaluation.org/resources/publications/ms-08-29?searchterm=data+quality+audit+

10.

Meadway

NHS Data Is a Goldmine. It Must be Saved From Big Tech. The Guardian, 9 December 2019. Available online at: https://www.theguardian.com/commentisfree/2019/dec/09/nhs-data-goldmine-value-private-companies (last accessed 17 February 2020).

11.

Kollewe

NHS Data Is Worth Billions – But Who Should Have Access to it? The Guardian, 10 June 2019. Available online at: https://www.theguardian.com/society/2019/jun/10/nhs-data-google-alphabet-tech-drug-firms (last accessed 17 February 2020).

12.

Curcin

Soljak

Majeed

Managing and exploiting routinely collected NHS data for research. Inform Prim Care 2012;20(4):225–31.