Abstract
Objectives:
Having accurate influenza vaccination coverage estimates can guide public health activities. The objectives of this study were to (1) validate the accuracy of electronic health record (EHR)–based influenza vaccination data among pregnant women compared with survey self-report and (2) assess whether survey respondents differed from survey nonrespondents by demographic characteristics and EHR-based vaccination status.
Methods:
This study was conducted in the Vaccine Safety Datalink, a network of 8 large medical care organizations in the United States. Using EHR data, we identified all women pregnant during the 2018-2019 or 2019-2020 influenza seasons. Surveys were conducted among samples of women who did and did not appear vaccinated for influenza according to EHR data. Separate surveys were conducted after each influenza season, and respondents reported their influenza vaccination status. Analyses accounted for the stratified design, sampling probability, and response probability.
Results:
The survey response rate was 50.5% (630 of 1247) for 2018-2019 and 41.2% (721 of 1748) for 2019-2020. In multivariable analyses combining both survey years, non-Hispanic Black pregnant women had 3.80 (95% CI, 2.13-6.74) times the adjusted odds of survey nonresponse; odds of nonresponse were also higher for Hispanic pregnant women and women who had not received (per EHR data) influenza vaccine during current or prior influenza seasons. The sensitivity, specificity, and positive predictive value of EHR documentation of influenza vaccination compared with self-report were ≥92% for both survey years combined. The negative predictive value of EHR-based influenza vaccine status was 80.5% (95% CI, 76.7%-84.0%).
Conclusions:
EHR-based influenza vaccination data among pregnant women were generally concordant with self-report. New data sources and novel approaches to mitigating nonresponse bias may be needed to enhance influenza vaccination surveillance efforts.
Influenza viruses cause substantial morbidity and mortality every year in the United States and globally,1-4 including among pregnant women who encounter higher risk of severe disease compared with nonpregnant women of similar age.5-7 Vaccination is the primary means of influenza prevention,8,9 and improving influenza vaccination coverage is a national public health priority. 10 Influenza vaccination coverage among pregnant women in the United States is primarily monitored through in-person,11,12 telephone, 13 and internet-based surveys,5,14,15 with respondents asked to report their vaccination status. While survey-based methods for assessing influenza vaccination coverage have strengths, important limitations are operant.13,16 Self-report of vaccination may be inaccurate by mistake or because of social desirability bias, survey response rates are generally falling, and survey-based methods are subject to nonresponse bias. 17 In addition, if survey response is more likely from vaccinated people than from unvaccinated people, 18 influenza vaccination coverage based on national surveys could overestimate true coverage.
Electronic health record (EHR) 19 and claims data 20 have also been used to estimate influenza vaccination coverage during pregnancy, an approach that may mitigate some limitations of survey-based methods. EHR data can be used to reliably identify pregnant women, 21 assess other risk factors for influenza disease,8,9 and obtain demographic information such as race and ethnicity. In addition, if EHR data indicate that a pregnant woman was vaccinated during the current influenza season, the positive predictive value (PPV) is high compared with self-report. 18 However, if EHR data do not indicate influenza vaccination, it is possible that a pregnant woman was vaccinated outside the medical home, such as at a workplace or pharmacy.18,22 In this circumstance, EHR data would underestimate true coverage.
Having accurate and timely influenza vaccination coverage estimates, particularly in groups at increased risk of serious influenza disease,5-7 is necessary to guide public health activities8,9 toward achieving national public health goals. 10 The objectives of the current study were to (1) assess by survey the validity of EHR-based influenza vaccination data among pregnant women and (2) assess whether survey respondents differed from survey nonrespondents by demographic characteristics and EHR-based vaccination status.
Methods
Study Setting
This study was conducted in the Vaccine Safety Datalink (VSD), a collaboration between the Centers for Disease Control and Prevention and 8 large medical care organizations (referred to as “sites”).23-25 Participating VSD sites included Marshfield Clinic, HealthPartners, Denver Health, Kaiser Permanente (KP) Washington, KP Northwest, KP Northern California, KP Southern California, and KP Colorado. Denver Health cares for publicly insured, privately insured, and uninsured patients26,27; the remaining VSD sites primarily serve insured patients.23-25 The demographic and socioeconomic characteristics of the VSD population are similar to those of the US population, 25 while influenza vaccination coverage among pregnant women in the VSD is generally higher than coverage indicated by national estimates. 19
Vaccination data at VSD sites, which are used for clinical and research purposes, are derived from several data sources. Vaccines ordered and administered within VSD sites represent a high proportion of available vaccine records. In addition, vaccine data from claims and state immunization information systems (at 6 VSD sites) are added to a patient’s EHR.19,28,29 Influenza vaccines administered in pharmacies would be integrated into EHR-based vaccination data if a claim were submitted to insurance or if the pharmacy submitted vaccine data to a state immunization information system. In the current study, the term “EHR-based vaccination data” refers to all available electronic vaccination data integrated into an individual’s record.
Study Overview
We identified all women at VSD sites who were pregnant during the 2018-2019 or 2019-2020 influenza season. We then conducted surveys among a sample of pregnant women who did and did not appear vaccinated during the same season according to EHR data. Pregnant women were surveyed during only 1 influenza season. Respondents were asked to report their influenza vaccination status for the current season, which was compared with EHR-based vaccination data. We considered vaccination self-report as the criterion standard.
Study Population: Identification of Pregnant Women
The VSD has developed and validated an algorithm to identify pregnancies with a high degree of accuracy. 21 The algorithm can identify current pregnancies in near-real time, using weekly updates of VSD EHR-based data. 30 The following data sources are used: pregnancy-related International Classification of Diseases, Tenth Revision, Clinical Modification (ICD-10-CM) codes, 31 prenatal procedure codes, and estimated delivery and last menstrual period dates from obstetric flowsheets.21,30
Using this algorithm, we identified all women aged 18-49 years who were pregnant at any time from August 1 through January 31 of the respective influenza season (eg, for the 2018-2019 survey, pregnant from August 1, 2018, through January 31, 2019). We required women to have continuous health insurance enrollment through the influenza season (for Denver Health, 1 outpatient visit was used as a proxy for health insurance enrollment). We excluded women with an ICD-10-CM code indicating an adverse pregnancy outcome such as spontaneous abortion, stillbirth, or anencephaly. We excluded from analysis women with multiple pregnancy episodes during a single influenza season, women with a diagnosis code for a vaccine allergy, and women with presumed influenza vaccine data errors (eg, receipt of a nonindicated vaccine, such as high-dose influenza). For the 2018-2019 survey, 439 of 107 126 pregnant women were excluded for vaccine allergy or invalid vaccine data; for the 2019-2020 survey, 409 of 97 661 pregnant women were excluded.
Survey Design and Administration
Sampling procedures
After identifying all eligible pregnant women in the VSD as our sampling frame, we randomly sampled women for survey administration. We stratified sampling for each survey year by VSD site and EHR-based influenza vaccination status, with women who appeared unvaccinated in EHR data oversampled. The total sample of pregnant women was 1247 for the 2018-2019 survey and 1748 for the 2019-2020 survey. For the 2018-2019 survey, we sampled 999 unvaccinated (per EHR data) and 248 vaccinated (per EHR data) women; for the 2019-2020 survey, the numbers sampled were 1460 unvaccinated and 288 vaccinated. In the 2018-2019 survey, non-Hispanic Black women were less likely to respond than women from other racial and ethnic groups; consequently, non-Hispanic Black women were oversampled for the 2019-2020 survey. With a sample of 999 EHR-unvaccinated women (ie, the 2018-2019 survey), a 50% response rate, and 60% of EHR-unvaccinated women reporting they were unvaccinated, the study was powered to achieve a CI width of 8.8% around the estimated proportion confirming their vaccination status.
Survey instrument
We developed and pilot-tested a survey instrument with 8 individuals, and then revised it accordingly. We based survey questions on published survey instruments; whenever possible, we used exact wording from prior instruments.32-38 Survey content included whether the respondent received an influenza vaccine and, if so, at what location; whether influenza vaccine had been recommended and offered by a health professional; if the respondent was not vaccinated, the primary reason why not; and attitudes about influenza vaccination. We also asked women to confirm that they were currently or had recently been pregnant; we excluded from analyses women who did not confirm pregnancy on the survey.
Survey administration
For the 2018-2019 influenza season, survey administration began March 15, 2019; for the 2019-2020 influenza season, survey administration began February 18, 2020. Surveys were in the field for 15 weeks. Women received up to 3 mailed surveys, up to 5 emails with a unique hyperlink to an internet-based survey, and up to 2 automated telephone reminders. Outreach stopped once the survey was completed or a person requested no further contact. In nearly all aspects, survey administration was consistent across survey years and VSD sites. One VSD site required that participants receive a pre-survey letter with an opportunity to opt out from the survey and did not permit email contact; consequently, at this site, participants received an additional mailed survey. For the 2018-2019 influenza season, a Spanish-language version of the survey was sent to women with an EHR designation of preferred language Spanish. Because of resource constraints, the 2019-2020 survey was available in English only. The internet form of the survey used Research Electronic Data Capture. 39 Respondents received a $20 gift card for completing the survey.
Analytic Methods
Pregnant women were considered survey respondents if they answered the primary survey question of “Since July 1 of [the current influenza season] have you had a flu vaccination?” We used the Pearson χ2 test to compare respondents with nonrespondents, with P < .05 considered significant. Self-reported vaccination status was treated as the criterion standard for all analyses. We accounted for the stratified sampling design, included a finite population correction, and incorporated inverse probability weighting for sampling and survey response probabilities. For the 2018-2019 season analyses, sampling weights accounted for VSD site and EHR vaccination status; for the 2019-2020 season analyses, weighting also accounted for oversampling non-Hispanic Black women. Weighted percentages for survey responses and EHR vaccination validity measures (sensitivity, specificity, PPV, and negative predictive value [NPV]) were reported with Clopper–Pearson 95% CIs. 40 In the context of this study, sensitivity is interpreted as follows: among all individuals who self-report influenza vaccination, what percentage have EHR documentation of influenza vaccination? When presenting descriptive frequencies, we combined survey results across influenza seasons. We estimated κ coefficients and associated CIs by using bootstrap replication variance estimation.
Two multivariable logistic regression models were developed: a model assessing factors associated with survey nonresponse (among the entire sample) and a model assessing factors associated with self-reporting influenza vaccination (among women unvaccinated according to EHR data). We explored correlation within VSD site and tested for the interaction between race and ethnicity and vaccination status. Covariates, identified a priori based on prior publications,11,14,15,19 included the following: age group (18-24, 25-34, 35-49 y), race and ethnicity (non-Hispanic White, non-Hispanic Black, Hispanic, non-Hispanic Asian, non-Hispanic Other [multiracial, Hawaiian, American Indian/Alaska Native, and unspecified other race and ethnicity]), chronic health condition (yes/no), gravidity (first pregnancy, pregnant before), vaccinated for influenza in prior season (yes/no), health care provider vaccination recommendation (yes/no), health insurance type (private, Medicaid, none), education (≤high school, some college/college graduate, advanced degree), annual household income (<$75 000, ≥$75 000), and VSD site. Covariates were retained in the final models regardless of significance. Multivariate analyses accounted for the study design and sampling strategy, 41 study year, VSD site, EHR vaccination status, and race and ethnicity. VSD site was treated as a fixed effect. In addition, we conducted a subanalysis among pregnant women unvaccinated according to EHR data, examining factors associated with self-reporting influenza vaccination (eg, EHR false negatives). We conducted all analyses using SAS version 9.4 (SAS Institute, Inc).
The KP Colorado Institutional Review Board approved this study, and other study sites ceded oversight to KP Colorado. Written consent was not required for survey administration, and survey participants could opt out in writing or by telephone.
Results
Survey Sample
For the 2018-2019 influenza season, a total of 106 687 survey-eligible pregnant women were identified, of whom 1247 (1.2%) were sampled for survey administration (Table 1). For the 2019-2020 influenza season, a total of 97 244 survey-eligible pregnant women were identified, of whom 1748 (1.8%) were sampled.
Characteristics of respondents and nonrespondents to surveys administered to women who were pregnant during the 2018-2019 and 2019-2020 influenza seasons, Vaccine Safety Datalink study sites a
Abbreviation: EHR, electronic health record.
Vaccine Safety Datalink23-25 sites are located in California, Colorado, Minnesota, Oregon, Washington, and Wisconsin.
Individuals unvaccinated for influenza in the current season (per EHR) were oversampled in both survey years; non-Hispanic Black women were oversampled in the 2019-2020 survey.
Using the Pearson χ2 test, with P < .05 considered significant.
For respondents and nonrespondents, data on race and ethnicity were obtained from the EHR. Other race and ethnicity included 37 multiracial, 15 Hawaiian, and 9 American Indian/Alaska Native women, as well as 19 women with an unspecified other race and ethnicity.
Survey Response
For the 2018-2019 influenza season, 630 of 1247 (50.5%) women responded; for the 2019-2020 influenza season, 721 of 1748 (41.2%) women responded (Table 1). Nearly all respondents (94.9% in 2018-2019, 97.1% in 2019-2020) confirmed they were or had been pregnant. Response rates were highest among non-Hispanic White pregnant women (62.0% in 2018-2019, 58.6% in 2019-2020) and lowest among non-Hispanic Black pregnant women (26.7% in 2018-2019, 29.9% in 2019-2020). Respondents differed significantly from nonrespondents by age group, race and ethnicity, and EHR-documented influenza vaccination in current and prior influenza seasons (all P < .001).
In multivariable analyses, non-Hispanic Black pregnant women had 3.80 (95% CI, 2.14-6.74) times the adjusted odds of not responding to the survey compared with non-Hispanic White pregnant women (both survey years combined) (Table 2). The adjusted odds of nonresponse were higher for Hispanic pregnant women than for non-Hispanic White pregnant women and for those who had not received (per EHR data) influenza vaccine during current or prior seasons compared with those who had received the influenza vaccine.
Multivariable analyses of characteristics associated with nonresponse to influenza vaccination surveys administered to women who were pregnant during the 2018-2019 and 2019-2020 influenza seasons, Vaccine Safety Datalink study sites a
Abbreviations: aOR, adjusted odds ratio; EHR, electronic health record.
Vaccine Safety Datalink23-25 sites are located in California, Colorado, Minnesota, Oregon, Washington, and Wisconsin.
Each column represents a separate multivariable regression model; models were adjusted for all variables listed and for Vaccine Safety Datalink site.
For respondents and nonrespondents, data on race and ethnicity were obtained from the EHR.
Other race and ethnicity included multiracial, Hawaiian, and American Indian/Alaska Native women and women with an unspecified other race and ethnicity.
Net Implied Bias in Coverage Estimates
In the survey sample for both years combined, the sampling probability weighted prevalence of having received influenza vaccine (per EHR data) was 70.6% (95% CI, 66.2%-74.6%). The sampling probability weighted prevalence of vaccination (per EHR data) among survey respondents was 75.9% (95% CI, 71.0%-80.3%). The net implied bias in coverage would therefore be 5.3% overall. Stratified by race and ethnicity, the net implied bias in coverage would be 6.5% among non-Hispanic Black, 5.2% among non-Hispanic White, and 1.1% among Hispanic pregnant women.
EHR Vaccination Data Compared With Self-Report
Combining results across both survey years, among pregnant women who appeared vaccinated according to EHR data, 8 of 310 self-reported they were not vaccinated (weighted percent false positives: 0.6%; 95% CI, 0.1%-2.3%). Combining both survey years, among pregnant women who appeared unvaccinated according to EHR data, 304 of 1041 self-reported they were vaccinated (weighted percent false negatives: 19.5%; 95% CI, 16.0%-23.3%).
The sensitivity, specificity, and PPV of EHR documentation of influenza vaccination were high for 2018-2019, 2019-2020, and both years combined (Table 3). For example, the sensitivity for both years combined was 92.4% (95% CI, 89.9%-94.5%). The NPV was the lowest of the validity measures in 2018-2019, 2019-2020, and both years combined. The κ coefficient showed strong agreement (κ, both years combined, 0.84; 95% CI, 0.81-0.87) between EHR documentation of influenza vaccination and self-report.
Abbreviations: EHR, electronic health record; NA, not applicable.
Vaccine Safety Datalink23-25 sites are located in California, Colorado, Minnesota, Oregon, Washington, and Wisconsin.
All values are percentage (95% CI) unless otherwise indicated. Point estimates and 95% CIs accounted for sample design strata, sample and response weights, and finite population correction; κ coefficient and associated 95% CIs were estimated with bootstrap replication variance estimation; 95% CIs for all other measures were estimated with Clopper–Pearson.
Data on race and ethnicity were obtained from the EHR.
Other race and ethnicity included multiracial, Hawaiian, and American Indian/Alaska Native women and women with an unspecified other race and ethnicity.
The validity of EHR documentation of influenza vaccination was similar across age groups and chronic health condition status (Table 3). The NPV was lower among women who (per EHR data) had received influenza vaccine the prior season (53.9%; 95% CI, 41.7%-65.7%) than among women who were unvaccinated (per EHR data) in the prior season (85.7%; 95% CI, 81.8%-89.0%).
We conducted a subanalysis among pregnant women unvaccinated according to EHR data to examine factors associated with self-reporting influenza vaccination (eg, EHR false negatives). In multivariate analyses, for both years combined, the pregnancy being the first pregnancy, having influenza vaccination recommended by a health care provider, and having received (per EHR data) influenza vaccine in the prior season were positively associated with self-reporting influenza vaccination (Table 4).
Multivariable analyses of characteristics associated with failure to capture data on influenza vaccination receipt within EHR data (EHR false negatives) among women who were pregnant during the 2018-2019 and 2019-2020 influenza seasons, Vaccine Safety Datalink a
Abbreviations: aOR, adjusted odds ratio; EHR, electronic health record.
Vaccine Safety Datalink23-25 sites are located in California, Colorado, Minnesota, Oregon, Washington, and Wisconsin.
Each column represents a separate multivariate regression model; models were adjusted for all variables listed and Vaccine Safety Datalink site.
Data on race and ethnicity were obtained from the EHR.
Other race and ethnicity included multiracial, Hawaiian, and American Indian/Alaska Native women and women with an unspecified other race and ethnicity.
Additional Vaccine-Related Information
Among pregnant women who reported they were vaccinated, 92.6% (95% CI, 86.5%-96.5%) reported receiving a recommendation from a health professional for influenza vaccination; among unvaccinated pregnant women, 83.9% (95% CI, 79.4%-87.8%) reported receiving a recommendation. Among pregnant women who reported they were vaccinated, 85.2% (95% CI, 75.7%-92.0%) reported being offered an influenza vaccine at a health care visit; among unvaccinated pregnant women, 84.4% (95% CI, 80.0%-88.2%) reported being offered a vaccine.
All respondents who self-reported receiving influenza vaccine were asked the location of their vaccination. For women unvaccinated per EHR data who reported vaccination (EHR false negatives), 35.6% (95% CI, 25.9%-46.4%) were vaccinated in a workplace, 24.2% (95% CI, 15.3%-34.9%) in a hospital, and 36.1% (95% CI, 27.4%-45.6%) in a physician’s office, clinic, or health center. For women vaccinated per EHR data who reported vaccination (EHR true positives), 1.1% (95% CI, 0.3%-3.1%) were vaccinated in a workplace, 18.7% (95% CI, 9.5%-31.2%) in a hospital, and 79.4% (95% CI, 67.1%-88.7%) in a physician’s office, clinic, or health center.
Discussion
To achieve high influenza vaccination coverage overall 10 and among groups at high risk from influenza,5-7 accurate and timely data on vaccination coverage are needed to guide public health activities.8,9 However, no single data source can provide a complete picture of coverage. 16 In this study, using self-report as the criterion standard, we found relatively close agreement between influenza vaccine receipt in EHR-based data and self-report, with high sensitivity, specificity, PPV, and κ. The NPV was lower at 80.5%, indicating that 19.5% of women who appeared unvaccinated in EHR data self-reported receiving influenza vaccine during the current season. In addition, using EHR data to characterize survey nonrespondents, we detected substantial risk of nonresponse bias across race and ethnicity and vaccination status, a finding with important implications for survey-based assessments of influenza vaccination coverage.
Aside from an investigation after the 2009 H1N1 influenza outbreak in the United States 43 and a report from Australia, 44 we are unaware of other studies assessing the accuracy of EHR-based influenza vaccination data among pregnant women. Accuracy has been assessed in other US populations, with some studies using self-report as the criterion standard18,22 and others treating EHR 45 or immunization information system46,47 data as the standard. Sy et al 18 compared EHR-based influenza vaccination data with self-report among 50- to 79-year-old members of a large health care organization during the 2007-2008 influenza season: the NPV of EHR data was 79.5%. This value is surprisingly close to the estimate of 80.5% from our study. We anticipated finding a higher NPV among pregnant women in the current study, given the expansion of immunization information systems and the improved capacity of these systems to send immunization data to EHR systems. 48 Workplace vaccination programs may contribute to the lower-than-anticipated NPV; integrating workplace-based influenza vaccination into state immunization information systems may be an important strategy to improve accuracy.
Our findings also highlight the risk of nonresponse bias in surveys of influenza vaccination. Non-Hispanic Black and Hispanic pregnant women and the unvaccinated (per EHR data) were significantly less likely to respond to the survey than non-Hispanic White pregnant women. Non-Hispanic Black people, including those who are pregnant, have consistently lower influenza vaccination rates11,12,15 and express less confidence in influenza vaccine safety and efficacy 49 than non-Hispanic White people; similar findings have also been observed among Hispanic people.12,49 Because our survey was introduced as concerning “the flu (which is also called influenza) and flu vaccination,” unvaccinated individuals and those with low vaccine confidence might have been less willing than vaccinated individuals and those with high vaccine confidence to complete the survey. National surveys, particularly if immunization-focused, may encounter similar behaviors. This type of nonresponse bias could lead to an overestimation of influenza vaccination coverage, and weighting to a referent population may not necessarily correct for this bias.
Limitations
Our study findings had several limitations. First, survey nonresponse may have influenced our results; differential nonresponse to the survey could have negatively or positively affected the validity measures we assessed. Second, while we treated self-report as the criterion standard, self-reported vaccination can be inaccurate, and we could not review outside records (such as from a workplace) to verify respondents’ self-report. Third, misclassification of pregnancy status could have occurred, although nearly all survey respondents confirmed their pregnancy status. Fourth, patients at VSD sites are largely insured (Denver Health also cares for uninsured individuals), and the VSD population may not be generalizable to the overall population of pregnant women in the United States.
Conclusions
The current study can be viewed as part of a longer-term goal of enhancing the nation’s ability to monitor influenza vaccination coverage. The study highlights the strengths of using EHR-derived data for this purpose, including the ability to identify large populations of pregnant women, as well as the limitations, including missing some influenza vaccination occurring outside the medical home. Current EHR-based systems are also not representative of all states and may underrepresent important sociodemographic groups such as the uninsured. Integrating data from multiple sources, including from surveys,5,11-15 EHR-based networks, 19 and immunization information systems, 48 continues to be needed to monitor influenza vaccination coverage.
Footnotes
Acknowledgements
The authors acknowledge Bradley Crane, MS, and Stephanie Irving, MHS, at the Center for Health Research, Kaiser Permanente Northwest, Portland, Oregon, for their contributions to this work.
Authors’ Note
Matthew F. Daley, Liza M. Reifler, and Jo Ann Shoup had full access to all the data in the study and take responsibility for the integrity of the data and the accuracy of the data analysis.
Disclaimer
The findings and conclusions in this article are those of the authors and do not necessarily represent the official position of the Centers for Disease Control and Prevention.
Declaration of Conflicting Interests
The authors declared the following potential conflicts of interest with respect to the research, authorship, and/or publication of this article: Liza M. Reifler has an immediate family member who owns stock in Merck, Abbott Laboratories, and AstraZeneca. Allison L. Naleway has received research support from Pfizer unrelated to the current study. Michael L. Jackson has received research support from Sanofi Pasteur unrelated to the current study. Huong McLean has received research support from Seqirus unrelated to the current study. Nicola Klein reports research support from GlaxoSmithKline, Sanofi Pasteur, Protein Science (now Sanofi Pasteur), Merck, and Pfizer unrelated to the current study.
Funding
The authors disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: this research was funded by the Centers for Disease Control and Prevention, through a Task Order (contract no. 200-2012-53582-21454), issued as part of the Vaccine Safety Datalink project (contract no. 200-2012-53582). Coauthors from the Centers for Disease Control and Prevention were involved in design and conduct of the study, interpretation of data, and review and approval of the article.
