Abstract
Importance
Quality indicators are used to evaluate the quality of healthcare delivery and as a speciality, otolaryngology–head and neck surgery (OHNS) is beginning to transition toward this empirical understanding of healthcare quality and delivery.
Objective
To describe the number and quality of studies that have developed novel quality indicators for any subdiscipline in OHNS.
Design
We performed a database search of MEDLINE (Ovid), EMBASE (Ovid), Web of Science, and Cochrane Database of Systematic Reviews. We did not employ language or study-type restrictions and included studies published from database inception to October 2024.
Study Selection
Following abstract screening, 184 articles underwent full-text screen. Eligible studies developed quality indicators in any subdiscipline within OHNS. Article screening and full-text review was performed in duplicate.
Data Extraction and Synthesis
We extracted study-specific and methodological data in duplicate. Quality appraisal was assessed using the Appraisal of Indicators through Research and Evaluation instrument.
Results
We identified 10,592 studies, of which 25 studies developed new quality indicators. Quality indicator development studies primarily focused on otology/neurotology, pediatric OHNS, and head and neck surgery. Few studies investigated facial plastics, rhinology and skull base surgery, and laryngology. Most studies employed Delphi consensus methods and patient engagement was rare. Consensus methodology reporting was poor and indicators were often not validated. Outcome indicators were often measured with fewer studies investigation structure or process indicators.
Conclusions
Quality indicators may help standardize and improve patient care in OHNS. Future research should focus on structure and process indicators, while improving reporting, optimizing panel composition, and validating quality indicators.
Key Messages
• Quality indicator development research is ongoing in otolaryngology–head and neck surgery (OHNS)
• Most studies developed quality indicators in otology/neurotology, pediatric OHNS, and head and neck surgery with few studies investigating facial plastics, rhinology, and laryngology
• Optimizing consensus panel composition, improving reporting, and developing structure and process indicators will improve the impact of quality indicator research in OHNS
Introduction
Healthcare systems are shifting toward an empirical understanding of patient care.1,2 Quality indicators are performance metrics that help standardize the evidence-based evaluation of healthcare quality and delivery, in addition to clinical outcomes. 3 A common framework is the Donabedian model, which categorizes quality indicators into structure, process, and outcome measures (Table 1). 1 An important and renowned application of surgical quality indicators is the American College of Surgeons National Surgical Quality Improvement Program (NSQIP), which is a quality indicator database connecting healthcare institutions and surgical disciplines across the United States. 4 Since the beginning of this quality initiative, NSQIP-affiliated institutions have helped to improve clinical outcomes and healthcare delivery, for patients and healthcare practitioners.4 -6
Quality Indicators as Defined by the Donabedian Framework.
Quality indicators are developed via a multistage process that requires analyzing peer-reviewed literature and clinical practice guidelines, while incorporating input from allied healthcare professionals, patients, and their families to generate target performance metrics for further scientific validation.7,8 An often-used development method for quality indicators involves a consensus approach, or Delphi method, whereby an expert panel of medical professionals establish candidate quality indicators that are then subject to successive rounds of discussion and selection until final quality indicators are identified for further validation. 9 Over the past decade, researchers in otolaryngology–head and neck surgery (OHNS) have begun generating quality indicators to standardize and improve healthcare quality and delivery. So far, however, there has not yet been a fulsome review of the breadth of quality indicators in OHNS. Hence, the objective of this scoping review was to characterize and evaluate the quality of studies that have developed quality indicators in OHNS.
Materials and Methods
This study was reported in accordance with the Preferred Reporting Items for a Systematic Review and Meta-Analysis (PRISMA) Scoping Review Extension Guidelines. 10 As this is a scoping review, it did not require PROSPERO database registration. 11
Search Strategy
We reviewed all studies that reported quality indicators within OHNS in Medline, Embase, Web of Science, and Cochrane Database of Systematic Reviews from database inception to October 24, 2024, with the help of a librarian specialist (C.W.) (Appendix A). We performed a qualitative search of the reference lists of included articles and review articles. We did not include a language restriction. Further, a gray literature search of global professional organizations in OHNS and healthcare quality as well as gray literature databases was also performed (i.e., Google, Google Scholar, OAlster, OpenGrey). We searched the first 10 pages of all web page results.
Study Eligibility
We included studies that addressed quality indicators in any subdiscipline in OHNS, including facial plastics and reconstructive surgery, general OHNS, head and neck surgery, laryngology, otology and neurotology, pediatric OHNS, and rhinology and skull base surgery. We defined development studies as those that utilized any consensus methods or other methodologies to identify quality indicators and/or outcome sets. We included adult and pediatric patients in any clinical setting. We did not include pathologies or clinical interventions involving the esophagus or thoracic cavity. We also did not include quality indicators in radiation or medical oncology for head and neck oncology.
We included peer-reviewed original research, systematic reviews, consensus and clinical guidelines, and guideline extraction articles that developed novel quality indicators in the context of clinical care. There was no language restriction. We also excluded any narrative reviews and editorials. We excluded any studies that proposed or evaluated existing quality indicators from the final descriptive analysis.
Study Selection and Data Abstraction
Articles were reviewed following PRISMA guidelines (Figure 1). All review stages were completed by 2 reviewers (P.S., L.W.) and conflicts were resolved via discussion including 2 senior authors (S.S., J.C.). We utilized Covidence (Melbourne, Australia) for article screening. Data abstraction was performed by 3 authors (P.S., S.S., and L.W.), and all discrepancies were addressed by a senior reviewer (E.M.). We collected data pertaining to authorship, year of publication, study location, conflicts of interest, study design, methodology, as well as individual quality indicators and their associated details, including descriptor numerator and denominator.

PRISMA flowchart for included studies. PRISMA, Preferred Reporting Items for a Systematic Review and Meta-Analysis.
Quality Appraisal and Statistical Analysis
All included studies that developed novel quality indicators were evaluated using the validated Appraisal of Indicators through Research and Evaluation (AIRE) tool. 12 The AIRE is a validated instrument that was designed to evaluate studies of quality indicators in healthcare. 13 It comprised 20 items that address 4 quality domains: (1) purpose and organizational context, (2) stakeholder involvement, (3) scientific evidence, and (4) additional evidence. Each item is scored on a 4-point scale (i.e., 1: strongly disagree and 4: strongly agree). A standardized domain score was calculated: [(total score − minimum possible score)/(maximum score − minimum possible score) × 100%]. A score closer to 100% indicated higher methodological quality. Each domain was independently evaluated by 2 reviewers (P.S., L.W.) with discrepancies resolved by a third reviewer (S.S.). We analyzed all included studies via descriptive analyses, including measures of central tendency and standard deviation. We calculated frequencies and proportions for all categorical variables. All statistical analyses and tables were generated using Microsoft Excel (Ridgefield, WA, USA).
Results
We identified 10,592 studies from the database search wherein 25 studies met eligibility criteria (Figure 1).14-38 All included studies developed new quality indicators in OHNS (Table 2). We found that most studies (n = 18, 72%) utilized Delphi, modified Delphi, or RAND-UCLA consensus methods. Candidate quality indicators were primarily selected from clinical practice guidelines and/or scientific literature (n = 21, 84%). These consensus studies primarily used an expert panel composed of medical specialists (n = 19, 76%) with few studies including patient or family engagement (n = 7, 28%). The most studied OHNS subdiscipline was pediatric OHNS (n = 7, 28%) with most studies evaluating otitis media or tonsillitis (Table 3). In otology and neurotology (n = 6, 24%), most studies evaluated tinnitus or vestibular disease. In head and neck surgery (n = 6, 24%), most studies evaluated head and neck cancer. Supplemental Table S1 describes the quality indicators selected in each included study.
Summary of Quality Indicator Development Studies (N = 25).
Abbreviation: CPG, clinical practice guideline.
Includes audiologists, nurses, speech-language pathologists, methodologists, psychologists, and administrators.
Summary by Otolaryngology–Head and Neck Surgery Subdiscipline (N = 25).
Abbreviation: OHNS, otolaryngology–head and neck surgery.
Includes acute otitis media, chronic suppurative otitis media, and otitis media with effusion.
Includes sudden sensorineural hearing loss.
Michel et al 30 also investigated allergic rhinitis.
Includes oral cavity, oropharyngeal, laryngeal, hypopharyngeal cancer, dysphagia in the context of head and neck cancer, and mandibular osteoradionecrosis.
Study Quality Appraisal
All studies were evaluated using the AIRE instrument (n = 25; Supplemental Table S2). We found that the mean value for the purpose domain was 63.8% (range: 27%-87%). The primary deficiency in this domain was poor reporting of the criteria for selecting quality indicators. The mean value for the stakeholder domain was 54.5% (range: 5.5%-78%) and its primary deficiency was a lack of patient/family and allied health professional engagement. The mean value for the scientific evidence domain was 55.8% (range: 0%-89%) with reporting deficiency in the search methods and types of resources identified for quality indicator development. The mean value for the additional evidence domain was 41.8% (range: 15%-74%). These studies reported construct validity with adequate detail; however, we found limited reporting with regards to quality indicator description and measurement, test reliability, risk adjustment, and quality indicator piloting.
Pediatric Otolaryngology–Head and Neck Surgery
There were 7 studies (28%) of pediatric OHNS that developed new quality indicators.15,16,18,21,23,25,34 Most studies investigated pediatric otitis media (Table 3).21,23,34 Four studies used a Delphi/modified Delphi consensus method, 2 used the RAND-UCLA method, and 1 study did not report the consensus method. Expert consensus panels included surgeons and physicians in all studies, while 2 studies also included allied health professionals (e.g., audiologists, nurses, speech language pathologies).23,34 One study incorporated patient and family surveys in developing outcome indicators for otitis media with effusion. 23 Three studies extracted candidate indicators from peer-reviewed clinical practice guidelines, while all remaining studies drew candidate indicators from the peer-reviewed literature and expert opinion.18,21,25 Across all eligible studies, 113 novel quality indicators were developed using consensus methods: 72 (63.7%) were outcome indicators, 41 (36.3%) were process indicators, and none were structure indicators (Supplemental Table S1). Empirical validation was performed in one study. 25
Otology and Neurotology
There were 6 studies (24%) of otology and neurotology that developed new quality indicators (Table 3).20,22,30,35,37,38 Most studies evaluated tinnitus and Meniere’s disease. Two studies used the Delphi method while 2 studies used the RAND-UCLA method. Three studies (50%) extracted candidate measures from clinical practice guidelines. Two studies included patient input or health user stakeholders in their consensus panels.22,35 Three studies extracted candidate quality indicators from clinical practice guidelines.20,30,38 None of the studies performed validation of proposed indicators. Across all included studies, 41 novel quality indicators: 34 (82.9%) were outcome indicators, 7 (17.1%) were process indicators, and none were structure indicators (Supplemental Table S1).
Head and Neck Surgery
There were 6 studies (24%) of head and neck surgery that developed novel quality indicators (Table 3).24,27,29,32,33,36 Five studies investigated head and neck cancer and one investigated primary hyperparathyroidism. Three studies (50%) used the modified RAND-UCLA appropriateness method, and 3 studies (50%) used Delphi or modified Delphi methods. Two studies employed patient-based organizations while remaining studies used expert panels that included both physicians and allied health professionals. All studies derived candidate indicators from published clinical guidelines, peer-reviewed studies, and expert opinion. No studies performed any empirical testing of selected quality indicators. Across all studies, 166 novel quality indicators were developed: 83 (50%) were outcome indicators, 77 (46.4%) were process indicators, and 6 (3.6%) were structure indicators.
Rhinology and Skull Base Surgery
Three studies (12%) developed novel quality indicators in rhinology.17,19,26 These studies investigated acute bacterial and chronic rhinosinusitis. One study developed quality indicators for both cerumen impaction and allergic rhinitis. 30 Two studies employed a RAND-UCLA method while the other study used a Delphi method. Three studies identified candidate quality indicators via clinical practice guidelines, while the other study identified candidate indicators via the peer-reviewed literature. Two studies described the expert panel to consist of physicians, while one study included patient engagement. 26 A total of 38 quality indicators were developed with 20 (52.6%) being process indicators and 18 (47.4%) being outcome indicators. None of these studies performed quality indicator validation.
Other Otolaryngology–Head and Neck Surgery Subdisciplines
Two studies investigated new quality indicators in general practice OHNS.14,31 These studies evaluated inpatient OHNS and dysphagia. One study used the Delphi consensus method. 14 These authors also included allied health specialists and medical specialists within their expert consensus panel. Moraes et al describe 12 process outcomes related to dysphagia and swallowing assessment. 31 Manahan et al identify candidate quality indicators from clinical practice guidelines, but do not describe a selection method. 28 These authors identify 1 outcome and 3 process indicators. None of these studies performed quality indicator validation.
Gray Literature
We identified 2 citations: (1) entnet.org, which is hosted by the American Academy of OHNS and (2) healthmonix.org, which is hosted by a private company involved with healthcare quality improvement. There were 49 quality indicators involving OHNS and general medicine, of which 47 were process indicators and 2 were outcome indicators. We did not identify any studies appraising these quality indicators.
Discussion
With the success of the NSQIP, surgical disciplines continue to develop and validate quality indicators that connect healthcare networks and improve healthcare delivery.8,39 Herein, we present a scoping review aimed at describing all published OHNS quality indicators so that investigators can identify gaps in their knowledge and application. Schneider and Lavin identified several quality improvement databases and registries that are being used in OHNS to improve patient care and healthcare efficiencies. 40 In our review, quality indicator studies primarily investigated patient management in otology, neurotology, pediatric OHNS, and head and neck surgery. These studies focused on otitis media, tinnitus and vestibular disorders, sudden and chronic hearing loss, tonsillitis, and head and neck cancer. Otology and neurotology are often limited by low-quality research and research volumes that make up less than half of all OHNS publications.41,42 There has also been a paucity of high-quality pediatric OHNS research published over the past 3 decades. 43 Head and neck surgery is often studied in OHNS, but randomized trials in head and neck surgical oncology suffer from poor methodological quality. 44 Core outcome sets are being developed to help standardize outcome assessment and reporting in head and neck surgery. 45 Minimal quality indicator development research has been performed in facial plastics, rhinology and skull base surgery, and laryngology. A drive toward an evidence-based understanding and application of quality indicators will contribute to clinical research and standardization of healthcare quality in all subdisciplines of OHNS.
Most studies employed the Donabedian framework, wherein quality indicators were categorized as structure, process, and outcome measures. The Donabedian framework has been validated in other surgical specialities and may help facilitate the implementation of eHealth into healthcare systems.46 -48 The Institute of Medicine described 6 domains of healthcare quality: patient safety, effectiveness, patient-centeredness, timeliness, efficiency, and equity that have been applied to medical populations, but we did not find any consistent use of this framework and it has been less applied to surgical patients. 49 When applying the Donabedian framework to OHNS quality indicators, we found that structure, process, and outcome indicators were all included, but most studies developed only process and outcome indicators. Similarly in vascular surgery, structure and process indicators were rarely studied when compared to outcome indicators. 50 In a review of surgical oncology studies, however, process indicators were studied more often than both structure and outcome indicators. 51 In other non-OHNS surgical specialties, process indicators have shown to be better drivers of hospital quality improvement when compared to outcome indicators.50,52 Further, Rademakers et al suggested that structure and process indicators were more likely to improve patient ratings of quality of care. 53 Hence, structure and process indicators are important for internal quality improvement, while outcome indicators help inform the public about institutional differences in healthcare. 54 As such, we suggest that in addition to outcome indicators, future studies focus on structure and process indicators as these are less often studied in OHNS but contribute to evidence-based healthcare quality improvement initiatives.
At present, there are no standardized methodologies or guidelines for conducting consensus research for developing quality indicators.55 -57 In our review, we found that a Delphi or modified Delphi method was most often employed by authors, but reporting detail was often lacking, which is consistent with the literature. 8 Further, consensus panels primarily included physicians but often did not gather the opinions of other stakeholder, such as administrators, allied health professionals, patients, and families, which is consistent with other studies. 58 In addition, quality indicators were primarily refined by local experts from regional hospital networks, but often did not incorporate opinions from other countries. We also found that quality indicators were not piloted or validated in these studies, nor were they provided to quality improvement entities (i.e., Agency for Healthcare Research and Quality) for feedback. Validating healthcare quality indicators is important as it prevents improper data entry and bias by providers while ensuring the chosen indicators are reflective of healthcare quality.59,60 In evaluating these studies using the AIRE instrument, deficiencies existed in the lack of patient engagement and allied health professionals within the expert panel, poor reporting of consensus methodologies and methods used to identify candidate quality indicators, and a lack of empirical piloting or risk adjustment for chosen quality indicators. We also recommend that investigators consider developing reporting guidelines for consensus methods used to develop quality indicators. Advances in the standardization of electronic medical record management across North America may help to drive empirical piloting and validation of quality indicators. Moreover, future studies may benefit from evaluating and developing quality indicators for marginalized communities and patients in OHNS, such as head and neck cancer patients, who require multidisciplinary care.61,62
There are several limitations of this study. First, we did not empirically evaluate or validate these quality indicators. Second, we employed broad library search parameters using terms relevant to OHNS, but these terms may not have captured quality indicators related to allied health or patient engagement in OHNS.
Quality indicators are being developed and tested to improve healthcare quality and delivery in OHNS. Consensus methods for identifying quality indicators are gaining popularity, but investigators must improve study reporting, patient and relevant stakeholder engagement, and perform empirical validation of novel quality indicators. We should also consider focusing on structure and process indicators since they are less studied but may better inform quality improvement initiatives when compared to outcome indicators. As we proceed toward an era of eHealth and personalized medicine, a better understanding of quality indicators in OHNS may help to improve healthcare standardization, institutional accountability, and professional collaboration around the world.
Supplemental Material
sj-docx-1-ohn-10.1177_19160216251330627 – Supplemental material for Quality Indicators in Otolaryngology–Head and Neck Surgery: A Scoping Review
Supplemental material, sj-docx-1-ohn-10.1177_19160216251330627 for Quality Indicators in Otolaryngology–Head and Neck Surgery: A Scoping Review by Phillip Staibano, Shireen Samargandy, Justin Cottrell, Lily Wang, Michael Au, Michael K. Gupta, Han Zhang, Doron D. Sommer, Christopher Walsh and Eric Monteiro in Journal of Otolaryngology - Head & Neck Surgery
Supplemental Material
sj-docx-2-ohn-10.1177_19160216251330627 – Supplemental material for Quality Indicators in Otolaryngology–Head and Neck Surgery: A Scoping Review
Supplemental material, sj-docx-2-ohn-10.1177_19160216251330627 for Quality Indicators in Otolaryngology–Head and Neck Surgery: A Scoping Review by Phillip Staibano, Shireen Samargandy, Justin Cottrell, Lily Wang, Michael Au, Michael K. Gupta, Han Zhang, Doron D. Sommer, Christopher Walsh and Eric Monteiro in Journal of Otolaryngology - Head & Neck Surgery
Footnotes
Appendix A: Search Strategy
Database: Ovid MEDLINE(R) and Epub Ahead of Print, In-Process & Other Non-Indexed Citations, Daily and Versions(R) <1946 to January 20, 2020> Search Strategy:
Data Availability
Available from authors upon request.
Declaration of Conflicting Interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) received no financial support for the research, authorship, and/or publication of this article.
Ethical Disclosures
Not applicable.
Supplemental Material
Additional supporting information is available in the online version of the article.
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
