Abstract
A central aspect of any theory-driven realist investigation (synthesis or evaluation) is to develop an initial program theory (IPT). An IPT can be used to frame and understand how, for whom, why, and under what contexts complex interventions work or not. Despite well-established evidence that IPTs are a central aspect to any realist investigation, there is wide variation and a lack of methodological discussion on how to develop an IPT. In this article, we present the approach that we used to develop an IPT of how patient-reported outcomes (PROs) are used in health care settings. Specifically, we completed a systematic review to extract tacit theories reported in the literature. The benefit of this approach was that it provided a rigorous review of the literature in the development of IPTs. The challenges included (1) rediscovering what is already well established in the theoretical literature, (2) generating an overabundance of partial candidate theories, and (3) extensive use of time and resources for what was the first stage to our larger funded research study. Our recommendations to other scholars considering this approach are to ensure that they (1) live within their means and (2) narrow the scope of the research question and/or develop a conceptual framework using middle-range theories. These methodological insights are highly relevant to researchers embarking on a realist investigation, tasked with developing an IPT.
Keywords
Background
Realist syntheses are driven by the question: “What works, how, for whom, in what circumstances and to what extent?” (Pawson, 2002; Pawson et al., 2005). It is a theory-driven approach to evidence synthesis philosophically rooted in scientific realism (Pawson, 2006). A central aspect of any realist synthesis is to develop, test, and refine a program theory. A “program theory” articulates the ideas and assumptions underlying how and why an intervention or program is expected to work (Pawson, 2006). A program theory can identify and map out the key components of the program, the outcomes the program is intended to generate, and the contexts that might shape the mechanisms through which the program works that contribute to particular outcomes (Muckumbang et al., 2017; Pawson, 2002; Pawson et al., 2005).
Developing an initial program theory (IPT) is the first step in a theory-driven realist research cycle (Muckumbang et al., 2017). An IPT “sketches the terrain” that will be investigated and assists in refining the scope for realist inquiry (synthesis or evaluation; Wong et al., 2013). Through realist inquiry, an IPT is tested and refined to more accurately explain how and why interventions work on the ground. While adoption of realist synthesis appears to be an increasingly popular approach for evidence syntheses of complex interventions, there is a paucity of literature that discusses or provides clear examples of how to undertake this necessary first step in a theory-driven realist research cycle. We attend to this gap in the literature and propose that more detailed methodological guidance will support rigorous development of IPTs with clear application of realist principles. In this article, we first introduce existing methodological guidance and approaches for IPT development. Next, we discuss the approach that we employed to develop an IPT for our realist synthesis on how patient-reported outcomes and experience measures (jointly referred to here as PROs) can be used to enhance person-centered nephrology care (Schick-Makaroff et al., 2017, 2019). Finally, we conclude with a discussion of the benefits, challenges, and lessons learned from the approach we employed, offering recommendations to realist researchers planning and undertaking IPT development.
Aim
The purpose of this article is to describe the benefits and challenges of the approach we used to develop an IPT for a realist synthesis to explain how patient-reported outcomes (PROs) are used in health care settings. We offer transferable lessons to advance existing knowledge and provide a methodological example of IPT development for researchers undertaking a realist synthesis.
Realist Syntheses
Realist syntheses aim to identify the underlying causal generative mechanisms that explain “how” outcomes are caused under certain contexts that produce outcomes of a program. Realist syntheses are guided by the context (C) + mechanism (M) = outcome (O) configuration (CMOc; Pawson & Tilley, 1997). A CMOc is a hypothesis that the program works (O) because of the action of some underlying mechanism (M), which only comes into operation in certain contexts (C; Pawson & Manzano-Santaella, 2012, p. 184). At an individual level, programs or interventions offer resources to participants and how these resources change the reasoning or responses of participants constitute the mechanisms that explain how things work beneath what the eye can see (Dalkin et al., 2015; Pawson, 2006). Westhorp (2018) notes that mechanisms can operate at different levels of a system (not just the individual) but fundamentally are causal forces or processes that explain how an intervention does or does not work.
Existing Methodological Guidance for IPT Development
Realist synthesis is still a relatively new approach that is a part of growing methodological advancement in evidence reviews. In recent years, the RAMESES project provided training materials and guidance on realist synthesis and developing IPTs (Wong et al., 2013). Section 4 of the RAMESES training materials focuses on (1) developing and refining realist program theory, (2) quality standards for program theories, (3) examples from the literature, and (4) a learning activity (Wong et al., 2013). The development of IPTs for realist investigation occurs through a variety of ways such as conducting workshops with stakeholders, reviewing program documentation (policy documents, funding applications, program descriptions, and so on) or by reviewing a small selection of literature about the program type (Wong et al., 2013). The variation as to how an IPT can be developed presents challenges for students and researchers alike embarking on a realist synthesis or evaluation. The RAMESES training materials for realist synthesis outline a list of six questions to assist with selecting an appropriate approach for developing an IPT and constructing and refining it (Table 1).
Questions to Assist Constructing and Refining a Realist Program Theory.
The RAMESES (2014) training materials also provide quality standards for realist synthesis that range from inadequate to excellent for constructing and refining a realist program theory. An excellent standard is defined to show that: The relationship between the program theory and relevant substantive theory is identified. Implications of the final theory for practice, and for refinements to substantive theory where appropriate, are described. The final realist program theory comprises multiple context-mechanism-outcome configurations (describing the ways different mechanisms fire in different contexts to generate different outcomes) and an explanation of the pattern of CMOs. (RAMESES, 2014, p. 4)
The approach we used aligns with Shearn et al.’s (2017) third suggestion, where we conducted a systematic literature search and extracted tacit theories about “what is working and why” from similar interventions. Using the realist CMOc heuristic, we aimed to identify how contexts (i.e., health care settings) shape mechanisms (the processes, reasoning, or behaviors triggered by the use of PROs) through which the intervention (PRO feedback) brings about an outcome (at individual and aggregate levels). Shearn et al. (2017) allude to some potential challenges to this chosen approach of using data-driven approaches only, such as (a) identifying what is already well established in the existing literature, (b) generating an overabundance of candidate theories, and (c) developing theory that may be unstructured. In this article, we add further specificity and discuss some of these challenges that we experienced. To reflect on and provide insight into the process of our IPT development, we draw upon the methods and approach that we employed, meeting notes from our weekly core research team meetings during IPT development, our team report, and our own sense-making processes.
Research Context
Developing IPT for PROs in Kidney Care
The objectives of our study were to (1) understand theories that explain how PROs are used and (2) develop a kidney-specific program theory about use of PROs in nephrology that may enhance person-centered care by testing and refining the theory through a realist synthesis of the empirical literature (Schick-Makaroff et al., 2019). Our intention was to first informally search the literature to identify existing theories on how PROs are used in healthcare settings to develop our IPT. To locate these theories, we (a) undertook a systematic review of the peer-reviewed literature to identify existing theories and (b) consulted our methodological expert (J.G.), research team practitioners (specifically nephrologists), and our patient advisory committee (PAC) to solicit their feedback for IPT refinement. Research team members identified three exemplar papers a priori including Greenhalgh (2009), Santana and Feeny’s framework (2014), and Valderas and Alonso (2008).
Method
IPT Development Through Systematic Review
Following the seven stages for realist synthesis (Wong et al., 2013), we set out to create IPTs to be tested and refined through evidence in order to meet objective one of our study: To formulate IPTs that explain how patient-reported outcomes (PROs) are used in health care settings (Schick-Makaroff et al., 2017, 2019). Patient-reported outcome and experience measures (PROMs and PREMS) are regarded globally as a means for people to report on the impact of illness on their health and quality of life as well as their experience with care (Greenhalgh, 2009). More specifically, PROMs are self-report instruments used to obtain self-appraisals about outcomes relevant to a person’s quality of life (e.g., well-being; overall health; symptoms; functional status; and other aspects of psychological, social, and spiritual quality of life; Fayers & Machin, 2016). PREMs refer to “questionnaires measuring the patients’ perceptions of their experience whilst receiving care” (Kingsley & Patel, 2017, p. 137). Although PREMs and PROMs are different types of instruments, both comprise important outcomes of health care, we refer to them jointly here as patient-reported outcomes (PROs; Schick-Makaroff et al., 2019). Development of our IPTs closely aligns with modern perspectives of PRO measurement validity which increasingly emphasizes the importance of focusing on “use” in contributing toward measurement validity evidence (Hawkins et al., 2018). Thus, IPT development on use of PROs contributes to the field of measurement validation of PROs.
The IPT development process took 15 months (December 2016–March 2018) with three paid part-time research assistants. To begin, and with the aid of a library scientist, a systematic strategy (Additional File 1) was created to search the peer-reviewed literature for two concepts: PRO measures and theory. Including primary research, theoretical or review literature, we retrieved 13,412 articles. After removing duplicated, 6,295 records were title screened, and 1,210 abstracts were screened by the core research team using NVivo Version 11 and EndNote X7 (with 10% double check). Inclusion/exclusion criteria were developed iteratively, trialed and discussed in weekly core research team meetings, and they are reported elsewhere (Schick-Makaroff et al., 2019). Full text of 42 articles were screened by three reviewers. Likewise, 10% of the included full texts were double-checked. Thirty-four full-text articles were included for extraction for our IPTs (see Figure 1).

Preferred Reporting Items for Systematic Reviews and Meta-Analyses. Screening process for the initial program theory
Prior to our extraction process, we drew from six exemplary articles on PRO use in health care settings (Greenhalgh, 2009; Greenhalgh et al., 2014; Porter et al., 2016; Santana & Feeny, 2014; Valderas & Alonso, 2008; Velikova et al., 2004) and worked through inductively extracting CMOc. This created an opportunity for us as a team to understand and define what we meant by context, mechanism, and outcome in relation to how PRO feedback works. For the IPT data extraction process, we created a table in Microsoft word and began abductive processes by creating “if-then” statements from which we developed complete CMOc. We trialed this process with the core research team, discussed and revised on a weekly basis. This process occurred over 3 months.
The core research team then divided into two groups: individual and aggregate levels of decision making. We defined individual levels of decision making as PRO use at point of care by patients, caregivers, and health care providers. We defined aggregate levels of decision making, as PRO use at government, policy, and system-wide levels. In each group, CMOc were inductively extracted from the 34 articles at individual and aggregate levels by a primary reviewer and then the secondary reviewers, and input into tabular format using Microsoft word. A tertiary reviewer reviewed one third of the articles. (The pattern for the selection of the articles, which were numbered 1–34, was the sequence of multiples of 3 [e.g., 14, 17, 20,….]). All extractions were reviewed either by the individual or aggregate team or at least one member of the other team. The core question that guided this process was: “Does this provide theoretical explanation about how PROs are used at an individual or aggregate level in health settings?”
Simultaneously, as we gathered partial CMOc inductively from our included studies, we also abducted from theory and ideas to form more complete CMOc which we later compared, tested, and refined through another realist synthesis in the nephrology literature.
The extraction process was extensive. We met weekly to update on progress and discuss questions. Combining tables from the teams was also a large undertaking that resulted in a 170-page Microsoft word document of individual and aggregate CMOc or partial CMOc. To ensure consistency between and among teams for data extraction, we developed a data extraction codebook/dictionary and we created “nickname themes” for our CMOc while under development so that team members could quickly reference them in discussion. We also created a process for extracting quotes: if direct quote, use quotation marks, page number, and reference. If not direct quote, page number and reference. Our final IPT CMOc are provided in Tables 2 and 3.
Initial Program Theory: Individual Level Use of PROs in Health Care.
a Clinicians’ perspectives of the credibility of PROs may influence their use.
b Providers’ beliefs to be further developed in program theory.
Initial Program Theory: Aggregate Level Use of PROs in Health Care.
Team Meetings and Other Work
In addition to article screening and extraction of CMOc or partial CMOc of the 34 included articles, our core research team met weekly. Our weekly meetings involved development of our protocol and PROSPERO registration, our first database searches, realist training workshops with our methodological expert (J.G.), IPT work, and refined program theory work conducted through a review of existing literature. Contextually, at the time of our IPT development, one of the research team members (J.G.) was also undertaking a realist synthesis on PRO use in health care. A protocol was published at the beginning of our systematic search (Greenhalgh et al. 2014), and the full report was published at the end or our extraction (Greenhalgh, Dalkin, Gibbons, et al., 2017). While our work was at different stages, there was productive overlap between our program theories.
Patient Advisory Committee
We engaged with a PAC during our IPT development process. We consulted PAC members who included people living with kidney disease (on home dialysis, in-hospital dialysis, kidney transplant recipients, and predialysis) and their spouses. One patient was a part of the full research team from the beginning of the research proposal development. In May 2016, the PAC members and research team participated in a 1-day long Canadian Institute for Health Research (CIHR) Strategy for Patient-Orientated Research (SPOR & CIHR, n.d.) Training that was tailored to our team’s work together on the project. We met again in August 2018 to provide an update and to solicit their input on our very early IPT development. We also reported our IPTs to the PAC in March 2018 and discussed program theories in October 2018.
Findings
The search and screening results of our systematic review for IPT development are presented through an adapted Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) flow diagram (Figure 1). Our IPT approach resulted in six CMOc at an individual level (Table 2) and eight CMOc at an aggregate level (Table 3). Our IPTs map out the main components of using PROs in health care settings, anticipated outcomes, contexts that might shape mechanisms that produce these outcomes, and sequencing of these elements at individual and aggregate levels.
To further test and refine our IPTs, we subsequently undertook a realist synthesis of the kidney-specific peer-reviewed and gray literature to address the second objective of our study. Our kidney-specific program theory was later developed and tested as an evidence-based theory about use of PROs in nephrology for enhancing person-centered care at individual and aggregate levels of decision making. The final program theory was dramatically different from the IPTs presented in this article but could not have been developed without this foundational work.
Challenges to IPT Development Approach
Reflecting on the approach we undertook to develop our IPT, below we have summarized three key challenges that we encountered: (1) rediscovering what is already well-established in the theoretical literature, (2) generating an overabundance of partial candidate theories, and (3) extensive use of time and resources for what was the first stage to our larger funded research study. Our IPT development work took 15 months and required the resources of a research librarian, three paid research staff, realist training, and weekly core research team meetings.
First, from focusing on peer-reviewed literature alone, we were faced with the tension of rediscovering what was potentially already known in the field or duplicating other work underway. Contextually, we were apparent of this tension because of other realist syntheses projects on similar topics which were being undertaken separately and in different stages (Greenhalgh, Dalkin, Gibbons, et al., 2017; Greenhalgh, Dalkin, Gooding, et al., 2017). Nevertheless, our larger research team decided that to ensure our IPTs were rigorous and fully informed future program theory development using kidney resources, we needed to undertake a systematic investigation of the literature. While the process offered reassurance and alignment with the field, it was not without significant use of resources.
Second, due to the magnitude of our search records (
Third, as mentioned, we had a large database of records to screen, and our screening process contained several steps. The title and abstract screening involved six stages, including practice screening, title screening, abstract screening using NVivo for a computer-assisted approach, and abstract screening via EndNote X7, 10% double-checking, and rescreening the included abstracts. We also completed “practice” screening and met as a core research team to discuss our questions to aid team members new to realist synthesis. While our approach enabled a highly rigorous systematic process, it is important to note the challenges. Our systematic approach to IPT development took longer than we anticipated, resulting in the use of more resources, including time and costs, than expected. Thus, our full project resulted in two systematic reviews—one to develop our IPTs and another to test and further refine our IPTs.
Discussion
Recommendations for IPT Development
Reflecting on the processes we undertook to develop our IPT, we have gained lessons and insights which we offer in the following two recommendations.
Anecdotally and from our team’s firsthand experience, it must be acknowledged that embarking on a full systematic search to develop realist IPT can be resource-intensive. To counter this, Saul et al. (2013) proposed a framework for conducting Rapid Realist Reviews. Even for an undertaking like this that nominally promises to mitigate the issue of resource constraints, they still require significant resources. For example, they suggest an ideal rapid realist review team would include a project manager, a local reference group, a librarian, a review team consisting of two to four individuals, a synthesis lead, and an academic or research lead. Not only is payment to all these individuals something to consider, whether through salaries, in-kind, or service agreements, but so too is the amount of time that each of these types of team members may be able to reasonably provide. The key is to balance available labor (i.e., graduate students who can only work part-time vs. full-time nonstudent staff), available funds, expertise, and time as well as to discuss expectations of the comprehensiveness of the IPT. It is also possible to develop strategies to help optimize the resources available for a realist synthesis. For example, we found that having a well-defined intervention and inclusion/exclusion criteria (Schick-Makaroff et al., 2017, 2019) enabled us to have clear guidance for data screening which increased the efficiency at which we were able to screen articles as well as help standardize the selection process across the different reviewers on the team. Full-team discussions about how to “live within your means” may help mitigate both anticipated an unanticipated challenge in the development of IPTs.
Having an overabundance of partial IPTs challenged our team to explain patterns or fully unpack mechanisms. This overabundance may have been due to (a) not having created sufficiently narrow boundaries around our research question or (b) not integrating MRTs until refinement of our IPTs.
First, focusing reviews is given an entire section in the realist training materials created by Wong et al. (2013). Even for one specific intervention (in our case, use of PROs in health care), the questions that could be asked about it are infinite. From a realist perspective, this is a characteristic of complexity. Wong et al. (2013) suggest that containing a review is therefore important and can be done in a number of ways including narrowing the question, narrowing the aspect of the intervention to be investigated, narrowing the scale (e.g., by countries or cultures or timeframes), and mitigating expectations about the comprehensiveness of the review.
Second, Shearn et al. (2017) incorporate the “use of concepts from abstract theories” into two of their suggestions for how to approach developing IPTs. For realist researchers, these concepts from abstract theories can be MRTs. MRTs can be used to develop an a priori set of contexts, mechanisms, and outcomes with which to reduce the variety of CMOs identified from the data in the literature, from interview transcripts, and/or document analysis. For our team, the integration of MRTs earlier on could have provided a framework to understand how, why, and in what circumstances the PROs are used in health care settings, thus creating parameters around what “nuggets” of data from the literature should be extracted and included in IPT development. One mitigating strategy we used to manage our overabundance of partial CMOc and to help with sense-making was to create two succinct categories, individual- and aggregate-level PRO use. Research team discussions to critically appraise the scope of the research question, as well as integration of MRTs may facilitate IPT development in realist investigation.
Limitations
A limitation that we encountered was that only two of the six members had engaged in this type of realist CMO extraction before, and only three of the six had ever participated in realist work—so developing our IPT was also our training to realist synthesis and extraction. Using the RAMSES training materials for realist synthesis, we only drew upon Questions 1–3 and 6 in-depth when first developing our IPT. We did not consider Question 5: “What assumptions are built into the program theory? What assumptions are we (the reviewers) making? What assumptions are there in the data? Which ones do we need to challenge and why?” nor Question 6: “What data, from where, might help to test and refine the theory?” until we were refining our IPTs. This may be considered a limitation to our IPT development.
Conclusion
In this article, we present the approach that we used to develop an IPT of how PROs are used in health care settings. We believe that reporting on the process we took to develop our IPT can improve transparency and future guidance on IPT development. Developing IPTs through systematic processes are both rigorous and detailed; however, this approach should be embarked on with caution, resources, and the needed skill set. We recommend the use of existing program theories (where available) to improve coherence and data abstraction. Further, earlier integration of MRTs can mitigate the challenges faced when developing IPT from solely data-driven approaches. These methodological insights are highly relevant to graduate students and researchers embarking on a realist investigation, tasked with developing IPTs in future realist work.
Supplemental Material
Supplemental Material, IJQM_additonalFile_30July19 - Developing an Initial Program Theory to Explain How Patient-Reported Outcomes Are Used in Health Care Settings: Methodological Process and Lessons Learned
Supplemental Material, IJQM_additonalFile_30July19 for Developing an Initial Program Theory to Explain How Patient-Reported Outcomes Are Used in Health Care Settings: Methodological Process and Lessons Learned by Rachel Flynn, Kara Schick-Makaroff, Adrienne Levay and Joanne Greenhalgh in International Journal of Qualitative Methods
Footnotes
Authors’ Note
Research Team members involved in this research include Stephanie Thompson, Richard Sawatzky, Onouma Thummapol, Scott Klarenbach, and Mehri Karimi-Dehkord. Our patient advisory committee members involved in this research include Loretta Lee (cochair), Tammy Fifield, Justin LeRoux, Jon Gilchrist, Sandra Johnstone, Marcus Alphonsus Walsh, Julia Walsh, Jeff Costley, Suzanna Pomrenke, and Beulah Fitzpatrick.
Acknowledgments
Rachel Flynn is supported by a Women and Children’s Health Research Institute postdoctoral fellowship. Kara Schick-Makaroff is a new investigator funded by Can-SOLVE CKD—KRESCENT (Canadians Seeking Solutions and Innovations to Overcome Chronic Kidney Disease; Kidney Research Scientist Core Education and National Training Program).
Declaration of Conflicting Interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: Study funded by Kidney Foundation of Canada, Allied Health Research Grant, KFOC 160024.
Supplemental Material
The supplemental material for this article is available online.
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
