Abstract
Ethnographic-epidemiological (“ethno-epi”) research methodologies are increasingly being used to examine health-related issues, including the experiences of people who use drugs. However, the complementary application of random sampling from a well characterised cohort and qualitative data collection methods in a single study has not been described. We address this gap by sharing insights from the implementation of a novel random stratified sampling technique to recruit participants from two large prospective observational studies of people who use drugs into a qualitative study about impacts of the COVID-19 pandemic on their lived experience. We aim to describe how an ethno-epi approach we used can enhance the validity, reliability and generalizability of research findings in mixed methods investigations. We do so by providing a step-by-step description of the process we used to determine participant eligibility and recruitment into the qualitative study. Although the approach is not without limitations, findings underscore how ethno-epi random sampling approaches can increase the credibility and trustworthiness of qualitative findings without compromising data depth and integrity. Our study makes an important contribution to the growing number of new creative approaches being developed in the mixed methods research field and we hope that by sharing our account it will encourage and support others to consider the use of ethno-epi approaches in health-related research.
Background
Ethnographic-epidemiological (“ethno-epi”) research approaches capitalize on the strengths of ethnographic/qualitative research methods and research strategies used in epidemiology. Almeida Filho (2001, 2020), one of the first scholars to describe and promote the integration of ethnography and epidemiology in health research, describes ethno-epi research as a “transdisciplinary approach for health research objects and methods” and an “alternative for research on social processes and practices related to health, able to competently combine qualitative and quantitative approaches” (Almeida Filho, 2020). According to Almeida Filho (2020), the term ethno-epi first appeared in the title of a 1992 research report on infant mortality among Hmong ethnic minority children in Thailand (Kunstadter et al., 1992).
In the 1990s, when mixed methods research approaches were gaining popularity (Johnson et al., 2007), scholars began using the term ethno-epi to advocate for the interdisciplinary integration and application of anthropological/ethnographic and epidemiological research (Agar, 1994; Almei da Filho, 2001, 2020; Inhorn, 1995). For example, Inhorn (1995), an early advocate of ethno-epi approaches argued that: … epidemiology, like medical anthropology, operates largely within its own sphere—one that is somewhat peripheral to the domain of biomedicine. In this respect, epidemiology and medical anthropology share markedly similar structural positions vis a vis biomedicine—a commonality that should serve to unite rather than divide them. (p. 286)
In the last two decades, ethno-epi approaches have been used to understand a broad spectrum of health-related issues. Most scholars have described doing so in the context of blending quantitative and ethnographic/qualitative data for analysis; using an epidemiological lens to inform ethnographic research; or using ethnography to guide epidemiological research or make sense of or “unpack” findings derived from epidemiological research. Littleton et al. (2008) for example, refer to their ethno-epi approach as one that used qualitative interview data to contextualise epidemiological data of the everyday life experiences of tuberculosis among migrants to New Zealand. Lee et al. (2004) on the other hand described their study as ethno-epi, by using insights from ethnography to inform epidemiological data collection tools to investigate the sociocultural risk factors of postnatal depression. Furthermore, Thompson and Gifford’s study (2000) about the meanings of health and diabetes in an Australian Aboriginal community, was defined as ethno-epi because their ethnographic fieldwork was conducted through the lens of epidemiological understandings.
Interest in the use of ethno-epi approaches in drug-related research has been present for at least two decades. Early US researchers (Clatts, 2001; Clatts et al., 2002; Pach Iii & Gorman, 2002) described using broadly defined ethno-epi methods that involved blending data as a tool for integrating ethnographic/qualitative and secondary sources of quantitative data (e.g. police and media reports, drug treatment data, drug monitoring systems) to identify and track emergent drug use trends over time, including for people who use methamphetamine (Clatts, 2001; Pach Iii & Gorman, 2002) and crack cocaine (Clatts et al., 2002). Furthermore, Moore et al. (2009) positioned their ethno-epi technique as one that merged agent-based modelling simulations with ethnographic fieldwork to increase understandings of psychostimulant use and related harms among young Australians. In relation to their study, they noted: … some of the limitations of the two forms of data collection are minimized (e.g., ethnography’s specificity and the limited depth of quantitative instruments), [while] some of their advantages are reinforced (e.g., ethnography’s richness and the use of large samples in epidemiology) and their interactive potential is maximized (p. 1).
More recently, Canadian scholars have described using ethno-epi methods to purposively recruit people who inject drugs from prospective observational studies to qualitatively understand experiences of HIV treatment (Fleming et al., 2021; McNeil et al., 2017; Small et al., 2016), injecting cessation among marginalised youth (Boyd et al., 2017) and hospital discharges (McNeil et al., 2014). Ethno-epi techniques used in these studies provided avenues for exploring and making sense of the associations identified in epidemiological studies (Small et al., 2016) and, at a practical level, allowing for efficient recruitment of participants with select characteristics of interest. Mayock et al. (2015), whose Irish study examined the risk environment of heroin use, described the approach as one that incorporated qualitative data collection techniques with the development of epidemiological descriptions of the types of environments in which young people first used heroin. As presented in the glossary of their manuscript, they describe an ethno-epi approach as: … an emergent cross-disciplinary research methodology that combines the strengths of ethnographic observation and other qualitative methods for understanding social meanings and contexts as practised in anthropology with the design, sampling, data collection, and
As the above examples illustrate, a growing number of scholars are using ethno-epi methods to examine health-related issues. Despite significant variation in how ethno-epi methods have been defined and used, missing from the literature are mixed methods studies that use quantitative random or probability sampling techniques to recruit participants into qualitative research. While there are many examples of qualitative research being used as a complement to randomized trials (e.g., Dixon et al., 2016; Maher et al., 2019; Page et al., 2019; Scantlebury et al., 2021), to our knowledge the complementary application of quantitative random sampling and qualitative data collection methods in a single study has not been described. We address this gap in the literature by sharing insights from the implementation of a stratified random sampling technique to recruit participants from two large prospective observational studies of people who use drugs into a qualitative study aiming to understand impacts of the pandemic on their lived experience (see Walker et al., 2023). By combining the use of qualitative research methods with observational cohort studies our work builds on the studies considered above, allowing for the efficient recruitment of participants with characteristics of interest, while minimizing recruitment biases in selecting these participants. We aim to describe how our novel ethno-epi approach can enhance the internal generalizability, rigour and trustworthiness of qualitative findings in mixed methods studies. In the following, we provide a broad overview of the study—COvid-19 Drug use Experiences (CODE)—to provide context for the mixed methods approach used. Next, we present a step-by-step description of our ethno-epi sampling process, followed by a discussion of the implications of its use, including benefits and limitations.
CoDE Study Overview
A growing number of studies have examined the unintended consequences of the COVID-19 pandemic (herein described as “the pandemic”) on the lives of people who use drugs. Although qualitative studies examining the impacts of the pandemic on people who use drugs have emerged globally (Ali et al., 2021; May et al., 2022; Otiashvili et al., 2022; Roe et al., 2021; Russell et al., 2021), fewer have focused on Australia (Brener et al., 2022; Coleman et al., 2022; Conway et al., 2023; Efunnuga et al., 2022). Furthermore, these studies are based on non-representative purposive or snowball sampling techniques. In contrast, our qualitative study randomly sampled participants with characteristics of interest determined by interrogation of the quantitative databases of two well characterized cohorts of people use drugs; the Melbourne Injecting Drug User Cohort Study (SuperMIX) (Van Den Boom et al., 2022) and the Understanding Methamphetamine Use in Victoria Study (VMAX) (Quinn et al., 2021), with the aim of recruiting a more representative sample to increase the rigour and trustworthiness of our findings.
SuperMIX (n = 1303 at the time of data collection) was established in 2008 and is focused on understanding how the health and socio-economic outcomes and drug use patterns of people who inject drugs evolve over time (Van Den Boom et al., 2022). Eligibility includes being at least 18 years of age, injecting at least monthly for six months prior to baseline interview, and residing in urban Melbourne, Victoria. VMAX (n = 853 at the time of data collection) commenced in 2016 and examines long-term patterns of methamphetamine use, and how these impact on service use and health and wellbeing outcomes over time (Quinn et al., 2021). Eligibility includes being over 18 years of age; residing in metropolitan Melbourne or one of three recruitment locations in rural Victoria; and at least monthly use of methamphetamine primarily via non-injecting routes of administration (e.g., smoking, snorting) in the last six months. The non-injecting criterion was relaxed to reach the target sample size as recruitment slowed. Both studies involve annual participation in follow-up surveys via phone call or in-person.
We added pandemic-related questions to the SuperMIX and VMAX cohort surveys at the beginning of the pandemic (March 2020); this epidemiological self-report survey data were used to select and recruit participants into the qualitative study. In-depth semi-structured interviews were conducted with 76 participants of the SuperMIX and VMAX studies. The aim of the study was to understand impacts of the pandemic on drug use; housing, employment, and income status; social relationships and supports; access to and use of health, drug treatment and harm reduction services; interactions with law enforcement; and acceptability of government public health restrictions and vaccines. Ethics study approval was received from the Alfred Health Ethics Committee in 2020 (#258/21).
Ethno-epi Sampling Approach
In the following we describe the ten steps used to determine inclusion eligibility for participation in the qualitative study, followed by the process of randomization, selection, and recruitment.
Determining eligibility for participation in our random sampling process began by identifying SuperMIX and VMAX study participants who had completed at least one survey questionnaire since the pandemic-related questions were added in March 2020. Those who reported no drug use since their last follow-up interview prior to March 2020 were excluded. A total of 766 potentially eligible participants were identified (SuperMIX, n = 350; VMAX, n = 416).
We created twelve ‘flags of interest’ representing epidemiological variables that corresponded to pandemic-related survey questions in the SuperMIX and VMAX studies (Table 1) that matched areas of inquiry in the qualitative study. Nine flags of interest were common across SuperMIX and VMAX studies: (1) employment and housing status; (2) drug use behaviour; (3) increased drug use; (4) purchasing drugs; (5) alcohol use and smoking; (6) overdose; (7) health; (8) COVID testing and isolation; and (9) police interactions. Three additional flags of interest were created for SuperMIX participants, to capture impacts specific to injecting drug use: (10) injecting behaviour; (11) opiate agonist maintenance treatment (OAMT); and (12) injecting health. (Table 1)
Examples of Pandemic-Related Questions used to Generate “Flags of Interest”.
With the aim of maximising the potential for information-rich cases, eligibility criteria for participation in the qualitative study were further refined to include only those who indicated significant impacts (i.e., reported the most impacts across the twelve flags of interest), as indicated by their responses to pandemic-related survey questions within each flag. For SuperMIX participants (n = 350), eligibility included those who reported impacts across at least four of the 12 flags of interest (n = 122). For VMAX participants (n = 416), those who indicated impacts across at least two of the nine flags of interest were eligible (n = 169); a decision that was driven by the larger number of flags of interest for SuperMIX (n = 12) compared to VMAX (n = 9). (Figure 1)

Flow chart of participants in randomized sampling process (n = 291).
A total of 291 participants were eligible to participate in the qualitative study via the process described above; this became our sample frame for randomization. Six stratified lists of participants were generated to ensure participant age, sex and geographical location characteristics were homogenous in each stratum. For SuperMIX we created two lists of participants based on sex: females (n = 46), and males (n = 76). For VMAX, four participant lists were generated based on sex and recruitment location: female rural (n = 33), female metro (n = 34), male rural (n = 51), and male metro (n = 51).
With the aim of minimising selection bias, participant identifiers in each of the six strata were assigned a random order using a random number generator in Excel. This procedure resulted in the creation of six randomized lists, which were used for contacting individuals in the sequential order in which they appeared in each list.
Each of the six randomized lists was imported into a password protected Excel spreadsheet. Participant contact details including mobile telephone numbers, email addresses and Facebook handles were obtained from SuperMIX and VMAX contact databases, which are updated by researchers after survey interviews are completed or when researchers opportunistically meet participants in the field. Contact details were merged into the six randomized spreadsheets for the first ten participants in each of the randomized lists (n = 60), as a starting point for the recruitment process.
Up to three attempts were made to contact the first ten participants in each of the six randomized lists. Once contact was made with a participant, brief information about the study was provided, followed by an invitation to participate in an interview via telephone call, virtual video call (facetime, zoom, WhatsApp, Facebook messenger), or in person (only if pandemic restrictions were not in place). If a participant indicated interest, a day and time was set up for the interview – up to two additional scheduled appointments were allowed if the participant did not attend the scheduled interview.
Excel spreadsheets were used to provide an audit trail of the recruitment process, which included keeping detailed notes of all attempts to contact eligible participants. Information gathered included: the number of times individuals were contacted and when; if their phones were disconnected or no other up-to-date contact details were available; if they declined to participate; or if they did not attend their scheduled interview. Most individuals were contacted via telephone if a number was available (four out of five randomized participants with whom attempts to contact were made, had mobile telephone numbers) followed by two text messages if the number was disconnected. Seven were contacted via email or Facebook.
Once attempts to contact the first ten participants in each of the six randomized lists were exhausted the next ten participants from the lists were contacted. The process was repeated until the target number of participants for theoretical saturation was reached (n = 76).
After the first twenty interviews were complete, the recruitment process was reviewed by the research team, which involved researchers with extensive experience in both quantitative and qualitative research methods and research with people who use drugs. No changes were made to the process, and the remaining participants were contacted using the same sequential approach described above.
Recruitment and Participation Summary
Reasons for Non-participation for Eligible Participants.
Seventy-six individuals participated in in-depth semi-structured interviews between August-October 2021 (during the final lockdown period in Victoria), thirty were conducted between December 2021 and January 2022, and 26 interviews were completed between March to April 2022. Most interviews were conducted via telephone (n = 56) or video call (n = 15). Five interviews were conducted in-person at the beginning and end of the recruitment period when no pandemic restrictions were in place. Interviews varied in duration from 15 to 80 minutes, with an average of 40 minutes.
SuperMIX/VMAX Statistical Comparison of Interviewed and Non-Interviewed.
Discussion
While the use of ethno-epi approaches in drug-related research has been growing, most published studies simply involve the integration of quantitative epidemiological data with qualitative data collected via in-depth interviews. Missing in the literature is a description of the use of probability sampling techniques to aid in the selection of participants for a qualitative study. We have addressed this gap by describing the application of a novel ethno-epi random sampling approach to recruit participants from two large prospective studies to a qualitative study to explore the impact of the pandemic on people who use drugs. In doing so, we have highlighted how this mixed method approach can improve the internal generalizability, credibility and rigour of qualitative research nested in prospective observational studies.
One of the important features that distinguishes qualitative from quantitative inquiry is the sampling strategy, with qualitative research typically involving theoretical or purposive sampling, which rests on the proposition that the selection of information-rich samples will provide/yield an in-depth view of the phenomena (Coyne, 1997; Suri, 2011). By contrast, many observational epidemiological studies or randomized trials tend to rely on random (or probability) sampling techniques with the aim of obtaining a representative sample for the purposes of statistical generalization across population groups (Sandelowski, 1997). The approach we used allowed us to capitalize on the strengths and contributions of both quantitative and qualitative research techniques, which helped increase internal generalizability across the two eligible cohorts we recruited from without compromising the depth or richness of the data collected (see Walker et al., 2023). We believe this approach helped improve the rigor and trustworthiness of our study. Furthermore, our process of identifying ‘flags of interest’ within the SuperMIX and VMAX datasets before randomization (to maximize the potential for information-rich samples) also provided a more efficient recruitment method than would have been achieved by screening each individual participant for eligibility.
Generalizability has been an ongoing contentious issue among social scientists within the non-positivist paradigm (Maxwell & Chmiel, 2014). Indeed, for some, generalizability has been considered a topic that should be ignored, or at best avoided, with some rejecting generalization across samples as a legitimate goal for qualitative research (Morse et al., 2002; Sandelowski, 1993). Nevertheless, qualitative scholars have taken different positions on questions concerning generalizability and how it should be defined (Maxwell, 2021; Osbeck & Antczak, 2021; Roald et al., 2021). For example, Roald et al. (2021) make the claim that not only is generalizability a relevant and important topic for qualitative research but that it is inescapable and core to the practice of human science. Furthermore, Osbeck and Antczak (2021) argue “qualitative methods cannot avoid having implications that transcend the original context in which the inquiry was conducted” (p. 64).
It is not our contention that the ethno-epi approach we used means that our qualitative understandings extend to the entire population of people who use drugs in Victoria, Australia during the pandemic. Rather, our approach provided an opportunity to efficiently obtain a more representative sample from within the two cohorts than could have been achieved via the sampling methods typically used in qualitative research (e.g., purposive or theoretical sampling). Our approach extends the work of Maxwell (2021), who has posed a distinction between two different types of generalization: internal and external generalization. Maxwell defines external generalization as “generalization beyond the person(s), setting, case, or time specifically studied to other persons, settings, cases, or times” (p. 113). In contrast, he refers to internal generalization as “generalizing within the setting, group, or population studied to persons, events, and activities that are not directly represented in the data collected (p. 113). The ethno-epi approach described here, we argue, helped minimise selection bias, and thus enhanced the potential for internal generalizability to the two large underlying cohorts of people we recruited into our study, so that qualitative findings were more reflective of the larger cohort. This claim is further supported by our observation that, except for Indigenous status, there were no statistically significant differences between the randomly selected participants who were interviewed and those who were not.
Internal generalization, not surprisingly, has been posed as more challenging for qualitative researchers because sample sizes are often too small to allow meaningful statistical inferences (Maxwell, 2021)—traditionally recruiting only to the point of data saturation (Morse, 2002) and often involving less than 30 participants (Marshall et al., 2013). Nonetheless, growing numbers of qualitative studies are utilising large data sets because it can extend opportunities for depth and breadth in the analysis of findings (Blodgett et al., 2005; Brower et al., 2019; White et al., 2012). As Brower et al. (2019) argue: … like binoculars that can focus near or far, large qualitative data sets enhance this function by allowing us to dial down to closely examine phenomena at the micro or individual level and then dial out to view phenomena at the macro or societal level (p. 5).
We believe our relatively large qualitative dataset (n = 76)—which was achieved by having access to two large prospective observational studies—increased the likelihood of achieving data saturation across the two stratified dimensions of sex (female vs. male) and geographical location (rural vs. metro). Moreover, as proposed by Maxwell (2021), an essential component for claiming internal generalization is “not just whether the results capture the typical views, characteristics, or actions of the persons involved but also the diversity of these within the case or population studied” (p. 114). Our simple process for selecting participants from each of the six stratified randomized lists, we argue, helped achieve this. At the same time, this technique allowed us to understand more fully the diversity of shared patterns of experiences, insights and views that differentiated participants from one another.
Transparency in sampling techniques used and decisions made along the way can increase the trustworthiness, validity and rigour of qualitative research results (Higginbottom, 2004; Tuval-Mashiach, 2017). For example, as Tuval-Mashiach (2017) has posed, “it could even be suggested that readers derive a greater benefit from seeing what has gone on ‘behind closed doors’ than from learning the study’s results” (p. 126), and that describing what was done, how and why, is critical for increasing the credibility and reliability of findings. By describing in detail, the process used to undertake our ethno-epi study, we have highlighted how quantitative sampling methods can be used to increase the dependability and robustness of qualitative data, without compromising its depth and nuance. Furthermore, we believe in doing so, our methodological descriptions provide important contributions to the literature on mixed methods research.
We acknowledge our approach is not without limitations. We understand the ethno-epi technique described here may have little relevance to the aims of some qualitative studies, particularly those where a large sample is deemed unnecessary and in-depth descriptions from a smaller number of participants may be all that is required.
In addition, we are cognisant that the technique described requires access to ongoing prospective cohort studies—a design which many regard as the gold standard for observational research (Black, 1996)—to ensure a large enough sample for randomization across strata. Although we had the benefit of being able to leverage our prospective observational cohorts with a total of more than 2000 participants, which provided sufficient numbers for random selection of characterised participants, we understand these studies are expensive to fund, and most qualitative research teams will not have access to such cohorts. We, therefore, encourage epidemiologists and others who conduct prospective observational studies keen to use this approach to explore opportunities for collaboration with qualitative researchers. We acknowledge, however, as Brower (2019) has highlighted, that requiring researchers from diverse methodological perspectives and, sometimes, different philosophical stances to work together can produce challenges if shared values and understandings of these diverse methodological paradigms are not communicated. Nevertheless, within our core research team—which comprised individuals with diverse methodological backgrounds (i.e., epidemiology, social science, biostatistics)—we found being able to draw on each other’s expertise also had the unintended consequence of enhancing the collective knowledge and skills of the team.
Although we make claims that our approach increased internal generalizability across the two large cohort studies we recruited from, we are cognizant that our sample for randomization were drawn from two non-randomized samples. Thus, despite our best efforts, it is possible that selection biases were already present in the cohort samples, therefore reducing the potential benefits of our randomized process for participant selection.
Finally, our inability to recruit all participants into the study in the order they appeared in the randomized lists reduced our capacity to achieve a pure probability sample - an issue that was precipitated by the ‘hard to reach’ nature of our population group (Bonevski et al., 2014). Furthermore, because the sampling approach involved contacting all participants in the randomized lists at least three times, much time was spent attempting to contact individuals who did not participate in the study. The social and economic circumstances of people who use drugs often present barriers to their participation in research (e.g., homelessness, mental health issues, fear of confrontation with legal authorities due the criminalised nature of drug use) (Marpsat & Razafindratsima, 2010; Shaghaghi et al., 2011). Ensuring these ‘hard to reach’ individuals have every possible opportunity to participate in research is necessary, however, for understanding their issues and needs. We argue, therefore, that although increased resources were required to accommodate the time taken to locate as many participants from the randomized lists as possible, it was a necessary component for enhancing the likelihood of achieving a more heterogonous and non-biased sample. Moreover, our sampling approach allowed for efficient recruitment of participants with the characteristics of interest to the study.
Conclusions
To our knowledge, this is the first published description of the application of an ethno-epi stratified random sampling approach to recruit participants into qualitative health research. We have highlighted how we were able to efficiently recruit a more internally representative sample into our study than would have been possible using purposive or snowball sampling approaches traditionally used in qualitative research. Through exploring and finding opportunities for embracing common ground, our collaborative cross-disciplinary approach, we believe, makes an important contribution to the growing number of new creative approaches being developed in the mixed methods research field. By sharing our account, we hope to encourage others conducting health-related studies to consider incorporating this approach as a method for gaining information-rich nuanced data, while also improving the internal generalizability, rigor and trustworthiness of the evidence gathered.
Footnotes
Acknowledgements
We would like to thank the SuperMIX and VMAX participants for the time and knowledge they contributed to the study. Thanks also to Burnet Institute and Monash University fieldworkers responsible for the recruitment and follow up of these cohorts.
Declaration of Conflicting Interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work was supported by the SuperMIX study is funded by the Colonial Foundation Trust and the National Health and Medical Research Council (#545891, #1126090). The VMAX study was established with a grant from the Colonial Foundation and is now funded by the National Health and Medical Research Council (#1148170). The Burnet Institute receives funding from the Victorian Government Operational Infrastructure Support Program.
