Abstract
Background:
In low-income settings, clinical trial participation may influence participant behavior, including among control groups. Increased access to care and heightened health awareness during trial enrollment could lead to altered behaviors, a phenomenon known as the Hawthorne effect, which may obscure true intervention impacts; however, this effect remains poorly studied in low-income environments.
Aim:
To conduct a secondary exploratory analysis of healthcare utilization among control participants of a randomized clinical trial (RCT).
Methods:
We retrospectively analyzed electronic medical records from the control arm (n = 26) of an RCT involving low-income Hispanic adults with type 2 diabetes receiving care at a community clinic. Before randomization to a 12-month diabetes education intervention or usual care (control), participants underwent on-site measurements of HbA1c, blood pressure, and weight. Healthcare utilization among control participants was compared during the year before and throughout the study, including all types of exposures: provider visits and other services (eg, orders).
Results:
Total healthcare utilization was similar between the pre-period (11.9 exposures/year) and the study-period (11.4 exposures/year; P = .93), with no significant changes across visit types. There were no significant differences in fitted mean monthly visits between the pre- and study-periods (P = .93), nor over time (P = .89).
Conclusions:
This exploratory study found no evidence of a Hawthorne effect on healthcare utilization among control participants. While this may suggest consistent healthcare behaviors, it may also highlight an important public health concern: individuals in low-income settings may lack the resources to translate increased awareness into health-related action. Larger studies are needed to further elucidate behavioral patterns in low-income populations.
Introduction
In low-income healthcare settings, participation in clinical trials can influence patient behavior, including among control group members. The Hawthorne effect refers to behavioral changes that occur when individuals are aware they are being observed or studied. 1 First identified in the 1920s, this phenomenon has been widely examined across disciplines such as psychology, sociology, and healthcare. 1 Its impact on healthcare research is particularly important, as awareness of observation can influence study outcomes. 2 Participants may improve protocol adherence, make healthier choices, or alter their behaviors and self-reports simply because they know they are being studied.3 -6 This effect can lead to either underestimation or overestimation of an intervention’s true impact, potentially compromising the validity of research findings.6 -9
Despite widespread recognition of the Hawthorne effect, significant gaps remain in understanding this phenomenon. The original Hawthorne studies were limited by methodological weaknesses, including the absence of a control group and failure to account for confounding variables. 10 Recent research has struggled to consistently quantify the Hawthorne effect, hindered by heterogeneity in study populations and designs, as well as differences in the nature and frequency of observations intended to elicit the effect.2,11,12 These challenges have led some investigators to suggest that observed behavioral changes may stem from external factors such as experimental bias, context-dependent influences, or methodological flaws, rather than a genuine Hawthorne effect.2,3,12 A systematic review of 14 studies reported a minimal Hawthorne effect (odds ratio [OR] = 1.17; 95% confidence interval [CI] = 1.06–1.30), but heterogeneity limited conclusions about its mechanisms and conditions. 2 Another review reported a stronger effect (OR = 1.41; 95% CI = 1.13–1.75); however, the authors noted that the effect was absent in well-designed studies and cautioned that it may overlap with placebo effects and regression to the mean. 12
The Hawthorne effect remains underexplored within low-income populations, where chronic disease prevalence is highest and the influence of socioeconomic factors is not well understood. It is possible that vulnerable groups may exhibit either increased or diminished susceptibility to this effect. Systemic barriers—including limited access to healthcare, language challenges, and economic stressors—profoundly shape health behaviors in these populations, potentially exerting a stronger influence than study participation itself.13 -15 These barriers may drive increased clinical encounters among control participants, thereby obscuring the true impact of interventions.
To better understand potential behavioral changes within the control arm of clinical trials, this study explored healthcare utilization among control participants in a randomized clinical trial (RCT) conducted at a nonprofit community clinic. Participants were randomized to receive either a 12-month, multidimensional diabetes education intervention or usual care (control). 16 This analysis focused on evaluating changes in healthcare utilization among control participants following receipt of baseline clinical measurements, including HbA1c. Specifically, we compared clinical encounters during the year before the study (pre-period, September 2021 to August 2022) to those during the study period (study-period, September 2022 to August 2023). We hypothesized that awareness of being observed and receiving baseline health information would lead to increased healthcare utilization, consistent with the Hawthorne effect.
Methods
Study Design and Setting
This cohort study is a secondary analysis of a 12-month RCT conducted at a non-federally funded, nonprofit community clinic in Greater Houston, Texas, serving low-income populations. 16 To qualify for clinic services, patients were required to be uninsured and have an annual income of ≤150% of the federal poverty level. More than 50% of the clinic’s patients were undocumented. The study was approved by our institutional review board, and all participants provided informed consent.
Participants
Potential participants were identified through a database by coding for the following inclusion criteria: Hispanic/Latino(a), type 2 diabetes (ICD-10 E11.X), and adult (≥18 years). During the consent process, research staff confirmed that individuals spoke Spanish. We excluded individuals who did not attend any clinic appointments during the 12-month study, were pregnant during the pre-period and/or study-period, or had conditions that may alter HbA1c levels, such as intermittent steroid use.
Intervention
The intervention was described in the primary study. 16 Briefly, individuals were randomized to a 12-month multidimensional diabetes program, called TIME (Telehealth-supported, Integrated Community Health Workers (CHWs), Medication Access, and group Education), as the intervention group, or to usual care in the clinic as the control group. Usual care consisted of routine provider encounters for chronic disease management, typically scheduled monthly to semi-annually depending on HbA1c control. Usual care also included opportunities for food assistance, nutrition education, children’s programs, and pastoral care. Intervention participants received communication from CHWs through text and phone calls 2 to 4 times per month, monthly diabetes education videos created by the research team on YouTube, and 8 group visits that included CHW-led diabetes education and provider encounters for disease management.
Measures
We gathered monthly clinic usage data for the control arm for the pre-period and the study-period through chart review in the electronic medical record. We included the total number of visits per month and categorized them into the following visit types: provider (MD, DO, or Advanced Practice Provider), nurse (eg, blood pressure checks), medication (eg, refills), and other (eg, lab orders and eligibility paperwork).
Statistical Analyses
All analyses were performed using SAS 9.4 (SAS Institute, Cary, NC). Time series data of the total number of monthly visits were generated for each participant. Summary statistics, including means and standard deviations, were calculated for each month and across the year for both the pre-period and study-period. Interrupted time series models were used to compare the level (intercept) and changes over time (slope) in visits between the pre-period and study-period. Testing differences in intercepts and slopes indicates whether there are significant changes over time.
To account for within-subject correlation, we used a generalized estimating equations (GEE) model with a Poisson distribution to model monthly visit counts. The estimated monthly visit numbers, along with 95% CIs, were presented to visualize the model-estimated trend for the 2 periods. Due to the sparse number of visits for subtypes, we used binary indicators (Yes vs No) for the monthly visits. The GEE model with a binomial distribution was then used, along with the interrupted time series model, to display predicted probabilities of monthly visits with 95% CIs. To compare the number of visits in a year between the 2 periods, the Wilcoxon signed-rank test was used. All tests were 2-sided, with a significance level of P < .05.
Results
A total of 26 participants were enrolled in the control arm of the study. Baseline demographic and clinical information is reported in Table 1. The average age was 52.9 years (SD = 7.4). The group included 16 females (61.5%) and 10 males (38.5%). Most participants were employed, primarily in domestic (34.6%) and manual labor (23.1%) occupations. Thirteen participants (43.3%) had hypertension, and 1 participant (3.9%) had chronic kidney disease. None had a history of coronary artery disease or cerebrovascular accident. The mean HbA1c was 7.65% (SD = 1.83), and the average duration of diabetes was 6.9 years (SD = 6.0). Of the 26 participants, 24 (92.4%) received oral glucose-lowering medications only.
Baseline Demographics and Clinical Information for the Participants (n = 26).
Mean laboratory values were as follows: total cholesterol, 178.3 mg/dL (SD = 43.4); HDL cholesterol, 38.9 mg/dL (SD = 10.3); LDL cholesterol, 106.0 mg/dL (SD = 32.9); and triglycerides, 157.4 mg/dL (SD = 131.0). Mean systolic blood pressure was 129.4 mmHg (SD = 13.8), and mean diastolic blood pressure was 80.1 mmHg (SD = 8.1). The average BMI was 31.7 kg/m2 (SD = 4.8), and the mean weight was 82.1 kg (SD = 12.9).
Figure 1 presents the monthly visit counts per study participant during the pre-period and study-period. Mean monthly visits ranged from 0.65 to 1.69 visits/month in the pre-period and from 0.58 to 1.35 visits/month in the study-period. The mean change in monthly visits from the pre- to the study-period ranged from −1.04 to 0.42. The Supplemental Table provides a monthly breakdown of total visits for all participants, comparing the pre-period to the study-period.

Individual monthly total visits by period.
Figure 2 displays the estimated number of monthly visits with 95% CIs from the interrupted time series model. There was no significant difference in the fitted mean number of visits in the first month of each period (pre/study-period ratio = 0.98; 95% CI = 0.66, 1.46; P = .93). There was no significant difference in changes during the 12 months for the pre-period and study-period, as both curves remained flat with a monthly increment ratio close to 1. The slope comparison also showed no significant difference (pre-/study-period slope ratio = 1.00; 95% CI = 0.93, 1.06; P = .89). These results indicate that there was no significant difference in trends, either at baseline or in monthly changes, between the 2 periods.

Interrupted time series model depicting the fitted total monthly visits for pre-period and study-period groups (pre: months 1-12, study: months 13-24).
Figure 3 presents the results of an interrupted time series model, describing the predicted probabilities by month for provider, nurse, non-provider (medication), and non-provider (other) visits. There were no significant differences in the predicted probability of a visit at month 1 between the pre-period and the study-period for provider visits (OR = 0.85; 95% CI = 0.53, 1.38), nurse visits (OR = 1.40; 95% CI = 0.32, 6.17), non-provider (medication) visits (OR = 0.67; 95% CI = 0.19, 2.35), or non-provider (other) visits (OR = 1.05; 95% CI = 0.62, 1.78). Additionally, the slope ORs for both periods showed no significant trends in the probability of visits over time for any visit type. There were no significant differences in the slope OR for provider visits (OR = 1.01; 95% CI = 0.93, 1.09), nurse visits (OR = 0.84; 95% CI = 0.67, 1.05), non-provider (medication) visits (OR = 1.11; 95% CI = 0.90, 1.36), or non-provider (other) visits (OR = 1.02; 95% CI = 0.93, 1.12).

Interrupted time series model for binary outcomes, demonstrating the predicted probabilities by month (pre-period: months 1-12, study-period: months 13-24) for provider, nurse, and non-provider visits with 95% confidence intervals (CIs).
Table 2 summarizes yearly totals for all visit types in the pre-period and study-period. Median total yearly visits increased from 10.5 (IQR = 8, 18) in the pre-period to 12 (interquartile range [IQR] = 6, 16) in the study-period, though the median change was −0.5 visits (IQR = −5, 3). For provider, nurse, and non-provider (medication) visits, the median annual change was 0: provider: 0 (IQR = −2, 1); nurse: 0 (IQR = −1, 0); non-provider (medication): 0 (IQR = 0, 1). Non-provider (other) visits showed a slight increase, with a median change in annual visits of 0.5 (IQR = −1, 2).
Yearly Summations for Total, Provider, Nurse, Non-provider (Other), and Non-provider (Medication) Visits.
Discussion
We conducted a secondary exploratory analysis of healthcare utilization among control participants in a 12-month RCT conducted in a low-income setting. Based on prior literature on the Hawthorne effect,2,12 we hypothesized that control participants would increase healthcare utilization during the study compared to the year prior, due to their awareness of study involvement and receipt of clinical information. However, this hypothesis was not supported; no significant differences in utilization were observed. Despite gaining knowledge of their clinical values, control participants did not alter their healthcare utilization. This finding is important for investigators working with low-income populations, where concerns may arise that increased clinical contact or access in the control arm could inflate healthcare utilization and mask true intervention effects.
Several explanations could account for the lack of behavioral changes observed in the control arm. One possibility is that participants did not fully understand their clinical values and therefore did not find it necessary to follow-up. While this is plausible, it is important to note that values were entered into the electronic medical record for provider follow-up, and urgent values were addressed immediately. Another possibility is that the Hawthorne effect may not be as potent as initially suggested. A systematic review of 19 studies revealed some evidence of the effect; however, the high degree of study heterogeneity made it challenging to determine the degree or conditions under which it occurred, and observed alterations within the study setup could have been influenced by factors such as the novelty of the study or various forms of bias. 12
Furthermore, it is possible that manifestations of the Hawthorne effect differ in low-income settings. Healthcare in these settings differs significantly from that in higher-income populations due to limited access to care, reduced ability to implement lifestyle and other changes (eg, modifications in medications or diet) in the context of restricted income, low health literacy, transportation barriers, and various social and economic determinants of health.17,18 After receiving their baseline measurements, control participants may have been unable to schedule follow-up appointments due to limitations in clinic infrastructure, lack of internet access, inflexible work schedules, and associated costs. In higher-income settings, individuals who learn of a poor clinical outcome are more likely to secure follow-up appointments, given the lower likelihood of encountering these barriers. Thus, access to care may remain the primary obstacle to healthcare for low-income individuals, rather than mere knowledge of their health conditions.
Strengths, Limitations, and Future Directions
The study contributes valuable data on healthcare behaviors in the control arm within a low-income setting, a population often underrepresented in clinical trials despite being disproportionately affected by chronic diseases. 19 By examining outcomes in the absence of an active intervention, the study provides important context for understanding baseline utilization patterns in low-income settings. The transparent discussion of null findings further underscores how structural barriers, such as limited access to care and competing socioeconomic demands, may mitigate the behavioral effects of study participation. This perspective highlights the critical role of contextual factors in shaping intervention outcomes and adds to the growing literature on health disparities and implementation science.
The study findings are limited by their exploratory nature and the small sample size of Hispanic participants from a single site, which may reduce generalizability and the ability to detect subtle behavioral changes. Several potential confounding factors may have influenced the observed patterns of healthcare utilization, independent of the intervention itself. For instance, individuals with higher baseline HbA1c levels or more complex medical needs may naturally require more frequent clinic visits, regardless of study participation. Similarly, variations in health literacy could affect both participants’ understanding of their health status and their likelihood of engaging in behavior change, potentially influencing utilization outcomes. Unmeasured comorbidities may also contribute to increased healthcare needs, while external factors, such as seasonal fluctuations in clinic operations or concurrent community health initiatives, could have impacted visit frequency. Recognizing these potential confounders is important for interpreting the findings and highlights the challenges of evaluating behavioral interventions in real-world, low-income settings.
The study findings highlight the need for future mixed-methods studies that combine qualitative interviews with quantitative data, particularly in households classified as ALICE (Asset Limited, Income Constrained, and Employed).20,21 Such research could better identify actionable barriers to optimal diabetes care in resource-restricted communities, ultimately informing policy and practice.
Conclusions
This study explored behavioral changes among control participants in low-income settings, testing the hypothesis that awareness of baseline health measurements would increase healthcare utilization in the control arm. This would have supported the Hawthorne effect, but no such effect was observed. This finding may reflect the structural and socioeconomic barriers faced by individuals in low-income communities, which can limit their ability to translate awareness into action. Although limited by small sample size, these exploratory findings offer preliminary insights into how structural barriers, more than patient knowledge, may influence healthcare behaviors in low-income populations, underscoring critical considerations for future interventions.
Supplemental Material
sj-docx-1-jpc-10.1177_21501319251379740 – Supplemental material for Healthcare Utilization Unchanged in the Control Arm of a Randomized Clinical Trial
Supplemental material, sj-docx-1-jpc-10.1177_21501319251379740 for Healthcare Utilization Unchanged in the Control Arm of a Randomized Clinical Trial by Pratik Gongloor, Saad Nadeem, Xiaoying Yu, Mukaila Raji, Kristina D. Mena and Elizabeth M. Vaughan in Journal of Primary Care & Community Health
Footnotes
Ethical Considerations
The study was approved by the Baylor College of Medicine Institutional Review Board (#H-49672).
Consent to Participate
Informed consent was obtained from all individual participants included in the study.
Consent for Publication
The authors consent to publish.
Author Contributions
EV, PG, and XY contributed to the study conception, design, and data collection. XY contributed to the statistical analysis. PG, SN, XY, MR, KM, and EV contributed to the manuscript.
Funding
The authors disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work was supported by the by the National Institutes of Health, National Institute of Diabetes, Digestive, and Kidney Diseases Federal Award, grant number DK129474 (Principal Investigator: Vaughan).
Declaration of Conflicting Interests
The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Data Availability Statement
Deidentified participant data are available from the corresponding author upon reasonable request.
Supplemental Material
Supplemental material for this article is available online.
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
