Abstract
The Short-Form-36 health survey (SF-36) is a factor-analytically derived measure of health-related quality of life (HRQOL; Ware & Sherbourne, 1992) created from a subset of items from the Medical Outcome Study Questionnaire (MOS; Stewart & Ware, 1992). The SF-36 items aggregate into eight scales (first-order factors): physical functioning (PF), bodily pain (BP), role limitations due to physical health problems (RP), general health (GH), role limitations due to emotional problems (RE), social functioning (SF), emotional well-being/mental health (MH), and energy/fatigue or vitality (VT). The eight subscales yield two second-order factors: the Physical Component Summary (PCS; PF, BP, RP, and GH) and the Mental Component Summary (MCS; RE, SF, MH, and VT; cf. Ware & Kosinski, 2001; Ware & Sherbourne, 1992).
Initially developed in the United States, the SF-36 was adapted and validated for international use by the International Quality of Life Assessment project (IQOLA; Ware & Gandek, 1998). The current version of the SF-36 (Version 2.0) was improved in 1996 with better instructions, scales of measurement, and item words while maintaining the composition of the scales (Ware, 1999). The improvements also included greater comparability to commonly used translations of the instrument and cultural adaptations that were based on Version 1.0, allowing for ease of comparison between the two versions to obtain information on HRQOL. The SF-36 is used in over 40 countries worldwide, including many Latin American countries (LAC; Ware & Gandek, 1998).
The Spanish version of the SF-36 was originally adapted by Alonso, Prieto, and Anto (1995) in general adult population from Spain and has been widely used in this country to study HRQOL in different diseases (Alonso et al., 1998; Failde & Ramos, 2000; Sánchez et al., 2012). Further research examining the psychometric properties of this instrument have been conducted in LAC, such as Brazil (Assumpcao et al., 2010; Gomes-Teixeira et al., 2015; Oliveira, Costa, Manzoni, & Cabral, 2014), Argentina (Augustovski, Lewin, Garcia-Elorrio, & Rubinstein, 2008), Mexico (López-Pérez et al., 2014; Martínez-Hernández, Segura-Méndez, Antonio-Ocampo, Torres-Salazar, & Murillo-Gómez, 2010; Zuniga, Carrillo-Jimenez, Fos, Gandek, & Medina-Moreno, 1999), Chile (Lera, Fuentes-Garcia, Sanchez, & Albala, 2013), Peru (Salazar & Bernabe, 2015), and Colombia (Arango-Lasprilla, Krch, Drew, De Los Reyes Aragon, & Stevens, 2012; Garcia, Vera, & Lugo, 2014). These studies have examined the psychometric properties of this instrument with either descriptive or theory-driven (i.e., a priori) statistical procedures.
Studies using descriptive statistical procedures have mostly focused on analyzing the validity and reliability of the SF-36 by testing its internal consistency with Cronbach’s alpha (Assumpcao et al., 2010; Augustovski et al., 2008; Garcia et al., 2014; Jordan-Marsh, Cody, Silverstein, & Garcia, 2008; Martínez-Hernández et al., 2010; Zuniga et al., 1999). The internal consistency varied among these studies, with coefficients ranging from low (Jordan-Marsh et al., 2008) to moderate and strong in adult samples of Argentina (Augustovski et al., 2008), Mexico (Martínez-Hernández et al., 2010; Zuniga et al., 1999), and Peru (Salazar & Bernabe, 2015). One study focused more on examining the predictive validity of the instrument in Spanish-speaking individuals (Assumpcao et al., 2010). Other studies did not assess the psychometric properties of the SF-36 in their samples (see Arango-Lasprilla et al., 2012; Garcia et al., 2014; López-Pérez et al., 2014). The cumulative findings from these studies suggest that the instrument overall has a moderate to strong internal consistency. However, Cronbach’s alpha is only an indicator of the closeness of the items in a scale but does not inform about its dimensionality or structure (T. A. Brown, 2014), that is, this measure of internal consistency does not allow for examining the structural validity of the SF-36 within each country.
Studies using theory-driven statistical procedures, such as confirmatory factor analyses (CFA), examined the measurement invariance of the instrument by looking at the relationship between the observed measures (i.e., behavioral ratings) and latent variables (i.e., construct measured), and testing how well measurement models generalize across groups (T. A. Brown, 2014). The original scoring algorithm for the SF-36 proposes an orthogonal (uncorrelated) model indicating that the two higher order constructs are independent (Ware & Sherbourne, 1992). This algorithm has been found problematic as it produces negative coefficients in some scales and negatively weighted scales of one component explaining the variance of the other component, suggesting an interdependence between PCS and MCS (Taft, Karlson, & Sullivan, 2001). Other studies have found better models using an oblique solution (Hann & Reeves, 2008; Laucis, Hays, & Bhattacharyya, 2015; Simon, Revicki, Grothaus, & Vonkorff, 1998). Thus, continued theory-driven research is important on this instrument to gain an accurate understanding of measurement invariance and cross-cultural factors that contribute to these differential findings.
Theory-driven statistical procedures to analyze the psychometric properties of the SF-36 have been conducted worldwide (Ware & Gandek, 1998). Although limited, the studies from Spain and LAC have been informative to our understanding of cross-cultural factors of HRQOL in Spanish-speaking populations. To our knowledge, five studies have used CFA statistical procedures to examine the psychometric properties of the SF-36 and found mixed results (Ayuso-Mateos, Lasa, & Vazquez-Barquero, 1999; Failde & Ramos, 2000; Lera et al., 2013; Peek, Ray, Patel, Stoebner-May, & Ottenbacher, 2004; Salazar & Bernabe, 2015). Most studies found that the first-order domains demonstrate some problems comparatively with the original factor structure (Ware & Sherbourne, 1992). For example, Ayuso-Mateos et al. (1999) found that five first-order factors best fit the model in their adult sample in Spain, while a similar study in Spain found that all eight first-order factors load except for some problematic items on the mental health domain (Failde & Ramos, 2000). Similarly, Peek et al. (2004) found that the RE domain was not loading as proposed in the original factor structure in their sample of Mexican Americans. The Peruvian study used CFA and found the model with eight first-order factors and two second-order factors had acceptable fit (Salazar & Bernabe, 2015). The Chilean study by Lera and colleagues (2013) used principal components factor analysis, which forces items to load onto a construct, with an orthogonal solution. Their findings supported two higher order factors (MCS and PCS) with GH loading on MCS (Lera et al., 2013). The Chilean study is the only study in LAC focusing on an older adult sample. The factor structure of the SF-36 should be examined to explore its measurement invariance and test its construct validity in each country interested in its clinical application. Of interest in this study is to examine the factor structure of the SF-36 in a sample of Costa Rican older adults.
Although most LAC share cultural-related characteristics and Spanish as their primary language, the differences between countries in older adults’ health status and self-reported health found in the Health, Well-Being, and Aging studies (SABE), led us to believe older adults’ HRQOL should be examined within each country’s cultural world (see Palloni & McEniry, 2007; Zunzunegui, Alvarado, Beland, & Vissandjee, 2009). To our knowledge, the present study is the first one in the Central America region using the SF-36 and examining its factor structure in a sample of cognitively healthy older adults. There is no research to date that has used theory-driven statistical approaches to study perception of health and well-being as measured by the SF-36 in Costa Rica. The purpose of this study is to examine the factor structure of the SF-36 in a sample of Costa Rican older adults followed by a model that evaluates the effects of personal characteristics in their HRQOL.
Method
The Epidemiology and the Development of Alzheimer’s Disease in Costa Rica (EDAD) is a large study of Latino cognitive aging conducted in collaboration with the Universidad de Costa Rica (UCR), the University of Kansas (KU), and the University of California at Davis School of Medicine (UCD). The EDAD was designed to assess functionality in domains of cognition, personality, psychosocial behaviors, physical health, and diet in older adults residing in rural and urban areas of Costa Rica. The overarching goal of this research project was to compare functional abilities of older adults in rural and urban residences on the various domains. Participants were recruited from the Institutional Program for the Older Adults of the University of Costa Rica, the Costa Rican Gerontology Association, the Golden Citizen Program of the Costa Rican Social Security Bureau, as well as community-dwelling older adults. Both the UCR and KU institutional review boards approved the study. All participants consented prior to taking part in this research study.
Participants
Participants included 250 Costa Rican older adults between the ages of 60 and 85 from the Greater Metropolitan Area of San Jose (urban region) and the Guanacaste province (rural region). As shown in Table 1, 74% of the sample were females and the ratio of female to male was relatively similar in urban and rural regions (~3:1 female to male ratio). Participants had a mean age of 68.28 years (SD = 5.98). In addition, 61% endorsed hypertension and 18% endorsed a diagnosis of diabetes; both conditions were reported to be well managed with medication. Eligibility requirements for all participants included normal to corrected vision and hearing, no present cognitive impairment as measured by the Mini-Mental Status Examination (MMSE > 23; Blesa et al., 2001; Folstein, Folstein, & McHugh, 1975), and regimented adherence to medication (within 30 days of testing). Participants were excluded for a past or current history of clinically significant major psychiatric disorder or significant psychiatric symptoms, clinically evident stroke, brain trauma, neurocognitive disorder, clinically significant infection within the last 30 days, history of drug or alcohol abuse or dependence within the past 2 years, and significant pain or musculoskeletal disorder.
Participant Descriptive Statistics.
Note. 11 respondents did not specify geographical region.
Procedure
Each participant completed the Spanish version of the SF-36 Health Questionnaire as part of a larger survey on health behaviors. The administered version is in coordination with the version adapted and translated by Alonso, Prieto, and Anto (1995) to measure HRQOL. Statistical analyses of the Spanish version indicated an acceptable internal consistency, with Cronbach’s alpha ranging from .70 to .90 (Alonso et al., 1995; Ayuso-Mateos et al., 1999). The Spanish version of this scale has been shown to be valid in Hispanic populations (Arango-Lasprilla et al., 2012; Augustovski et al., 2008; Garcia et al., 2014; López-Pérez et al., 2014; Martínez-Hernández et al., 2010; Salazar & Bernabe, 2015; Zuniga et al., 1999).
Data were collected with three-form planned missing design, where each subject answered two thirds of the items of the SF-36 scale (Graham, Hofer, & MacKinnon, 1996; Graham, Taylor, Olchowski, & Cumsille, 2006). The three-form planned missing design reduces subject burden and fatigue, increasing reliability and power. Due to the planned missing data design, multiple imputation was used to properly handle missing data (Enders, 2010). The data were imputed as ordinal keeping the categorical scale intact.
Statistical Analyses
We examined the underlying dimensions (i.e., factors) and the item–factor relationship (i.e., factor loadings) of the SF-36 Questionnaire in a sample of Costa Rican older adults using CFA. Relevant items were reverse scored and multiple imputation was implemented using the Bayes estimator in Mplus 7.2 (Muthén & Muthén, 1998-2012) to generate 100 imputed data sets. Three alternative factor structures of the SF-36 were tested. The first model was with eight first-order factors corresponding to the eight scales (PF, RP, BP, GH, VT, SF, RE, MH) and no higher order factors. The second model added to Model 1 two higher order factors represented by the PCS (PF, RP, GH, VT) and the MCS (VT, SF, RE, MH). The third model added to Model 1 one higher order factor representing overall health (PF, RP, BP, GH, VT, SF, RE, MH).
All factor models were identified and evaluated using the parameters described in this section. The response items on the SF-36 Questionnaire are categorical, so we used the robust weighted least squares (WLSMV) in Mplus 7.2 (Muthén & Muthén, 1998-2012) to estimate model parameters as this is the most appropriate estimator for categorical data (Curran, West, & Finch, 1996; Hu, Bentler, & Kano, 1992). Latent variables were defined by the corresponding item of the SF-36 Questionnaire. Model identification was set using a fixed factor variance method so that the latent mean is fixed to 0 and latent variance is fixed to 1. The higher order models utilized marker variable for the lower order factors and fixed factor for the higher order factor(s). Marker variable fixes the first item to 1 and freely estimates the factor variance. Factor loadings between measured indicators and between latent variables were allowed to vary without constraint.
Model fit was evaluated on overall goodness of fit. Chi-square analyses were conducted to evaluate overall goodness of model fit (χ²) with nonsignificant values indicating poor model fit. Additional model fit indices, such as comparative fit indices (CFIs) and root mean square error of approximation (RMSEA) were also evaluated for greatest specification of overall model fit. CFI values equal to greater than 0.95 and RMSEA values equal to or smaller than 0.05 indicate good model fit (Hu & Bentler, 1999), with CFI greater than 0.90 and RMSEA less than 0.08 considered acceptable (T. A. Brown, 2014). We did not conduct model modifications to assess for model improvement fit to data because the goal of this project was to determine if the factor structure of the SF-36 could be replicated in our sample of cognitively healthy older adults in Costa Rica.
In subsequent analyses, region, age, and education were added as covariates and the a priori α was set to .05. Region was implemented as a dummy variable predictor of latent constructs that were scaled to a mean of 0 and SD 1, and the coefficients were automatically interpretable as latent d effect sizes. Urban was set as the reference group, with differences estimated for rural participants. Covariate for education was included in the analyses as the highest level of completed education with five dummy variables (elementary school, incomplete high school, high school, incomplete university, and university) compared with incomplete elementary school. Furthermore, age was mean-centered and then entered as a covariate.
Results
Preliminary Analyses
Based on the proposed factor structure, all combinations of items within a factor were placed in contingency tables. Responses were inspected to look for sparseness of responses in some answer choice categories and lack of variance in pairs of responses. From the item inspection, we decided to omit Item 12 (limits in “bathing or dressing yourself”) due to a large percentage of participants answering “no, not limited at all,” and Item 15 (“were limited in the kind of work or other activities”) for having identical answers to Item 13 (“cut down the amount of time you spent on work or other activities”). Furthermore, the first two response choices were collapsed into a single choice for certain questions due to too few selected responses across the two choices. On the GH domain, Items 33 to 36, response choices “definitely true” and “mostly true” were collapsed. On the Vitality and Mental Health domain, Items 23 to 31, response choices “all of the time” and “most of the time” were collapsed.
After removing Items 12 and 15, and collapsing categories, CFA model fitting was conducted (Bentler & Mooijaart, 1989; Jöreskog & Sörbom, 1996; MacCallum, Browne, & Cai, 2006). Each of the eight factors proposed in the original model of the SF-36 fit well, but their loading onto higher order latent constructs was highly correlated, introducing instability into the CFA, nested model testing procedure. The first pair of problem factors was RP and RE. Each factor is composed of questions following a prompt about problems in the last 4 weeks, and the first two questions in each set have similar meaning (i.e., “cut down the amount of time” and “accomplished less”). As shown in Figure 1, correlation residuals were added between Questions 13 and 17 and between Questions 14 and 18. This minor change reduced the correlation between RP and RE to acceptable levels. The second pair of problem factors was BP and SF, so each model was reestimated twice, once without BP and once without SF. Dropping either factor added stability to the model but did not change the general results, as shown in Tables 2 and 3.

Final model with factor loadings.
Fit Indices for the Three Factor Structures and All Models at Each Stage.
Note. Because robust weighted least squares estimator was used with imputed data, available model fit statistic are available as a point estimate and standard deviation across successful estimations. RMSEA = root mean square error of approximation; CFI = comparative fit index; TLI = Tucker–Lewis index; BP = bodily pain; SF = social functioning.
Correlations Between Constructs for Model 1 Based on All or Subset of First-Order Factors.
For all correlations, p < .001.
Note. Bolded values indicate latent correlations > 1.00, a condition that produced a non-positive definite result.
As shown in Table 1, participants from the urban region (n = 168) had a mean age of 67.54 years (SD = 5.67) and those residing in the rural area (n = 71) a mean age of 69.85 years (SD = 6.25), being statistically different, t(237) = −2.79, p < .001. The mean level of education for participants in the urban area (M = 14.18, SD = 6.21) was significantly higher than that of their counterparts in the rural region (M = 10.27, SD = 5.82), t(234) = 4.51, p < .01. These results suggest that age and level of education could be potential extraneous variables and therefore were added as covariates in our CFA models. Additional chi-square analyses did not show a significant difference between rural and urban participants for hypertension, χ²(1, N = 235) = .320, p = .57, φ = .037, and diabetes, χ²(1, N = 235) = 1.41, p = .24, φ = .07.
Model Comparison
Our results show that the observed data fit the eight first-order factors model well and that the inclusion of higher order factors did not significantly improve fit. Neither the two- nor the one higher order factor model fits the observed data appreciable better. Summary of model fit statistics are in Table 2 for the three alternate factor structure models of the SF-36 Questionnaire from our analyses. As shown in Table 2, factor loadings of individual indicators (i.e., factors or domains) changed very little or not at all whether the domain of BP or SF was omitted or not for all models. The largest observed difference from one model to the next was within the confidence interval of the same estimate in the other two models. In reference to size of factor loadings, ideally all standardized factor loadings (λ) should exceed 0.70, indicating that more than half of the variance for each item is due to the factor. As shown in Figure 1 (depiction of Model 1), the factor loading for Question 35 (“I expect my health to get worse”) was 0.37, well below 0.70, indicating that only 13.7% of its variance for this question was explained by GH. Examination of sample statistics showed that this item correlated highly (r = .53) with only one other question on the factor: Question 33 (“I seem to get sick a little easier than other people”). The other correlations ranged from .05 to .24. Factor loadings for three other items were less than 0.70, but the 95% confidence interval for those items contained 0.70. Those questions were 33 and 34 on GH and Question 24 on MH.
When the two higher order factors were introduced in Model 2, there was no impact on the factor loadings for the lower order factors with the exception of four domains (BP, SF, RP, and RE). The factor loadings for BP and SF changed slightly because these items were constrained to equality for identification purposes in the other models. However, as shown in Table 3, removal of either BP or SF did not change factor correlations for the lower order factors. Furthermore, PF was correlated at the same level with all other factors (those loading on PCS and MCS), except MH, and RP was highly correlated (r = .94) with RE. These statistical changes suggest cross loading between PCS domains and MCS domains. In order for higher order factors to improve model fit, the correlations between the Physical Health domains should be higher than with any Mental Health domain. This is not the case in these Latin American older adults. Due to the high cross loading of first-order factors when the higher order models were analyzed, Model 1, with no higher order factors, best represents HRQOL in our sample.
Furthermore, the estimates in Model 2 with eight first-order factors and two higher order factors produced a correlation between PCS and MCS that exceeded .84. Although the exact statistic differed slightly for each of the three models, the estimates for each model remained within the confidence interval of the other two models (i.e., correlation was still within confidence interval). In this sample, the correlation in the model with all eight first-order factors was .88 (SE = 0.03). The correlation was .85 (SE = 0.04) in the model without BP. The model without SF estimated a latent correlation of .86 (SE = 0.04). Regardless of model, the correlation between PCS and MCS was large.
Effects of Covariates (Region, Age, and Education)
Given the results from the independent samples t test, we examined the effects of region, age, and education as covariates in our models. These variables were added to the model selected as the better of the higher order models (Model 2, two higher order factors) and the first-order model with no higher order factor (Model 1) to estimate change in overall health.
Region
Region (urban or rural) was added first to Model 1. Overall health on all factors was lower for rural participants with standardized coefficients that ranged from −0.23 SDs on RP to −0.77 SDs for SF. The effects of region on RP and RE were not statistically significant based on Wald statistics with probability that exceeded .05, which indicated that there was no statistical difference between urban and rural participants on these two factors. All other first-order factors showed that there were statistically significant differences by region. Two models were estimated to determine if omission of a factor would change results, and the effects of region did not differ in models in which BP or SF were omitted. The full list of coefficients for models is provided in Table 4, which also contains the coefficients for the two higher order factors model (with all eight factors) to determine if there were PCS and MCS differences based on region.
Effect of Age and Rural Geographic Location With Urban Participants as Reference Group.
p < .05. **p < .01. ***p < .001.
In Model 2, the effect of region was quite large, similar to the results in Model 1. In the model with all eight first-order factors, participants’ PCS in rural settings was 0.68 (SE = 0.18) standard deviations lower than their urban peers. MCS was 0.47 (SE = 0.17) standard deviations lower as well.
Age
In Model 1, statistically significant results were observed in PF, VT, and MH. The effect for age was negative, which indicated that, for every year increase in age, health ratings decreased with changes interpreted in terms of standard deviations on the factor scores. The largest effect was for PF (β = −.04, SE = 0.01). Both VT and MH decreased 0.03 standard deviations (SE = 0.01) for every 1-year increase in age. In Model 2 (two higher order factors), the effects of age were significant in PCS (β = −.03, SE = 0.01) but not MCS (β = −.02, SE = 0.01), as shown in Table 4.
Education
The results listed in Table 5 indicated mean differences on SF-36 responses changed little based on level of completed education, with one exception. In Model 1, participants with incomplete high school rated their health lower on VT (β = −.59, SE = 0.30), SF (β = −.95, SE = 0.41), and MH (β = −.71, SE = 0.29), all emotional health domains. Likewise, in Model 2, MCS rating were significantly different for those same participants (β = −.74, SE = 0.29). All of these statistically significant results had p < .05. These findings suggest that education does not have overall significant impact on perception of HRQOL, although some slight differences on emotion-based domains were noted.
Mean Health Differences and Standard Errors for Highest Completed Education Level as Compared With Incomplete Elementary School.
Note. Standard errors are listed below the estimate and enclosed in parentheses.
p < .05.
Summary of Results
Analyses identified two questions that could not be included in the models, namely, Items 12 and 15, as well as questions with sparse responses at one end of response choices. After removal of these indicators and collapsing of categories, high latent correlations were problematic for role limits (RP and RE) though correlated residual added due to similar wording and closeness of questions on the measure reduced the correlation to acceptable levels for estimation. All models were estimated 3 times to examine the factor structure with all eight factors and then without SF or BP. Estimates were affected very little by the omission of one of those factors, and final CFA results indicated that the best fit model was the model without higher order factors. The health ratings differed by region, with lower physical and mental health reported by participants from rural settings. Older age also influenced participants’ perception of their HRQOL. Finally, no health differences were found in the completed educational levels.
Discussion
The purpose of the present study was to assess the factor structure of the SF-36 in a sample of cognitively healthy older adults from Costa Rica. Based on modification indices, we revised the current model and evaluated alternative factor structures. Our findings show that the best fitting model of the SF-36 was the one with the eight first-order factor structure, indicating that the same first-order factor structure found to be valid for English (Ware & Sherbourne, 1992) and Spanish speakers (Spain: Alonso et al., 1995; Peru: Salazar & Bernabe, 2015; Chile: Lera et al., 2013) was also valid in this sample of Central Americans. The original empirical model of the SF-36 with eight first-order factors is supported in our sample of Costa Rican older adults.
As part of the model revision of the confirmatory procedure, we tested the seven first-order models that omitted BP or SF, which did not result in a superior factor model to the one with eight factors. We found that the eight-factor model was the most robust model, even when the two higher order factors (i.e., MCS and PCS) were included. These findings suggest that Costa Rican older adults’ HRQOL is better characterized by the scores of the eight domain scales than the two higher order component scores.
The Eight-Factor Structure of the SF-36
As we explained in the “Results” section, the specified model resulted in omitting some items, collapsing answer choices, and testing alternative models to add stability to the model. The source of instability came from the low variability in some item responses (i.e., Item 12), multicollinearity found on pairs of items (i.e., Items 13 and 15), and first-order factors (i.e., BP and SF). Recall that factors are best identified when multiple items are used in composite to define them. In the case of BP and SF, there were only two items that defined each factor. Thus, the item density of these factors was low. Furthermore, each contributing item had similarly worded descriptions. The item conceptual and grammatical overlap was not distinct enough for our Costa Rican sample to make fine discerned judgments leading to the items’ multicollinearity. Other studies have similarly found that these same constructs (BP and SF) have poor psychometric properties (i.e., low Cronbach’s alpha) and they do not clearly discriminate on which second-order factor they load (see Alonso et al., 1998; Ayuso-Mateos et al., 1999; Jordan-Marsh et al., 2008; Keller et al., 1998; Kelly et al., 2015; Oliveira et al., 2014; Peek et al., 2004; Y. Zhang, Qu, Lun, Guo, & Liu, 2012).
An alternative explanation to Costa Rican older adults’ perception of their HRQOL can be formulated from a cultural perspective. Our findings of multicollinearity between the BP and SF factors could be indicative of the cultural construction Costa Rican older adults have on pain perception and social functioning. Based on Yiengprugsawan, Welsh, and Kendig’s (2018) finding of a positive association between social connectedness and good self-rated health in mid- and later life, and on Biesanz, Biensanz, and Biesanz’s (1999) description of older generations of Costa Ricans as work-oriented and community-involved citizens, it is possible that Costa Rican older adults’ social and work-related activities positively influence their views on pain, physical, and/or emotional health–related problems. This interpretation would support findings from experimental studies in which distractibility, cognitive reappraisal (Wiech, 2016), and social support (J. Brown, Sheffield, Leary, & Robinson, 2003) reduces pain perception.
HRQOL as a Unidimensional Construct
Costa Rican older adults perceive health as a unitary construct, and their HRQOL is better characterized by analyzing the eight first-order factor scores only. The high correlation between PCS and MCS advises there was low discriminant validity between the physical and mental health composites, indicating that these constructs are not discriminated in this sample of Latin Americans (T. A. Brown, 2014). The cross-loadings of high correlation between domains of PCS and of those that load on to MCS suggest that even the relationship between the first-order domains was not clearly differentiated as purely physical or mental. These results converge with other reports in LAC, where high correlations between the SF-36 components have also been reported (see Salazar & Bernabe, 2015; Tucker, Adams, & Wilson, 2013).
The concepts of health and HRQOL are social constructions (see Olafsdottir, 2013) that are embedded within a particular social context. This perspective of the social construction of health could explain the nondistinction between a mental and a physical component of the SF-36 in our sample. Costa Rican older adults perceive their health as an overlapping single construct, not as two Cartesian discrete constructs in which the body and the mind are separate entities. Costa Ricans’ perception of their HRQOL is consistent with the dynamic relationship between mental and physical health found in aging studies (Ohrnberger, Fichera, & Sutton, 2017); and with the World Health Organization’s (WHO) view on health: “there is no health without mental health” (WHO, 2004, p. 10). The joint effect of social, physical, and mental health on HRQOL has been evidenced in studies conducted outside the Latin American region. For example, X. Zhang and colleagues (2018) found that different lifestyle factors (e.g., diet, sleep, exercise, labor, and leisure time) contributed to the perception of HRQOL in Chinese older adults.
HRQOL as a Function of Sociodemographic Characteristics
HRQOL of Costa Rican older adults differed by region of residency. Those residing in rural Costa Rica reported overall lower HRQOL compared with those living in urban Costa Rica. These geographical differences in health perception highlight that impact of social determinants of health as a fundamental cause of health inequalities. According to the WHO (n.d.), the social determinants of health consider the “conditions in which people are born, grow, live, work and age.” The fundamental cause of health inequalities is more specific in explaining that socioeconomic status (SES) can open or limit the access to resources that can prevent diseases or minimize its consequences (see Phelan, Link, & Tehranifar, 2010). These theories explain the effect of region on Costa Rican’s HRQOL we found in this study.
Costa Rica has superior health indicators compared with its Central American neighbors; however, significant health disparities within the country exist. Costa Rica is an LAC with a superior human development in the region (see the human development index from the United Nations Developmental Program [UNDP]), with a public health system (in Spanish: seguridad social) implemented on principles of social participation, solidarity, equity, and universality, which are financially mandatory, and with rural health programs such as the Community and Rural Health Program in Nicoya (Salas-Chavez, 2013). However, the inequalities within the country are moderately increasing (see UNDP), making health access difficult to reach in those provinces with lower social development (Castillo-Martinez, 2013). Rural Costa Rican older adults’ lower perception of their HRQOL is a reflection of the health and sociocultural disparities in their surroundings. Health and quality of life are not dependent on individual effort.
From the SABE studies, we know that differences in self-reported health among Latin American older adults are associated with education and income inequalities (Palloni & McEniry, 2007). Specific to Costa Rica, Rosero-Bixby and Dow (2009) found that smoking and sedentary behaviors were more common among low-SES older adults, compared with those in the highest SES bracket. These studies support the view that the HRQOL disparities found in our study can be explained by the fundamental cause of health inequalities.
As expected, we found that age is related to Costa Ricans’ view of HRQOL. Older participants reported having poorer PF, less energy, and lower emotional well-being. As age increased, reported health on these domains specifically was rated lower. It is important to note that age did not influence ratings on all domains of the SF-36. The association between age and HRQOL has been well documented and our results are consistent with those found in other studies (see Der-Martirosian, Kritz-Silverstein, & Barrett-Connor, 2010; Sullivan, Kempen, Van Sonderen, & Ormel, 2000). We want to highlight that even though poorer ratings of HRQOL on those three factors were related to participants’ age, domains associated with perception of pain, social functioning, role limitations, and general health were not significantly rated lower as a function of age. Age should be considered when evaluating HRQOL in older adults.
Finally, our results showed no mean differences in HRQOL between the highest and lowest completed level of education (i.e., university completed vs. incomplete elementary). This finding is suggesting that the differences in HRQOL between the urban and rural regions are the result of other sociocultural disparities but not of education. Future studies should aim at exploring the characteristics that explain the urban and rural differences found in the Costa Rican older adult population.
Conclusion
Our study examined the factor structure of the SF-36 in a sample of Costa Rican cognitively healthy older adults and found that the eight-factor model describes better this population’s HRQOL. In the model with two higher order factors, our findings show a high correlation between mental component and physical component scores, suggesting that these scores are not clearly distinguished as two separate constructs. Given the popularized use of the SF-36 in research and clinical settings, we strongly encourage consideration of the sociocultural context of Latin American older adults. We believe that researchers and clinicians should always be mindful of the underlying cultural values of health-related concepts (see Ngo-Metzger, Sorkin, Mangione, Gandek, & Hays, 2008). Those who use the SF-36 with Latin American older adults should use the mental and PCS scores with caution (see, Simon et al., 1998).
Validity studies of the SF-36 in Latin American older adult population are necessary for understanding of health perception of older adults within the region, and for developing country-specific scoring norms. In this effort, we recommend (a) testing measurement invariance with the specific populations of interest, (b) conducting preliminary analyses of items and distribution of responses to adjust properly the model specification step of the confirmatory process (see MacCallum, 2003), and (c) testing if changes on the structure of the BP and SF factors (i.e., expanding the items on each subscale) would result in improved model fit.
Overall, we examined the structural validity of the SF-36 and found the eight scales are valid constructs in our sample of cognitively healthy Costa Rican older adults. Our results support the use of the SF-36 in Latin American older adults. The interpretation of composite measures of physical health and mental health should be conducted with caution as they may not be distinctive in the way Latin Americans view health.
Footnotes
Declaration of Conflicting Interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This study was supported by Fogarty International Center (FIC) Epidemiology and Development of Alzheimer’s Disease in Costa Rica (R21TW009665), National Institute on Aging (NIA) Alzheimer’s Disease Core Center (P30AG010129), NIA Center to advance cognitive health and health care in older Latinos (Resource Centers for Minority Aging Research [RCMAR]; P30AG043097).
