Abstract
Research has demonstrated that adequate parenting is an important determinant of a healthy social-emotional development in children. There is a need for valid assessment tools for measuring parenting quality, particularly in clinical settings. The Caregiver Wish List (CWL) is a new 53-item interview-based scale for assessing parenting practices. We examined the CWL’s factor structure in a sample of 348 parents of children (4-11 years), 220 were drawn from the general population and 128 from a clinical setting. Exploratory factor analysis revealed five factors, which did not fully correspond with the hypothesized, original factor structure. Nonetheless, the extracted factors were meaningful and could be labeled as follows:
Ineffective parenting is the most widely studied and empirically established risk factor for externalizing behavior problems in children (DeBaryshe, Patterson, & Capaldi, 1993; Forgatch, Bullock, & Patterson, 2004; McCoy, Frick, Loney, & Ellis, 1999; Nix et al., 1999; Oliver, Guerin, & Coffman, 2009; Patterson, Forgatch, Yoerger, & Stoolmiller, 1998; Sharma & Sandhu, 2006; Williams et al., 2009). In particular, harsh and inconsistent discipline, poor monitoring and supervision, and low positive involvement have all been demonstrated to be significantly associated with young people’s oppositional-defiant and conduct problems (Bierman & Smoot, 1991; Cunningham & Boyle, 2002; Ehrensaft et al., 2003; Frick et al., 1992; Nicholson, Fox, & Johnson, 2005). Many interventions for childhood behavior problems focus on increasing effective parenting and decreasing the use of ineffective rearing practices. Research generally supports the theoretical notion that improving parenting practices is the most important mechanism contributing to a decrease in children’s behavior problems (DeGarmo & Forgatch, 2005, 2007; Forgatch & DeGarmo, 1999; Kazdin, 2007; Martinez & Forgatch, 2001; Ogden & Amlund-Hagen, 2008). Improvements in the behavior problems of the children also tend to reduce stress and depression in parents, which in turn will increase the likelihood that the effective parenting practices will be maintained, thereby promoting the long-term effects of therapy (DeGarmo, Patterson, & Forgatch, 2004).
Many parent training programs are based on the Social Interaction Learning (SIL) model (Forgatch et al., 2004). The SIL model assumes that the psychosocial development of the child is directly influenced by the parents and their parenting strategies. Contextual factors, however, can have a negative impact on the parenting quality, and thus, indirectly on the child. For example, when parents are going through a divorce, this puts pressure on the parenting quality (Conger, Patterson, & Ge, 1995; Hetherington, Bridges, & Insabella, 1998). According to the SIL model, there are five core effective parenting practices: skill encouragement, discipline, monitoring, problem solving, and positive involvement.
Many studies rely on observations of parent–child interactions as a method to assess parenting skills (Martinez & Forgatch, 2001; Ogden & Amlund-Hagen, 2008; Patterson, DeGarmo, & Forgatch, 2004). For instance, parent and child can be observed during a structured playroom session or in a natural situation at home or in school. Self-report measures can be used in addition to observation measures of parenting. In fact, there are indications that a multi-method assessment of parenting possesses incremental validity over using only one measurement method (Harvey, Danforth, Ulaszek, & Eberhardt, 2001; Meyer et al., 2001).
There are many self-report measures that can be used for the assessment of parenting, although there is a wide variety in terms of definition and theoretical framework. The Caregiver Wish List (CWL; Hodges, 2002) is a structured interview-based instrument that enables parents and other caregivers to reflect on their own parenting practices and talk about their experiences with parenting. Interestingly, the CWL was designed to measure the core parenting practices of the SIL model and seems to be particularly useful in clinical practice for a number of reasons. First, the CWL may help clinicians to gather information on the strengths and weaknesses in parenting practices as perceived by parents themselves. Furthermore, the CWL can be expected to enhance the therapeutic alliance and the positive engagement between parents and the clinician. Finally, the CWL may assist in identifying and clarifying the goals of treatment. The goal of the CWL is to empower parents by encouraging them to see themselves as the main agent of change for their child (Hodges, 2005). The CWL contains six a priori domains of parenting skills: providing direction and following up, encouraging good behavior, discouraging undesirable behavior, monitoring activities, connecting positively with the child, and problem solving orientation.
In the Netherlands, the CWL is being used as an intake assessment for Parent Management Training–Oregon model (PMTO), a parent training program that teaches parents effective parenting skills according to the SIL model with the aim of reducing externalizing behavior problems in children aged 4 to 12 years. This makes sense as the CWL directly measures the parenting skills that are the main focus of change in PMTO. A Dutch translation of the CWL was also employed in a large-scale study evaluating the effectiveness of PMTO in The Netherlands (Thijssen, Vink, Muris, & de Ruiter, 2017). However, in that study, it was found that the reliability of the CWL was rather poor, with only one of the six original parenting domains showing adequate internal consistency. In fact, to our knowledge, no study can be found that examined the psychometric properties of the CWL. The aim of the present study was a first attempt to fill this gap. The CWL was administered in a large sample of parents of children from the community as well as a clinical setting visited by parents of whom children displayed disruptive behavior problems. In this way, it became possible to explore the factor structure of the instrument as well as its reliability (internal consistency and correlations between factors) and validity (relations between parenting scores and children’s psychopathological symptoms).
Method
Participants
The total sample consisted of 348 parents (92.1% mothers) of 4- to 11-year-old children (
Demographic Data and Significant Differences for the Community and Clinical Sample.
Procedure
The parents of the community sample were recruited in elementary schools where information letters explaining the nature of the study and consent forms were distributed, together with a brief survey to obtain background information on the parents (e.g., ethnicity, educational level, employment status). When parents agreed to participate, they signed the consent form, filled out the demographic characteristics survey, and returned these materials to the researchers via the child’s teacher at school. The parent who spent most time with the child (main caregiver) was contacted to make an appointment for the CWL interview. The CWL interview was administered at the child’s school and lasted for about 30 min. As a reward, parents received a €10 gift voucher. This study was approved by the Medical Ethical Committee of the Academic Hospital of Maastricht University (NL19855.068.07/MEC 07-3-091).
The parents of the clinical sample were recruited as part of a study on the effectiveness of PMTO in children with disruptive behavior problems. As soon as families were referred to the child service agency, it was checked whether they met the inclusion criteria for the study (for details, see Thijssen et al., 2017). Families who met the criteria received information about the study and its procedure and were invited to participate. When parents agreed, they were asked to give their written consent. The CWL interview was completed by the main caregiver and mostly took place at the child service agency. For the present study, only the CWL data that were obtained prior to the treatment were employed.
Instruments
As noted in the introduction, the CWL (Hodges, 2005; Hodges, de Ruiter, & Thijssen, 2009) is an interview-based instrument consisting of 53 items questioning the parent about his or her parenting skills. The interviewer reads the questions to the parent, who has to indicate the most applicable response option using a 5-point Likert-type scale. The response options are specific to the question, although they mostly refer to the frequency of behaviors. The parent also has the option to respond with “not applicable” when the described situation does not apply to their family. Twenty-nine items are positively formulated, whereas 24 items are negatively phrased. The latter items are reversed so that higher CWL scores reflect higher levels of effective parenting. On the basis of conceptual work by Hodges (2002), items can be allocated to six a priori domains of parenting skills: providing direction and following up (four items), encouraging good behavior (five items), discouraging undesirable behavior (24 items), monitoring activities (13 items), connecting positively with child (three items), and problem solving orientation (four items). Each domain score can be regarded as a dimension with weak parenting skills on one end and strong parenting skills on the other.
The
Data Analyses
As no research on the factor structure of the CWL has been published, a confirmatory factor analysis was considered as less appropriate. Instead, we performed an exploratory factor analysis (EFA), using the preimposed conceptual structure as a guide in the process of exploration. Exploratory factor analyses were conducted with Mplus (version 5.21), using robust maximum likelihood estimation (MLE) to correct for the nonnormality in the distribution of the items. An oblique rotation method (geomin) was applied, as it seemed plausible that the extracted factors of positive parenting would be correlated. Unlike principal components analysis, EFA does not permit the quantification of the proportion of common variance explained. For an alternative indication of the quality of our factor solution, we decided to examine a few model fit indices that are usually reported in the context of confirmatory factor analysis: the Tucker–Lewis Index (TLI), the root mean square error of approximation (RMSEA), and the standardized root mean square residual (SRMR). These fit indices give complementary information. Although they should be considered with caution when conducting EFA, they provide a useful impression on the quality of the factor solution. Fit values that would indicate an accurate description of the original correlation matrix would be a TLI value larger than 0.95, a RMSEA value smaller than 0.05, and an SRMR value smaller than 0.08 (Hu & Bentler, 1999).
Even though the separate samples were relatively small in comparison with the number of items to be processed in the factor analysis, we decided to initially explore the factor structure for each of the samples separately. Using smaller samples in the factor analysis will increase the instability of the solution, but prevents possible confounding because of using very heterogeneous populations. Only in case of sufficient similarity in the factor solutions for the separate samples, it seemed justified to interpret the analysis for both samples combined.
To examine whether the CWL factors were significantly able to discriminate between the community and clinical sample, independent-samples
Results
Preliminary Considerations
Before conducting the factor analysis, some preliminary decisions had to be made. First, the issue of “not applicable” responses in the CWL had to be resolved. As noted earlier, parents had the option to select this response in case a given situation did not occur in daily life. We decided to treat these “not applicable” responses as missing values. As we used robust maximum likelihood estimation for our explorative factor analysis, all cases with missing values on some of the items could be retained in the analysis. Meanwhile, items that generated a relatively high (>10%) percentage of “not applicable” responses were removed from the analysis. The threshold value of 10% was chosen because until this percentage is reached, most of the traditional methods for dealing with missing values produce fairly similar results (e.g., see Barzi & Woodward, 2004). Based on this criterion, three items were removed.
Second, we screened all item distributions to identify items that were excessively skewed. Items for which a score of 4 or 5 was given in 95% of the cases in one sample and at least 90% of the cases in the other were excluded from the analyses, because it can be assumed that these items had little discriminatory power. There were six items showing such extreme distributions in both samples, which were, therefore, excluded. Furthermore, we noticed a number of items that had conspicuously low mean scores, indicating that the pertinent behaviors were hardly practiced by the parents. One would expect that the mean scores on parenting skills in the community sample would at least be “neutral” (i.e., somewhere around the rating of 3 or higher). Bearing this in mind, we excluded all items with a mean value lower than 2.50 in the community sample (Items 19, 21, 23, 24, 38, and 40). A mean below 2.50 indicates a clear majority of “almost never” or “now and then” responses (with “now and then” reflecting a lower frequency than “sometimes,” which is coded as 3 in the questionnaire). Of course, a relatively low item mean for the general community sample need not by itself invalidate the item as a diagnostic marker, as it is conceivable that the corresponding item mean might still be significantly lower in the clinical sample. However, with one exception (Item 21), the items that were singled out for exclusion on the basis of this first criterion all had higher means in the clinical sample than in the general sample. This indicates that parents from the clinical sample show the desired behavior more often than parents from the general population sample. Therefore, these items cannot be used for the desired diagnostic purpose. Item 21 had a lower mean in the clinical sample, but it did not differ significantly from that in the general population sample. We therefore decided to exclude these six items, because they do not differentiate between general population and clinical population samples. Using these three criteria, 13 items were ultimately removed before conducting the EFA.
EFA
The Kaiser–Meyer–Olkin (KMO) measure of sampling adequacy showed acceptable values for both the community sample (KMO = 0.67) and the clinical sample (KMO = 0.64). The initial attempt to conduct EFA for the community sample failed to reach convergence in the estimation of parameter values. Inspection of the output provided, showed conspicuous estimates of means and covariances involving four items. Although failure to reach a convergent solution may have several causes, the most likely explanation seemed related to low initial communality values for these four items. We therefore decided to remove these items from the analysis, after which no further problems in convergence occurred. The EFA in the community sample was then rerun with the remaining 36 items and also performed for the clinical sample. Based on the Kaiser criterion (eigenvalues > 1), 12 factors were identified for both the community and clinical sample. Obviously, this was not a useful criterion. Inspection of the scree plot suggested a five-, six-, or a seven-factor solution. Inspection of the fit indices for both the community and clinical sample pointed in the direction of either a five- or a six-factor solution for both samples. The sixth factor was difficult to interpret and, therefore, a five-factor solution seemed most appropriate. This resulted in well-interpretable factors with acceptable RMSEA and SRMR values, but rather low TLI values (see Table 2). As the pattern of factor loadings was quite comparable in both samples, we decided to rerun the analysis for both samples combined. The resulting five-factor solution produced good fit values for the RMSEA and the SRMR measures, whereas the TLI improved but was still below the acceptable limit.
Fit Indices of the Five-Factor Solution for the Community, Clinical, Total Sample, and Final Factor Solution Based on the Included Items of the Total Sample.
In the five-factor solution for the total sample, 13 items had nonsignificant factor loadings or factor loadings less than .40 on all factors and were, therefore, not included in the final solution. The final factor loadings for each included item in the community, clinical, and total sample are shown in Table 3. Factor 1 consisted of five items and was named
Factor Loading for Items on the Caregiver Wish List for the Community, Clinical, and Total Sample.
Reliability
The internal consistency coefficients of the five factors were calculated for the community, clinical, and total sample. As can be seen in Table 4, all Cronbach’s alphas were in the .61 to .82 range. When adopting an alpha of .70 as cutoff point, it can be concluded that in the clinical and total sample, four of the five factors displayed good internal consistency. The CWL total score had a good internal consistency in both the community and clinical sample, as well as in the total sample. In the community sample, Cronbach’s alphas were in general somewhat lower: only the factors
Internal Consistencies (Cronbach’s α) of the Extracted Factors of the Community, Clinical, and Total Sample.
Pearson’s product–moment correlations among the five factors were calculated for the two samples separately and for the total sample (Table 5). Overall, the correlations were quite low. Two correlations were significant in all three samples, namely the correlation between
Pearson’s Correlations Among the Five Extracted Factors for the Community, Clinical, and Total Sample.
External Validity
As stated before, the pattern of factor loadings was roughly similar across the two samples, motivating us to reanalyze the data on the basis of the total sample, thus reducing instability in the parameter estimates. Table 6 shows the mean scores and standard deviations on the factors for both the community and clinical sample. The difference between both samples was significant for all factors and the CWL total score (with 23 items included in the factors). On all factors, parents from the community sample had a significantly higher mean than parents from the clinical sample, indicating that the parenting skills of parents in the community sample were better than those of parents in the clinical sample.
Means (
Pearson’s product–moment correlations between the five CWL factors and CBCL Internalizing and Externalizing subscales were calculated. As can be seen in Table 7, all CWL factors and the total score were significantly and negatively correlated with both CBCL Internalizing and Externalizing scales. Relatively low, but still significant, negative correlations were found for the CWL factor
Pearson’s Correlations Between the Five Extracted CWL Factors and CBCL Internalizing and Externalizing Scores and the Results of Tests Comparing the Strength of These Correlations.
Discussion
This study was a first attempt to investigate the psychometric qualities of the CWL, an interview-based scale for measuring parenting behaviors as defined in the SIL model (Forgatch et al., 2004). EFA was performed to examine the factor structure of the CWL using the data of parents from both a community and a clinical sample of elementary school–age children. Furthermore, the internal reliability of the extracted factors was investigated and we examined whether the CWL factors showed theoretically meaningful associations with behavior problems of the parents’ children and whether they were able to differentiate between the community and clinical sample.
The EFA revealed five factors, which were named
The extracted factors are only partially in line with the domains proposed by the developer of the CWL. Hodges (2002) had in mind that the CWL would be composed of six domains of parenting, whereas in the present study only five factors were found. Three of these five factors, namely,
Only 23 of the 53 original CWL items loaded on at least one of the five factors, indicating that the original CWL can probably be reduced in length without losing critical content. Despite the limited number of items included in the five factors, the factors showed adequate internal consistencies, with Cronbach’s alphas ranging between .61 and .77. The CWL total score, based on the items included in the factors, had good internal consistency coefficients in the samples (.71-.82). Only two correlations among CWL factors were consistently found in the community, clinical, and total sample, namely those between
The factor structure of the community and clinical samples was highly similar, and thus, the extracted CWL factor structure seemed to be applicable to parents in both populations. Meanwhile, it was found that the two samples differed in terms of absolute scores on the five extracted CWL factors and the total score in the expected direction: Parents in the community sample exhibited higher scores and hence better parenting skills than parents in the clinical sample. Finally, all factors and the CWL total score were significantly negatively related to internalizing and externalizing behavior of the child: The better the parenting practices of the parents, the lower the behavior problems. The factors
A few limitations of our study have to be noted. First, the majority of the families consisted of two biological Dutch parents with steady employment, so it is unknown to what extent our findings can be generalized to, for example, single parents, stepfamilies, or parents from other ethnic and lower socioeconomic backgrounds. Second, the sample size was quite small for factor analysis when considering the number of items included in the original CWL. The criterion for factor analysis is that there should be at least five cases per item (Hair, Black, Babin, & Anderson, 2009). The CWL contains 53 items, which means that at least 246 participants were needed to meet this criterion. However, before the final factor analysis was run, 17 items needed to be excluded based on various criteria, which means that only 36 items were used in the analysis. Thus, a sample of 180 participants was sufficient. The community sample and the total sample met this criterion, but the clinical sample (
As the CWL in its current form seems unsuitable for research purposes, it would probably be better to rely on another, better-validated self-report measure for assessing parenting skills; for example, the Alabama Parenting Questionnaire (APQ; Essau, Sasagawa, & Frick, 2006), which has been examined in various studies (e.g., Dadds, Maujean, & Fraser, 2003; Escribano, Aniorte, & Orgilés, 2013; Shelton, Frick, & Wootton, 1996; Zlomke, Lamport, Bauman, Garland, & Talbot, 2014). However, the APQ does not measure all the parenting skills defined by the SIL model, whereas the CWL was especially developed to measure those parenting dimensions. Also, for clinical purposes, the CWL would be more suitable than the APQ. In contrast to the APQ, the CWL is an interview and was designed to help therapists gather information on strengths and weaknesses in parenting practices as seen by parents themselves, and to enhance the therapeutic alliance and positive engagement between parents and therapists (Hodges, 2005). For this purpose, an interview is more appropriate than a questionnaire. Therefore, the CWL could be a useful instrument and it may be worthwhile to further examine its psychometric properties to improve this instrument for research purposes. Items could be added to some of the factors to improve their internal consistency. Also, the factor structure found in the present study should be replicated before more definitive conclusions about the instrument’s structural validity can be drawn. Furthermore, it needs to be examined if the extracted factors of the CWL are sensitive to change. In our study evaluating the effectiveness of PMTO, in which the CWL was used as an outcome measure, only the original “discouraging undesirable behavior” domain and the CWL total score had adequate internal consistencies and these scales showed significant change in response to parent training (Thijssen et al., 2017). In addition, future research should explore whether the parenting practices assessed by the CWL predict the development or maintenance of externalizing behavior problems in children. The association between CWL self-reported parenting practices and parenting assessed on the basis of actual parent–child interactions studied in real life or in the lab should also be examined.
In conclusion, findings from our study suggest that the CWL in its current form needs to be amended to render it more suitable for research purposes. The a priori factor structure could not be replicated and several of the extracted factors need additional items to improve the CWL’s psychometric properties. Furthermore, although the CWL was designed to assess parenting skills defined by SIL theory, it failed to assess problem solving as one of the main SIL skills. Notwithstanding, the CWL’s interview format provides opportunities for rapport building between parent and interviewer, and for reflection on parenting skills in terms of strengths and vulnerabilities.
Footnotes
Authors’ Note
Jill Thijssen is now at Social and Behavioral Sciences, Department of Developmental Psychology, Tilburg University.
Declaration of Conflicting Interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This research was supported by The Netherlands Organization for Health Research and Development (ZonMw 157001022), Innovatiefonds Zorgverzekeraars (11-038), VSB fonds (20052185), and Kinderpostzegels Nederland (25937).
