Abstract
Moral emotions are experienced in daily life and are crucial for mediating appropriate social behaviors, as they prevent individuals from committing transgressions. In this study, caregivers of 377 children aged between 2.5 and 6.5 years old completed the Moral Emotions Questionnaire (MEQ), a parent report aimed to separately identify the presence of shame, guilt, and pride behaviors in early childhood. To validate this newly developed questionnaire, a confirmatory factor analysis and measurement invariance were conducted, and internal consistency, and concurrent validity were tested. Outcomes confirmed that the three moral emotions can be individually identified through the MEQ, even at such an early age. The MEQ scales showed acceptable internal consistencies and the associations between the three moral emotions and externalizing behaviors, internalizing behaviors, and social competence were in accordance with previous research, therefore confirming concurrent validity.
Keywords
Introduction
Moral emotions arise when people feel judged, or expect to be judged by others, or judge their own behavior in terms of right and wrong. Within the spectrum of moral emotions, shame, guilt, and pride, are considered self-conscious emotions. Self-conscious emotions are those which require a self-evaluative process, in which the individual continuously evolves according to the social norms (Sznycer, 2019; Tracy & Robins, 2004a). The experience—or even just the anticipation—of these emotions serves as a behavior regulation mechanism, urging people to abide by the rules and to avoid committing moral transgressions (Blasi, 1999; Tangney et al., 2007). Moral emotions also serve a communicative function: when people express moral emotions, they openly acknowledge that their behavior was incongruent (as in guilt or in shame) or congruent (as in pride) with the group’s rules, norms, and values, and that they take responsibility for their behavior. This helps them to be accepted—or reaccepted—as a valuable group member (Shariff & Tracy, 2011; Stearns & Parrott, 2012).
Shame, guilt, and pride have been appointed as the more focal self-conscious emotions, their development and relation with other aspects of psychosocial functioning have long been studied (Sznycer, 2019; Tracy & Robins, 2004a). Lower levels of moral emotions have been associated with externalizing behaviors (e.g., bullying, aggression, delinquency, psychopathy) in early adolescence and adulthood (Blair et al., 2001; Holmqvist, 2008; Menesini & Camodeca, 2008; Olthof, 2012). These findings point to the importance of examining the development of moral emotions at the youngest possible age.
Moral understanding and morally guided behaviors can be observed early in life. For example, 2-year-old children are able to stop themselves from doing something that is prohibited, and then also hesitate after disobeying a rule (Kochanska & Aksan, 2006). Studies have confirmed that children as young as 2 or 3 years old already show behaviors that suggest the experience of moral emotions, such as signs of distress, avoiding eye contact, confessions, and/or reparative behaviors after a transgression (Barrett et al., 1993; Kochanska et al., 1995), as well as behaviors that suggest an attempt to attract attention when they achieved something on their own (Stipek et al., 1992). This is in line with the development of self-concept, which has been suggested to emerge around the 30 months of age (Bullock & Lutkenhaus, 1990). The development of moral emotions accompanies self-concept development, because to be able to evaluate whether their actions are congruent or incongruent with societal norms and values, children first need to be able to distinguish themselves from others and then focus the attention on their individual actions (Hart & Matsuba, 2007). This learning process, which seems to be a major importance on the first 4 years of life, occurs through the daily interactions that children have (Dahl & Killen, 2018a, 2018b). Since they are born, children experience and observe behaviors of aid and harm from their families toward them, and also between other people (Dahl, 2015; Hammond et al., 2017). These experiences help them shape their knowledge about right and wrong, and together with a constant development of their self-concept, allow them to guide their behaviors toward others (Dahl & Killen, 2018a, 2018b). To the best of our knowledge, no instrument is yet available that can facilitate the systematic study of the different moral emotions in the preschool years, when children first show signs of these emotions. Therefore, our aim was to develop and validate a questionnaire that can uniquely identify different moral emotions (shame, guilt, and pride) in early childhood.
Distinguishing Moral Emotions
Different moral emotions can be distinguished according to their associated social goals and behaviors. Shame and guilt could both arise from the same antecedent, namely, when a transgression is committed. In fact, the same transgression can evoke shame in one person and guilt in another (Lewis, 2014; Tangney et al., 2007). Shame arises when a transgressor thinks that the harm is irreparable, as it reflects a failure of the whole self, and/or that others will attribute to him or her a negative, unwanted identity (Barrett et al., 1993; Olthof, 2012). Shame can be observed through submissive behaviors (e.g., making oneself appear smaller, avoiding eye contact) and the avoidance of others’ attention by the transgressor. Although the action tendencies of trying to avoid the attention after children commit transgressions or cause harm to others are reported to already be exhibited by 2-year-old children (Barrett et al., 1993), research suggests that shame emerges after age 3, when children become more aware of themselves, and their behaviors in relation to social norms and rules (Kagan, 2005; Lewis, 1998). Shame prevents the individual from acting against social norms, thereby preventing negative judgment from others (Barrett, 1998a; Bedford & Hwang, 2003; Cole et al., 2006; Dempsey, 2017; Fung, 1999; Midlarsky et al., 2006; Yoshioka & Choi, 2005). Previous studies have shown that in older children, adolescents, and adults, excessive shame often leads to increased levels of anger and to persistent feelings of inferiority or failure, which can contribute to anxiety and depression (Bennett et al., 2005; Broekhof et al., 2018; Harper & Arias, 2004; Tangney et al., 2007).
In contrast, guilt arises if the transgressor has a sense of responsibility over the transgression. In this case, the damage only reflects his or her behavior in this specific situation and not in relation to the whole self (Barrett et al., 1993; Ferguson et al., 2000; Tracy & Robins, 2004a). Guilt elicits a need to repair harm done and restore the relationship. Thus, guilt serves as a prosocial behavior motivator (Dempsey, 2017). Guilt behaviors are usually aimed at drawing attention to the wrongdoing and trying to make up for it, as in confessing, making apologies, or trying to repair the damage (Tangney et al., 2007). A study conducted by Zahn-Waxler et al. (1992) has shown that action tendencies related to guilt start to manifest in children as young as 15 months of age, with children showing intent to repair the harm they caused to others. However, most studies report that these reparative behaviors became more prominent after 2 years of age (Barrett, 1998b; Cole et al., 1992; Kochanska et al., 1994, 2002), suggesting that guilt is developed throughout the second year of life. Previous studies have shown that children who more frequently experience guilt are more attentive and competent when dealing with others, show more prosocial behaviors, and have better quality friendships and relationships (Baumeister et al., 1994; Estrada-hollenbeck & Heatherton, 1995; Kochanska & Aksan, 2006). Conversely, children who show fewer guilt behaviors tend to exhibit more disruptive behaviors and show more aggression and conduct problems, which may negatively affect their social relationships (Frick & Morris, 2004; Kochanska & Aksan, 2006).
Pride arises when one feels responsible for accomplishing something that exceeds the expectations of others in a positive way, while also feeling internally positive about oneself (Orth et al., 2010; Tracy & Robins, 2004b, 2007). Pride may play an important role when establishing and maintaining social interactions with peers, as it not only promotes the value of one person in the eyes of others (Mauro et al., 1992), but is also internally rewarding to the individual. As such, it motivates one to act according to social norms to feel valued by others (Cheng et al., 2010; Sznycer et al., 2017, 2018; Williams & DeSteno, 2008). The behavioral components of pride (e.g., head held high, making eye contact) are aimed at attracting attention from significant others. Still, in this case, the attention is focused on oneself or one’s accomplishment (Tracy & Robins, 2004b). Children, who are 2 years of age, already manifest action tendencies related to pride, like calling for attention after achieving something by themselves (Hart & Matsuba, 2007; Stipek et al., 1992), which continuously developed throughout childhood. Previous studies have shown that pride increases children’s confidence in initiating interactions and resolving conflicts (Cheng et al., 2010; Mauro et al., 1992; Sznycer et al., 2017, 2018; Williams & DeSteno, 2008).
Assessment of Moral Emotions in Early Childhood
To date, not many instruments for measuring moral emotions in early childhood are available, despite the crucial role of moral emotions in the development of psychosocial functioning. Observational studies that measure responses to emotion-evoking events are the most commonly used method for assessing moral emotions in young children. In these studies, children are instructed to complete tasks that are either designed to allow them to succeed or to set them up to fail (Belsky et al., 1997; Kelley et al., 2000; Ketelaar et al., 2015). Observing children in staged situations or in field observations is very informative, but can be time consuming. Questionnaires allow researchers to obtain information from a large number of participants in a short amount of time. Moreover, given that young children spend a lot of time in the vicinity of their parents, parents can be a valuable source of information about their child’s moral behaviors across a variety of settings and situations with different people.
To the best of our knowledge, the only questionnaire to date that focuses on young children’s moral emotions is “The Conscience Measure Questionnaire” (also referred to as the “My Child” measure) by Kochanska et al. (1994). The “My Child” questionnaire is a parental report on children’s awareness of wrongdoing and their willingness to stop or repair an incorrect behavior (Kochanska et al., 1994). However, the “My Child” questionnaire does not capture pride, and seems mostly oriented toward guilt, not shame. Therefore, we aimed to develop a new parent report questionnaire addressing the three major moral emotions (i.e., guilt, shame, pride) in young children, while focusing on behavioral indices related to each emotion separately.
Current Study
The development of moral emotions is crucial for psychosocial development, with shame, guilt, and pride being related to distinct outcomes regarding externalizing and internalizing symptoms and social competence. However, no instruments that distinguish between these three moral emotions for children at their emergent age have been developed. Therefore, this study aimed to establish the factor structure, internal consistency, and validity of a newly developed parent report questionnaire for assessing shame, guilt, and pride in young children: the Moral Emotions Questionnaire (MEQ).
First, we examined whether the hypothesized three-factor structure of the MEQ was confirmed (shame, guilt, and pride) by conducting a confirmatory factor analysis (CFA). Second, we examined the internal consistencies of the individual scales from the MEQ. The third aim of this study was to establish the concurrent validity of the MEQ by examining relations with other aspects of social–emotional functioning (i.e., internalizing behaviors, externalizing behaviors, and social competence). Finally, measurement invariance was assessed across gender, and two age groups (under 4 years of age and 4 or older) to verify whether the responses of caregivers to the items were equivalent considering those factors.
In this questionnaire, shame is operationalized as the tendency to show submissive behavior or to withdraw or escape from a situation. Based on the literature, we hypothesized that higher levels of shame would be associated with higher levels of internalizing and externalizing problems (Ferguson et al., 1999; Tangney et al., 1992, 2014; Thomaes et al., 2011). Guilt was operationalized as an other-oriented, adaptive emotion. This emotion is associated with the urge to confess, apologize for, or repair a wrongdoing as a sign of remorse. Therefore, guilt was expected to be positively related to social competence and negatively related to externalizing problems (Ferguson et al., 1999; Roos et al., 2014; Stuewig et al., 2010; Tangney et al., 1996). Pride is operationalized as a tendency to draw the attention of others toward an individual accomplishment, and exhibit enjoyment in regards to others’ appraisals of oneself. Based on literature, behavioral indices for pride were expected to be associated with higher levels of social competence (Hooge et al., 2011; Kluwin et al., 2002; Mascolo & Fischer, 1995).
Method
Participants and Procedure
Caregivers of a total of 377 children aged between 2.5 and 6.5 years old (mean age = 54 months, SD = 13 months; 55% of boys) participated in the study. Mothers completed the questionnaire for 301 children; fathers completed the questionnaire for 37 children; for 11 children, both parents completed the questionnaires together; and for another 11 children, the questionnaire was completed by other caregivers. Mothers aged between 25 and 47 years old (mean age = 37 years, SD = 4 years), while fathers aged between 24 and 61 years old (mean age = 39, SD = 6 years). Concerning the socioeconomic status of the participating families, 22% did not report this information. From the families who replied the majority belonged to the middle level of socioeconomic status (44%), followed by families in the high level of economic status (25%), and finally by families with low economic status (9%). The participants were recruited via daycare centers, preschools, and elementary schools in the Netherlands. Children were excluded from the study if they had any apparent developmental delays or mental health disorders, such as attention-deficit hyperactivity disorder (ADHD) or autism spectrum disorders. Parents were informed about the goals and execution of the study, how data were to be handled and stored to guarantee their privacy, and about the voluntary nature of their participation. All parents provided written consent to participate in the study. Parents filled in the questionnaires either on paper or via a website. Approval for the study was obtained from the ethics committee of the Medical Ethics Committee of the Leiden University Medical Center (ethics approval number: P08.140/SH/sh).
Outcome Variables and Materials
Moral Emotions Questionnaire
The MEQ (Table 1) aims to assess behavioral responses associated with three distinct moral emotions: shame (eight items), guilt (eight items), and pride (nine items). This initial 25-item version of the MEQ was developed by a team of developmental psychologists and psychology students, some of whom had children within the intended age range, in a multi-step procedure.
Items of the MEQ.
Note. MEQ: Moral Emotions Questionnaire. (R) denotes a reverse scored item.
In Step 1, each member of the team formulated items to measure guilt, shame, and/or pride, based on their experience with young children, their knowledge of the literature on moral emotions, and their experience with conducting observational studies on moral emotions.
In Step 2, the combined list of items was discussed in the team, and inappropriate items were deleted (e.g., only suitable for older children, not involving observable behavior, or overlapping content with other items) or revised (e.g., when an item was not specific enough). At this stage, 25 items passed this selection.
In Step 3, parents of 106 children were asked to fill out the first version of the MEQ consisting of 25 items, and additional questionnaires. Parents rated the degree to which each item represented their child’s behavior in the last 2 months on a 3-point scale (0 = never, 1 = sometimes, and 2 = often). For these parents, an extra response category for “not applicable” (NA) was available. Parents were instructed to choose this option only when their child had not been in that particular situation in the past 2 months. Frequencies of the response categories based on these first 106 participants showed that, for any of the items, no more than 25% of the parents had selected the option NA, and for 21 out of 25 items, this option was selected by less than 10% of parents.
In Step 4, based on the low use of the category NA, this response category was now removed from the questionnaire. No items were revised or removed at this stage. The questionnaire was now administered to another 271 parents, who were instructed to answer all items by selecting one response from the following options: “never,” “sometimes,” or “often.”
In Step 5, data from Steps 3 (n = 106) and 4 (n = 271) were analyzed collectively to examine the psychometric properties of the MEQ (variance, factor structure, internal consistency, and inter-item correlation). NA answers were analyzed in two ways: recoded as 0 and recoded as missing values. Both methods of data analyses showed similar outcomes. Therefore, in the final data analyses, NA answers from the first sample of 106 parents were recoded into 0 because, most likely, parents who did not have the NA option would have opted for “never” when their child had not been in that particular situation.
Internalizing and Externalizing Behaviors
To obtain information about the prevalence of internalizing and externalizing behaviors, the Early Childhood Inventory-4 (ECI-4; Sprafkin et al., 2002) parent checklist was used. This checklist measures the behavioral symptoms of the most prevalent disorders in the Diagnostic and Statistical Manual of Mental Disorders (4th ed.; DSM-IV) among preschool children. The ECI-4 parent checklist contains 108 items that screen for 15 emotional and behavioral disorders. Parents rated the extent to which their child showed each behavior on a 4-point scale (0 = never, 1 = sometimes, 2 = often, and 3 = very often).
For the scale for internalizing behaviors (25 items), we combined items that belonged to the following scales: major depressive disorder (10 items), separation anxiety (8 items), social phobia (3 items), and generalized anxiety (4 items).
For the scale for externalizing behaviors (18 items), we combined items belonging to the oppositional defiant disorder (8 items) and conduct disorder (10 items) scales.
The psychometric properties of these scales are shown in Table 2. The internalizing and externalizing behavior scales showed good reliability with Cronbach’s alpha values of .77 and .80, respectively. The inter-item correlation of the internalizing behaviors scale (.13) was lower than desired, but close to acceptable values, and the value for the externalizing behaviors scale was acceptable (.18).
Internal Consistencies of the Indices for Internalizing Behaviors, Externalizing Behaviors, and Social Competence.
Note. SD: standard deviation; CI: confidence interval. aFour-point scale (0 = never, 1 = sometimes, 2 = often, and 3 = very often); b3-point scale (0 = not true, 1 = somewhat true, and 2 = certainly true).
Social Competence
To obtain information about social competence, we followed the procedure of Veiga et al. (2017) and used the prosocial behaviors scale, and positive items from the peer problems scale in the Strengths and Difficulties Questionnaire (SDQ; Goodman, 1997; Muri et al., 2003). The SDQ was administered to parents, who were asked about their child’s peer relationships (two items; i.e., “Has at least one good friend,” “Generally liked by other children”); and prosocial behavior (five items). Parents rated on a 3-point scale (0 = not true, 1 = somewhat true, and 2 = certainly true), the degree to which each item represented their child’s behavior in the last 3 months. As reported in Table 2, this scale showed acceptable reliability with Cronbach’s alpha value of .64, the inter-item correlation value was also acceptable (.20).
Statistical Analyses
First, to assess construct validity, we conducted a CFA, where the proposed 25-item measurement model (Table 1) was tested. Due to non-normality of the data, the CFA was performed with maximum likelihood estimation with Satorra & Bentler’s (1994) correction, to safeguard against deviations. Goodness of fit of the model was evaluated using the root mean square error of approximation (RMSEA < .06; Hu & Bentler, 1999), the comparative fit index (CFI > .90; Bollen, 1989), and the standardized root mean square residual (SRMR < .08; Little, 2013). Furthermore, the value for the Akaike information criterion (AIC) was also used for comparison between the models. The AIC does not have an absolute norm. When comparing between models, the lower AIC value indicated the better fit (Pho et al., 2019).
To evaluate whether measurement properties of the MEQ were invariant across genders and age groups (under 4 years of age and 4 or older), a multigroup CFA was conducted. Following the procedure suggested by Milfont and Fischer (2010), testing three levels of measurement invariance sequentially: configural, metric, and scalar. Configural invariance is meant to confirm whether the model in equivalent for the groups that are being compared. In this step, the model structure in both genders and age groups was analyzed without any constraints. Metric invariance is meant to confirm whether the meaning of the items of the scale is similar for the groups being compared. In this step, the model structure in both genders and age groups was analyzed constraining all the factor loadings. In the case that metric invariance was not met, partial invariance was analyzed, after freeing the invariant items (Byrne et al., 1989). If metric (partial) invariance is met, scalar invariance can be tested, which allows to confirm that members of each group being compared are rated similarly when using the scale. In this step, the model structure in both genders and age groups was analyzed constraining the item intercepts. To test the metric and scalar invariance, three model fit indices variations were considered: the decrease of the CFI value should not be more than .01 in comparison to the previous model; the variation of the RMSEA between models should be less than .015; and the variation of the SRMR between models should be less than .030 (Chen, 2007; Cheung & Rensvold, 2002). Items to be freed for partial invariance analyses were chosen based on their univariate modification indices, and also on the Lagrange multiplier test. This test shows the effect of releasing an equality constraint simultaneously between groups (Martín-Puga et al., 2020; Rosseel, 2012). Gender and age group differences were assessed if at least 50% of the items comprising a given factor were invariant (Steenkamp & Baumgartner, 1998; Vandenberg & Lance, 2000). As significant correlations were found between age and the scale guilt (r = .18; p < .05) and also with the scale pride (r = .15; p < .05), further analyses were conducted with partial correlations corrected for age.
Second, considering the ordered categorical nature of the items, we assessed the internal consistencies of the obtained MEQ scales using McDonald’s omega and inter-item correlations (Crutzen & Peters, 2017). Third, Pearson’s correlations (with Bonferroni correction for multiple comparisons) with the internalizing and externalizing behaviors of the ECI-4, and the social competence scale of the SDQ, were conducted to examine concurrent validity. Prior inspection to the scatterplots of the associations between the three moral emotions and the concurrent measures, indicated only linear trends, therefore, only linear associations were studied. The CFA in this study was conducted using R’s (version 4.0.2) lavaan package (version 0.6-6; Rosseel, 2012); and the lavTestScore function of the same package was used to conduct the Lagrange multiplier test. All the other statistical analyses were performed with the IBM SPSS (version 21).
Results
Confirmatory Factor Analysis
Items on the MEQ that were negatively formulated (see Table 1) were reverse coded, so that, higher scores represented more expression of the moral emotion. The original 25-item MEQ (Table 1) was fitted with the hypothesized three-factor structure and yielded a poor fit (Model 1 in Table 3). To improve model fit, factor loadings and modification indices were analyzed. That is, items with low factor loadings (< .30), and high modification indices (> 10) were further analyzed in terms of their content. The content of each item was considered before exclusion, so that, any deletion was not solely based on statistical outcomes, but also on the theoretical appropriateness of the item on the intended factor. Items who showed conceptual overlap with another scale or concept were removed. This procedure resulted in the deletion of six items (7, 20, 12, 19r, 5, and 16) and resulted in Model 2. Although this model showed an improved fit, two more items were removed resulting in Model 3 (Figure 1): Item 10 was excluded due to high loading on the non-intended factor; Item 24 was removed due to theoretical overlap with Item 13. The values for goodness of fit for the final model (Model 3) showed appropriate levels. The SRMR and the RMSEA were below the desired cut-off points. The AIC of Model 3 was the lowest, therefore indicating a better fit. Although the CFI did not reach the cut-off point of > .90, our CFI value (.871) was within the acceptable range, given that the RMSEA of the null model value for Model 3 was .119 (Veiga et al., 2019). The CFI is a comparative index that ranges from 0 to 1, in which the proposed model is compared to a null model wherein all measured variables are uncorrelated. The null model should have a poor fit, with a very large chi square (Ching et al., 2014). Kenny et al. (2014) have suggested that if the value of the RMSEA of the null model is < .158, the CFI is less informative. Thus, a threshold of > .90 could be too strict for our model. Correlations between error terms of items were analyzed. No significant correlations were found, indicating that there was no overlap between items from different latent variables.
Results of the Confirmatory Factor Analysis.
Note. RMSEA: root mean square error of approximation; CFI: comparative fit index; SRMR: standardized root mean square residual; AIC: Akaike information criterion. N = 377. *p < .001.

CFA of the Three-Factor Structure of the MEQ.
Measurement Invariance Across Gender
To test the invariance of the model across gender, a multigroup CFA was performed to the final model (Table 4). First, the fit statistics obtained from the configural (baseline) model showed an acceptable fit, χ2 (232, N = 306) = 296.162, p < .01; CFI = .891; RMSEA = .040; SRMR = .065. In the next step, the testing of the metric invariance showed a significant change in the model fit (ΔCFI = −.020; ΔRMSEA = .003; ΔSRMR = .007), indicating that the model was variant across genders, and metric invariance could not be assumed. Partial metric invariance was then tested by freeing the constraint on Item 11, and a nonsignificant change in the model fit was obtained, as compared to the configural model (ΔCFI = .005; ΔRMSEA = −.002; ΔSRMR = .004). This indicates that except for Item 11, the factor loadings were invariant across genders, and partial metric invariance could be assumed. Next, the testing of the scalar invariance showed a significant change in the model fit (ΔCFI = −.052; ΔRMSEA = .008; ΔSRMR = .004). After freeing the equality constraints on the intercepts of Items 15 and 18, partial scalar invariance could be achieved (ΔCFI = .002; ΔRMSEA = −.001; ΔSRMR = .001).
Fit Indices for Measurement Invariance Models Across Gender and Age Group (< 4 yo; ⩾ 4 yo).
Note. CFI: comparative fit index; RMSEA: root mean square error of approximation; CI: confidence interval; SRMR: standardized root mean square residual. N = 377. *p < .01.
Equality constraint on the factor loading of Item 11 was freed from the model.
Equality constraints on the intercepts of Items 11, 15, and 18 were freed from the model.
Equality constraints on the intercepts of Items 8, 9, 14, 17, and 23 were freed from the model.
Because partial scalar invariance could be assumed, and more than 50% of the items for each factor were invariant, the means of the two gender groups can be compared for the three moral emotions. As shown in Table 6, caregivers of boys acknowledged their children to show less guilt, t(375) = −2.75, p = .006, and pride, t(373) = −4.17, p < .001, compared to girls. An additional inspection on the latent means showed that boys and girls differed in pride, E(girls–boys) = .074 (Var(girls–boys) = 0.022); p = .001, but not in guilt and shame.
Measurement Invariance Across Age Groups
To test the invariance of the model across age groups, a multigroup CFA was performed to the final model (Table 4), with one group comprising participant younger than 4 years of age, and the other group comprising participants who were 4 years or older. First, the fit statistics obtained from the configural (baseline) model showed an acceptable fit, χ2 (264, N = 306) = 316.517, p < .001; CFI = .869; RMSEA = .046; SRMR = .065. In the next step, the testing of the metric invariance showed a nonsignificant change in the model fit (ΔCFI = .005; ΔRMSEA = −.002; ΔSRMR = .003), indicating that the model was invariant across the two age groups, and metric invariance could be assumed. Next, the testing of the scalar invariance showed a significant change in the model fit (ΔCFI =−.121; ΔRMSEA = .016; ΔSRMR = .008). Further analyses indicated that the equality constraints on the intercepts of Items 8, 9, 14, 17, and 23 should be freed, and partial scalar invariance could be achieved afterwards (ΔCFI = −.008; ΔRMSEA = .001; ΔSRMR = .001).
The assumption of partial scalar invariance (with at least 50% of the items for each factor were invariant) allows the means of the two age groups to be compared for the three moral emotions. As shown in Table 6, caregivers of the children younger than 4 years of age acknowledged their children to show less guilt, t(375) = −2.77, p = .006, and pride, t(220) = −2.83, p = .005, compared to children who are 4 years or older. An additional inspection on the latent means also showed that children younger than 4 years of age and children who were 4 years or older, differed in guilt, E(⩾ 4 yo – < 4 yo) = .153 (Var(⩾ 4 yo – < 4yo) = .046); p = .001, and pride, E(⩾ 4 yo – < 4 yo) = .040 (Var(⩾ 4 yo – < 4 yo) = .020); p = .043, but not in shame.
Reliability
Table 5 shows partial correlations corrected for age between the MEQ scales. Guilt was positively associated with pride, yet not to a degree that suggests collinearity. No other significant correlations between the MEQ scales were found.
Correlations Between Indices for Moral Emotions (Corrected for Age).
Note. MEQ: Moral Emotions Questionnaire. N = 377. *p < .05; **p < .006 (after Bonferroni correction).
McDonald’s omega and inter-item correlation coefficients for the three MEQ scales are reported in Table 6. The outcomes show that the internal consistency per scale is acceptable for shame (.62), guilt (.62), and pride (.63).
Mean Total Mean by Age Category, and Internal Consistencies of the MEQ Scales.
Note. Ω: McDonald’s omega; IIC: inter-item correlation; CI: confidence interval; d = Cohen’s d. The MEQ was scored on a 3-point scale (0 = never, 1 = sometimes, and 2 = often). Total N = 377 (n = 127 for < 4 years; n = 250 for ⩾ 4 years; n = 208 for boys; n = 169 for girls). *p < .05. Age category and gender comparisons showed that for both groups differences were found for the guilt and pride scales.
Concurrent Validity
As shown in Table 7, shame was positively associated with externalizing and internalizing behaviors. Guilt was negatively associated with externalizing behaviors and positively associated with social competence. Pride was positively associated with social competence. No other significant correlations were observed.
Correlations of MEQ Scales With Indices for Internalizing and Externalizing Problems, and With Social Competence (Corrected for Age).
Note. N = 205. *p < .006 (after Bonferroni correction).
Discussion
The outcomes of this study suggest that the three moral emotions, that is, guilt, shame, and pride, can be identified separately in early childhood through the MEQ. The originally proposed 25-item model was not confirmed. However, after extracting eight items due to theoretical–statistical reasons (e.g., low factor loadings; high loading on the non-intended factor; theoretical appropriateness of the item; overlap with other 17-item model based on the hypothesized three-factor structure, with a satisfactory goodness of fit). Although eight items were deleted, the final 17-item model still represents the intended constructs, considering that besides the statistical results, the appropriateness of each item in its intended scale was considered in each step. This resulted in the deletion of items from each scale that referred to more general behaviors, and therefore lacked an action tendency related to its intended construct (e.g., in shame—“My child is afraid of making mistakes,” “My child gets upset when he/she has done something wrong”). Furthermore, only items that clearly reflected the action tendencies of its intended scale were kept. For example, in our questionnaire, shame was operationalized as the tendency to show submissive behavior or to withdraw or escape from a situation. Therefore, looking at the remaining four items from the final model, we see that they reflect these action tendencies (e.g., “My child hides when he/she has done something wrong,” “My child quickly walks away when he/she has done something he/she is not allowed to do”).
Measurement invariance analysis across gender showed that the factor loading of one item from pride differed across gender. Furthermore, the intercepts of three items (one item from guilt and two items from pride) were variant across gender. While previous studies have shown that female participants tend to report significantly more action tendencies of shame, guilt, and pride than male participants (Beißert & Hasselhorn, 2016; Else-quest et al., 2012; Etxebarria et al., 2019; Kushnir et al., 2016), our results seem to further show that the action tendencies of pride could be different in girls and boys. As for measurement invariance across age groups (< 4 or ⩾ 4 years), the analysis showed that all the items have similar meanings for caregivers. Yet, intercept invariance across age groups was only achieved after releasing the equality constraints on five items (one item from guilt, two items from shame, and two items from pride). This indicates that the younger group in this study was at the early stages of developing moral emotions, and therefore was expected to show less action tendencies related to shame, guilt, and pride, compared to the older age group. Although only partial invariance was achieved, the proportion of invariant items on each factor was above the required level (i.e., ⩾ 50%). Therefore the constructs can be considered as equally calibrated across groups, and group means could be compared (Steenkamp & Baumgartner, 1998; Vandenberg & Lance, 2000). Although acceptable, all three scales showed low internal consistencies. However, the concurrent validity of these three scales was further confirmed by the relationships of the three moral emotions with externalizing behaviors, internalizing behaviors, and social competence. These correlations conformed exactly with our hypotheses, and were in line with the literature on this topic (Ferguson et al., 1999; Hooge et al., 2011; Kluwin et al., 2002; Mascolo & Fischer, 1995; Roos et al., 2014; Stuewig et al., 2010; Tangney et al., 1996; Thomaes et al., 2011).
As mentioned earlier, previous studies focusing on young children have not yet distinguished between the three moral emotions included in our newly developed questionnaire. Not only did this study show that shame, guilt, and pride can be distinctly observed in preschool children; it also showed that these emotions turned out to have distinct relationships with other domains of social–emotional functioning.
First, relations found in our results confirm previous studies that characterize guilt as an adaptive emotion (Baumeister et al., 1994; Broekhof et al., 2018; Estrada-hollenbeck & Heatherton, 1995; Frick & Morris, 2004; Kochanska & Aksan, 2006). Parents reported that children who expressed more guilt behaviors after a transgression showed fewer externalizing behaviors (e.g., aggression or rule-breaking) and higher levels of social competence. These findings confirm outcomes from previous studies that were focused on older children and adolescents (Baumeister et al., 1994; Broekhof et al., 2018; Estrada-hollenbeck & Heatherton, 1995; Frick & Morris, 2004; Kochanska & Aksan, 2006).
Second, we found that pride, much like guilt, also serves an adaptive purpose. In line with other studies (Hooge et al., 2011; Kluwin et al., 2002; Mascolo & Fischer, 1995), our results showed that pride was related to better social competence. This suggests that pride allows children to feel confident enough to interact with peers, and that showing others that you are a valuable asset to the group indeed helps you to be evaluated positively by others.
Third, shame seems to serve a different function, in line with previous studies, our study portrays shame as a maladaptive emotion; higher levels of shame behaviors were related to more internalizing and externalizing behaviors (Tangney et al., 1992; Thomaes et al., 2011). The social context and content of transgressions in this study may help explain these outcomes, for example, the shame items in the questionnaire all involved wrongdoing, for example, breaking something or disobeying a rule. In those cases, children were commonly expected to make eye contact and admit their transgression, thus showing their guilt. Avoidant behaviors were usually perceived as trying to avoid the blame or punishment (Barrett et al., 1993; Estrada-hollenbeck & Heatherton, 1995; Stuewig et al., 2010), which might explain the maladaptive function of shame behaviors in those contexts. Yet an unwanted identity (a core feature of shame) does not necessarily involve harm or wrongdoing, as formulated in the items in the questionnaire. Instead, shame can also arise from walking around with a bad haircut or accidentally tripping over a carpet, or other clumsy behaviors. These shame-only occasions with no guilt involved, where no explicit harm was done to another, were not included in this questionnaire. However, including such items could provide a more adaptive function of shame. This other context for shame might be a valuable addition in future studies.
This study did have some limitations that need to be highlighted. First, parents were our only informants in regard to their children’s manifestations of morally guided behavior and psychosocial functioning. Second, future studies should compare these parent reports with reports from other informants (e.g., preschool teachers) and field observations, to further confirm the validity of the MEQ. Third, cross-cultural studies with the MEQ could be informative, because the literature on these emotions often discusses the different functions of moral emotions in Western, individualistic-oriented cultures versus Eastern, collectivistic-oriented cultures (Bedford & Hwang, 2003; Cole et al., 2006; Fung, 1999; Midlarsky et al., 2006; Yoshioka & Choi, 2005). For individuals from Eastern cultures, the collective harmony (group) is more important than individual independence. In these societies, failing to achieve the group demands causes the individual to feel as a failure. This sense of failure is extended to their families, which even puts a higher pressure on the individual to behave well within the norms and values of that group (Bedford & Hwang, 2003). Shame, in collectivistic-oriented cultures, seems to have a self-regulative function, preventing individuals from acting against social norms (Bedford & Hwang, 2003; Cole et al., 2006; Fung, 1999; Midlarsky et al., 2006; Yoshioka & Choi, 2005). Guilt, seems to have the same functions cross-culturally (Bedford & Hwang, 2003; Merolla et al., 2013). As for pride, previous studies have shown that in collectivistic cultures it is emphasized that a person should feel pride in situations that positively contribute to others (e.g., helping someone else; Stipek, 1998; Stipek et al., 1989). Also, it is reported that Eastern individuals do not emphasize the value of self (Heine et al., 1999), and therefore individual pride, is less prominent and even discouraged in these societies. Therefore, cross-cultural studies with the MEQ are needed as the results obtained may only hold true in Western populations. Fourth, the cross-sectional nature of this study prevents us from drawing conclusions about the directionality of the relationships we found between the three moral emotions and externalizing behaviors, internalizing behaviors, and social competence. We assume that it is the anticipation of the moral emotions that has an effect on the way children behave toward others, and not the other way around. Yet, this must be confirmed in longitudinal studies. Fifth, future longitudinal studies should endeavor to identify which factors underlie the development of moral emotions. Insight into which factors support the development of moral emotions will improve our understanding of how to develop effective preemptive interventions to stimulate the development of these crucial emotions. Sixth, future studies should also look into the relation between the MEQ scales and observational data, to further contribute for its validity. Finally, for this study, the participants were evaluated in a single time point. Future studies should assess test re-test reliability, not only to improve internal validity, but also to understand the stability of this measure over time.
In sum, the MEQ appears to be a promising and reliable instrument for evaluating the extent to which young children experience and display distinct moral emotions, through the parents’ perspective. The MEQ does offer some important advantages: first, the MEQ is an easy-to-administer instrument. It is far less time consuming and more cost-effective than observational methods. Second, it provides an ecologically valid way to assess moral emotions in early childhood, as parents are able to report on their children’s moral behavior over time and across situations. Furthermore, the MEQ gives us the opportunity to examine new research questions, including those that concern early impairments in moral development and their underlying causes, or how moral development further affects other social–emotional functioning domains. Answers to these questions could help professionals understand the basis and consequences of possible impairments in moral development, and create strategies that promote children’s moral, emotional, and social development.
Footnotes
Acknowledgements
The authors thank the parents and children who participated in this study and the students who helped collect the data. They also thank Shannon Yuen for her English language editing and review.
Declaration of Conflicting Interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This research was financially supported by the Fundação para a Ciência e Tecnologia (FCT), Portugal (grant number: SFRH/BD/144768/2019), and the Care for the Young: Innovation and Development program by ZonMw (grant number: 80-82430-98-8025).
