Abstract
Pragmatics refers to the ability to effectively use and interpret language in different contexts. Pragmatic abilities develop and refine through childhood, and they are essential for socialization, academic achievement and wellbeing. The scarcity of assessment tools in this field makes it challenging to provide a comprehensive assessment of pragmatic abilities. The Pragma test, originally developed for Finnish and also adapted into Italian, consists of a battery of tasks assessing children’s pragmatic abilities. In this study, a first evaluation of the psychometric qualities of a Norwegian adaptation of this test is presented. In addition, we investigated pragmatic development between ages 5 and 7, and explored possible gender-based differences. Altogether 119 Norwegian-speaking children participated in the study. The children were tested with the Pragma test and parents completed the Children’s Communication Checklist-2. The children were allocated into three groups: 5-year-olds, 6-year-olds and 7-year-olds. The psychometric qualities of this Norwegian adaptation supported its use as a tool for assessing pragmatics in children aged 5–7. Strong and significant growth in pragmatic competence was observed from age 5–6, subsequently flattening out between age 6 and 7, and gender differences in favour of girls were identified. These findings indicate that pragmatic ability, as measured by the Pragma test, shows similar age effects in a Norwegian setting as in Finnish and Italian contexts, paving the way for further cross-cultural and cross-linguistic studies.
Introduction
Pragmatic ability is essential for children’s socialization, academic achievement and wellbeing as successful communication requires one to go beyond the literal meaning of words and utterances and utilize linguistic, cognitive and social skills to construct meaning (Loukusa et al., 2018; Norbury, 2014). Pragmatics refers to the ability to effectively use and interpret language in different contexts (Levinson, 1983; Turkstra et al., 2017) and the terms pragmatics and social communication are often used interchangeably, referring to similar skills (Murphy et al., 2019). A broad definition of pragmatic competence includes the ability to make inference, initiate conversation, follow the rules for politeness and conversational exchange, provide relevant responses, produce coherent narratives, understand non-literal language and convey information through gesture, facial expressions and prosody (Fujiki & Brinton, 2009; Matthews et al., 2018; Murphy et al., 2019). In this study, the term pragmatics, referring to aspects of verbal as well as non-verbal language, is adopted.
Pragmatic communication skills emerge long before children start talking; as they can follow the gaze of another person and establish joint attention by 3–4 months and start using gestures and vocalizations for requesting, greeting and protesting before their first birthday. Gradually single words and two-word utterances are used to replace or supplement gestures; and by age 2, basic communicative use of language is commonly mastered (Alduais et al., 2022; Turkstra et al., 2017). Between the ages of 4 and 8, the ability to utilize contextually relevant information accelerates (Bosco et al., 2004; Gabbatore et al., 2021). Pragmatic competence continues to develop and refine through childhood and adolescence, making the individual increasingly able to use language as an effective tool for social interaction (Alduais et al., 2022; O’Neill, 2007). Successful communication presupposes the ability to take the perspective of others, and Theory of Mind (ToM), the ability to attribute mental status to other people and behave accordingly, appears to link to pragmatic development, although this relationship is still not fully understood (Bosco et al., 2018; Bosco & Gabbatore, 2017; Hyter, 2017; Loukusa et al., 2017). Traditionally, it has been argued that ToM emerges around the age of 4 and keeps developing during childhood and adolescence (Bosco et al., 2014). However, recently it has been found that even before the age of 2, children may display action expectations congruent with others’ belief when they are tested non-verbally (Grosse Wiesmann et al., 2020).
Being able to reflect upon and provide relevant explanations of one’s own answers is an ability that is connected to pragmatic abilities and that develops over time (Donaldson, 1986; Loukusa et al., 2017). In a study of Finnish children, Loukusa and colleagues (2017) found that even at the age of 8, children were not able to correctly explain more than 80% of their answers on questions targeting pragmatic ability. Similar findings were reported by Gabbatore and colleagues (2021) for Italian children; they found that while 5-year-olds were able to correctly explain 48% of their answers, the corresponding number for 8-year-olds was 78%.
To be able to identify pragmatic difficulties or delays, knowledge of typical pragmatic development is essential. It is well-established that pragmatic problems are core characteristics of a variety of clinical populations, for example, attention deficit hyperactivity disorder (ADHD; Carruthers et al., 2022; W. A. Helland, Helland, & Heimann, 2014), autism spectrum disorder (ASD; Félix et al., 2024; Loukusa et al., 2018), behavioural problems (W. A. Helland, Lundervold, et al., 2014) and hearing impairment (Yoshinaga-Itano et al., 2020). By investigating typical developmental patterns and collecting normative data, better support and individually targeted interventions can be provided for children who struggle (Gabbatore et al., 2019, 2021).
Studies exploring gender-based differences in emerging language skills have commonly reported girls scoring consistently better than boys (Bornstein et al., 2004; Eriksson et al., 2012; T. Helland et al., 2017; Nouraey et al., 2021; Stolt, 2023). However, studies focusing on the role of gender in pragmatic ability are sparse and mixed findings are reported. In a recent Italian study with 4–8-year-old children, Gabbatore and colleagues (2021) found no gender-based pragmatic differences as measured by the Pragma test (Loukusa et al., 2017), confirming previous findings by Longobardi and colleagues (2017) based on parental reports. Likewise, in studies with the Language Use Inventory (LUI; O’Neill, 2007) assessing children aged 18 to 47 months, pragmatic competence did not differ between boys and girls neither in a French (Pesco & O’Neill, 2016) nor in a Norwegian sample (W. A. Helland & Møllerhaug, 2020). However, significant differences in favour of girls have been reported in English (O’Neill, 2007), American (Ash et al., 2017), Polish (Białecka-Pikul et al., 2019) and Dutch (Ketelaars et al., 2010) samples.
The scarcity of assessment tools in this field makes it challenging to provide an accurate and comprehensive assessment of pragmatic abilities (Gabbatore et al., 2019). In Norway, only three tools for the assessment of pragmatics in children exist. These are all based on parent/teacher reports; the Norwegian versions of the Children’s Communication Checklist Second Edition (CCC-2; Bishop, 2011), The LUI (W. A. Helland & Møllerhaug, 2020; O’Neill, 2007) and the pragmatic profile of the Clinical Evaluation of Language Fundamentals, Fourth Edition (CELF-4; Semel et al., 2013). These assessment tools have all been developed in English and adapted to Norwegian. They have been found to provide reliable observations of children’s daily communication; however, the subjectivity of the observer is an obvious limitation. Thus, a structured test, not mediated by parent/teacher observations, presents as a valuable part of a comprehensive assessment of pragmatic competence.
The Pragma test (Loukusa et al., 2017) was originally developed in Finnish for use with children in the age range 4–8 years. Subsequently, an English version has also been developed. The test focuses on children’s abilities to understand others’ intentions and utilize contextual cues, areas that are not fully covered by other instruments. Furthermore, the test reflects real-life-pragmatic demands in a structured test situation and provides quantifiable results of strengths as well as weaknesses (Gabbatore et al., 2021; Loukusa et al., 2017). The Pragma test contains five categories of questions, each targeting different aspects of pragmatic abilities. The categories are termed (1) contextual interference with ToM demand, (2) contextual inference without ToM demand, (3) feelings recognition, (4) relevant language use and (5) false belief.
It is plausible that children growing up with different cultural backgrounds and native languages may have different communicative styles. However, mainly due to lack of appropriate assessment tools, research on the development of pragmatic abilities across different languages is scarce (Gabbatore et al., 2019, 2023). The Pragma test was recently translated into Italian by Gabbatore and colleagues (2021), and their results indicate that the test is also relevant in an Italian cultural context. Translating and adapting an existing assessment tool for use in different languages enables cross-cultural investigation of abilities assessed by the same instrument. To the best of our knowledge, no formal direct test of pragmatic ability is presently available in Norway. Thus, the aims of this study were (1) to make a first evaluation of the psychometric qualities of a Norwegian adaptation of the Pragma test (2) to investigate pragmatic development in Norwegian children aged 5–7 years, (3) to investigate responses by question type and (4) to explore possible gender-based differences in pragmatic abilities.
To assess the concurrent validity of the Pragma test, CCC-2 scores were obtained from a sub-set of children.
Method
Participants
Altogether, 119 Norwegian-speaking 5–7-year-old children (60 girls and 59 boys) took part in this study. The participants were allocated into three groups: 5-year-olds (
Participants’ characteristics.
Procedure
The study was approved by the Norwegian Centre for Research Data. The participants were recruited from seven kindergartens (last year before starting school) and six elementary schools representing urban as well as rural districts in Vestland county, Norway. The kindergartens and schools distributed information about the study together with a consent form to parents of all children in the selected age range. Those who wanted their children to take part in the study returned their written consent, whereupon they received a copy of the CCC-2 to complete. The children were tested individually by students in speech-language therapy in a separate room in their kindergartens and schools. The test sessions were video and audio recorded for scoring. Initially, we intended to include only children who had Norwegian as their first language. Information on first language is collected as part of the CCC-2. However, checklists were missing for 33 out of the 119 children who were administered the Pragma test, meaning that information on first language was unavailable for a substantial part of the children. In response to this, we ran group comparisons (Student’s independent samples
Instruments
The Pragma test
The Pragma test (Loukusa et al., 2017) consists of a battery of tasks assessing children’s ability to understand contextual meaning, intentions, and relevant use of language. Short scenarios, supported by pictures, figurines and plastic animals, are presented verbally to the children. The Pragma test includes 39 questions in total, classified into five categories (see Table 2 for examples).
Examples of different question types, scenarios, and correct answers.
Source: Adapted from Loukusa et al. (2017) with permission. [ ] = Norwegian version.
The explanation question was only asked if a correct answer was given.
Contextual interference with ToM demand
These questions (
Contextual inference without ToM demand
These questions (
Feelings recognition
These questions (
Relevant language use
These questions (
False belief
These questions (
To answer correctly, the children need to understand the implied meaning of the utterances. Each question is scored as either 1 or 0. A score of 1 requires that the answer reflects an accurate understanding of the situation and addresses the targeted pragmatic aspect, failing this; a score of 0 is assigned. The maximum score equals the number of questions, leading to a Pragma maximum score of 39. In addition, 13 of the questions require the children to explain how they arrived at their correct answer. Guidelines for scoring the answers are presented in the manual (Loukusa, 2019). Depending on the performance of the individual child, it took approximately 30–45 minutes to administer the Pragma test. The test presents with good psychometric properties. Internal consistency values with Cronbach’s alpha of 0.927 and 0.794 are reported for answers and explanations, respectively. Interrater reliability (intra-class correlation coefficient, ICC) of 0.985 (answers) and 0.984 (explanations) indicates highly reliable scoring (Gabbatore et al., 2021; Loukusa et al., 2017).
Norwegian translation and adaptation of the Pragma test
Written permission for translating and adapting the English version of the Pragma test into Norwegian was given by the developer, Soile Loukusa. To provide an equivalent version of a test developed in one language into a different language, merely translating the test would not be sufficient; a more extensive adaptation process would usually be required. A two-way translation procedure in line with the guidelines given by the ‘ITC Guidelines for Translating and Adapting Tests’ (2018) was followed. A panel consisting of students in speech-language therapy (who were also educated teachers and linguists) and professors in speech-language therapy collaborated on the Norwegian translation. Only minor modifications were made to adapt the Pragma test to a Norwegian context; item 1 referring to
The CCC-2
The CCC-2 (Bishop, 2011; Norwegian version) was used as a gold-standard in analyses of concurrent validity. The CCC-2 is a widely used parent-completed checklist, designed to screen for children at risk for language impairment and to give a quantitative estimate of pragmatic/social interactional problems in children. The CCC-2 assesses children’s communicative abilities in everyday contexts, which may differ compared to formal test situations. The instrument is standardized for the age range 4–16 years. Internal consistency for the Norwegian version ranging from 0.73 to 0.89 (Cronbach’s α) and inter-rater reliability ranging from 0.44 to 0.76 are reported (W. A. Helland et al., 2009). The checklist consists of 70 items organized into 10 subscales: (a) Speech, (b) Syntax, (c) Semantics, (d) Coherence, (e) Inappropriate initiation, (f) Stereotyped language, (g) Use of context, (h) Non-verbal Communication, (i) Social relationships and (j) Interests. Scale ‘a’ through ‘d’ assess mainly structural aspects of language (however scale ‘d’ might be considered to measure both structural and pragmatic aspects of language), scale ‘e’ through ‘h’ assess pragmatics, and scales ‘i’ and ‘j’ assess autistic traits. The raw scores are converted into scaled scores with a mean of 10 and an SD of 3 with higher scores indicating better competence. A General Communication Composite (GCC), encompassing both structural and pragmatic language difficulties, is derived by summing the scaled scores of subscales ‘a’ through ‘h’. In addition, although not included as a standard part of the CCC-2, a pragmatic composite (PC) may be calculated by summing the scaled scores of scales ‘d’ through ‘h’ (W. A. Helland & Helland, 2017).
Statistical analyses
The internal consistency of the Pragma test was measured by Cronbach’s alpha. Alpha scores between 0.70 and 0.80 are considered acceptable, with scores above 0.80 indicating a highly internally consistent scale (Tavakol & Dennic, 2011). To check the reliability of scoring, the ICC between scores assigned by two independent raters was calculated on a subsample of 24 randomly selected participants (20.2% of the total sample). An ICC between 0.50 and 0.75, between 0.75 and 0.90 and greater than 0.90 are considered moderate, good and excellent reliability, respectively (Koo & Li, 2016). Two-tailed correlation analysis (Pearson’s
Results
Internal consistency and inter-rater reliability
The Norwegian version of the Pragma test showed a Cronbach’s alpha of 0.857 for the Pragma total score. The ICC between two independent raters measured on a subsample (
Concurrent validity
The Pragma total score correlated significantly with the composite scores of the CCC-2; GCC and PC with
Correlations between Pragma and CCC-2.
CCC-2: Children’s Communication Checklist Second Edition.
Performance across age groups on the Pragma test
One-way ANOVA revealed a significant effect of age (

Mean Score of Pragma Total in the Age Groups 5, 6 and 7 Years.
Comparison of question types
One-way ANOVA showed a significant effect of age on four out of five question types: contextual interference with ToM demand (
Mean, SD and correct answers (%) for the different question types by age groups.
ConToM: contextual inference with theory of mind demand; ConNoToM: contextual inference without theory of mind demand; Feelings: feelings recognition; Relevant: relevant language use; False: false belief. Maximum raw score equals the number of items in each question type.
Performance on the explanation tasks
As mentioned, 13 questions included a follow-up question (given only if the child provided a correct answer) where the children were asked to explain how they arrived at their answer. Analysis of the relative frequency (number of correct explanations/total number of explanations given × 100%) showed that 5-year-olds could successfully explain 21.5% of their correct answers, 6-year-olds 39.2% and 7-year-olds 43.4%. One-way ANOVA showed a significant main effect of age group in the ability to explain answers (
Gender-based differences
In the overall sample, independent samples
Discussion
In sum, the main findings of this study indicate that the Norwegian adaptation of the Pragma test, which was originally developed in Finland, presents with acceptable psychometric qualities confirming its potentials to serve as a valid tool for assessing children’s pragmatic abilities. Strong and significant growth in pragmatic competence was observed from age 5 to 6, subsequently flattening out between age 6 and 7. Furthermore, gender differences in favour of girls were identified.
The alpha value of the Pragma total score showed a highly internally consistent scale comparable to the original Finnish version (Loukusa et al., 2017), thus confirming the reliability of the Norwegian adaptation. To check the reliability of scoring, an ICC between two independent raters was calculated on a subsample of 24 children. The ICC for the Pragma total score and the explanation tasks was 0.988 and 0.919, respectively, indicating that the scoring was highly reliable. These values are also in line with those reported by Loukusa and colleagues (2017).
To derive a measure of concurrent validity for the children’s performance on the Pragma test, the CCC-2, was included in the study as a gold standard. Pearson’s correlations coefficient showed a significant and moderate correlation between the Pragma total score and the GCC of the CCC-2. Furthermore, a significant correlation was also found between the Pragma total score and the PC of the CCC-2. Better performance on the Pragma test corresponded to better performance (or fewer communicative problems) as measured by the CCC-2. These results are in line with the findings reported for the Italian adaptation (Gabbatore et al., 2021) suggesting good sensitivity of the Pragma test in identifying pragmatic difficulties in different native languages and cultural contexts. It should still be mentioned that the Pragma test showed a clearer pattern of correlation with the CCC-2 subscales foremost assessing structural aspects of language than with the subscales assessing pragmatics more exclusively. This could in part be due to the morphosyntactic and semantic demands represented by the Pragma questions. As opposed to the CCC-2, which is filled out by parents, understanding and responding to the items of Pragma requires the child to process the questions themselves. This would logically mean that Pragma should show a direct relationship with the language processing abilities of the child. Another potential reason could be that both the GCC and the Pragma total score essentially conflate subtests which assess different abilities. That is, the GCC combines measures of structural (grammatical and semantic) language and pragmatics, whereas the Pragma total score combines scores on different aspects of pragmatics – which do not necessarily measure the same construct. Nonetheless, our results indicate that the Norwegian version of Pragma is highly internally consistent, which should justify treating it as a single scale. Furthermore, the significant correlation between the PC and the Pragma total score indicates that both are measures of pragmatic language abilities. Importantly, CCC-2 and the Pragma test should be considered complimentary, and not necessarily interchangeable, instruments as they focus on similar but not entirely overlapping pragmatic competencies, and because they assess pragmatic ability from different perspectives (parents’ assessment and formal test performance).
Our results showed a significant effect of age on pragmatic abilities with performance improving with age. When comparing adjacent age groups, the youngest group (5-year-olds) performed significantly poorer than 6-year-olds. A slower but still detectable improvement in pragmatic abilities, with 7-year-olds descriptively performing better than 6-year-olds, was found; however, this improvement did not reach significance. This aligns with the results of Gabbatore et al. (2021) who did not find any significant differences when comparing 6- and 7-year-old Italian children, but contrasts with the findings of Loukusa et al. (2017) who reported significant differences between age 6 and 7 in their Finnish sample. The Pragma total score (mean) of Norwegian children is somewhat below that reported for Finnish (Loukusa et al., 2017) and Italian children (Gabbatore et al., 2021). One might only speculate what could be the reason for this. It might be due to slightly different inclusion criteria; shortcomings in translation, a stricter scoring may have been applied, or it may of course reflect cross-cultural differences in pragmatic development. In a study comparing Finnish, Italian and Canadian children, Mäkinen et al. (2020) found that despite largely similar trends in development across the native languages some differences were observed, for example, in productivity with Italian children being more talkative. Furthermore, at an early age, Finnish children seem to show better performance than English children on some pragmatic tasks (Loukusa et al., 2007). However, the same pattern of performance related to age was found in our Norwegian sample as in the Finnish and Italian samples, which is also consistent with results of previous studies of children’s pragmatic development in different cultural contexts (Angeleri & Airenti, 2014; Białecka-Pikul et al., 2019; Bosco et al., 2018; Glenwright & Pexman, 2010; W. A. Helland & Møllerhaug, 2020; Loukusa & Leinonen, 2008; O’Neill, 2007).
When exploring the effect of age in relation to each question type; contextual interference with ToM demand, contextual inference without ToM demand, feelings recognition, relevant language use and false belief, the same pattern as for the Pragma total score emerged. Except for the false belief task, where no significant differences were found between the age groups, 5-year-olds performed significantly poorer than 6- and 7-year-olds on all question types. The finding of no differences between the age groups in the false belief task is consistent with studies indicating that such tasks are commonly mastered by the age of 4 or even earlier (Grosse Wiesmann et al., 2020; Milligan et al., 2007). Regarding performance in relation to question type, the false belief task also turned out to be the easiest one to understand, followed by feelings recognition and contextual inference without ToM demand, while relevant language use and contextual inference with ToM demand were the most challenging. To perform well on the two latter tasks, children must understand social norms, draw conclusions by combining information derived from different sources, as well as understand and consider the mental states of others. ToM abilities have been found to keep developing during childhood and into adolescence and thus it should be expected that questions requiring these abilities are among the most difficult ones (Bosco et al., 2014). These findings align well with those reported by Loukusa et al. (2017) and Gabbatore et al. (2021) for Finnish and Italian children respectively. Some further adjustments should probably be made to the three final items (all with ToM demand) of the Norwegian adaptation as the meaning of these stood out as challenging to grasp.
In addition to answering pragmatically complex questions, the children’s awareness of how they arrived at their correct answers was also investigated. Five-year-olds explained 21.5%, 6-year-olds 39.5% and 7-year-olds explained 43.3% of their answers correctly. There was a significant difference between 5- and 6-year-old children’s ability to provide appropriate explanations for their answers. Although 7-year-olds were able to explain more correct answers compared to 6-year-olds, this difference did not reach significance. These findings indicate that to verbally formulate how one arrives at an answer requires competence that takes a long time to master (Letts & Leinonen, 2001; Loukusa & Leinonen, 2008). The children in our Norwegian sample successfully explained their correct answers to a lesser extent than Finnish (Loukusa et al., 2017) and Italian (Gabbatore et al., 2021) children. However, the same pattern, showing increasingly better competence as a function of age, especially between the ages of 5 and 6, emerged.
In this study, the exploration of gender-based differences in pragmatic abilities showed that girls outperformed boys. This contrasts with the results of Gabbatore et al. (2021) who found that girls and boys performed equally well on the Pragma test in their Italian sample. However, our findings are in line with other cross-cultural studies reporting significant differences in pragmatic abilities in favour of girls (Ash et al., 2017; Białecka-Pikul et al., 2019; Ketelaars et al., 2010). When inspecting the separate age groups in our sample, it was found that while 5- and 7-year-old girls performed better than boys, no gender-based differences in pragmatic abilities were evident in 6-year-olds.
Some methodological limitations should be considered when evaluating the results of this study. As an inclusive approach was adopted, our sample may be more diverse than the Finnish (Loukusa et al., 2017) and Italian (Gabbatore et al., 2021) ones. Furthermore, our study included children in the age range 5–7 years while the age of children in the aforementioned studies ranged from 4 to 8 years. Thus, future research collecting data from Norwegian 4- and 8-year-olds should be carried out. Preferably, studies should also be carried out to investigate the ability of the Norwegian adaptation to distinguish between typically developing children and clinical samples. Information of parental education level or socioeconomic status (SES) was not collected in this study. However, in Norway, a relatively egalitarian income distribution and a universal social security system contribute to low poverty levels, and consequently significant differences in SES were not expected. Furthermore, pragmatic abilities have not been found to correlate with children’s SES (Gabbatore et al., 2021). The fact that the test manual was not formally translated from Finnish into Norwegian may have led to uncertainties of how to interpret and score some of the answers. This study applied a cross-sectional design, however, to assess within-subject growth in pragmatic competence over time, longitudinal studies are required.
The Norwegian adaptation of the Pragma test derived from the lack of assessment tools for individual assessment of pragmatic ability in children. Combining assessment tools focusing on similar but not fully overlapping aspects, a questionnaire to be completed by parents and a direct test protocol, should contribute to a more accurate and comprehensive assessment of pragmatic abilities in children. In sum, the results of this study support the psychometric qualities of the adaptation as a tool for assessing pragmatics in Norwegian children aged 5–7 years. Furthermore, our findings indicate that pragmatic ability, as measured by the Pragma test, shows similar evidence of age effects in a Norwegian setting as in Finnish and Italian contexts, paving the way for further cross-cultural and cross-linguistic studies. Although further research with larger samples is needed to validate our results and to look further into possible gender-differences, this study contributes with new knowledge of pragmatic development in Norwegian children. Subsequently, this should lead to better support for clinical populations in line with their actual developmental level.
Footnotes
Acknowledgements
First, we want to thank Soile Loukusa for her generous assistance and for giving us permission to adapt the Pragma test into Norwegian. We also thank all children and their parents, kindergartens and schools who participated in the study.
Author contributions
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work was supported by Norsk Logopedlag (NLL) and the University of Bergen.
Ethical approval
The study was given ethical approval by the Norwegian Centre for Research Data.
