Abstract
Language education policies contain discourses that have language ideologies embedded within them. This study explores the language ideologies in official language education policy documents of Pakistan from 2000 to 2020. Using Corpus Assisted Critical Discourse Analysis techniques, a 1.28 million-word-corpus was generated from 32 policy documents collected from official websites of UNESCO and the Ministry of Federal Education and Professional Training, Pakistan. The corpus analysis of frequency, collocation, and MI significance values were utilized to find out the typicality of topics and semantic preferences, while the concordance analysis revealed discourse prosody around the languages mentioned in the corpus. Moreover, seven documents were downsampled through dispersion plot. Discourse Historical Analysis was carried out on these documents based on discursive strategies of argumentation, predication, and perspectivization with reference to intertextuality and interdiscursivity. The findings show that English and to some extent Urdu occupy the ideological space related to education. The local languages were discussed in positive light, but they were marginalized in educational ideological space. The analysis also showed that the policies had underlying monoglossic ideology that can be seen having tolerance-oriented and expediency-oriented policy ideologies for local languages. This implies that the official policies of Pakistan marginalized the local languages in the discourse of language education. These findings invite policymakers to bring the local languages in the ideological space of education and introduce high level of ideological uniformity, consistency and goal setting for English, Urdu, and local/indigenous languages in the language-in-education policies of Pakistan that matches, captures and promotes the linguistic diversity in Pakistan.
Keywords
Introduction
Language policy and planning is one of the most critical domains due to the power of the language policy that makes some ideologies official and prevalent (Canese, 2018). For the subfield of language-in-education policy, Tollefson (2002, p.6) identifies two important arenas for language ideology research: (1) how language ideologies shape policies and how these are naturalized through discursive practices, and (2) the interplay of contesting language ideologies in language-in-education policies. Schools are the primary sites for promoting language ideologies through language-in-education policies (Canese, 2018; Cushing, 2021). Cushing (2021) calls the language policy-related texts, including curriculum documents, tests, linguistic landscapes, and rules and regulations, as a “mechanism” to implement those ideologies into practice. Through these mechanisms, the language ideologies become de facto language practices in centralized education systems (Shohamy, 2006).
The language-in-education in Pakistan has witnessed the power of ideologies implemented through 10 education or language-in-education policies over the last seven decades since the country’s independence in 1947, shown in Table 1. The table has been adapted from Siddiqui (2016) by adding 2017 policy as the education policy (2017) was not captured in the book. Besides, two columns have been added to show the ideological trends related to language-in-education policy in the government education policies. Siddiqui (2016) believes that most policies were led by the ideology of the ruler of the time instead of a democratic consultative process. He argued that it is essential to look at these policies from the ideological standpoint of the ruler (Siddiqui, 2016, p. xxi). de Jong et al. (2016) compared the language policies of Taiwan, Pakistan and China. They argue that historically different languages have been treated from different ideological stances. In pre-partition India, Urdu was considered a resource as the language of intellectuals and literature. During British rule, English did not immediately replace Urdu as the language of formal communication. However, it ripped Urdu off its heritage of being a resource for intellectuals, administration, economy, political and social mobility. After the independence, the Bengali language emerged from the language-as-right orientation in response to Urdu that was declared the national language. Bengalis in East Pakistan felt it was challenging to communicate in English or Urdu as they had limited proficiency in both languages. However, their study is based on the selected review of literature which traces its history in a concise space and lacks any systematic reference and comparison to the language policies. Shamim (2008) traced the ideological trends in Pakistan’s language-in-education policy up to National Education Policy (2009) She broadly divides them into four phases. The phase from 1947 to 1977 was identified as a trend toward one-nation one-language ideology, that is, Urdu as the symbol of national identity and unity. 1978 to 1988 was the phase of the revitalization of Urdu under the Islamization efforts by General Zia ul Haq. The democratization of English ideology marked the phase from 1989 to 1999. Lastly, in the period from 1999 to 2009, the ideology of English as the language of development was dominant. Mahboob (2002) discussed the connection between political shift and language policy till 1999, as indicated in Table 1. It may be noted that these studies show varying ideologies during those political eras. Shamim (2008) believes that the one-nation-one-language ideology remained dominantly consistent over a long period, whereas, Mahboob (2002) shows a fluctuation or shift. Both agree that from 1979 until 1992, the policy reflected pro-Urdu ideology, which changed into pro-English post-1992.
Summary of History of Language-in-Education-Policies in Pakistan and Their Ideological Trends.
The above studies show the investigation of language ideologies underlying in language-in-education policies till 2009, primarily based on randomly selected policies and their analysis. Moreover, mostly researchers selected national-level policy documents only. It suggests a selective presentation of ideologies in official documents related to language-in-education without including the provincial governments’ documents. We argue that after the 18th Constitutional Amendment of the 1973 Constitution of Pakistan, the provincial governments have autonomous educational decision-making, including language-in-education. Consequently, the national and provincial language-in-education policy documents would reflect a broader ideological representation of different languages. Therefore, the purpose of this study is to explore the language ideologies embedded in the discourse of language-in-education policies not only in the selected national education language policies but across all national and provincial government education policies. The study is significant because it addresses the limitation of contextual studies discussed earlier and adopts a structured linguistic analysis of policy documents from 2000 to 2020 guided by Corpus Assisted Critical Discourse Analysis (Baker et al., 2008). The research question for the study is:
What are the ideologies related to the languages present in the education policy texts of Pakistan?
Literature Review and Theoretical Framework
Language Policy and Language Ideologies
Ideology, discourse and language are interconnected phenomena. It is problematic to capture ideology with a single agreed-upon definition of the term (Tollefson, 2011). For Blommaert (1999), language is the leading site of ideological debate. The language beyond sentence level is discourse (Lin, 2014); this “Discourse” reflects ideologies or the way of thinking and represents certain phenomena under discussion (Gee, 1996). Krzyzanowski and Wodak (2001) show the inter-relatedness between language policy and language ideologies. Language ideologies are “cultural ideas, presumptions and presuppositions with which different social groups name, frame and evaluate linguistic practices” (Gal, 2006 cited in Krzyżanowski & Wodak, 2011). These ideologies are reconstructed and perpetuated in discussions where language is the main topic of debate in public domains. Language-in-education policy is one such example of public sphere discourse that reflects the ideologies of institutional social actors who define how languages should co-exist and what roles they should perform in social, political and economic contexts (Krzyżanowski & Wodak, 2011). The process of language and planning takes place at several levels. Ricento and Hornberger (1996) identified those levels of language policy and planning with three layers of an onion. They consider that the outer-most layer or macro level policy is regulated by the state that decides which language receives national status, which receives educational support, which stay repressed, and which stay ignored. The second inner layer as meso-level policy is concerned with institutions such as schools, religious organization, media, civic and other private and public organizations, and business communities involved in language policy and planning development. The inner-most-layer or micro level policy is where teachers are the main people, but it is debatable that whether they can influence or have direct contribution to language policy and planning process.
The powerful groups such as authors and politicians who control significant public discourses influence and help reproduce dominant ideologies (Baker et al., 2008). Tollefson (2000) maintains that language ideology present in the language policy is so strong that it makes school systems translate the language ideology as something necessary that is in favor of the society. He argues that language ideology has a direct impact on language education policies. In countries like the United States, England, and Australia, the standard language ideology is enacted through language policies of those countries. Spolsky (2004) believes there is no difference between language ideology and language policy as he puts down “language policy is language ideology with the manager left out, what people think should be done. Language practices, on the other hand, are what people actually do” (p.14). For the “legitimization” (Canese, 2018) and “concretization” (Cushing, 2021) of language ideologies symbolized in language-in-education policies, they need to be practiced in educational institutions and in the communities outside those institutions. The four categories of actors with different roles for implementation of language policy and planning are people-with-power, people-with-expertise, people-with-influence and people-with-interest (Zhao, 2011; Zhao & Baldauf, 2012). The people-with-power are the policymakers at the macro level with official role of deciding the status of languages. The other three groups are described as the professionals such as linguists, social elites who influence the masses with their work such as scholars and celebrities, and the communities and language users at the micro level. According to Zhao (2011), the postmodernist perspective emphasizes the roles of agency and actors from micro levels, and on individual agency from the four categories on the effectiveness of language policy and planning efforts. These micro level actors lack the power that the macro level policymakers possess as they have the privilege to take part in policy formation and often these policies benefit themselves (Shao & Gao, 2019, pp. 257–258). Although the process of successful language-in-education policy and planning takes place at macro to micro levels with a variety of policy actors, the textual analysis of language policy documents is an important arena. A rigorous policy textual analysis can reveal the underlying hegemonic ideologies that perpetuate inequalities in and through language-in-education policies without being questioned. It is aptly stated by Shao and Gao (2019), p. 261), “A critical-discursive perspective is able to deconstruct ideology laden statements and taken-for-granted positions.” The critical discourse analysis of policy texts has the power to make the implicit ideologies explicitly visible for a variety of actors at different levels for them to address the language ideologies in alignment with the needs of the rapidly changing society and the world.
Language Ideology as a Theoretical Framework
Tollefson (2006) presents Fairclough’s (1989) definitions of hegemony and ideology together within the realm of Critical Theory in language policy as both are interconnected. Hegemony is a process through which institutions help reside and maintain power within the control of a few people, supported by the ideology that is “naturalized” unconscious belief and assumptions in society. The focus of critical theory in language policy remains on uncovering the implicit and explicit policies. These policies act as tools for creating and sustaining inequalities through hegemony portrayed as “natural” via social institutions (Tollefson, 2006, pp. 47–48). More specifically, certain language ideologies are presented as natural by language-in-education policies. For example, Ricento (2013, p. 530) refers to the ideology of English monolingualism prevalent in English dominant countries, initially introduced by Wiley and Lukes (1996) and Silverstein (1996). Wiley and Lukes (1996) believe that monolingual ideology considers that immigrants introduced linguistic diversity in those originally English dominant societies. Silverstein (1996) terms it as “monoglot” ideology which disregards the sociolinguistic reality replete with practices of multilingualism and diversity. The monoglot ideology influences language-in-education practices, social life, identities, and academic life (Blommaert 2006, cited in Ricento, 2013). García (2009) separates monoglossic ideology from heteroglossic ideology. Monoglossic ideology embedded in bilingual programs considers command on two languages independent of each other, resulting in “double monolingualism.” Heteroglossic ideology maintains that languages are inseparable and uncountable, and reflects a fluid use of multiple languages by multilingual speakers known as translanguaging (Flores & Baetens Beardsmore, 2015, p. 213). Similarly, Wiley and Lukes (1996) identify standard language ideology. A specific variety of a language of a powerful social group is given a higher status while other varieties are attached with a lower status. The political and social influential groups make a particular variety standard through codification against which all other varieties are judged and made as a norm through schooling gradually (Wiley & Lukes, 1996). Busch (2011) notes that schools in Europe have actively repressed other languages and language varieties by implementing standard languages as a nation-building process under monolingual and homoglossic ideologies. Despite prevalent multilingualism, schools in Belgium and Switzerland still have several monolingually oriented systems. Learners are assessed in the standard academic language against a standard based on an ideal native speaker, creating barriers for equal access and opportunities for other language speakers (Busch, 2011, pp. 545–546).
Ideologies are embedded in language policies. Kloss (1998) studied official language policies from an ideology perspective and developed a schema that helps categorize official language policies’ orientations or ideologies. He identified four types of language policies: (1) promotion-oriented, (2) expediency-oriented, (3) tolerance-oriented, (4) and restrictive-oriented. Wiley and Tollefson (2013) elaborates that promotion-oriented policies are those in which the governmental resources are utilized to support a language or languages in official domains. Expediency-oriented policies are a “weaker version” of promotion-oriented policies. They do not target the enhanced use of minority languages. Tolerance-oriented policies leave the responsibility of maintaining minority languages on language communities with no governmental resources and support. Restriction-oriented policies “make social, political, and economic benefits, rights, and opportunities conditional on knowing or using the dominant language. Language restrictions usually target communication in work-related or official domains” (p. 326). Wiley and Tollefson (2013) expanded Kloss’s (1998) framework by adding two more types: null- and repression-oriented policies. Repression-oriented policies are those that deliberately try to eliminate minority languages. Null policies are those where there is an apparent absence of policy recognizing minority languages or language varieties. Historically, language policies of the US can be categorized across all of these types (Wiley & García, 2016). Similarly, Ruíz (1984, p.16) introduced three orientations or ideologies to language policy: language-as-resource, language-as-right, and language-as-problem. According to him, these orientations indicate “a complex disposition towards languages and their roles,” and language attitudes are formed within these ideologies. Johnson (2013, p.36) further elaborated that language-as-problem ideology views minority languages as a problem in acquiring the dominant language. Therefore, transitional bilingual policies hold this ideology. Language-as-right ideology is a unidirectional bilingual program in which the minority language students are required to learn the dominant languages and maintain their language simultaneously. Language–as-resource ideology values linguistic diversity through bidirectional programs aiming at additive bilingualism. Students from both the dominant and minority language learn in both languages. The above literature shows that language ideology is an important phenomenon that is inseparable from and strongly influences language policies and has been studied from different perspectives. This study contributes to the existing literature on language ideology by expanding the application to language-in-education policy text from Pakistan. It also contributes to the methodologically growing field of corpus assisted critical discourse analysis studies (CACDA) by applying it to the corpus of language-in-education documents. Other studies have applied CACDA mainly in language ideologies presented in mainstream, social media, and language-in-education policy (see Doolan, 2011; Vessey, 2017).
Language Ideologies, CDA and Corpus Linguistics
Critical Discourse Analysis (CDA) has three dominant approaches with different foci: discourse-historical, cognitive, and dialectical-relational-dialectic (see Fairclough, 2013; Van Dijk, 2009; Wodak, 2001). Although all three dominant approaches of CDA have been widely used in language ideology research, CDA is often criticized for its random data collection (Cheng, 2013). Similarly, frequency, collocation and concordance techniques under Corpus Linguistic (CL) have been applied to ideology research (e.g., Cheng, 2013; Vessey, 2017). Nevertheless, the focus on textual analysis of data in corpus linguistics is considered a disadvantage in language ideology research since useful “contextual” information beyond the texts is usually absent (Vessey, 2017). Texts are significant instances for witnessing the embedded language ideology (Vessey, 2017); nonetheless, the socio-political context and discourses from which those ideologies emerge are equally important (Milani & Johnson, 2008 cited in Vessey, 2017). Within texts or documents, the explicit discursive representation or metalanguage shows the explicit language ideologies, whereas the implied assumptions about the role of languages show implicit language ideologies (Vessey, 2017). Combining corpus tools with other methods can enhance research on language ideology (Cheng, 2013; Vessey, 2017) as demonstrated by CACDA (Baker, 2006; Haider et al., 2021; Partington, 2010). A valuable combination of CDA and CL has been utilized by Baker et al. (2008), combining corpus linguistics approaches with Wodak’s (2001) discourse-historical studies (DHS). This combination helps address the disadvantages associated with both CDA and CL approaches noted above and enable triangulation of findings across methods (Baker et al., 2008, p. 295).
Wodak’s DHS provides a robust systematic methodological framework enabling researchers to carry out systematic data collection around a macro topic over a defined period to investigate changes in discourse practices (Reisigl & Wodak, 2009). Catalano and Waugh (2020) argue that DHS is one of the approaches of CDA with the greatest emphasis on wide-ranging historical data. DHS targets three types of critiques: (1) exploration of inconsistencies and contradictions within text or discourse; (2) exposition of manipulative discursive practices; and (3) improvement in communication for the future. The analytical strategies include intertextuality and interdiscursivity across a range of texts, genres, and discourses around a macro topic that helps identify a set of ideologies and inequalities that are perpetuated and reinforced by different texts and sites over time (Reisigl & Wodak, 2009). Five discursive strategies or intentional plans that authors use to attain specific goals through a text are investigated (see Reisigl & Wodak, 2009). These strategies are nomination, predication, argumentation, perspectivization and mitigation. The analysis of relevant strategies is guided by the nature and focus of a study (Mulderrig, 2014).
Method
This study incorporated Baker et al. (2008) methodology of combining CL with DHS. Incorporating Baker et al. (2008) and Vessey’s (2017) strategies, the word frequency, collocation and concordance analysis was carried out, followed by a detailed qualitative analysis of downsampled policy documents.
Data collection
For collection of language-in-education policy documents, provincial and federal government websites, offline government archives through government personnel, and officials in Higher Education Commission of Pakistan were approached. However, very few documents were available. Therefore, “Portal of Education Plans and Policies” (UNESCO, 2021.) website was selected because it provided the most comprehensive list of documents on language education in a single place. It also reduced the researcher’s bias for the selection of the documents. A few drafts of the recent Single National Curriculum (SNC) were gathered from the Federal government website (Ministry of Federal Education and Professional Training, n.d.) as these were not updated on the UNESCO’s portal yet.
Compilation of corpus
Based on the selected portal, a “purpose-built corpus” was developed from policy texts/documents on “particular topic” of language ideologies in language-in-education policy documents as suggested by Mautner (2016, p. 165). Total 71 documents were collected (54 from UNESCO and 17 from Ministry of Federal Education and Professional Training website). Using Vessey’s (2013) strategy of core query term (CQT), documents were manually searched for CQT “language”. This step ensured that only the documents that potentially had a discourse on language-in-education were selected for the corpus, while documents that did not contain the word “language” were excluded. This process yielded 26 documents from the UNESCO website, whereas 06 documents from the Ministry of Federal Education and Professional Training website. Other CQTs frequently used in the literature were also searched through those documents to ensure that no potential document was ignored. These terms included “tongue” (for mother tongue), “medium,” “instruction” (for medium of instruction), “linguistic” “Urdu,” “English,” “local” (for local language), “regional” (for regional language), minority (language). Based on this process, 32 PDF document files, shown in Table 2, were selected for analysis and converted to (.txt).
Distribution of Compiled Corpus.
Downsampling
Downsampling is a common technique adopted in CACDA as a strategy to counter the challenge that it is too “labour-intensive” to perform CDA on a number of texts (see Baker et al., 2008; Vessey, 2017). In CACDA, samples of texts are selected based on high frequency of the CQTs. For detailed CDA analysis of 07 downsampled documents shown in Figure 1, a combination of strategies used by Baker et al. (2008), and Vessey (2017) including high frequency, dispersion plot (see appendix) and data relevance were applied.

frequency of LANGUAGE in downsampled policy documents.
Data analysis
The research question is answered by combining the corpus and DHS analysis of data. The micro analysis of corpus data is carried out through frequency, collocation, and concordance analysis. These three analyses respectively reveal the typicality of topics/CQTs, strength of relationship with other words found in close environment, and the collocational environment of the words. In other words, the ideologies of the policymakers are revealed through what they discuss most often in the policy documents, what other words are most often used in discussing those topics, and whether negative or positive terms are used by the policymakers while discussing those topics/CQTs. The DHS analysis reveals the ideologies of the policymakers in macro discussion of the languages.
Corpus software packages such as WordSmith Tools (Scott, 2013) allows researchers to generate frequency lists, and explore measures of statistical significance such as Mutual Information (MI value/scores) and
Similarly, Mutual Information (MI) test scores show co-occurrence of words in a corpus that indicates the strength of collocation relationship between the words (Baker, 2006). Researchers have used various MI scores. For example, Vessey (2017) suggests MI score higher than 3.0 is considered to have statistically significant collocation, whereas Baker (2006) used an MI score of 4.0 and above for a significant relationship between words. Following Vessey’s work on language ideology, an MI score of 3.0 and higher was considered significant. WordSmith also allows researchers to examine the “collocational environment” of search words through patterns of concordance lines that show “evaluative polarity” (Mautner, 2016). In other words, whether the search word is found repeatedly in negative or positive collocational environment. This evaluative polarity is known as semantic prosody or discourse prosody. Similarly, “semantic preference” reveals with which category of other words a search word frequently occurs (Mautner, 2016; Vessey, 2013). For example, Baker (2006) concluded that the term
The DHS techniques guided the macro and detailed textual analysis. The focal strategies from Reisigl and Wodak (2009) used for this study were predication, argumentation, and perspectivization along with intertextual and interdiscursive analysis. The authors of texts use different discursive strategies to persuade the reader or audience. Through predication strategy, they assign positive or negative evaluative traits through adjectives, phrases, and clauses. Therefore, the predication techniques revealed what other semantic terms the authors attributed with the languages. Argumentation strategy involves the claims and how they are justified through culture-specific topoi or loci that are the underlying assumptions for concluding the argument. The arguments with the underlying topos revealed the policy authors’ use of different arguments to support their views. Perspectivization strategy, to some extent, relies on intertextuality to other texts and interdiscursivity by drawing upon references to other discourses for legitimization and support for the author’s perspective (Reisigl & Wodak, 2009). The perspectivization techniques found in the text showed the use of intertextual and interdiscursive references by the policymakers to support their perspectives.
The corpus techniques and analysis of DHS techniques together revealed the dominant ideologies through analysis of typicality, significance, attribution and discursive strategies in the language-in-education policy documents of Pakistan.
Findings and Discussion
This section answers the research question revealing the ideologies related to the languages of Pakistan through highlighting the typicality of topics/CQTs, significant words around the CQTs, and the collocational environment of the CQTs through expanded concordance lines. These techniques capture the ideologies of the policymakers through corpus analysis.
Frequency Analysis: The Typicality of Topics/CQTs
One of the key findings that surfaced through the distribution plot was that the CQT LANGUAGE had low frequency in National Education Policy (2009) and (2017) documents, while it occurred more frequently in other recent policy-related documents. Nevertheless, it occurred in 100% of the documents (32) as it was the main CQT for corpus development. This reflects the validity of the corpus data as the study aims to explore discourse and ideology in language-in-education policy. The wordlist showed that LANGUAGE occurred 1,009 times in the corpus. In contrast, the CQT EDUCATION appeared as the first most frequent content word ranked 8 with a frequency of 17,131, contributing 1.34% in the entire corpus across 32 texts. This is because the issue of language is a subject discussed in small portions in language-in-education policy documents, whereas the issue of education dominates the discourse. The next word of interest ENGLISH occurred 617 times in 26 texts. Compared to English, the CQT URDU had a frequency of 243 in 23 texts of the corpus. It reflects that ENGLISH has been discussed roughly three times more frequently than URDU. CQT MOTHER occurred 118 times across 17 texts and the phrase MOTHER TONGUE 75 times in 11 documents. This is the term commonly used for local languages the learners speak at home and has been heavily discussed in the domain literature. Other adjectives as CQTs such as “local” and “indigenous” showed even lower frequency. However, they are not reported here due to limited space. Overall, the low frequency of adjectives commonly used to name the local languages implies that the discussion of local languages is very limited in the education document corpus. Besides the significant CQT from literature in the domain of language-in-education, it was essential to find out which languages are frequently discussed in the corpus and the discourse around them. For this reason, the latest language data from census report of Pakistan 2017 (Pakistan Bureau of Statistics, 2017) served as the starting point. It showed 09 local languages, including six major languages Urdu, Punjabi, Sindhi, Pushto, Balochi, Saraiki, and 03 minor languages, including Kashmiri, Hindko, and Brahvi. Table 3 shows frequency analysis from the corpus for 09 languages identified in Pakistan Census (2017) (Figure 2).
Major Language Frequencies in Primary Corpus.
English presented for comparison of languages present in the corpus.

Mother tongues and number of speakers Pakistan.
Urdu and Sindhi appeared to be the most frequently occurring local languages in the corpus, besides Balochi and Punjabi 9 and 8 occurrences, respectively. English stands as the most frequently discussed language in the entire corpus. It is pertinent to compare the representation of languages in the discourse in language-in-education documents and the number of speakers. In Figure 1 above, the most prominent Pakistani language is Punjabi with around 80 million speakers, followed by Pushto roughly (40 million), Sindhi (30 million), Saraiki (25 million), and Urdu (15 million). The representation of languages in the corpus shows almost the reverse trend. Punjabi and Pushto are the two major languages according to the latest Pakistan Population Census (2017). The reference to these languages occurred only eight and three times respectively in five and three documents each. It means that the discourse of major local languages has been largely absent in language-in-education policies of Pakistan. Given this situation, it is hard to expect that other minor or smaller indigenous languages would receive any representation in language-in-education policy discourse.
The frequency analysis shows that English dominates as the typical topic among the languages discussed in the corpus. Urdu (the national language of Pakistan) is the second most discussed topic while the regional and local languages of Pakistan are least discussed topics. Nevertheless, frequency analysis has its limitation as sometimes frequency analysis in isolation could be misleading (Archer, 2009). These frequencies do not show the context of discourse. Other tools within CL such as collocation and concordance (Baker et al., 2008; Vessey, 2017) show the contexts of the topics or CQTs.
Collocation Analysis: The Immediate Context of CQTs
The analysis of statistically significant collocates started with the search of the CQT LANGUAGE showing the strength of the relationship with other words. LANGUAGE and EDUCATION showed a low MI value (0.5); therefore, not even a weak relationship was indicated between the words. This implies that discourse on the specific term “language-in-education” in the educational documents is minimal.
The selected significant collocates of CQT LANGUAGE/ES in Table 4 shows that it mostly co-occurred with words associated with education or school (learning, school, teaching, instructions, medium), besides three languages (Urdu, English, and Sindhi) and a few hinting at the context of discourse about language issues (need, mother (tongue), policies, and problem). Thus collocation analysis reveals that CQT LANGUAGE has semantic preferences with international, national, and local languages and policy issues in the discourse of language-in-education policy community. These results further helped identify CQT MOTHER TONGUE analyzed later in this section.
Education Related Significant Collocates of LANGUAGE.
URDU collocates with a range of words with various perspectives related to education with MI values between 5 and 8 (see Table 5). URDU collocates with ENGLISH and SINDHI (56 and 7 occurrences, respectively). There is frequent collocation between URDU and adjectives NATIONAL, REGIONAL and PROVINCIAL (12, 11, and 5 occurrences, respectively). The lemma URDU also occurs six times with MOTHER and five times with CHILD and CLASS.
Significant Collocations of URDU.
The collocation analysis of URDU shows that it had the semantic preference of being a language for a range of contexts, as a language of education, and discussed with other languages. In short, Urdu was discussed in context of educational aspects discussed by the policymakers. Similarly, with the significance value of 3 to 10 in Table 6, ENGLISH collocates with a range of words with a variety of perspectives related to education, including 39 collocations with the only other language URDU. English collocates with SECOND (12 occurrences), FOREIGN (10 occurrences).
Significant Collocations of English.
English seems to have a semantic preference for educational purposes, however, more strongly than Urdu as the collocational-context range with education-related words is much more extensive. English is also discussed in proximity with the national language but not with other local languages.
SINDHI collocated with URDU 17 times and with ENGLISH 8 times (MI value 8–11). MOTHER occurs with TONGUE, TONGUES, and LOCAL 77, 14, and 6 times and with education-related terms MEDIUM and INSTRUCTION 7 and 5 times respectively (MI value 6–13.41). This means unlike English and Urdu, mother tongues were largely absent in proximity of educational context showing the policymakers sidelined the local languages in educational domain. There was no significant collocation relationship between the CQT PUNJABI/PANJABI, PUSHTO/PASHTO, BALOCHI, KASHMIRI, SARAIKI, HINDKO, and BRAHVI with any other word. However, they were found in the word frequency analysis. During the analysis, the corpus showed that the use of the term mother tongue(s) for local language was more common, whereas very few local languages were discussed by their names. Therefore, the concordance analysis below has used mother tongue rather than the specific local language names. Urdu and English have been explored by names. The following section presents the concordance analysis of selected lines from the corpus related to English, Urdu, and mother tongues.
Concordance Analysis: The Attributes Surrounding the CQTs
The frequency and collocational significance are valuable techniques for finding out the topics’ typicality and the relationship among words. The contextual use of the words can be derived from the frequency of concordance lines showing the examples of the use of a word in context and functions of those words (Baker, 2006).
The pattern of concordance lines in Table 7 reveals that words around English explicitly suggest that English is necessary for “economic advancement” in the “globalised and competitive world” therefore “not just the elite,” but all schools should teach it or use it as a medium of instruction (MoI) as “children with better English language skills tend to have more opportunities.” The repeated use of the adjective “important” positively evaluates it as a language necessary for “advancement,” in the “modern world” that “continues to globalize.” This is a clear example of language-as-resource. The expanded concordance lines in the examples below show the importance of English.
Example 1 Private schools predominantly use English as medium of instruction, with a very strong focus on active use of English language by children, while public sector schools mostly use Urdu and the regional mother tongue as a language of instruction. Although, English is taught in many public-sector schools but because of the low capacity of teachers to teach English as a second language, children are unable to achieve even basic competency levels. The disparity in acquisition of language skills is one of the most divisive factors when it comes to application of education skill for jobs. Children with better English language skills tend to have more opportunities because of its glorified status in society and its use as a de facto official language. Generally, students in high-cost private schools are more advantaged than children studying in public sector schools, low-cost private schools and madrassas. (National Education Policy Framework 2018) Example 2 Furthermore, the predominant medium of instruction in rural areas, Urdu, makes the transition to the English medium higher education system difficult for students. The English language proficiency requirement poses a severe challenge to education equity even in urban areas for students coming from Urdu medium and disadvantaged backgrounds. This makes them more vulnerable amidst fierce competitions in universities’ admission tests. This increasing institutional heterogeneity creates classes of education where the advantaged retain their right to privileges that define their upward social mobility and worsens the development divides.
Selected Concordance Lines of ENGLISH.
Examples 1 and 2 exhibit that while English is considered the language of employment and opportunities, the students taught in Urdu medium schools are at a disadvantage. The negative words “disadvantaged,” “vulnerable,” “worsen” show the negative evaluation associated with Urdu.
Concordance trend in Table 8 indicates that the policy documents establish that English is not only considered necessary by the policymakers; instead, it is also the “demand,” “need,” and “prefer(ence)” of the parents and communities for their children to acquire English language proficiency to “seek better employment” opportunities.
Selected Concordance Lines for ENGLISH.
The concordance trend in Table 9 manifest mixed opinions on the introduction of English in the curriculum discussed as “compulsory subject,” and as a “medium of instruction as early as Class 1” or “Class III” and at “higher education.” These and other examples indicate that the documents argue that children should have good English proficiency for the highly competitive world.
Selected Concordance Lines of ENGLISH.
The concordance lines in Table 10 reveal a trend of Urdu being discussed as a national language and a regional lingua franca.
Selected Concordance Lines of URDU.
Table 11 reveals multiple opinions about the introduction of Urdu in various roles and stages such as in “upper classes,” “as a subject from class 1,” “as medium of instruction at the primary level” and according to the “local decision” and “wishes.”
Selected Concordance Lines of URDU.
The concordance trend on mother tongue in Table 12 reflects it being perceived as a resource that should be capitalized on by using more and more mother tongue-based education. The benefits of mother tongue are positively predicated with the help of several positive lexical items “additional,” “better expressions,” “superior achievement,” “perform better,” “international research supports.” All these words in the discourse around mother tongue are intentionally used as a persuasive technique. In context of the Urdu language, the discourse was positive or neutral. It was positive as emphasized by higher frequency (243 occurrences) in the entire corpus; moreover, it was predicated as a “national” language or “a lingua franca.” MOTHER TONGUE with 75 occurrences, on the other hand, showed lower frequency, however, several positive attributes are carefully used.
Selected Concordance Lines of MOTHER TONGUE.
Table 13 reveals the trend that mother tongue is encouraged at “early” stage of education for students to have a strong foundation in their own language as they “need confidence” to “value” their own language.
Selected Concordance of MOTHER TONGUE.
The concordance analysis of major languages of Pakistan revealed the presence of other Pakistani languages in the corpus, including Shina, Khowar, Burushaski, Wakhi, and Potohari. Concordance lines for the identified languages revealed that the discourse around these local languages only mentioned the regional linguistic profile, similar characteristics with Urdu and other local languages, and their use for television channels. These languages were entirely absent in discourse around education. This triangulates the finding from wordlist and collocation analysis that local languages are greatly ignored in language-in-education policy discourse.
The policymakers discussed English the most compared to other languages of Pakistan. The typicality of topics appeared in a hierarchy where English occupied the top place, followed by Urdu and mother tongues. English occurred in discourse of education, followed by Urdu while mother tongues only very marginally. English, Urdu, and mother tongues are positively attributed shown through concordance analysis. Together these findings reveal that English occupies the center place as being the most important language sharing some grounds with Urdu language in the discourse of the policymakers evident in the policies documents. The mother tongues have been marginalized in the educational domain. They are valued as a resource for the society, but no space is indicated for them in the education. English remains the central of policymakers’ discourses of education in the policy documents.
Discourse Historical Analysis
In this section, discourse historical analysis of the downsampled language-in-education policy documents is presented, highlighting ideologies of policymakers and their discursive strategies of argumentation, predication, and perspectivization under DHS principles in order to answer the research question through detailed document analysis. First, four Federal government documents are discussed, followed by three documents from provincial governments.
One of the downsampled policy documents
The downsampled
The downsampled
The
The Federal government’s
Two of the downsampled provincial government policy documents appear expediency-oriented policies (Wiley, 2013).
Similarly, the
It is doubtless that the ideologies present in the policy documents have significant impact on the stakeholders. It is the actors and stakeholders that make the implementation of any policy successful. Therefore, all the actors identified by Zhao and Baldauf (2012) working at the macro, meso and micro levels (Ricento & Hornberger, 1996) contribute to the implementation of the ideologies embedded in the language-in-education policies.
Conclusion and Implications
This study attempted to answer the research question focusing on the ideologies related to languages through the discourse language-in-education policy documents of Pakistan. The findings revealed that English dominated the ideological space of discourse existing in the language-in-education policy texts explored through typicality, significance and attribution techniques of corpus analysis. The discourse around Urdu and mother tongues had positive prosody generally. Whereas Urdu was associated with educational discourse, mother tongues were completely marginalized from language-in-education related discourse resulting in complete absence from ideological orientation of considering them important for education.
The detailed DHS discursive analysis showed that English, Urdu and mother tongues were discussed in positive light. English and Urdu had their own importance in education. It also indicated agreement on the use of mother tongue or local language in education by the provinces, however, there is variation in the approach of different provincial governments. The analysis confirmed that over all the policies had underlying monoglossic ideology evident from their early-exit transition implementation plans for local languages. The policies were ideologically tolerance-oriented or expediency-oriented for local languages. This means that either no government resources were allocated for development of languages to be used in education or the role of local languages is constrained for first few years of education. These ideologies do not seem to promote the linguistic diversity of the country.
These findings are significant for language policy actors or “people” at macro, meso and micro levels. The people-with-power need to balance their ideological orientations for international, national and local languages as all these languages perform significant roles in lives of people. It has been established the ideologies enacted through policies are given huge importance by schools and those ideologies are translated through schools. People-with-expertise should work for corpus planning of several local languages so that they can be ready for educational domain. People-with-influence such as different NGOs working for local languages should understand that local languages have been largely ignored in the educational space, therefore, they should coordinate with the policymakers and inform provide guidelines for effectively including local languages in education. People-with-interest including communities can benefit from these findings and create pressure from the bottom through building confidence, pride and realization among the local language speakers. More specifically, these findings have implications for the policymakers both in the Federal government and the regional or provincial governments of Pakistan. First, since the Federal government is consistently urging the provinces for uniformity, there should be similarities of discourses and ideologies about the languages across the government documents from these layers of government. The provincial governments’ discourses need some consistent efforts to present the linguistic profile of each province and seek support from all corners given their autonomous powers after 18th Constitutional Amendment. Furthermore, the Federal government should recognize and allow discourse space to more major and minor local languages and provide guidelines to the provinces for the development and projection of the local languages. This study was limited to the analysis of official documents only. Future research may focus on the discourse in the media, both electronic and print, to explore the discourse and ideologies around languages in Pakistan.
Footnotes
Appendix
Declaration of Conflicting Interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) received no financial support for the research, authorship, and/or publication of this article.
