Abstract
Purpose
Accounts for failures of merit pay in educational settings were stagnated by the “political obstruction hypothesis,” which blamed teacher unions for impeding the merit pay schemes from proper function. It required stronger evidence from both public and private schools to refute this hypothesis.
Design/Approach/Methods
Accountability typology was conceptualized to uncover the power relations underpinning merit pay schemes, and teachers’ motivational status was described from the perspective of Self-Determination Theory (SDT) to analyze the corresponding consequences. One public school and three private schools were purposefully sampled, and data included principal interviews, teacher interviews, school documents, and relevant policy documents.
Findings
Our findings suggested an opposite conclusion to the “political obstruction hypothesis” that the more dependent on market, the more intense the conflict between external accountabilities and the nature of teaching. Thus, we supported the “nature of teaching hypothesis” that merit pay was inapplicable in educational settings due to the inherent contradictions between market accountability and professional accountability.
Originality/Value
Policymakers and school administrators had been misled by the neoliberal initiative of borrowing merit pay into educational settings to promote the quality of teaching. Our findings provided strong evidence to refute this hypothesis and to restore the appreciation of professional autonomy.
Introduction
Merit pay as a compensation increase based on measurement of one's “performance” has been adopted in public administrations in 80% of the Organisation for Economic Co-operation and Development (OECD) countries (Dielmann, 2012; Lah & Perry, 2008; Ruffini et al., 2020). China also gradually embraced merit pay in educational policies to break the egalitarianism in teachers’ renumeration along with the economic reform and opening-up. In 1985, the “structural salary” schedule was rolled out by the government to connect teachers’ renumeration with differed actual workload. In 1993, a professional career ladder was introduced in the “Hierarchical Salary of Professional Technical Post” policy linking teacher salary with professional title appraisal in 1993 (He, 2019). A deepening merit pay reform in compulsory education was later launched in 2009 to compensate teachers based on the results of the performance appraisal and motivate them with pecuniary incentives (Ministry of Education of China [MOE], 2008). These reforms were expected to improve teaching practices, encourage cooperation, and attract talents to the profession (Fan & Fu, 2011).
Since 2009, however, researchers have found relentless complaints among teachers about merit pay. It was criticized for unfair assessment criteria for non-exam subjects (e.g., music, PE), preferential biases toward administrative positions, and the overemphasis on students’ academic performance (X. Li, 2010; K. Li, 2013; Liu, 2012; Liu et al., 2016; Wang et al., 2014; Xiong, 2009; Zhang, 2010). Meanwhile, evidence about its impacts on teacher development was at best inconclusive, if not negative. While some case studies found teacher development possible under merit pay reforms (e.g., Wang, 2007; Zhang & Ng, 2017), the success could hardly be attributed to the pecuniary incentives or the summative performance appraisal, but rather the promotion of collaborative culture or the developmental use of appraisal results. More frequently, researchers found negative impacts of performance-contingent incentives, including worsened test-oriented climate, wide use of the appraisal results for punitive instead of developmental purposes, overreliance on quantifiable indicators at the cost of a qualitative appraisal, and accentuation on individual incentives while overlooking collective stimulation (Fan & Fu, 2011; Meng & Yuan, 2014). The Chinese literature echoes international scholarship, portraying merit pay as a policy failure (Ballou & Podgursky, 1993; Gius, 2013; Goldhaber et al., 2008; Yuan et al., 2013).
There are roughly two schools explaining the failure of merit pay in public schools: the “nature of teaching” hypothesis and the “political obstruction” hypothesis (Goldhaber et al., 2008). The different explanations vary significantly in their practical implicatures: The “nature of teaching” hypothesis rejects merit pay in all educational settings, while the “political obstruction” hypothesis defends merit pay even in public schools with failed examples. However, the extant literature lacks further evidence to bridge these two contesting arguments, leaving the debate stagnated with wavering policy directions. This study attempted to bridge the empirical gap by comparing the practices of merit pay in public and private schools with the analytical framework of accountability typology, trying to figure out what the most essential reason for the failures of teachers’ merit pay is, and how it manifests itself in terms of multiple accountabilities.
Accounts for failures of merit pay
Merit pay may be dysfunction when it lacks a democratic foundation or measurement accuracy. For instance, employees may become discouraged, less productive, or even querulous when they perceive the performance measured by the manager as underrated due to subjective factors (Dessler, 2011), or when individuals are held accountable for performance that depends more on teamwork or other external factors that are beyond one's control (Noe et al., 2011). Yet, these are at most critiques of technical failures that simply call for improvement to the process and instruments.
More essential defect of merit pay lies in its narrow focus on observable behavioral index over unobservable aspects due to its methodological limits. Even when merit pay schemes are implemented as intended to orient employees’ behavior toward prescribed objectives, the narrow focus on observable aspects over unobservable aspects may still decline the overall quality of work, especially when the unobservable aspects are essential to the success of the task (Al-Ubaydli & Lee, 2012). The emphasis on accountability and incentives may also discourage long-term professional development while attaching stakes to the emergent performance goals (Gailmard & Patty, 2007; Hood, 1991).
In addition to the methodological defect, merit pay could also be problematic in public sectors due to the motivational misalignment. Since employees in public sectors are often motivated by intrinsic incentives rather than extrinsic incentives (Cacioppe & Mock, 1984; Crewson, 1997; Frey & Osterloh, 2013), the introduction of and emphasis on behavioral objects and pecuniary incentives may work as “substitutes” rather than “complements” for employees’ pro-social motivations when the treatment indicates “bad news” about the manager, triggers “moral disengagement,” or evokes “control aversion” in the targets, thus causing a “crowding-out” effect (Benabou & Tirole, 2003; Bowles & Polania-Reyes, 2012; Falk & Kosfeld, 2006; Frey, 1997; Frey & Osterloh, 2002; Lidenberg, 2001; Weibel et al., 2007, 2010).
Accounts for failures of merit pay in educational settings are also grounded in the aforementioned defects: the “political obstruction” hypothesis attributes the failure of merit pay to external obstruction from teacher unions, which prevents the schemes from proper functioning (Ballou, 2001), and the “nature of teaching” hypothesis attributes the failure of merit pay to the inherent methodological defects and the unintended “crowding-out” effect brought by motivational misalignment (Murnane & Cohen, 1986).
Researchers holding the political hypothesis argued that merit pay schemes were far more favorable among private school teachers than their public counterparts (Ballou & Podgursky, 1993). They interpretated such difference as that private schools differed from public schools in their management style (more flexible compensation system), political context (less engaged with teacher union), and dependence on market environment (more sensitive to market demands), rendering them in conditions closer to the private sector and more suitable for merit pay schemes (Ballou, 2001; Ballou & Podgursky, 1993). They tended to blame the technical failures on teachers’ obstructions, but avoid further examination of the methodological defects and motivational misalignment.
Researchers advocating the “nature of teaching” hypothesis insisted on teaching as a collective endeavor, depending on professional collaboration for sustained development (Paine & Fang, 2006; Paine & Ma, 1993). They perceived merit pay as flawed by methodological defects because student outcomes, the common-used performance indicator in merit pay appraisal, were the results of multiple interactive influences from family, school, and the larger social environment, far beyond the control of single teachers (Pham et al., 2021), and much of teachers’ daily effort that was critical for improving students’ well-being was invisible to school management and hard to be quantified (Hannaway, 1992; Warren & Ward, 2019).
Furthermore, although the crowding-in effect exists in theory when incentives align with teachers’ altruistic motivation and adhere to self-interested motivation, the crowding-out effect on intrinsic motivation was often more prominent, especially when teachers’ work was alienated under test-oriented or market-oriented climate (Bowles & Polania-Reyes, 2012; Weibel et al., 2010). Teachers’ preference may even change endogenously under the influence of merit pay to be lacking enthusiasm and recognition of educational values, and become more likely to work for better pay rather than the public good (Gius, 2013).
Teachers’ merit pay reform in China was initiated in 2009, intending to attract quality new teachers and facilitate professional in-service development with pecuniary incentives. Yet, researchers have found that regional gap was even enlarged due to the localized fiscal sources for merit pay, and teachers were hardly motivated due to the problematic performance appraisal (Fu & Zeng, 2010; Tang & Hu, 2013). Existent domestic literature mainly attributed the failure of merit pay reform to the defects of fiscal system and distortion of policy design at school levels, lacking reflections on the fundamental hypothesis concerning the relation between pecuniary incentives and teacher motivation (Fu, 2013; Lai, 2013; Zhao & Liu, 2017). Empirical evidence on teacher supply has confirmed that influences of pecuniary incentives would be insignificant when teachers were satisfied with working conditions, implying the inadaptability of merit pay under certain circumstances (Huang et al., 2021). Yet, further studies are needed since current debate has not touched on fundamental hypothesis with empirical data.
Accountability typology of teachers’ merit pay
Essentially, merit pay holds teachers accountable for their “merit” or “performance.” As a form of accountability institution, its values and policy goals vary by different types of accountabilities. Generally, researchers have identified three types of accountabilities relevant to teachers’ merit pay: (1) administrative accountability, or described as organizational and bureaucratic accountability, that sets prescribed goals and procedures by top-down administrative power; (2) professional accountability that expects teachers to uphold professional norms and standards defined by the professional community; (3) market accountability that demands desired services including response to complaints and compliance with consumers’ choice (Darling-Hammond, 2004; Williams & Engel, 2012).
Each type of accountability has correspondent indicators common in teachers’ work: administrative accountability often inspects teachers’ attendance, compliance, workload, and post-duty conducted by administrators; professional accountability embodies professional requirements via teaching contests, class observation, lesson study, and recognition of traditional qualifications as educational attainment, work experience (seniority), and academic publications; market accountability underpins the wide adoption of questionnaire to include parents and students as teacher appraisers, as well as various procedures to take care of parents’ disputes with teachers or individualized appeals (Pang et al., 2019; Wang, 2005; Zhao et al., 2011).
The three accountabilities may influence teacher motivation in different orientations. Merit pay geared toward professional accountability has the potential to advance teacher professional development and encourage intrinsic motivation with incentives working as “complements.” Yet, when teachers are pressured by merit pay plans orienting toward market and administrative purposes, non-educational purposes may function as “substitutes” for pedagogical principles and result in crowding-out effect. Therefore, the accountability typology can be utilized to explain the varying consequences of different merit pay schemes by detailed examination of the relationship between merit pay structure and teachers’ agency.
Self-determined theory and taxonomy of motivation
Although evidence has shown that students of teachers under merit pay are higher in test scores (Cooper & Cohn, 1997; Dee & Keys, 2004; Woessmann, 2011), it is inadequate to judge the effects of teachers’ merit pay since the quality of teaching is beyond mere cognitive development. Meta-analysis of merit pay schemes has found that there was little change in teachers’ instruction, number of working hours, job stress, or collegiality under merit pay schemes (Yuan et al., 2013). To better elaborate the delicate differences in teachers’ motivation, a taxonomy of motivation is borrowed from the self-determination theory (SDT; Ryan & Deci, 2000).
SDT distinguishes different types of motivation based on different reasons that drives action and argues that the motivational types matter in eliciting and sustaining people's efforts (Deci & Ryan, 1985). Basically, motivation can be divided into two categories: intrinsic motivation which rewards people with inherent satisfactions in the task, and extrinsic motivation which relies on separable outcome to encourage efforts. Yet, in actual practice, people's motivational status may shift on a spectrum of internalization where there are several transitional stages between the typical extrinsic and intrinsic motivation as shown in Figure 1 (Ryan & Deci, 2000).

The taxonomy of motivation.
The strength of motivation starts from “amotivation” as not motivated to “intrinsic motivation” as strongly and continually motivated, while “extrinsic motivation” contains several subcategories including “external regulation,” “introjection,” “identification,” and “integration.” The perceived locus of causality, namely the attributional source of drive, shifts from “external” at the “external regulation” stage to “internal” at the “integration” stage. For instance, a teacher at the “identification” stage may recognize the prescribed goals of test scores, which are extrinsic to teaching practices, because she believes the test-oriented strategy is helpful in maximizing students’ chances into good colleges, which reflects somewhat internal locus of causality from her endorsement of goals.
It is noteworthy that teachers’ motivation is likely to move either rightward or leftward as reactions to the dynamics in practice, so the impacts of merit pay on teachers’ motivation can be shown either in the status or in the change of motivation. Empowered by the SDT, the analysis can trace the status and change of motivation back to different accountabilities and line up the data of public and private schools in a comparable framework.
Research setting and data
So far, the debate between the “political obstruction” hypothesis and “nature of teaching” hypothesis is stagnated because the studies were conducted in either public schools or private schools separately and under different frameworks, lacking a bridging perspective to include comparable examples from both public and private schools (Ballou, 2001). In light of this gap, China offers a perfect scenario for this comparison. Its government has been promoting merit pay in public schools since 2009. The country features a strong public school system as well as a booming private educational sector. Public school teachers are members of the National Federation of Trade Unions without a profession-specific union. Therefore, teachers’ union exists only nominally with minimal political power to influence teachers’ work conditions (Gao et al., 2007; Xu & Yang, 2008). Embedded in substantial market influence and marginal union effect, China provides an ideal setting to test the two hypotheses under debate.
Teachers’ remuneration in public schools in China is constituted by four parts since the merit pay reform: post-pay, level pay, merit pay, and subsidies, among which post-pay, level pay, and subsidies constitute the basic salary depending on teachers’ professional title, while merit pay is meant to be the incentive salary depending on teachers’ performance (Pang et al., 2019). Usually, performance appraisal plan that determines teachers’ merit pay is deconcentrated to schools while local governments retain the power to intervene. Thus, school documents will provide detailed data about merit pay plan in public schools. On the contrary, merit pay in private schools is less subject to government policy with greater autonomy. The structure of teachers’ remuneration in private schools may vary significantly depending on school context, using multiple synonyms for merit pay. Thus, investigation of merit pay in private schools shall be anchored in data and informed by the nature of merit pay rather than a specific pattern.
The study was carried out in Shenzhen, Guangdong Province, one of the most developed cities and a pioneer of education reforms in China. Since the exploratory interviews with principals suggested that teachers’ renumeration is largely of identical structure among public schools while private school teachers’ renumeration is highly dependent on school management and school's orientation as for-profit or non-profit, four schools, one public, and three privates, were purposively sampled for comparison. School D was added as a case for the for-profit category since the principal of School C declined further interviews with teachers. Since High School Entrance Examinations (HSEE,
Data were collected through interviews with principals and teachers as well as from school policy documents. Since teachers of different subjects and different working experience may experience different pressures under merit pay, sampling of teacher participants purposefully included teachers of academic and non-academic subjects and different working generations in each case school. Table 1 reports the school sample and data forms, and Table 2 reports the background of teacher participants. All the interview participants were informed of their right to keep personal information confidential and withdraw from the research at any time. A written consent form was signed by interview participants before data collection, and a copy of interview transcript was proofread by interview participants afterward.
Case schools and data forms.
Background of teacher participants.
Data analysis adopted deductive interpretation, which is semi-structured by the accountability typology and the motivation taxonomy. The semi-structured analysis is composed of three key aspects: primary stakeholder, power relation, and practical impacts. The three aspects of analysis are neither clear-cut nor mutually exclusive but closely interlinked to portray the same object from different angles.
First, the analysis of primary beneficiary intends to figure out whose interest is mostly privileged behind a particular arrangement. It examines the purpose behind the selection of indicators, implementation of measuring activities, and application of results. In this way, merit pay schemes are dissected into relatively independent subsystems featured with a particular type of accountability.
Second, the analysis of power relation explores the interactions between different accountabilities by looking into the relations between their intended objectives and the strength of their influence on final decisions. It intends to detect the potential conflicts between different accountabilities and how they negotiate about the disputes.
Last, the analysis of practical impacts evaluates mainly the motivating effect of each type of accountability, identifying teachers’ motivational status and change with regards to influences of different accountabilities according to SDT (Ryan & Deci, 2000). SDT distinguishes different types of motivation based on different reasons that drives action and argues that the motivational types matter in eliciting and sustaining people's efforts (Deci & Ryan, 1998). It distinguishes motivational status through a spectrum of internalization where there are several transitional stages from least intrinsically driven to mostly intrinsically driven, including motivation, external regulation, introjection, identification, integration, and intrinsic motivation (Ryan & Deci, 2000). In addition to direct self-report motivational status, teachers’ positive or negative comments also give clues about their attitude toward different accountabilities and the extent of their internalization.
In practice, teacher appraisal measures can be highly sensitive to and dependent on context, and the nominally identical practices may turn out to be completely different in accountability goals. For instance, “class observation,” a practice commonly seen in both public and private schools, can be either administrative, professional, or market-oriented depending on “who observes,” “for what purpose,” and “in what way.” When dominated by administrative accountability, class observation is usually conducted by school administrators with quantitative rubrics to score teachers’ performance as evidence to inform personnel management decisions, leaving little space for professional dialogue with teachers about development. On the contrary, class observation led by professional accountability, although may engage administrators, is mainly conducted by colleagues to encourage exchange and mutual learning in forms of research project, lesson study, or professional dialogue, during which teachers make their own roles and seek recognition from the professional learning community. In other circumstances like “school open day” in private schools, “class observation” can also be oriented by market accountability since the original intention of opening class to parents is to attract students in the competitive market by making a good impression on parents and allowing them to judge teachers for their own good. Thus, data analysis needs to go beyond the surface practices to detect the meaning of specific merit pay policy by teasing out the primary beneficiary, power relations, and practical impact behind similar merit pay regulations.
Public school: Autonomous teachers under professional-led accountability system
School A as a public school, its merit pay scheme was largely subject to government policies with limited autonomy for school-based adaptions. About 70% of public-school teachers’ renumeration was made up of scheduled “national pay” and “subsidy,” which was solely dependent on teachers’ “professional title” and “working experience” (Bureau of Education Shenzhen, 2015; Pang et al., 2019). The total of post-salary and level salary was also named as “basic merit pay” since the 2009 merit pay reform, which was covered by fiscal appropriation allocated by district and municipal government.
Among the determinants of the “basic merit pay,” the professional title appraisal was the most decisive one. It ranked teachers into five leveled titles starting from “clerk,” “assistant,” “associate,” “advanced” to “senior.” Since the integration reform of appraisal and employment in 2015, the professional title appraisal lower than “advanced” level (included) has been authorized for public schools (Bureau of Human Resource and Social Security Shenzhen, 2015). Triennially, public schools shall convene a “Professional Title Appraisal Committee” consisting of holders of higher titles as the authoritative gatekeeper to evaluate candidates and determine the results via ballot. The appraisal of titles under “advanced” level (excluded) was mainly based on application materials, while candidates for “advanced” level need an extra oral defense.
The quantitative criteria were set up by the provincial government in
In forms of a career ladder, the professional title appraisal was basically in accordance with the professional accountability for its committee-based organization and the profession-oriented design of rubrics. When asked about comments on the professional title appraisal, teachers were basically content with it for the abundant opportunities for professional growth and the effectiveness of sorting out excellent teachers.
As shown in the excerpts, Betty appreciated the professional title appraisal for facilitating teachers’ professional growth, and Lara recognized it for the autonomous space retained for teachers. It could be inferred from the positive comments that teachers in School A had internalized the professional codes to a great extent and mainly relied on intrinsic motivations to take the opportunities for professional growth or stick to their own beliefs as empowered by the autonomous space.
The rest 30% of public-school teachers’ renumeration was named as “incentive merit pay,” and its distribution was determined by school-based performance appraisal. As shown in Table 2, the performance appraisal measured teachers’ teaching workload, administrative post, attendance, and related awards according to School A's “Distribution Plan of the Incentive Merit Pay.” The measurement converted teachers’ work into abstract commeasurable units counted as multiple Xs, which usually equaled to 100 yuan and adjusted with the total of “incentive merit pay” (Table 3).
The distribution plan of incentive merit pay in School A.
In the conversion of teachers’ workload, the performance appraisal encouraged test preparation by attaching extra coefficient to academic subjects, especially the graduating grade, and including morning reading and tutoring class as standard lesson hour. While it sounded reasonable that teachers of academic subjects have extra workload from correcting students’ assignment and test-oriented training, teachers of non-academic subjects still felt mistreated and tended to withdraw efforts at work secretly to reach psychological balance.
As indicated by Betty, School A's performance appraisal failed to reconcile the dispute among teachers and inevitably crippled teachers’ willingness to commit themselves. For teachers who found the stipulation unfair, they tended to view it as a compromise for retaining in School A and would withdraw part of their efforts as tacit resistance until they reached a psychological balance.
The conflict embedded in the performance appraisal indicated that the dominating stance was to maximize the gain in test scores as demanded by parents who represented the market accountability and to facilitate surveillance of teachers’ input efforts for the good of school administrators. While teachers’ full attendance brought them a monthly bonus of 800 yuan, two times of personal leaves or absence from school activities without proper excuse would result in punitive deduction in full. The surveillance of administrative accountability exerted “external regulation” upon teachers and drove their motivation toward the extrinsic direction.
As shared by Lisa, teachers were distracted by the merit pay and the performance appraisal from teaching to the debate about the measurement. Whatever the result, teachers had been induced to pay more attention to extrinsic stimulus rather than maintaining the intrinsic rewards in teaching itself, causing an interruption to the autonomous working environment laid by the professional title appraisal.
In sum, School A's accountability system was featured by professional accountability as the primary force to normalize teachers with professional codes, while the administrative accountability functioned as the secondary force to monitor teachers’ observable input efforts. Teachers’ motivational status in relation to professional accountability was largely intrinsically driven in an autonomous atmosphere, while the administrative accountability had caused particular externalization of teachers’ motivational status due to the controversies on merit pay. Market accountability was least influential since School A did not rely on tuition income.
Non-profit private school: Compliant teachers under administrative-balanced accountability system
As a non-profit private school, School B was founded by an overseas Chinese entrepreneur as a philanthropy. The tuition fee was kept as low as 6,500 yuan per semester when public school students enjoyed 20,000 yuan per capita funding and for-profit private schools charged more than 20,000 yuan. Therefore, teachers’ renumeration in School B was also limited, ranging from 5,000 to 10,000 yuan monthly in 2019, when public school teachers’ renumeration started from 11,420 yuan and the average in for-profit private schools was above 10,000 yuan.
Principal B had been working in the Institute of Educational Sciences in Shanghai for years before he came to School B in Shenzhen, hence borrowed many common strategies from public schools. He was also personally against merit pay, emphasizing that teaching was incomparable to industrial manufacture, and a considerable portion of teachers’ efforts was unquantifiable (Principal B, 3). Under Principal B's leadership, School B's merit pay scheme was mainly featured by administrative accountability balancing the tension between market accountability and professional accountability.
As borrowed from public school, the measurement of teachers’ workload in School B was also structured with a uniform base pay guaranteed by a minimum requirement of lessons per week and extra pay for extra lesson hours exceeding the minimum requirement. However, teachers were not motivated as expected because of the meager income. To help promote teachers’ income within the given structure and resources, Principal B lowered the requirement of minimum lesson hours from 10 to 5 per week. Thus, a larger proportion of work was counted as “extra,” and teachers’ income got promoted without changing the original salary structure or aggravating teachers’ burden. While the direct beneficiaries seemed to be teachers, the nature of extra pay for extra workload was administrative accountability in essence since such adaption involved neither professional requirements nor inclusion of teachers in the decision-making process, and the primary concern was, in fact, to retain teachers from job-hopping with little implications for professional growth.
Excluded from the professional title system as a private school, School B lacked systemic professional accountability to encourage in-service professional development. To make up for the loss, Principal B borrowed a few strategies from public schools including subsidies scheduled by teachers’ working experience and professional title, school-based teaching contests, lesson studies, class observations, personal yearly development planning, training of research methods, etc. With hopes to better motivate teachers, pecuniary incentives were attached to the measurement of teachers’ performance related to professional activities.
In terms of gains in test scores, Principal B's effort had paid off with the passing rate of College Entrance Examinations raised from 27.4% to 99.51% in the past eight years. Yet, in terms of teachers’ motivational status, Principal B might only be getting half the result with twice the effort.
As shared by Jerry, teachers in School B tended to be indifferent to the meager bonus and perceived the professional activities as external administrative tasks, indicating a limited extent of internalization and basic level of motivational status. Although Jerry was still intrinsically motivated by his enthusiasm for education, this drive was independent from the merit pay scheme, while the merit pay scheme was actually diminishing it rather than maintaining it. Such findings echoed with the crowding-out effect that people accustomed to intrinsic motives may experience decline in motivation when the incentives were changed from intrinsic to extrinsic. In other words, the effect of merit pay actually counteracted the contributions of Principal B's professional leadership to a certain extent given the limited financial resources, resulting in the alienation of professional activities toward administrative tasks as perceived by teachers.
In the meanwhile, despite Principal B's personal disapproval of merit pay, School B still relied on pecuniary incentives to maintain administrative control and to meet market demands.
As shown in Table 4, School B converted teachers’ performance in “teaching routines,” “professional activities,” and “teaching quality” into several ranks attached with corresponding bonuses. Among these categories, “teaching routines” represented the stance of administrative accountability by regularly checking teachers’ lesson plans and feedbacks on students’ assignments, monitoring teachers’ class management by administrators patrolling around the teaching buildings and recording the class discipline, and demanding for full attendance and absolute compliance with temporary work arrangements such as invigilation and proposition of test papers. “Professional activities” were originally designed to complement the professional accountability but ended up as auxiliary to administrative accountability due to the crowding-out effect in a limited financial condition, shifting the motivational force from intrinsic pursuit of pedagogic values to compliance with administrative regulations. “Teaching quality” was measured based on students’ test scores in every mid-term and final examination, responding to parents’ fervent demand of progress in academic performance. As a private school with heavy dependence on tuition income, School B had to respond to market accountability as much as it could.
The merit pay scheme in School B.
In addition to the bonus included in the ranking system, School B actually retained another lump-sum bonus to reward teachers whose students outperformed the others in the College Entrance Examinations. With an average of 4,000 yuan, subject teachers whose students scored higher than the municipal average got an extra coefficient of 1.2 or 1.3, while those whose students scored lower than municipal average suffered from a deduction of about 0.2 or 0.3 coefficient (Principal B, 6–7). Numbers of students getting into first-tier universities brought teachers extra coefficient mediated by their backgrounds. One admission to first-tier universities from key classes brought an extra coefficient of 1, while one admission to first-tier universities from average classes brought an extra coefficient of 1.6 (Principal B, 6–7). Thus, the highest amount of the lump-sum bonus for CEE could reach as high as 20,000 yuan according to past cases.
In sum, School B's merit pay scheme was centered on administrative accountability, which proactively integrated professional norms to maintain teachers’ professional ethics, and compromised to market accountability to hold teachers accountable to students’ test scores. With the different accountabilities balanced within the administrative structure, teachers followed Principal B's leadership in teaching and research with basic-level intrinsic motivations. Meanwhile, the crowding-out effect of pecuniary incentives also indicated that it was the “nature of teaching” that rejected the ideology of merit pay.
For-profit private school: Struggling teachers under market-dominated accountability system
As part of a larger education group, School C has abundant financial resources. Teachers’ monthly renumeration in School C ranged from 10,000 to 15,000 yuan, which was much higher than that in non-profit private schools like School B. Principal C appreciated the rationale and motivating function of merit pay, so he designed a typical merit pay system named “post accountability and goal management system” (PAGMS). It was featured with the hierarchical objective structure and the rated bonus based on the extent of fulfillment. At the beginning of each academic year, teachers were supposed to set up objectives in every aspect of evaluated performance, which were subordinative to the objectives of their superior administrators. About 60% fulfillment of objectives equaled 80% bonus, and the upper limit was 120% bonus, which required 100% fulfillment. The average bonus could reach as high as 120,000 yuan for an academic year.
The indicators of PAGMS were mainly outcome-based and at the service of market accountability. For instance, objectives of teaching outcome were defined by four indicators: (1) students’ mean scores in standardized tests, (2) the ratio of high achievers versus underachievers, (3) students’ awards in various competitions, and (4) the final outcome of the College Entrance Examinations. Moreover, the PAGMS imposed the indicator of “student stability” on teachers’ work, holding teachers accountable for the rate of students retaining at School C. It explicitly revealed the dominance of market accountability, which treated teachers more like marketing personnel rather than professionals.
Unlike School C's goal management system, School D responded to market accountability by including parents in school management and teacher appraisal. Parents of School D were constantly leading the flag-raising ceremony, parents’ academy, festival celebrations, mixed-age family routines, and the basketball club management. Moreover, they can observe teachers’ teaching directly in classrooms, check teachers’ work records on a digital platform, and debrief teachers about their teaching performance in one-to-one consultative meetings.
While teachers in School D were closely monitored by parents, their work was labor intensive. School D de-constructed teachers’ work into a few independent posts (like family tutor, mental health teacher, head teacher, group head, subject head, and faculty head), with packaged salary ranging from 2,000 to 6,000 yuan for each, making it insufficient to make a living by only taking one post but considerably attractive to take two or more posts concurrently. Although it was also common in public schools like School A that teachers may take additional administrative posts, the income gap brought by administrative workload ranged only from 200 to 1,800 yuan in School A with a minimum income as high as 11,420 yuan, while the gap ranged from 2,000 to 6,000 yuan in School D with a minimum income as low as 6,000 yuan. As a result, teachers in School D had to take two or more posts concurrently to reach a satisfactory income of around 10,000 yuan.
Besides the monthly salary piled up by intensive work, School D also adopted an annual bonus with delayed payment to retain teachers. Teachers had to retain at School D until next semester to get the annual bonus for the last academic year, otherwise, they would lose this bonus if they quit the job voluntarily. The amount of annual bonus was ranked into three tiers: 10,000, 15,000, and 20,000 yuan. Yet, its determination was controlled by administrators in a black box, which excluded teachers and lacked explicit standards for the ranking. Teachers had to vote by feet if they were not going to deal with it, and so did they. Many teachers had already left for another school or another occupation as reported by interviewees, and School D had to keep recruiting more new teachers to maintain its faculty. Such instability of teaching team might be attributed to the anti-democratic administrative accountability, as well as the irreconcilable conflicts between professional ethics and market accountability.
As reported by Selina, parents’ demand for individualized service was in essence self-interested preferential treatment rather than pedagogic concern to teach students according to their aptitude. Vulnerable to parents’ meddling demands, teachers were inevitably frustrated by the irreconcilable conflict between market and professional accountability.
To sum up, the accountability structure of for-profit private schools was featured with collusion between market accountability and administrative accountability. Specified by the case of School D, market accountability might constantly interrupt teachers’ work since parents’ demands were mainly self-interested. Lacking supportive force, teachers in for-profit schools were vulnerable to external requirements from market and administrative accountabilities and were struggling among the irreconcilable conflicts between market accountability and their own pedagogic concerns. Under such accountability system, teachers’ motivational status was least internalized with inclination to leave for a better school or occupation. The variations of merit pay forms between School C and School D also indicated that it was the market-oriented accountability structure that matters rather than the technical structure of merit pay.
Conclusion and discussion
In this study, we fully recognized the variations of merit pay schemes and attempted to capture the nuances among public, non-profit private, and for-profit private schools by analyzing the accountability structure. This analytical lens made comparison between different school contexts possible and meaningful and helped to expose the power relation behind the trivial measurements. We also chose teachers’ motivational status instead of students’ academic progress to depict the impacts of merit pay schemes on the process and conditions of teachers’ work, not merely the results. Thus, teachers’ experience of potential conflicts became visible and their voice received attention that they deserved. Yet, the limitation of this study was also apparent that it failed to include more direct viewpoints and experiences from parents and school administrators or observation of interactional situations to enable a thicker description. Despite that, the findings still generated profound implications about the ongoing debate between the “nature of teaching” hypothesis and the “political obstruction” hypothesis.
To the contrary of the “political obstruction” hypothesis, our findings suggested that the more dependent on the market, the more intense the conflict between external accountabilities and teachers’ internalized professional concern. The struggles of teachers in School D indicated that they were aware of the interference brought by the market accountability but were still half-way aware of the essence of market-oriented “teaching subject” (Ball & Olmedo, 2013). A similar conflict was also embedded in the discriminative measurement of workload in School A, but the discouraging effect was largely counteracted thanks to the dominant professional accountability. The crowding-out effect of pecuniary incentives in School B also supported the “nature of teaching” hypothesis that merit pay would probably damage teachers’ intrinsic motivation due to their inherent incompatibility, despite its contribution to extrinsic motivation. Thus, our findings suggested timely pause of merit pay reform in public schools and hopefully a new reform based on the nature of teaching.
Footnotes
Contributorship
Hejia Shi contributed to the data collection and the first daft of this article. Dan Wang contributed to the revision of theoretic framework and was closely engaged in data analysis and presentation.
Declaration of conflicting interests
The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Ethical statement
A written consent form was signed by interview participants before data collection and a copy of interview transcript was proofread by interview participants afterward. The study was approved by the Human Research Ethics Committee of the University of Hong Kong (reference number: EA1811027).
Funding
The authors disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This article is based on the corresponding author's doctoral study financed by Post-Graduate Scholarship of the University of Hong Kong (HKU).
