Abstract
Low-income students and students of color are faced with pervasively lower levels of opportunity to learn compared with their peers, creating unequal opportunities for educational success. Textbooks, which serve as the backbone of the curriculum in most mathematics classrooms, present a potentially powerful tool to help mitigate unequal opportunity to learn across students. Using the Surveys of Enacted Curriculum framework, we investigate the content of commonly used eighth-grade math textbooks in California and the extent to which they align with the Common Core State Standards. We also explore the relationship between the variation in content coverage and alignment and student characteristics. We find poor alignment between the textbooks in our sample and the Common Core State Standards and low overall levels of cognitive demand, but only limited evidence of systematic differences in alignment or cognitive demand coverage associated with student characteristics at the school or district level.
Keywords
In 2004, the
A growing body of research indicates that textbooks may be particularly powerful tools in the effort to equalize OTL. The vast majority of teachers use textbooks in most of their lessons, making them the backbone of the curriculum (Polikoff, 2018) and ubiquitous in a typical student’s schooling experience (Blazar et al., 2020). Research suggests that textbook choice may be important for student achievement, with several studies indicating that the choice of textbook can affect standardized test scores in mathematics (Agodini et al., 2010; Bhatt et al., 2013; Bhatt & Koedel, 2012; Koedel et al., 2017), though a recent large study found no such effects (see Blazar et al., 2020). In the context of the highly decentralized textbook market in the United States, where states, districts, schools, and even individual teachers may be involved in choosing from among dozens of possible materials (Polikoff, 2021), textbook choice may thus present policy makers with a potentially powerful tool to help equalize learning opportunities and improve education outcomes for disadvantaged students.
As part of a broader effort to improve teaching and prepare students for success in college and careers, California joined most other states in adopting the Common Core State Standards (CCSS) in 2010. Four years later, in 2014, the state also adopted a list of mathematics textbooks they approved as being aligned with the standards. While at certain times in California’s history these adopted textbook lists have been mandatory, they are now advisory—districts are free to adopt on or off the state-approved list. As we describe below, most California districts indeed adopt textbooks approved by the state as being aligned with the CCSS. While adoption by the state of California presumably signals some degree of alignment and quality, we know that textbooks often vary in their content (Polikoff, 2015). Thus, we wondered whether the problems highlighted in
In this article, we examine two dimensions considered to promote student OTL as they apply to textbooks: content alignment and cognitive demand. Using the Surveys of Enacted Curriculum (SEC) framework (Porter, 2002), we evaluate many popular eighth-grade math textbooks according to their alignment to the CCSS and the level of student expectation (cognitive demand) involved in the presentation of content. We then analyze the extent to which variation in alignment and cognitive demand are systematically related to the characteristics of students in schools and districts using those books. In short, we address the following research questions:
This analysis is one of the first to apply a rigorous content analysis method to a large number of textbooks and is the first to do so in the context of middle school math. The methods therefore have applicability in nations where there are multiple approved curriculum materials used in schools. It also answers important questions about students’ equitable exposure to aligned and cognitively demanding content through curriculum materials.
Review of Literature
The Origins of Opportunity to Learn
The notion of OTL has roots in international comparative studies, including the First International Mathematics Study (FIMS). Serving as a validity check, OTL questionnaires asked teachers whether they had covered each topic on the test to ensure that countries did not appear to be failing students when in fact certain topics simply had not yet been covered (Husen, 1967). While the questionnaires became more nuanced over the ensuing decades, bolstering the legitimacy of these international comparisons remained the purpose of OTL. Therefore, variation in OTL was nearly always analyzed between countries rather than within a single country (Floden, 2002). Contemporary international comparative studies like the Trends in International Mathematics and Science Study (TIMSS) and the Program for International Student Assessment (PISA) still use OTL in this way.
OTL in Research
Following the release of
The approach with the most sustained impact was pioneered by a team of researchers studying OTL in the context of high school mathematics (Gamoran et al., 1997). This research sought to explain between-class variation in student learning using survey measures of teachers’ instruction. They explored measures based on topic coverage only (e.g., multiplying fractions, solving two-step problems), cognitive demand coverage only (e.g., memorize facts, understand concepts), or the intersection of topic and cognitive demand. They found that, when defined as the intersection of topic and cognitive demand, OTL explained nearly all the between-class variation in achievement gains (in other words, that differences in OTL were driving differences in student learning between classes). This study led to the development of a suite of tools for measuring OTL (see Porter, 2002), which have since been used in numerous studies of teachers’ instruction (e.g., Polikoff, 2012a, 2012b), the content of various policy instruments like curriculum materials and content standards (e.g., Polikoff et al., 2015; Porter et al., 2011), and the associations of OTL with student learning (e.g., Polikoff & Porter, 2014).
OTL for Measuring Equity
OTL has been a particularly useful concept in research analyzing differential educational opportunities (e.g., Floden, 2002; Guiton & Oakes, 1995). Researchers reasoned that if OTL was predictive of student learning, then differences in OTL could be used to understand and contextualize oft-studied student performance gaps. OTL measures have found a foothold in studies focused on explaining between-group differences in performance. For instance, studies showed that low achievement among English learners (Callahan, 2005) and historically disadvantaged racial/ethnic groups (Schiller et al., 2010) could be explained by low OTL in these students’ classes, rather than by factors like race, gender, or English proficiency level. Traditionally underserved students’ learning experiences have often been characterized as offering weak OTL (e.g., see Cooper & Liou, 2018; Heafner & Fitchett, 2015; Santibañez & Fagioli, 2016; Tate, 2001). These types of OTL studies have provided evidence of educational inequities, such as those cited in the
Current Conceptions of OTL
Much OTL literature has conceived of curriculum in three tiers (e.g., see Martin et al., 1999; McDonnell, 1995; Mullis et al., 2000; Mullis et al., 2005; Törnroos, 2005). The “intended curriculum” refers to a system-wide official curriculum, typically a set of academic standards. The “enacted curriculum” refers to how teachers bring that content to life in their classroom. The “attained curriculum” refers to the understanding students actually gain during a lesson. Each tier influences, but does not dictate, the tier below it. Tarr et al. (2006) add the “written curriculum,” consisting of textbook content, as a useful fourth tier. The written curriculum is informed by the intended curriculum and in turn informs teachers’ enacted curriculum. While all tiers of curriculum influence the knowledge students attain, improving written curriculum may offer policy makers a particularly high-leverage reform tactic (e.g., Koedel & Polikoff, 2017), since teachers use and students interact with their textbooks directly (Chingos & Whitehurst, 2012). Therefore, we focus our inquiry on textbook content as a way to document and improve students’ OTL. It is also worthy to study variation in the enacted curriculum, but this is much more complex and difficult to carry out at any reasonable scale.
Textbook Variation and Implications for OTL
Despite the importance of written curriculum to OTL, not all textbooks used in schools appear to be equally effective in promoting student learning. In particular, a number of experimental and nonexperimental studies have shown that textbooks differ meaningfully in their effects on student achievement (e.g., Agodini et al., 2010; Bhatt et al., 2013; Koedel et al., 2017), although a recently published study of textbook effectiveness across six states failed to find a significant effect (Blazar et al., 2020). These possible differences in textbook effectiveness suggest textbook choice as a potentially important intervention aimed at improving written curriculum.
Furthermore, research suggests that textbooks may be the most central component of written curriculum, especially in mathematics. Blazar et al. (2020) surveyed almost 1,200 mathematics teachers across six states and found that over 90% used an official textbook in more than half their lessons, with three fourths using the official textbook in 75% of lessons or more. While very few math teachers report exclusive use of the textbook in their instruction (Blazar et al., 2020; Opfer et al., 2017), many seem to consider officially adopted textbooks to be fundamental to their curriculum (Polikoff, 2018). In studying textbook content, then, we study a major determinant of both teachers’ enacted curriculum and students’ attained curriculum. Moreover, while textbooks are not the only component of students’ OTL, the link between textbooks and student achievement, the central role of textbooks as a learning tool, and the high-leverage policy reform intervention that they present underscore the importance of analyzing their connection to OTL.
Systematic Variation in OTL
The
The
Recent research continues to emphasize persistently lower levels of OTL for disadvantaged students compared with their more advantaged counterparts (e.g., Abedi & Herman, 2010; Albano & Rodriguez, 2013; Barnard-Brak et al., 2018; Lafontaine et al., 2015; Ottmar et al., 2014; Santibañez & Fagioli, 2016; Wang, 2010). Lower levels of OTL are reported for English language learner (ELL; Abedi & Herman, 2010), special education (SPED; Kurz et al., 2014), female (Heafner & Fitchett, 2015), racial/ethnic minority (Heafner & Fitchett, 2015; Morton & Riegle-Crumb, 2020; Polikoff & Struthers, 2013), and low-income students (Heafner & Fitchett, 2015; Polikoff & Struthers, 2013), as well as students in urban schools (Polikoff & Struthers, 2013), relative to students not in these groups.
Research demonstrating variation in OTL based on student characteristics has relied on a variety of tools and methods to measure OTL. Heafner and Fitchett (2015), for example, rely on student survey data from the National Assessment of Educational Progress (NAEP), which includes items on instructional experiences and exposure to curriculum. Abedi and Herman (2010), draw from teacher and eighth-grade student surveys on Algebra I first semester topic coverage and an assessment of achievement in initial Algebra I content for ELL and non-ELL students. Kurz et al. (2014) study OTL among students with and without disabilities using data collected from self-reported daily logs of standards coverage, time allocated to each standard along five cognitive process expectations, instructional practices, and grouping formats (individual, small group, or whole class) from a small sample of teachers. Polikoff and Struthers (2013) use data the most similar to ours—teacher surveys of topic and cognitive demand based on the SEC. That these studies use such disparate methods but reach the same general conclusion bolsters the argument that there remain systematic disparities in OTL across student groups.
We further investigate systematic variation in OTL by focusing on differences in both content and rigor of textbooks across differences in school and district demographics. In doing so, we make three primary contributions to this field of literature. First, while many other studies of OTL rely on survey or observation data more limited in size, we take advantage of a statewide panel of schools. Our statewide data give us assurance that our findings are representative of the state (which is itself highly diverse). Second, the recent timing of our data allows for analysis of systematic variation in OTL specific to the post-CCSS implementation context, making the findings relevant at the present day, when the large majority of states still use the CCSS or a close facsimile (as of 2016, 36 states along with Washington, D.C. were still implementing the CCSS, for either all or a significant part of their curriculum, with another 11 states announcing major rewrites or replacements for the CCSS, and 4 states remaining that never adopted the CCSS; Ujifusa, 2016). Last, we analyze statewide demographic data in relation to detailed content analysis of textbooks, allowing us to measure OTL in terms of both content and rigor at a fine-grained level not possible in most other OTL studies.
The Current Study
To investigate the relationship between textbooks and OTL, we first describe variation in content coverage of standards and cognitive demand in twenty widely used eighth-grade mathematics (pre-Algebra and Algebra I) textbooks that were used in California schools in 2015–2016, when schools were transitioning to textbooks aligned with the adopted CCSS. We then analyze the extent to which student OTL, as characterized by our measures of textbook content and cognitive demand, is equitably distributed according to school- and district-level demographics.
Data
Textbook Selection
One obstacle to carrying out large-scale OTL studies of written curriculum is that most American states do not record information on school and district textbook usage. They may have a list of state-approved textbooks, but no way of telling which books are actually being used in which districts (Chingos & Whitehurst, 2012). As a result of the
Another obstacle to studying OTL through textbooks is that content analyzing these materials is time consuming and resource intensive. While we were unable to content analyze every textbook used by any California eighth grader, we selected and analyzed 20 textbooks that provided us the greatest coverage of California’s eighth-grade population: in 2015–2016, 46% of California eighth graders attended one of the 974 schools across 347 districts with at least one of our selected textbooks adopted. 2 Table 1 shows a full list of textbooks included in our sample, as well as the number of districts overall, uniform textbook adoption districts, 3 schools and students represented by each textbook.
Textbook Descriptions
School Characteristics
To obtain school demographic characteristics, we draw from the National Center for Education Statistics’ Common Core of Data (CCD), the Office of Civil Rights’ Civil Rights Data Collection (CRDC), and the California Department of Education for school year 2015–2016. We obtain the following characteristics at the school level from the CCD: total school enrollment, as well as enrollment by grade, gender, and race/ethnicity; number of students eligible for free or reduced-price lunch (FRL); urbanicity (city, suburban, town, or rural), alternative school status, and charter status. From the CRDC we obtain school counts of students classified as SPED or ELL. From the California Department of Education, we obtain mean scale scores on the California Assessment of Student Performance and Progress (CAASPP) standardized math test for students.
Methods
Research Question 1
To address our first research question focusing on the variation in alignment to content standards and cognitive demand in textbooks, we apply the SEC framework for mathematics to quantitatively code the content in the standards and textbooks. The SEC provides a taxonomy of mathematics content at the intersection of specific topics and levels of cognitive demand. The instrument has been in use for over two decades, undergoing developments along the way to reflect changes in content standards (see Porter, 2002, for a history; for other examples, see Polikoff, 2012a, 2012b; Porter et al., 2007), including those changes brought on by the adoption of the Common Core (see Porter et al. [2011] for an analysis of the Common Core using the SEC and Polikoff, Gasparian, et al. [2020] for recent work to update the SEC framework).
The SEC mathematics framework used in this study is the most recent version described by Polikoff, Gasparian, et al. (2020). It includes 228 topics (e.g., surface area, quadratic functions), that are grouped under 16 broader categories (e.g., measurement, basic Algebra). The SEC content language is intended to be inclusive of any content that might be covered in typical K–12 mathematics instruction. Orthogonal to the topics are six levels of cognitive demand, which are based on a modified version of Bloom’s taxonomy. Figure 1 shows an image from the SEC teacher survey that illustrates the topic-by-cognitive demand organization of the instrument (we refer to the intersection of a topic and a cognitive demand level as a “cell”). The SEC framework can be applied to analyze content standards, assessments, curriculum materials, and teachers’ instructional content. An outline of the SEC framework in mathematics, including a list of topics and cognitive demand levels, is provided in Supplemental Appendix A, available in the online version of this article.

Surveys of Enacted Curriculum (SEC) teacher survey.
We apply the SEC framework to the textbooks in our sample following the methods described in detail, with a worked example, in Polikoff (2015). The first step is “chunking” the textbooks, by which we mean breaking each textbook down into its finest-grained level of detail. The “chunks” in textbooks typically consist of sections of text (typically set off by headings) and problems for students to complete (including example problems, each problem treated individually). These are all equally weighted. We hired an experienced mathematics teacher who had familiarity with the SEC to go through the 20 books and “chunk” them, preparing the coding sheets for reviewers to use. In total the number of chunks in the 20 books ranged from 2,857 (
Once the books have been chunked, the task is to analyze each chunk in terms of its content and cognitive demand coverage. Content analysts examine each chunk and code it as covering up to six topic-by-cognitive demand combinations. 4 The weight for that chunk is then evenly divided among the SEC cells that were selected. Finally, the codes for all of the chunks in a book are aggregated and the results are turned into proportions. If multiple raters are used, the codes for the multiple raters are then averaged to arrive at the final content analysis, which takes the form of a matrix of proportions, one for each cell, indicating the percentage of the textbook’s content on that particular topic-by-cognitive demand combination. The procedure for analyzing content standards is identical. The “chunks” in content standards are objectives, which are again coded as covering up to six SEC cells apiece.
Since analyzing full textbook content using the SEC framework is time intensive, the analyses presented here rely primarily on a single coder—the same experienced mathematics teacher who created the coding sheet. Prior examinations of the quality of textbook analyses indicate that these are highly reliable, with generalizability coefficients well above .90 for even two raters (Polikoff et al., 2015). This implies that the difference between the ratings from one coder and the ratings from another would be negligible. Here, we also check the reliability of our coding by randomly sampling one chapter within each of four randomly selected textbooks to be analyzed by a second coder—a doctoral student and former elementary teacher with advanced quantitative training. No training was provided, just a brief orientation to the process and content language. We calculated an alignment index for each chapter, comparing the main coder’s ratings with the second coder’s ratings. In all cases, these alignment indices were .94 or above, indicating a high degree of agreement, and supporting the consistency with which the SEC framework is applied in this study.
We similarly apply the SEC to eighth-grade mathematics standards, following the same general procedure (for standards the “chunks” are simply objectives). We focus on the grade-level standards because those are the standards against which students are assessed, and also because our research questions require an apples-to-apples comparison given the uneven distribution of students across Algebra/pre-Algebra classes both within and between districts.
After applying the SEC to textbooks and content standards, we create indicators for both content alignment and for cognitive demand coverage. We calculate two alignment indices. The main alignment index (Porter, 2002) is
Here,
The difference here is that exact proportional agreement is no longer required. This index also ranges from 0 to 1 and represents the proportion of document
In addition to these two alignment indices, we also calculate the simple proportion of each textbook’s content at each of the six levels of cognitive demand. This analysis is based on previous work in English language arts (Polikoff & Struthers, 2013), which found that students in more disadvantaged schools were more likely to be exposed to lower cognitive demand skills during the No Child Left Behind Act era.
Research Question 2
To analyze our second research question, focusing on the relationship between school demographics and variation in standards alignment and cognitive demand, we rely on ordinary least squares regression at the school level. 5 Using each of our indices as an outcome, we apply the following model:
Here,
Results
Research Question 1
Alignment Differences Across Books
We begin analysis for our first research question by focusing on the extent to which textbooks align with the CCSS using either of our alignment indices (see Table 2). The results for the main alignment index indicate that only about one fifth to one third of textbook content aligns with the CCSS in terms of both topic and cognitive demand, with an average alignment of 0.25 (
Textbook Alignment With California’s Common Core State Standards (CCSS)–Alignment Index 1 Ranking
These alignment values are somewhat lower than those found for third-grade mathematics textbooks (0.28 to 0.40 for Alignment Index 1, 0.64 to 0.80 for Alignment Index 2) in the one prior published study of mathematics textbook alignment to standards using the SEC (Polikoff, 2015). Prior research on alignment indices cautions against direct comparisons of alignment indices outside the confines of a single study, however, because features of the alignment process such as the number of cells in the coding framework can affect the expected magnitude of alignment indices (Polikoff & Fulmer, 2013).
The alignment indices also reveal patterns by adoption date and subject. We find that, while overall alignment with the CCSS remains low, textbooks that were adopted after CCSS implementation are in the top half of the rankings for Alignment Index 1 (eight of the top nine most aligned books), while textbooks that were adopted prior to CCSS implementation rank primarily among the bottom half. In other words, the books claiming CCSS alignment were indeed more aligned to the CCSS than prior textbooks. Perhaps not surprisingly given that the CCSS moved a good deal of Algebra content to high school, we also find that post-CCSS pre-Algebra books rank higher than any other textbooks on the first alignment index.
Interestingly, as shown in Table 2, comparing books on the alternate alignment index does change the rankings. For instance, post-CCSS books are no longer the most aligned when the alignment index no longer requires exact proportional agreement. Based on the second alignment index, the five books measured as most closely aligning with the CCSS are pre-CCSS pre-Algebra books. In the next section, we explore areas of misalignment and explain these differences between the two indices.
Exploring Areas of Misalignment
To shed some descriptive light on the misalignment between textbooks and the CCSS, we examine differences in content coverage at the coarse-grained topic level. For these analyses, we first calculate the percentage of CCSS content under each of the 16 coarse-grained topics (only 11 of the 16 are represented at all in the eighth-grade CCSS; see online Supplemental Appendix C). Then, we calculate the percentage of textbook content on CCSS-covered cells under those same 16 coarse-grained topics. Finally, we take the difference. So, for instance, we find that 3.5% of CCSS eighth-grade mathematics content is on whole number operations, and Big Ideas Math Course 3 allocates an additional 2.1% more of its content to the CCSS cells covered under whole number operations (so 5.6% of its total content).
Based on this analysis, we find that all textbooks overemphasize SEC cells within the coarse-grained topic area of whole number operations by an overall average of about six percentage points (the CCSS allocate about 3.5% of their content to this topic; textbooks allocate about 9.4% of their content to it). Pre-CCSS pre-Algebra books fare especially poorly here, allocating about 12% of their content to this topic versus just 6.1% for post-CCSS pre-Algebra books. In contrast, all textbooks underemphasize CCSS content within most other coarse-grained topics. Geometric concepts, in particular, is dramatically underemphasized in textbooks relative to the standards. While about 30% of CCSS content is on geometric concepts, textbooks average just 5.4% content on geometric concept cells covered by the standards.
We also analyze the content in the books by aggregating the textbook content on cells not covered at all in the CCSS under each coarse-grained area (see online Supplemental Appendix D). Based on this analysis, we reach several conclusions. First, textbooks do not cover much content in coarse-grained areas that are not covered at all in the standards (these are consumer applications, probability, analysis, trigonometry, special topics). Only about 1% of textbook coverage on average is on these areas. In contrast, about 48% of textbook content on average is found within coarse-grained topics that are covered in the standards (the remaining 51% of textbook content is Alignment Index 2, i.e., it is on cells covered in the standards). Areas that are particularly emphasized in textbooks include basic algebra (about 13% of textbook content on average is on basic Algebra cells that are not covered at all in the CCSS); fraction, decimal, and percentage of operations (about 9%); and whole number operations (about 7%).
We find interesting patterns when comparing pre- with post-CCSS pre-Algebra books. Pre-CCSS pre-Algebra books emphasize non-CCSS content under whole number operations (9.0%); fraction, decimal, and percentage of operations (17.0%); and number sense/properties/relationships (9.3%) to a far greater extent than do post-CCSS pre-Algebra books (5.1%, 5.7%, 3.9%, respectively). In contrast, post-CCSS pre-Algebra books actually fare worse than pre-CCSS pre-Algebra books in their emphasis of non-CCSS basic Algebra content (12.8% to 8.6%, respectively).
Cognitive Demand in the Textbooks
In addition to examining differences between textbooks and the CCSS by coarse-grained topics, we also analyze cognitive demand. To do this, we sum any fine-grained topic covered at each of the six levels of cognitive demand (regardless of whether the cell is covered by the CCSS). This allows us to determine the total proportion of each textbook, as well as the CCSS, that is devoted to coverage at each level of cognitive demand. In fact, we find striking differences in the cognitive demand levels emphasized in the textbooks in our sample compared with the CCSS.
Table 3, ordered by Alignment Index 1, displays the proportion of each textbook devoted to coverage of each level of cognitive demand, as well as the proportion of coverage devoted to each of these levels in the CCSS in the bottom row. We find that, on average, the textbooks in our sample and the CCSS devote roughly the same proportion of coverage, about 7%, to topics using memorize/recall techniques (Level B), which comprise the most basic level of cognitive demand but are likely fundamental to the development of more complex cognitive skills. Similarly, we find almost matching levels of coverage when it comes to cognitive approaches requiring students to make generalizations (Level F): both our sample of textbooks and the CCSS devote about 4% of coverage. Beyond these two categories of cognitive demand, however, we find little agreement between our textbook sample and the CCSS in the level of rigor required by content coverage.
Textbook Total Coverage of Common Core State Standards (CCSS) by Coarse-Grained Cognitive Demand Category Alignment Index 1 Ranking
In general, we find that textbooks devote a much higher proportion of coverage to content using techniques relying on performing procedures (Level C) at the expense of techniques that require students to demonstrate/communicate understanding (Level D) and provide justifications/evaluations (Level E). On average, textbooks in our sample cover over three quarters of content through techniques involving performing procedures, whereas the CCSS devotes just over a third of coverage to this cognitive demand level. The highest levels of emphasis on procedures can be found in pre-CCSS books for pre-Algebra and Algebra, with an average of about 81% and 80% for each category of textbooks, respectively. In short, our findings on cognitive demand indicate that these commonly used eighth-grade math textbooks are universally underdelivering on the cognitive demand expectations called for by the CCSS.
Overall, our alignment indices reveal major discrepancies that remain between the CCSS and textbooks in our sample, and we find that these differences remain prominent along both dimensions of coarse-grained topic and coarse-grained cognitive demand. While the main alignment index does show that post-CCSS pre-Algebra books are the most aligned with the standards, when we use other alignment indices the results are not as positive. Furthermore, all of the books, regardless of whether they were written pre- or post-CCSS, overemphasize procedural skills relative to what is called for in the standards.
Research Question 2
We next examine the extent to which the variation in textbook alignment with the CCSS, as well as cognitive demand coverage of the adopted textbooks, are associated with student characteristics. We begin by describing our school-level findings (see Table 4). We briefly discuss our district-level findings—those results are available in the online Supplemental Appendix B.
Fitted Models Estimating the Relationship Between Alignment Indices/Cognitive Demand Levels and School-Level Variables
School Level
Looking first at the regressions predicting alignment indices, we find limited evidence of differences in OTL as measured by alignment. The only consistent finding between the two models is for school size, with larger schools adopting slightly less aligned textbooks on both indices (
Relationships of alignment with student demographics are variable, but all evidence indicates any statistically significant associations are small in magnitude. The only student demographic characteristic that is statistically significant at the .05 level is for the proportion of students receiving FRL on the second alignment index. This magnitude of this coefficient indicates that a 10 percentage point increase in the proportion of FRL students is associated with a 0.2 percentage point decrease, or about 6% the standard deviation of textbook alignment indices, on Alignment Index 2.
The remaining columns of Table 4 are for the relationship of school and student characteristics with cognitive demand coverage. We include both the six individual cognitive demand levels and, in the final column, an aggregate of Cognitive Demand Levels B (memorize) and C (perform procedures). Again, the broad pattern is one of limited or weak associations, though there are individual coefficients that are statistically significant at the .05 level in each model. The most robust finding is that schools with more Asian students adopt higher cognitive demand textbooks (significantly lower on cognitive demand Level C and the B/C combo, significantly higher on Level D and F). There is some weaker evidence that schools with more students receiving free/reduced-price lunch adopt books emphasizing more memorization and schools with more students with disabilities adopt books emphasizing more procedures. In contrast, schools serving more Black students adopt textbooks emphasizing more application (Level G). In general, the magnitude of these relationships is again relatively modest. The largest coefficient is for students with disabilities on Cognitive Demand Level C—a 10 percentage point increase in the percentage of students with disabilities is associated with about 1% more procedural content.
District Level
Because many, but not all, textbook adoptions happen at the district level, we also ran all analyses in these “uniform-adopting” districts by aggregating school-level results to the district level. As these are a subset of districts represented in the school-level analysis, we would not expect identical results. Indeed, the results shown in the online Supplemental Appendix B indicate even fewer clear or consistent relationships of either alignment or cognitive demand with district characteristics. None of the main findings that were statistically significant at the .05 level in the school-level model remains significant in the district-level model.
Limitations
Our work aims to provide a thorough descriptive analysis of textbook OTL. In doing so, we make no causal claims about any relationships identified between school and district characteristics and our OTL measures.
It is also important to note three limitations in our data. First, SARC data, which is self-reported by schools and/or districts, may contain errors. When we are able to identify likely errors, we attempt to remedy these errors when possible as previously detailed. However, not all issues of missing or incorrectly entered data can be remedied. Moreover, in some cases, even when a school has valid SARC data, demographic data for the school may be missing. Second, while our data capture a single year snapshot of trends when examining the relationship between school and district characteristics and our OTL measures, we are not able to examine possible changes in trends across years. Third, due to the time-intensive analysis required to code textbook and CCSS alignment, our sample is limited to 20 textbooks analyzed by one primary coder. We attempt to mitigate the impact of these limitations by focusing our analysis on more commonly used textbooks in California and, for a limited portion of the analysis, confirming high levels of interrater reliability with a second coder.
As a consequence of our data limitations, some schools and districts are dropped from our sample. We aim to be transparent about our resulting sample by reporting the number of eighth-grade serving schools in the SARC data, the number of schools remaining after we limit our sample to schools that we determine to use at least one of our 20 textbooks, and the number of these schools with nonmissing demographic data used in regression calculations. However, we do acknowledge that these limitations result in a sample of schools that are not necessarily representative of all California schools.
Discussion
Our results reveal that problems of poor alignment of textbooks to standards reported elsewhere (e.g., Polikoff, 2015) were also present in eighth-grade mathematics, even after the statewide adoption of “standards-aligned” textbooks. While post-CCSS textbooks do appear to be more aligned than pre-CCSS textbooks, even the most highly ranked textbooks are not very well-aligned with the CCSS. The highest scoring textbook in our sample, for example, only achieves about a third of cell-based alignment with the CCSS using our primary alignment index. Even using the alternative version of our alignment index, which more generously scores textbook alignment with the CCSS since there is no penalty for overemphasis, we still find that, depending on the textbook, 40% to 64% of textbook content is on topic/cognitive demand combinations not covered by the CCSS.
Examining the data by coarse-grained topic and cognitive demand level, we find that textbooks dramatically underemphasize the geometry content in the standards and overemphasize operations. This could imply that the textbooks are devoting considerable space to prior-grades content, which would align with findings about the large amount of redundancy of content across grades in mathematics instruction (Polikoff, 2012c). We also find that the eighth-grade math textbooks in our sample overemphasize performing procedures and de-emphasize techniques relying on higher level cognitive functions requiring students to demonstrate/communicate understanding and provide justifications/evaluations. This aligns with work on elementary mathematics textbooks (Polikoff, 2015). Post-CCSS textbooks do fare somewhat better on this measure. Given the centrality of curriculum materials in mathematics teachers’ instruction, these results suggest that teachers may need to supplement core textbooks with more cognitively demanding material if students are to experience instruction on these more ambitious skills.
Given the challenges of determining textbook and CCSS alignment, it is perhaps also not surprising that we find only limited evidence of systematic differences in textbook alignment associated with school or district characteristics. When we do see significant relationships between school or district characteristics and our OTL measures, the relationships tend to be fairly small in magnitude and primarily occur with our measures of cognitive demand rather than alignment indices. Moreover, most of the small differences at the school level tend to disappear when we focus our analysis on the district level, although doing so of course omits districts without uniform adoptions from our sample entirely. In short, though the books themselves do differ from one another in their alignment to standards and their coverage of topics and cognitive demand levels, we find only modest evidence that these differences are systematically associated with the kinds of schools and districts that choose them.
Where we do find any statistically significant associations of school or student characteristics with our measures of textbook content, they are often—but not always—in the direction we might expect based on theory and prior literature. For instance, schools with more students receiving FRL adopt slightly less aligned textbooks and schools serving more students with disabilities or students receiving FRL adopt textbooks that emphasize lower levels of cognitive demand more strongly. Though far from overwhelming, these findings comport with recent research finding ongoing inequities in student OTL (e.g., New Teacher Project, 2018).
Our conclusions are particularly important in the context of the
Second, although we do not generally see large systematic differences in textbook OTL by school/district characteristics, it is important to also note that there are many other ways in which historically underserved student groups receive less access to OTL in and out of school. For instance, recent research finds that teachers of these student groups often lower the challenge of standards-aligned lessons in implementation, in essence denying students the OTL the materials seek to provide (New Teacher Project, 2018). And of course there are undoubtedly inequities in out-of-school learning opportunities such as access to enrichment and tutoring activities. Thus, while our findings suggest that textbook content may not be a primary driver of inequities in access to learning opportunities, clearly textbooks are not contributing much to the solution of these inequities.
Supplemental Material
sj-xlsx-1-ero-10.1177_23328584211065712 – Supplemental material for The Equitable Distribution of Opportunity to Learn in Mathematics Textbooks
Supplemental material, sj-xlsx-1-ero-10.1177_23328584211065712 for The Equitable Distribution of Opportunity to Learn in Mathematics Textbooks by Morgan S. Polikoff, Sarah J. Rabovsky, Daniel Silver and Rosalynn Lazar-Wolfe in AERA Open
Footnotes
Acknowledgements
We gratefully acknowledge financial support for this research from the National Science Foundation Grant No. 1445654, as well as the William T. Grant Foundation. Any opinions, findings, and conclusions or recommendations expressed in this material are those of the author(s) and do not necessarily reflect the views of the NSF or the William T. Grant Foundation.
Notes
Authors
MORGAN S. POLIKOFF is an associate professor of education at the University of Southern California Rossier School of Education. He researches standards and curriculum policy.
SARAH J. RABOVSKY is a research analyst at the Tennessee Department of Education. She researches alternative schools in U.S. education.
DANIEL SILVER is a PhD candidate at the University of Southern California Rossier School of Education. He researches teachers’ selection and use of supplementary curriculum materials.
ROSALYNN LAZAR-WOLFE is a teacher at the Ojai Valley School in Ojai, California.
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
