Abstract
Student evaluations of teaching (SETs) are important at most colleges and universities. One purpose of this study was to determine the extent to which motivational climate was associated with SETs. Another purpose was to determine whether course ease was associated with SETs. Participants included 2,949 undergraduate students from 30 courses at a large public university. Using hierarchical linear modeling, we examined the extent to which students’ motivation-related course perceptions (empowerment/autonomy, usefulness, success expectancies, situational interest, and caring) related to SETs at the student and class levels. SETs were highly associated with motivational climate. Furthermore, easier courses were rated lower by students when controlling for motivational climate and the demographical composition of the class. These findings highlight the association between the motivational climate and SETs and suggest that one way to improve SETs may be for instructors to focus on improving the motivational climate rather than making the course easier.
Keywords
Student evaluations of teaching (SETs) are important at many colleges and universities because they are often used as an indicator of teaching quality to make important personnel decisions, such as annual reviews, merit raises, and hiring and promotion decisions (Linse, 2017; Miller & Seldin, 2014; Stroebe, 2020; Wachtel, 1998). Another purpose of SETs is to provide instructors with feedback about students’ perceptions of their teaching, presumably so they can improve their teaching strategies if needed. Students base their ratings on a variety of factors, including their instructors’ teaching strategies (Carpenter et al., 2016) and the motivational climate that the instructor establishes (Griffin, 2016). For example, students’ interest in class activities is correlated with their SETs (Jones, 2019). Therefore, despite the fact that SETs are not a good indicator of student learning and can be biased (Carpenter et al., 2020; Peterson et al., 2019; Uttl et al., 2017), they can provide an indication of the motivational climate that is created, in part, by the instructor. By motivational climate, we are referring to the aspects of the psychological environment that affect students’ motivation and engagement within a course. Researchers have only begun to understand which aspects of the motivational climate are most related to SETs and the relative importance of these aspects.
One purpose of this study was to determine the extent to which different motivation-related course perceptions were associated with SETs. Although some studies have examined these relationships, they have typically only included two or three motivation-related perceptions (e.g., Filak & Sheldon, 2003; Griffin, 2016), with a few studies including a wider breadth of motivation-related perceptions to assess the motivational climate (e.g., Jones, 2010, 2019; Wilkins et al., 2021). Understanding the extent to which different motivation-related perceptions are associated with students’ SETs could be useful to instructors because strategies that have been shown to affect students’ motivation-related course perceptions could also be used to increase SETs. For example, if students’ perceptions of the usefulness of a course are related to SETs, instructors could implement usefulness strategies to increase both their SETs and improve the motivational climate in the course. And, attending to students’ motivation-related perceptions could have positive downstream effects on students’ academic motivation and behaviors (Oppenheimer & Hargis, 2020; Serra & McNeely, 2020). For example, a student in a general education course who rates an instructor highly and enjoys the course may be more likely to select courses in that discipline in the future or to seek out more information about that discipline outside of school.
A second purpose of this study was to determine whether the ease of a course is related to SETs when controlling for students’ motivation-related class perceptions. Some researchers have found that when instructors provide higher grades, they are more likely to receive higher SETs (Greenwald & Gillmore, 1997), which allows students to shape faculty behavior by encouraging faculty to provide higher grades to receive higher SETs (see Stroebe, 2020, for a discussion). However, the extent to which this finding is causal remains uncertain and alternate explanations are also possible (e.g., Marsh & Roche, 1997; McKeachie, 1997). For example, it may be the case that quality instruction leads to increased learning and achievement, which leads to higher ratings. If “easy” courses are not related to SETs, instructors may be more likely to design rigorous courses without fear of receiving lower SETs, and people making personnel decisions based on SETs may be less likely to attribute high SETs to easy courses rather than other factors, such as teacher effectiveness or motivational climate.
Predictors of Student Evaluations of Teaching
Researchers have correlated SETs with a variety of variables, but likely none so hotly debated as course grades. Students’ grades in a course and their SETs tend to be slightly to moderately positively correlated (Brockx et al., 2011; Spooren et al., 2013) and researchers have explained this relationship in a variety of ways. One possibility is that SETs are a valid measure of teaching quality and students rate instructors higher when they earn higher grades because they have learned more due to the instructor’s effective teaching (referred to as the validity hypothesis by Marsh, 1984). Another possibility, referred to as the “grading leniency hypothesis” by Marsh (1984), states that when teachers give higher-than-deserved grades, students reward them by giving them higher-than-deserved SETs. Studies have supported both of these possibilities, with some researchers providing evidence that seems to support the validity hypothesis (Arnold, 2009; Centra, 2003; Remedios & Lieberman, 2008) and others providing evidence for the grading leniency hypothesis (Greenwald & Gillmore, 1997; Krautmann & Sander, 1999; McPherson, 2006). After reviewing studies supporting these hypotheses, Brockx et al. (2011) stated, “It is possible to conclude that there is no consensus regarding the interpretation of the relationship between course grades and SET” (p. 292).
Other variables, such as teacher, student, and course characteristics, have also been examined for their relationship with SET scores (for a review, see Brockx et al., 2011). Some research about teacher characteristics highlights the fact that SETs are based more on factors related to the teacher than to the course (Beran & Violato, 2005; Marsh & Roche, 1997). For example, Marsh (1982) documented that there was virtually no correlation between the SETs of different instructors teaching the same course; yet, the correlation in SETs for the same instructor teaching two different classes was fairly high (r = .61 and .72). SETs are also correlated with some important teaching qualities. For example, SETs have been shown to be related to instructors’ organization, preparation, enthusiasm, and presentation style (Carpenter et al., 2016; Motz et al., 2017; Serra & Magreehan, 2016; Toftness et al., 2018; Williams & Ceci, 1997), all of which are factors that can affect students’ motivation and engagement within courses (Frenzel et al., 2009; Keller et al., 2014; Zhang, 2014). Taken together, these studies indicate that teacher characteristics and teaching strategies are related to SETs.
While these studies have focused on students’ ratings of teacher characteristics and qualities, other studies have focused on students’ motivation-related course perceptions. These studies have examined how students’ motivation-related perceptions of the class environment are related to their SET scores. The term “motivation-related” course perceptions describes student perceptions associated with motivational constructs, such as autonomy (R. M. Ryan & Deci, 2020), utility value (Eccles & Wigfield, 2020), expectancy for success (Bandura, 1997; Eccles & Wigfield, 2020), situational interest (Renninger & Hidi, 2017), and caring/relatedness (R. M. Ryan & Deci, 2020). Students’ motivation-related course perceptions are important because they are associated with students’ engagement and performance in classes (Christenson et al., 2012; Middleton et al., 2017).
Although researchers have studied motivation-related course perceptions for decades (see Wentzel & Miele, 2016), the extent to which these perceptions are related to SETs is less understood because the research in these two fields has not overlapped significantly, with a few exceptions. For example, SETs have been shown to be related to undergraduate students’ perceptions of autonomy support (Demir et al., 2019; Filak & Sheldon, 2003; Griffin, 2016), intrinsic motivation (Filak & Sheldon, 2003; Griffin, 2016), and instructor caring (a.k.a., rapport, relatedness; Demir et al., 2019; Filak & Sheldon, 2003; Perkins et al., 1995). Other researchers have investigated a broader variety of motivation-related perceptions within a study and documented relationships between SETs and autonomy/empowerment, usefulness/utility value, success expectancies, situational interest/intrinsic motivation, and instructor caring (Jones, 2010, 2019; Jones & Skaggs, 2016; Wilkins et al., 2021). In sum, these studies provide evidence that at least five different course perceptions—autonomy/empowerment, usefulness/utility value, success expectancies, intrinsic motivation/interest, and instructor caring—are significantly associated with SETs. In the next section, we provide more explanation of these course perceptions and discuss their importance.
The Present Study
Although many different class perceptions and motivation constructs exist (see Wentzel & Miele, 2016), we chose to focus on the five course perceptions that comprise the MUSIC Model of Motivation (Jones, 2009, 2018, 2020): perceptions of empowerment/autonomy, usefulness/utility value, success expectancies, situational interest/intrinsic motivation, and caring (the beginning sounds of these five perceptions form the acronym MUSIC). In the MUSIC model, empowerment/autonomy refers to students’ perceptions of the amount of control and choice that students have in a course. Usefulness/utility value refers to students’ perceptions that the course content is useful to their lives, either currently or in the future. Success expectancies are students’ beliefs that they can succeed in the course if they put forth effort. Situational interest is the extent to which students perceive the course to be interesting and enjoyable. Finally, caring refers to students’ perceptions that others in the learning environment (i.e., the instructor and other students) care about their learning and well-being.
We included the five MUSIC perceptions in our study for several reasons. First, many studies have shown that the five MUSIC perceptions are distinct (but correlated), including studies with samples of (1) undergraduate students in the United States (Jones et al., 2013; Jones et al., 2014; Jones et al., 2016; Jones & Skaggs, 2016; Wilkins et al., 2021), China (Jones et al., 2017), and Colombia (Jones et al., 2017), and (2) professional school students in the United States (Jones et al., 2019; Pace et al., 2016) and New Zealand (Gladman et al., 2020). Therefore, although these five course perceptions include a range of perceptions, these constructs do not overlap significantly, which reduces redundancy when assessing students’ course perceptions. Second, the five MUSIC perceptions can be measured reliably (Jones & Skaggs, 2016; Pace et al., 2016) with instruments that assess constructs common to many current motivation theories, including self-determination theory (R. M. Ryan & Deci, 2020), situated expectancy-value theory (Eccles & Wigfield, 2020), social cognitive theory (Bandura, 1997), interest theories (Renninger & Hidi, 2017), among others (for a more comprehensive list, see Jones, 2018). Third, the five MUSIC perceptions have been shown to be correlated with SETs in prior studies (Jones, 2010, 2019; Jones & Skaggs, 2016; Wilkins et al., 2021) and other researchers have assessed some of these perceptions in their studies with similar results (e.g., Filak & Sheldon, 2003; Griffin, 2016). Fourth, the MUSIC perceptions are malleable in that instructors can adjust their teaching strategies to change students’ perceptions. Therefore, our findings could have more direct implications for instructors (e.g., instructors with low SETs should implement strategies to increase students’ perceptions of the usefulness of the content). Fifth, all five MUSIC perceptions have been shown to be important predictors of students’ motivation and engagement in courses (Wentzel & Miele, 2016).
We wanted to investigate the relationships between students’ MUSIC perceptions and SETs at the student level across a wider variety of courses than had been investigated previously. Some studies have included courses from only one academic discipline, such as education (Griffin, 2016), journalism (Filak & Sheldon, 2003, Study 2), mathematics (Wilkins et al., 2021), and psychology (Demir et al., 2019). Other studies have included a few students from many different courses, which does not allow the results to be generalized to all of the students in those courses (Filak & Sheldon, 2003, Study 1; Jones & Skaggs, 2016). Also, because we were interested in the motivational aspects of the course climate, it was important for us to consider class level perceptions in addition to student level perceptions. Examining students’ perceptions at the class level allowed us to consider the contextual effects, such as how the dynamics of a teacher and students within a class can affect the overall class climate.
Two research questions guided this study:
Regarding our first research question, we anticipated that students’ MUSIC perceptions would be positively related to SETs at the student and class levels because prior studies have documented these relationships (Filak & Sheldon, 2003; Griffin, 2016; Jones, 2010, 2019; Jones & Skaggs, 2016; Wilkins et al., 2021). Concerning our second research question, we did not have a clear hypothesis because the research findings and interpretations associated with grade leniency are mixed (Greenwald & Gillmore, 1997; Marsh & Roche, 1997; McKeachie, 1997; Stroebe, 2020; Zabaleta, 2007). In our analyses for both research questions, we controlled for three additional variables in our analyses—gender, race/ethnicity, and class size—because researchers have documented that SETs can vary based on these variables (Basow & Martin, 2012; Bavisi et al., 2010; Benton & Pallett, 2013; Ho et al., 2009; Peterson et al., 2019).
Method
Participants, Courses, and Procedure
Participants included 2,949 students from 30 undergraduate courses at a large public university in the Southeastern United States. Of the 2,949 participants, 1,574 (53.4%) were male, 1,358 (46%) were female, and 17 self-identified as another gender (0.6%). With respect to race/ethnicity, 2,046 (69.4%) of the students were White or Caucasian (not Hispanic), 507 (17.2%) were Asian or Pacific Islander, 131 (4.4%) were more than one race/ethnicity, 113 (3.8%) were Hispanic, 106 (3.6%) were Black or African American, 37 (1.3%) were another race/ethnicity not provided as a survey option, and 9 (0.3%) were Native American.
The 30 courses were part of about 400 undergraduate courses that comprised the general education program, which consisted of the portion of the undergraduate curriculum shared by all students enrolled at the university, regardless of their major. The purpose of the general education program was to provide all students with the opportunity to develop crosscutting skills and capacities and engage with a broad selection of disciplinary fields and perspectives to build a foundation for civic engagement, employability, and lifelong learning (Association of American Colleges and Universities, 2002). Courses that were included in this study met the following criteria: (1) the course instructor participated in an optional, self-selected professional development opportunity designed to help instructors improve their instruction (the purpose of the professional development was to help instructors use assessments effectively and to consider strategies to make their courses more engaging); (2) no more than one section of the course could be included in the study; (3) no instructor could teach more than one course included in the study to ensure that all of the instructors in the study were different; and (4) more than 50% of the students in the course had to have completed the study survey. The courses that met these criteria represented a wide variety of topics, such as theatre, art history, geography, economics, human development, planning and design, computer science, chemistry, and physics (see Table 1 for the complete list). All of the courses were worth three credits except for one course that was worth two credits and one course that was worth four credits.
Participating Courses Ordered by the Number of Students in Each Course
Students completed an online “Student Perceptions Survey” that they received from their instructor through a URL link provided in an email. The link was sent during the middle half of the course for 21 of the courses and during the last quarter of the course for the other nine courses. The primary purpose of the survey was to obtain feedback from students about the course that instructors could use to improve the course. Typically, the instructors used the feedback to make changes to their course the next semester they taught it. Some instructors included the survey as a graded homework assignment, other instructors gave students time in class to complete it, and others strongly encouraged students to complete it because their feedback could be used to improve the course in the future. To keep students’ individual survey responses anonymous to the instructors, a researcher collected all of the survey responses and provided the instructors with the names of students who completed the surveys (if the instructors needed the names to assign grades for the assignment).
This study was approved by the institutional review board at our university, and a student consent form was included as part of the survey. A total of 2,949 students completed the survey and gave their consent to participate in this study. The response rate for each course ranged from 52% to 95%, with a mean value of 72% across courses (see Table 1).
Instruments
Perceptions of the MUSIC Model Components
Students’ perceptions of the five MUSIC model components (i.e., empowerment/autonomy, usefulness, success, situational interest, and caring) were measured using the five scales from the MUSIC Model of Academic Motivation Inventory, college student version (MUSIC Inventory; Jones, 2012). The MUSIC Inventory scales measure the extent to which students perceive that they have control of their learning environment in the course (empowerment/autonomy scale; five items), the coursework is useful to their future (usefulness/utility value scale; five items), they can succeed at the coursework (success expectancies scale; four items), the instructional methods and coursework are interesting (situational interest scale; six items), and the instructor cares about whether the student succeeds in the coursework and cares about the student’s well-being (caring scale; six items). Students rated all items on a 6-point Likert-format scale with descriptors at each point (1 = Strongly disagree, 2 = Disagree, 3 = Somewhat disagree, 4 = Somewhat agree, 5 = Agree, 6 = Strongly agree). Here is an example item from each scale: “I have the freedom to complete the coursework my own way” (empowerment/autonomy), “In general, the coursework is useful to me” (usefulness), “I am confident that I can succeed in the coursework” (success), “The coursework is interesting to me” (situational interest), and “The instructor cares about how well I do in this course” (caring). Each student received the items in a different, random order. The complete inventory, administration instructions, and validity information are available in the User Guide (Jones, 2012). Good Cronbach’s alpha values (used as a measure of internal consistency reliability) have been reported in several studies with college students, including Jones and Skaggs (2016; .91 for empowerment, .96 for usefulness, .93 for success, .95 for interest, and .93 for caring); Chittum et al. (2019; .83 for empowerment, .87 for usefulness, .86 for success, .87 for interest, and .82 for caring); and Jones (2019; 37 of the 40 values calculated were between .70 and 1.0). In the present study, the Cronbach alpha values were good to excellent: .86 for empowerment, .94 for usefulness, .87 for success, .92 for interest, and .86 for caring. Some studies have also provided construct and predictive validity evidence for the MUSIC Inventory when it is used with undergraduate students. For example, factor analysis has been used to provide evidence for the construct validity of the five-factor structure of the MUSIC Inventory in undergraduate courses (Chittum et al., 2019; Jones & Skaggs, 2016; Tendhar et al., 2017). In addition, higher scores on the five MUSIC Inventory scales have been shown to be associated with higher course ratings, instructor ratings, effort (behavioral engagement), and cognitive engagement in undergraduate courses (Chittum et al., 2019; Jones, 2019; Jones & Skaggs, 2016; Wilkins et al., 2021).
Ease of Course
The Ease of Course scale (Jones et al., 2021) assesses the extent to which students perceive the course to be easy. The scale consists of three items that are rated on a 6-point Likert-format scale with descriptors at each point (1 = Strongly disagree, 2 = Disagree, 3 = Somewhat disagree, 4 = Somewhat agree, 5 = Agree, 6 = Strongly agree). The items are (1) “This course is very easy for me.” (2) “I don’t need to work my hardest to get a high grade in this course.” (3) “In this course, I can get the grade I want with very little effort.” This scale has demonstrated acceptable internal consistency reliability in other studies with undergraduate students (α = .73; Jones et al., 2021) and in the present study (α = .82).
Student Evaluations of Teaching
SETs were assessed related to the course and instructor. Students answered one item that measured their overall perceptions of the course and one item that measured their overall perceptions of their instructor. These items were the same as those used in other studies (Jones, 2010; Wilkins et al., 2021) and they are similar to the items on the mandatory course evaluation forms at the participating university. The items were “My overall rating of the course” and “My overall rating of the instructor for this course,” and both items were rated using the following Likert-format scale: 1 = Terrible, 2 = Poor, 3 = Satisfactory, 4 = Good, 5 = Very good, and 6 = Excellent.
Analysis
We used a two-level multilevel model (hierarchical linear modeling [HLM]; Raudenbush & Bryk, 2002) because the structure of the data was nested, with individual students nested within classes (the Level 1 units were students, and the Level 2 units were classes). HLM allowed us to account for the fact that the responses from the same classes were likely to be more correlated than those in other classes, which creates nonindependent observations that violate the assumption in a multiple linear regression model. The number of macro units in this study (30 classes) met the recommended minimum number of 30 (Hox et al., 2018). The details of our analysis are provided in the appendix, available in the online version of this article.
Results
Descriptive Statistics and Correlations
At the class level, the average number of students in each class (i.e., the cluster size) was 98.4 and ranged from 16 to 427 (see Table 2). Class variables (i.e., Level 2) included the means of the Empowerment (Mean Empowerment), Usefulness (Mean Usefulness), Success (Mean Success), Interest (Mean Interest), Caring (Mean Caring), and Ease (Mean Ease). The control variables at the second level were the proportion of female students and other genders, the proportion of Asian and other races/ethnicities, and the class size.
Univariate Descriptive Statistics for Student Level and Class Level Variables
n = 2,949. bn = 30.
Bivariate associations among variables at both the student level and class level are reported in Table 3. At the student level, the course rating and instructor rating were highly correlated (r = .675). The MUSIC variables were moderately to highly correlated positively with course ratings (r ranged from .419 to .723) and instructor ratings (r ranged from .410 to .623), whereas the ease of course variable had a small, positive correlation with course rating (r = .125) and instructor rating (r = .047).
Pearson Correlation Coefficients for Study Variables at the Student Level and Class Level
Note. The results for the student level (n = 2,949) are shown above the diagonal. The results for the class level (n = 30) are shown below the diagonal.
p < .05. **p < .01.
At the class level, the mean course rating and instructor rating were very highly correlated (r = .937). The mean MUSIC variables were highly positively correlated with course ratings (r ranged from .463 to .902) and instructor ratings (r ranged from .409 to .920). The mean ease of course variable (Mean Ease) was not significantly correlated with course rating or instructor rating.
Multilevel Analysis
In this section, we describe the results of the multilevel analysis by summarizing the results of each model tested. We start with the models related to course rating and then move on to the models related to instructor rating.
Course Rating
The results of the analyses for course rating are presented in Table 4. There are six models (Model A to Model F) fitted for the analysis. These are referred to as the taxonomy of statistical models, which is a systematic sequence of models that, as a set, address our research questions (Singer & Willlett, 2003). The ICC for Model A (0.154) is moderately high and indicates that the student responses within the same classes are more similar than those in different classes. In Model B, the independent MUSIC variables are all group mean centered, and therefore, the coefficients represent the unbiased estimates of the student level relationships without considering the ease of the class, student demographics, or class size. All five of the MUSIC variables were statistically significant predictors of course rating and all of the relationships were positive and highly statistically significant (p ≤ .01; Empowerment = 0.055, Usefulness = 0.090, Success = 0.140, Interest = 0.499, and Caring = 0.105). In Model C, the student level estimates were the same as those in Model B, which is a virtue of the group mean centering. At the class level (i.e., level 2) for Model C, only Mean Empowerment (0.231) and Mean Interest (0.579) were significant predictors of course rating. The five MUSIC variables explained 93.0% of the variance in course rating at the class level (see
Results of Fitting a Taxonomy of Multilevel Models for Course Rating
Note. p Values are based on t tests for fixed effects parameters and z tests for variance component parameters; parameters are estimated by full maximum likelihood; Base in pseudo-R2 indicates that the R2 was computed based on the value for the model in the column. ICC = intraclass correlation coefficient; AIC = Aikaike information criterion; BIC = Bayesian information criterion.
p ≤ .10. *p ≤ .05. **p ≤ .01. ***p ≤ .001.
Model D included only the ease of course variable, which was significantly related to course rating at the student level (0.108), but not the class level (0.127, p > .05). Ease of course only explained 2.7% of the variance in course rating at the class level (
To compare the student level and class level effects more directly, we computed the completely standardized coefficient (
Half and Completely Standardized Regression Coefficients for Course Rating for Model F
Note. The expression of SD (β0j) =
p ≤ .05. **p ≤ .01. ***p ≤ .001.
Instructor Rating
The results of the six analyses for instructor rating are presented in Table 6. The ICC for Model A (0.255) is rather high and indicates that the student responses within the same classes are more similar than those in different classes. In Model B, three of the five MUSIC variables were significantly related to instructor rating and all of the relationships were positive and statistically significant (p < .001; Success = .077, Interest = .322, and Caring = .383). In Model C, the student level values were the same as Model B and at the class level, only Interest (.696, p < .001) was a significant predictor of instructor rating. In Model C, the five MUSIC variables explained 88.8% of the variance in instructor rating at the class level.
Results of Fitting a Taxonomy of Multilevel Models for Instructor Rating
Note. p Values are based on t tests for fixed effects parameters and z tests for variance component parameters; parameters are estimated by full maximum likelihood; Base in Pseudo-R2 indicates that the R2 was computed based on the value for the model in the column.
p ≤ .10. *p ≤ .05. **p ≤ .01. ***p ≤ .001.
In Model D, the ease of course variable was not significantly related to instructor rating at the student level or class level. Ease of course only explained 0.6% of the variance in instructor rating at class level. When the MUSIC variables were added to the ease of course variable in Model E, several variables were related to instructor rating at the student level (Success = .110, Interest = .315, Caring = .372, ease = −0.044, all p < .001) and course level (Usefulness = −0.184, p < .01; Success = .704, p < .05; Interest = .537, p < .001; and ease = −0.362, p < .05). These relationships remained similar at the student level when the demographic variables were added in Model F (see Table 6). In addition, demographic class composition such as identifying as a non-male (Mean Female = −0.383, p < .01; Mean Other Gender = −4.027, p < .05), as Asian (Mean Asian = −0.859, p < .001), or non-White and non-Asian (Mean Other Race = −0.714, p < .05) was negatively related to instructor rating. Furthermore, instructors of larger classes received higher ratings (0.001, p < .001). Almost all of the variance at the class level in instructor rating (99.96%) was explained when all of the variables were included in the full model (Model F).
The results for the completely standardized regression coefficient β are reported in the right-hand column of Table 7. These values indicate that the most important variables in predicting instructor rating were in the following order: Interest at the class level (0.890), Success at the class level (0.505), ease at the class level (−0.484), Interest at the student level (0.374), Caring at the student level (0.280), Usefulness at the class level (−0.122), Success at the student level (0.093), and Ease at the student level (−0.058).
Half and Completely Standardized Regression Coefficients for Instructor Rating for Model F
Note. The expression of SD(β0j) =
p ≤ .05. **p ≤ .01. ***p ≤ .001.
We examined the possibility of multicollinearity among the predictor variables and determined that multicollinearity did not have a significant effect on the results. The details of our multicollinearity analysis are provided in the online appendix.
Discussion
The primary purpose of this study was to determine the extent to which students’ motivation-related class perceptions were associated with their SETs, both at the student level and class level. A secondary purpose was to determine whether the ease of the course was related to SETs when controlling for students’ motivation-related class perceptions. In this section, we discuss our findings related to the primary purpose first, followed by those related to the secondary purpose.
Relationships Between Motivation-Related Class Perceptions and SETs
Our first research question was: To what extent are students’ MUSIC perceptions associated with SETs at the student level and class level? We measured five motivation-related perceptions (eMpowerment, Usefulness, Success, Interest, and Caring; referred to as “MUSIC perceptions”) and assessed SETs using an overall measure of the course and an overall measure of the instructor. We documented that students’ MUSIC perceptions were significantly related to their SETs at both the student and class levels, although two of the relationships were negative at the class level.
At the student level, all five MUSIC perceptions were positively, significantly correlated with the SETs (i.e., course and instructor ratings), with correlations ranging from .410 to .723 (see Table 3). These findings are consistent with the MUSIC Model of Motivation theory (Jones, 2009, 2018, 2020) and research studies that have reported correlations between students’ MUSIC perceptions and SETs in undergraduate college courses at the student level (Jones, 2010, 2019; Jones & Skaggs, 2016; Wilkins et al., 2021). Even when controlling for all of the other variables in the model (Model F, Tables 4 and 6), all five MUSIC perceptions were significantly correlated with course rating and three of the MUSIC perceptions (i.e., success, interest, and caring) were significantly correlated with instructor rating. These results are important because they demonstrate that SETs are associated with course perceptions (MUSIC) that have been identified by motivation researchers as being vital to students’ engagement in classes and learning activities more generally (Wentzel & Miele, 2016). Although some researchers have investigated these relationships (e.g., Demir et al., 2019; Filak & Sheldon, 2003; Griffin, 2016; Wilkins et al., 2021), the research literature about SETs has remained fairly distinct from the literature about student motivation with little overlap between these literatures. The present study indicates that there are relationships between motivation constructs and SETs that could provide greater insight into the findings in both literatures.
Another contribution of the present study is that we assessed the class level relationships by conducting a multilevel analysis. Class level relationships take into consideration contextual effects, such as the fact that the dynamics of the teacher and students in a class can affect the motivational climate within the class. The models that included the MUSIC perception variables and the control variables (i.e., ease of class, gender, race/ethnicity, and class size; see Model F in Tables 4 and 6) accounted for almost all of the variance at the class level in course rating (99.6%) and instructor rating (99.96%). These findings indicate that the classroom motivational climates (as reflected by the class average perceptions of empowerment, usefulness, success, interest, and caring) predict students’ course and instructor ratings. It is important to note that even with only the MUSIC variables in the model (Model C), these variables explained 93.0% of the variance in course rating and 88.8% of the variance in instructor rating at the class level. Similar to the findings at the student level, these class level findings highlight the strong associations between students’ motivation-related perceptions and their SETs and add to these findings by demonstrating that the motivational climate of the class is highly related to SETs.
The Importance of Interest and Success
A few patterns emerge when examining the MUSIC perceptions that are most highly predictive of SETs (see Tables 5 and 7). Interest and success at the class level are the highest predictors of course and instructor ratings. These two class level variables are more strongly associated with SETs than any of the other class level or student level variables, which suggests that SETs are strongly associated with the motivational climate of the class.
Because this study is correlational, we cannot conclude that higher class level interest and success perceptions caused students to rate their courses and instructors higher. However, given the high correlations between these variables, it would be reasonable to design experimental studies to investigate whether increasing students’ perceptions of interest and success leads to higher SETs. In fact, researchers have successfully implemented interventions to increase one or more of students’ motivation-related perceptions in a variety of ways (Lazowski & Hulleman, 2016; Lin-Siegler et al., 2016; Reeve et al., 2004). For example, when lower-performing undergraduates were asked to write about how the course material related to their lives, they reported higher usefulness, interest, and expectancy for success in the course (Hulleman et al., 2017). Similarly, undergraduates reported higher levels of usefulness and interest when they were placed in groups of three or four students and asked to discuss: (1) how the topics or assignments in the course related to their goals, (2) which topics or assignments on the syllabus were most interesting to them, and (3) questions they had about the course or instructor (McGinley & Jones, 2014). These types of interventions may be useful in not only increasing students’ motivation-related class perceptions but also increasing SETs.
Because the interest variable in the present study measures situational interest, it can be triggered by the environment (Renninger & Hidi, 2017). In fact, Hidi and Renninger (2019) note that situational interest is typically triggered by other people or affordances of the environment and that repeated opportunities are needed to sustain interest. The implication is that within a class, teachers can play a critical role in providing experiences that can interest students during class. As a few examples, instructors can provide hands-on activities (Swarat et al., 2012), incorporate games and/or cooperative learning activities (Bergin, 1999), and stimulate emotional arousal (e.g., showing enthusiasm, pacing the lesson appropriately, varying instructional activities; Jones, 2018). Instructors can affect students’ perceptions of success through a variety of strategies, such as providing explicit expectations (Wang et al., 2018), giving specific and honest feedback (Van den Bergh et al., 2014), designing activities at an appropriate level of difficulty (Shernoff et al., 2003), and helping students understand that their success is related to a combination of their effort and the use of relevant strategies (Dweck, 2006; Weiner, 2000).
Although empowerment perceptions were not as strongly associated with course rating as interest and success perceptions, it may be a variable that is worth manipulating in experimental studies to determine its effect on SETs because it has been shown to be an important predictor of students’ engagement in empirical studies (Cheon et al., 2020; Griffin, 2016; Reeve et al., 2020). The fact that empowerment was a predictor of course rating, but not a predictor of instructor rating indicates that students view their empowerment/autonomy as an aspect of the course more than an indication of instructor quality.
The Conundrum of Usefulness and Caring
Although the simple correlations between usefulness and SETs, and caring and SETs, are positive at the class level, their relationships become negative or nonexistent at the class level when these variables are included in the full model with the other variables (usefulness is negatively related to course and instructor rating; caring is negatively related to course rating and unrelated to instructor rating). The reason for this finding was that in the full model, the variance explained in the SETs was shared with all the variables; and thus, only variables that accounted for unique variance in SETs were identified as statistically significant predictors. Because the correlations among students’ MUSIC perceptions were statistically significantly correlated, the variance that remains for usefulness and caring after controlling for the other variables is negatively related to the SETs or unrelated (for caring predicting instructor rating). As noted previously, we determined that multicollinearity among the MUSIC perceptions was not a problem statistically; and therefore, we ruled out the possibility of multicollinearity as a reason for these findings.
We found no theoretical or empirical rationale for why usefulness would be negatively related to SETs when controlled for by the other MUSIC perceptions; consequently, we can only speculate as to why this association exists. Perhaps students reward instructors with higher ratings when courses are perceived as less useful because they understand the difficulties in making seemingly less useful courses relevant to their lives. This can be especially true in the general education courses included in this study because the usefulness may not be as obvious as it is in courses within their major field of study. For example, a history course may not be perceived to be useful by a student majoring in engineering; yet, if the instructor is able to make the history course interesting or make students believe that they can succeed in the course, students may still rate the course highly (as is evidenced by the fact that interest and success were highly related with SETs in this study).
Similarly, we found no research to explain why caring perceptions at the class level were negatively related to course rating in the full model. Clearly, further research is needed to interpret the negative relationships between SETs and usefulness and caring at the class level. To do so, it could be helpful to identify which aspects of the usefulness and caring constructs overlap significantly with the other MUSIC perceptions and which aspects do not. For example, it may be possible to create subscales within each MUSIC model component and then determine which subscales are more highly correlated with the other MUSIC scales or subscales. As examples, the caring construct has been divided into two subscales (personal and academic) by some researchers (Jones & Wilkins, 2013) and four subscales (promoting interaction, promoting mutual respect, promoting performance goals, and teacher support) by others (A. M. Ryan & Patrick, 2001).
Relationships Between Course Ease and SETs
Our second research question was: To what extent is the ease of the class related to SETs when controlling for students’ MUSIC perceptions? To understand the relationships between ease of course and the other study variables, we ran three models: (1) Model D, which included only the ease of course variable and either the course rating (Table 4) or the instructor rating (Table 6); (2) Model E, which added the MUSIC variables to Model D; and (3) Model F, which added the gender, race/ethnicity, and class size variables to Model E (provided in Tables 4 and 6).
Model D showed that students’ perceptions of the ease of the course were not related to their course and instructor ratings except that they were positively related to course rating at the student level. This student level finding is consistent with some prior studies (Greenwald & Gillmore, 1997) in which students tended to rate courses higher when the course was perceived as easier. However, another picture emerges when the MUSIC and other variables are included in Models E and F. In Model F (which includes all the variables), ease was negatively correlated with both course and instructor rating at the class level and negatively correlated with instructor rating at the student level (it was uncorrelated with course rating at the student level). These findings indicate that easier courses were rated lower than harder courses once motivation-related variables were controlled for both at the student and class levels, which is a finding that seems to contradict some other studies (Greenwald & Gillmore, 1997; Krautmann & Sander, 1999; McPherson, 2006).
To understand these results, it may be useful to examine the role of success perceptions, which measures students’ beliefs that they can succeed at the coursework if they put forth effort. Success perceptions could be high if the course was easy, but they could also be high if the course was hard, such as when students have the resources needed to be successful (e.g., example problems, study guides), the expectations are clear (e.g., clear directions, rubrics are provided), or some other conditions are present that lead students to believe they can succeed even when the course is perceived to be difficult. By including all of the MUSIC variables in the model along with the ease variable, we were able to parse out the individual variance that could be attributed to each variable. For example, the variance that remains for the ease variable is the part that is unique to ease that does not include the variance that can be attributed to MUSIC perceptions, such as their success perceptions. Given that, SETs were rated lower when courses were perceived as being easy and higher when students had higher MUSIC perceptions. This finding is consistent with others who have reported that higher workloads can lead to higher SETs if students learn more (Marsh & Roche, 2000). Of course, unreasonably high workloads may lead to student frustration and lower SETs (Kulik, 2001), possibly because students perceive the excessive work to be useless or an indication that the instructor does not care about them enough to consider how an excessive workload affects their lives. Future studies could investigate whether it is possible to increase SETs by creating challenging courses and helping students believe that they can succeed by providing appropriate resources and expectations.
The Role of Sex, Race/Ethnicity, and Class Size
We did not have a specific research question related to the role of sex and race/ethnicity primarily because we had no reason to believe that the relationships in this study would vary in any systematic way by gender based on prior studies (Jones, 2010; Nowell, 2007; Zabaleta, 2007). Nonetheless, we used these variables as co-variates in our analysis to determine if they would have any effect on the study variables; and consequently, should be considered as variables to examine in more detail in future studies.
Related to course rating (see Table 4), the only two significant findings were at the course level: (1) classes with a higher percentage of females than males and other genders received higher ratings (0.430, p ≤ .01) and (2) larger classes were rated higher than smaller ones (0.0004, p ≤ .05). Related to instructor rating (see Table 6), several findings emerged, all at the course level: (1) instructors of classes with more females and “other” (non-male) genders were rated lower (−0.383, p ≤ .01 for females’ proportion; −4.027, p ≤ .05 for other genders’ proportion), (2) instructors of classes with more non-White students were rated lower (−0.859, p ≤ .001 for Asian’ proportion; −0.714 for Other Race’s proportion); and (3) instructors who taught larger classes were rated higher (0.001, p ≤ .001). Future studies could examine why SETs differ based on these variables.
Limitations and Future Research
Our findings must be interpreted within the context of the study limitations. First, the study included only general education courses and students in these courses may have different motivations and expectations of these courses. In addition, because the instructors of these courses chose to participate in a professional development opportunity to improve their instruction, these instructors may differ from other instructors in some characteristics (e.g., they may be more committed to improving their teaching). In the future, researchers could compare the results of this study with courses and instructors that were more representative of the population of university courses and instructors. Second, there were some differences in the data collection that were assumed to not have an effect on the results, such as when the surveys were administered during the course and the types of incentives given for completing the survey. Future research could be designed to examine whether these differences in data collection affected the results in any meaningful way. Third, two general “overall” items were used to measure SETs (course rating and instructor rating). Future studies could include more items to assess specific characteristics of teachers and courses. Fourth, there may be other motivation-related class perceptions that are related to SETs that were not included in our study. However, the fact that our full model explained almost all of the class level variance in SETs provides evidence that the variables included were appropriate. Fifth, given that other studies have documented biases in SETs (Carpenter et al., 2020; Peterson et al., 2019), future studies could include instructor-specific class level control covariates (e.g., age, teaching experience) to identify biases against different groups of instructors. Sixth, the fact that fewer than 5% of the students self-identified their race/ethnicity as something other than White/Caucasian or Asian/Pacific Islander impeded our ability to adequately test for the effects of ethnicity/race. Further studies are needed to corroborate our findings in samples of underrepresented students. Seventh, we did not examine the variable associations for nonlinear relationships. Future research could examine whether any of these relationships are nonlinear.
Conclusion
One of our main findings was that the motivational climate of a course—as measured by students’ perceptions of empowerment, usefulness, success, interest, and caring within the course—predicts students’ overall course rating and instructor rating. This finding suggests that it is reasonable to suspect that changes in one or more MUSIC perceptions could affect students’ course and instructor ratings. Because prior studies have shown that instructors have some control over students’ MUSIC perceptions (Hulleman et al., 2017; Lin-Siegler et al., 2016; McGinley & Jones, 2014; Reeve et al., 2004), it is possible for future studies to examine how different interventions and teaching strategies can be used to positively affect students’ MUSIC perceptions, and consequently, SETs. Experimental studies would be very useful to investigate how intentionally manipulating students’ MUSIC perceptions affects their SETs.
Another main finding was that easier courses were rated lower than harder courses when the other study variables (i.e., MUSIC perceptions, gender, race/ethnicity, and class size) were held constant. For instructors, this finding suggests that attempts to make courses easier in order to improve SETs will likely not be successful. For researchers, these results demonstrate that it is important to control for motivation-related variables (e.g., MUSIC variables) in their studies when they investigate relationships between ease of course and SETs.
Given the results of this study and related studies (e.g., Wilkins et al., 2021), it is becoming clearer that instructors who want to improve their SETs should consider ways to improve the motivational climate of their course. Until more experimental research can be conducted to provide causal evidence of the interactions among motivation-related variables and SETs, the predictive evidence in the current study provides a logical rationale for instructors to focus on increasing students’ MUSIC perceptions.
Supplemental Material
sj-docx-2-ero-10.1177_23328584211073167 – Supplemental material for Motivational Climate Predicts Student Evaluations of Teaching: Relationships Between Students’ Course Perceptions, Ease of Course, and Evaluations of Teaching
Supplemental material, sj-docx-2-ero-10.1177_23328584211073167 for Motivational Climate Predicts Student Evaluations of Teaching: Relationships Between Students’ Course Perceptions, Ease of Course, and Evaluations of Teaching by Brett D. Jones, Yasuo Miyazaki, Mengyun Li and Stephen Biscotte in AERA Open
Supplemental Material
sj-xlsx-1-ero-10.1177_23328584211073167 – Supplemental material for Motivational Climate Predicts Student Evaluations of Teaching: Relationships Between Students’ Course Perceptions, Ease of Course, and Evaluations of Teaching
Supplemental material, sj-xlsx-1-ero-10.1177_23328584211073167 for Motivational Climate Predicts Student Evaluations of Teaching: Relationships Between Students’ Course Perceptions, Ease of Course, and Evaluations of Teaching by Brett D. Jones, Yasuo Miyazaki, Mengyun Li and Stephen Biscotte in AERA Open
Footnotes
Authors’ Note
Authors
BRETT D. JONES is a professor and leader of the Educational Psychology Program in the School of Education at Virginia Tech. He teaches courses related to motivation, cognition, and teaching strategies, and studies instructional methods that support students’ motivation and learning.
YASUO MIYAZAKI is a professor in the School of Education at Virginia Tech. His research involves developing new statistical and measurement modeling methodologies and improving current methodologies for the most pressing issues that plague behavioral and social scientists.
MENGYUN LI is a doctoral candidate in the School of Education at Virginia Tech. She is interested in educational research and evaluation, including instrument design, data collection, analysis, and visualization.
STEPHEN BISCOTTE is the director of general education at Virginia Tech. His scholarship has focused on the non-STEM student experience in STEM general education courses and he gives presentations about innovative structures, university reform efforts, and program evaluation.
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
