Sage Journals: Discover world-class research

Abstract

Teacher preparation programs are increasingly expected to use data on preservice teacher (PST) skills to drive program improvement and provide targeted supports. Observational ratings are especially vital, but also prone to measurement issues. Scores may be influenced by factors unrelated to PSTs’ instructional skills, including rater standards. Yet we know little about how these measurement challenges play out in the preservice context specifically. Here, we investigate the reliability and sensitivity of two observational measures. We find measures collected during student teaching are especially prone to measurement issues; only 3% to 4% of variation in scores reflects consistent differences between PSTs, while 9% to 17% of variation can be attributed to the mentors with whom they work. When high scores stem not from strong instructional skills, but instead from external circumstances, we cannot use them to make consequential decisions about PSTs’ individual needs or readiness for independent teaching.

Keywords

field experiences performance assessment practice-based teacher education preservice teacher education

Get full access to this article

View all access options for this article.

References

Allen

Gregory

Mikami

Lun

Hamre

Pianta

(2013). Observations of effective teacher–student interactions in secondary school classrooms: Predicting student achievement with the classroom assessment scoring system—Secondary. School Psychology Review, 42(1), 76–98.

Allen

Hafen

C. A.

Gregory

A. C.

Mikami

A. Y.

Pianta

(2015). Enhancing secondary school instruction and student achievement: Replication and extension of the My Teaching Partner-secondary intervention. Journal of Research on Educational Effectiveness, 8(4), 475–489.

Allen

Coble

(2018). Creating a data culture in educator preparation: The role of the states. In Mandinach

Gummer

(Eds.), Data for continuous programmatic improvement (pp. 35–67). Routledge.

American Association of Colleges for Teacher Education (AACTE). (2018). A pivot toward clinical practice, its lexicon, and the renewal of educator preparation (Technical Report). http://www.nysed.gov/common/nysed/files/cpc-aactecpcreport.pdf

Anglin

K. L.

Wong

V. C.

Boguslav

(2021). A natural language processing approach to measuring treatment adherence and consistency using semantic similarity. AERA Open, 7, 23328584211028615.

Araujo

M. C.

Carneiro

Cruz-Aguayo

Schady

(2016). Teacher quality and learning outcomes in kindergarten. The Quarterly Journal of Economics, 131(3), 1415–1453.

Atteberry

Loeb

Wyckoff

(2015). Do first impressions matter? Improvements in early career teacher effectiveness. AERA Open Access Journal, 1(4), 1–23.

Bartanen

Kwok

(2020). Pre-service teacher quality and workforce entry (EdWorkingPaper No. 20-223). Annenberg Institute at Brown University.

Bartanen

Kwok

(2021). Examining clinical teaching observation scores as a measure of preservice teacher quality. American Educational Research Journal, 58(5), 887–920.

10.

Bassok

Magouirk

Markowitz

A. J.

(2021). Systemwide quality improvement in early childhood education: Evidence from Louisiana. AERA Open, 7(1), 1–19.

11.

Bastian

K. C.

Henry

G. T.

Pan

Lys

(2016). Teacher candidate performance assessments: Local scoring and implications for teacher preparation program improvement. Teaching and Teacher Education, 59, 1–12.

12.

Bastian

K. C.

Lys

Pan

(2018). A framework for improvement: Analyzing performance-assessment scores for evidence-based teacher preparation program reforms. Journal of Teacher Education, 69(5), 448–462.

13.

Bastian

K. C.

Lys

D. B.

Whisenant

W. R. L.

(2023). Does placement predict performance? Associations between student teaching environments and candidates’ performance assessment scores. Journal of Teacher Education, 74(1), 40–54.

14.

Bastian

K. C.

Patterson

K. M.

Carpenter

(2022). Placed for success: Which teachers benefit from high-quality student teaching placements? Educational Policy, 36(7), 1583–1611.

15.

Bell

C. A.

Dobbelaer

M. J.

Klette

Visscher

(2019). Qualities of classroom observation systems. School Effectiveness and School Improvement, 30(1), 3–29.

16.

Bell

C. A.

Gitomer

D. H.

McCaffrey

D. F.

Hamre

B. K.

Pianta

R. C.

(2012). An argument approach to observation protocol validity. Educational Assessment, 17(2–3), 62–87.

17.

Bell

C. A.

Croft

A. J.

Leusner

McCaffrey

D. F.

Gitomer

D. H.

Pianta

R. C.

(2015). Improving observational score quality: Challenges in observer thinking. In Kane

Kerr

Pianta

(Eds.), Designing teacher evaluation systems: New guidance from the measures of effective teaching project (pp. 50–97). John Wiley.

18.

Boyd

Lankford

Loeb

Rockoff

Wyckoff

(2008). The narrowing gap in New York City teacher qualifications and its implications for student achievement in high-poverty schools. Journal of Policy Analysis and Management, 27(4), 793–818.

19.

Briggs

D. C.

Alzen

J. L.

(2019). Making inferences about teacher observation scores over time. Educational and Psychological Measurement, 79(4), 636–664.

20.

Campbell

S. L.

Ronfeldt

(2018). Observational evaluation of teachers: Measuring more than we bargained for? American Educational Research Journal, 55(6), 1233–1267.

21.

Casabianca

J. M.

Lockwood

J. R.

McCaffrey

D. F.

(2015). Trends in classroom observation scores. Educational and Psychological Measurement, 75(2), 311–337.

22.

Castles

Rastle

Nation

(2018). Ending the reading wars: Reading acquisition from novice to expert. Psychological Science in the Public Interest, 19(1), 5–51.

23.

Charney

R. S.

(1993). Teaching children to care: Management in the responsive classroom. Northeast Foundation for Children.

24.

Cohen

Erickson

Krishnamachari

Wong

(2023). Experimental evidence on the robustness of coaching supports in teacher education. Educational Researcher.

25.

Cohen

Goldhaber

(2016). Building a more complete understanding of teacher evaluation using classroom observations. Educational Researcher, 45(6), 378–387.

26.

Cohen

Wong

Krishnamachari

Berlin

(2020). Teacher coaching in a simulated environment. Educational Evaluation and Policy Analysis, 43(2), 208–231.

27.

Council for the Accreditation of Educator Preparation (CAEP). (2022). 2022 Initial Level Standards. https://caepnet.org/~/media/Files/caep/standards/2022-initial-standards-1-pager-final.pdf?la=en

28.

Davis

S. C.

Peck

C. A.

(2020). Using data for program improvement in teacher education: A study of promising practices. Teachers College Record, 122(3), 1–48.

29.

Dieker

L. A.

Rodriguez

J. A.

Lignugaris/Kraft

Hynes

M. C.

Hughes

C. E.

(2014). The potential of simulated environments in teacher education: Current and future possibilities. Teacher Education and Special Education, 37(1), 21–33.

30.

Feuer

Floden

Chudowsky

Ahn

(2013). Evaluation of teacher preparation programs: Purposes, methods, and policy options. National Academy of Education.

31.

Forzani

F. M.

(2014). Understanding “core practices” and “practice-based” teacher education: Learning from the past. Journal of Teacher Education, 65(4), 357–368.

32.

Gitomer

Bell

McCaffrey

Hamre

B. K.

Pianta

R. C.

(2014). The instructional challenge in improving teaching quality: Lessons from a classroom observation protocol. Teachers College Record, 116(6), 1–32.

33.

Gitomer

(Ed.). (2009). Measurement issues and assessment for teaching quality. SAGE.

34.

Goldhaber

(2019). Evidence-based teacher preparation: Policy context and what we know. Journal of Teacher Education, 70(2), 90–101.

35.

Goldhaber

Krieg

Naito

Theobald

(2020). Making the most of student teaching: The importance of mentors and scope for change. Education Finance and Policy, 15(3), 581–591.

36.

Goldhaber

Krieg

Theobald

(2020). Effective like me? Does having a more productive mentor improve the productivity of mentees? Labour Economics, 63, Article 101792.

37.

Goldhaber

Ronfeldt

Cowan

Gratz

Bardelli

Truwit

(2022). Room for improvement? Mentor teachers and the evolution of teacher preservice clinical evaluations. American Educational Research Journal, 59(5), 1011–1048.

38.

Grossman

Compton

Igra

Ronfeldt

Shahan

Williamson

(2009). Teaching practice: A cross-professional perspective. Teachers College Record, 111(9), 2055–2100.

39.

Hafen

C. A.

Hamre

B. K.

Allen

J. P.

Bell

C. A.

Gitomer

D. H.

Pianta

R. C.

(2015). Teaching through interactions in secondary school classrooms: Revisiting the factor structure and practical application of the classroom assessment scoring system-secondary. The Journal of Early Adolescence, 35(5–6), 651–680.

40.

Henry

G. T.

Campbell

S. L.

Thompson

C. L.

Patriarca

L. A.

Luterbach

K. J.

Lys

D. B.

Covington

V. M.

(2013). The predictive validity of measures of teacher candidate programs and performance: Toward an evidence-based approach to teacher preparation. Journal of Teacher Education, 64(5), 439–453.

41.

Hill

Charalambous

C. Y.

Kraft

M. A.

(2012). When rater reliability is not enough: Teacher observation systems and a case for the generalizability study. Educational Researcher, 41(2), 56–64.

42.

Hill

Grossman

(2013). Learning from teacher observations: Challenges and opportunities posed by new teacher evaluation systems. Harvard Educational Review, 83(2), 371–384.

43.

Hill

Mancenido

Loeb

(2020). New research for teacher education (EdWorkingPaper No. 20-252). Annenberg Institute at Brown University.

44.

A. D.

Kane

T. J.

(2013). The reliability of classroom observations by school personnel. Bill & Melinda Gates Foundation.

45.

Janssen

Grossman

Westbroek

(2015). Facilitating decomposition and recomposition in practice-based teacher education: The power of modularity. Teaching and Teacher Education, 51, 137–146.

46.

Kane

Staiger

(2012). Gathering feedback for teaching: Combining high-quality observations with student surveys and achievement gains. Bill & Melinda Gates Foundation.

47.

Kraft

M. A.

Gilmour

A. F.

(2017). Revisiting the widget effect: Teacher evaluation reforms and the distribution of teacher effectiveness. Educational Researcher, 46(5), 234–249.

48.

Lampert

Graziani

(2009). Instructional activities as a tool for teachers’ and teacher educators’ learning. The Elementary School Journal, 109(5), 491–509.

49.

La Paro

K. M.

Pianta

R. C.

Stuhlman

. (2004). The classroom assessment scoring system: Findings from the prekindergarten year. The Elementary School Journal, 104(5), 409–426.

50.

Levine

(2006). Educating school teachers (No. 2). The Education Schools Project. http://www.edschools.org/teacher_report_release.htm

51.

Malmberg

L.-E.

Hagger

Burn

Mutton

Colls

(2010). Observed classroom quality during teacher education and two years of professional practice. Journal of Educational Psychology, 102(4), 916–932.

52.

Mancenido

(2022). Impact evaluations of teacher preparation practices: Challenges and opportunities for more rigorous research (EdWorkingPaper No. 22–534). Annenberg Institute at Brown University.

53.

McDonald

Kazemi

Kavanagh

S. S.

(2013). Core practices and pedagogies of teacher education: A call for a common language and collective activity. Journal of Teacher Education, 64(5), 378–386.

54.

Pacheo

(2009). Mapping the terrain of teacher quality. In Gitomer

(Ed.), Measurement issues and assessment for teaching quality (pp. 168–178). SAGE.

55.

Papay

(2012). Refocusing the debate: Assessing the purposes and tools of teacher evaluation. Harvard Educational Review, 82(1), 123–141.

56.

Paris

Alim

H. S.

(2014). What are we seeking to sustain through culturally sustaining pedagogy? A loving critique forward. Harvard Educational Review, 84(1), 85–100. https://doi.org/10.17763/haer.84.1.982l873k2ht16m77

57.

Park

Y. S.

Chen

Holtzman

S. L.

(2015). Evaluating efforts to minimize rater bias in scoring classroom observations. In Kane

Kerr

Pianta

(Eds.), Designing teacher evaluation systems: New guidance from the measures of effective teaching project (pp. 381–414). John Wiley.

58.

Peck

C. A.

McDonald

(2013). Creating “cultures of evidence” in teacher education: Context, policy, and practice in three high-data-use programs. The New Educator, 9(1), 12–28.

59.

Pianta

R. C.

Hamre

B. K.

(2009). Classroom processes and positive youth development: Conceptualizing, measuring, and improving the capacity of interactions between teachers and students. New Directions for Youth Development, 121, 33–46.

60.

Responsive Classroom. (2014). The responsive classroom approach: Good teaching changes the future. https://www.responsiveclassroom.org/sites/default/files/pdf_files/RC_approach_White_paper.pdf

61.

Reznitskaya

Kuo

L.-J.

Clark

A.-M.

Miller

Jadallah

Anderson

R. C.

Nguyen-Jahiel

(2009). Collaborative reasoning: A dialogic approach to group discussions. Cambridge Journal of Education, 39(1), 29–48.

62.

Ronfeldt

(2012). Where should student teachers learn to teach? Effects of field placement school characteristics on teacher retention and effectiveness. Educational Evaluation and Policy Analysis, 34(1), 3–26.

63.

Ronfeldt

Bardelli

Truwit

Mullman

Schaaf

Baker

J. C.

(2020). Improving preservice teachers’ feelings of preparedness to teach through recruitment of instructionally effective and experienced cooperating teachers: A randomized experiment. Educational Evaluation and Policy Analysis, 42(4), 551–575.

64.

Ronfeldt

Brockman

S. L.

Campbell

S. L.

(2018). Does cooperating teachers’ instructional effectiveness improve preservice teachers’ future performance? Educational Researcher, 47(7), 405–418.

65.

Ronfeldt

Matsko

K. K.

Greene Nolan

Reininger

(2021). Three different measures of graduates’ instructional readiness and the features of preservice preparation that predict them. Journal of Teacher Education, 72(1), 56–71.

66.

Texas State Legislature. Texas Senate Bill 174. (2009).

67.

U.S. Department of Education. (2011). Our future, our teachers: The Obama administration’s plans for teacher education reform and improvement.

68.

van der Lans

R. M.

van de Grift

W. J.

van Veen

Fokkens-Bruinsma

. (2016). Once is not enough: Establishing reliability criteria for feedback and evaluation decisions based on classroom observations. Studies in Educational Evaluation, 50, 88–95.

69.

Wallace

T. L.

Parr

A. K.

Correnti

R. J.

(2020). Assessing teachers’ classroom management competency: A case study of the classroom assessment scoring system–secondary. Journal of Psychoeducational Assessment, 38(4), 475–492.

70.

Weisberg

Sexton

Mulhern

Keeling

Schunck

Palcisco

Morgan

(2009). The widget effect: Our national failure to acknowledge and act on differences in teacher effectiveness. New Teacher Project.

71.

Wylie

E. C.

(2020). Observing formative assessment practice: Learning lessons through validation. Educational Assessment, 25(4), 251–258.

Supplementary Material

Please find the following supplemental material available below.

For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.

For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.

0.00 MB

0.07 MB

Different Methods for Assessing Preservice Teachers’ Instruction: Why Measures Matter

Abstract

Keywords

Get full access to this article

References

Supplementary Material