BraunH.I., and MislevyR.J. (2004). Intuitive test theory. CSE Technical Report #631. Los Angeles: The National Center for Research on Evaluation, Standards, Student Testing (CRESST), Center for Studies in Education, UCLA.
2.
CappsW.R., and MaxwellM.E. (1999). Where everybody knows your name: The beauty of small schools. The American School Board Journal, 186(9), 35–36.
3.
ChudowskyN., and BehuniakP. (1998). Using focus groups to examine the consequential aspect of validity. Educational Measurement: Issues and Practice, 17(4), 28–38.
4.
DeciE.L., SpiegelN.H., RyanR.M., KoestnerR., and KauffmanM. (1982). The effects of performance standards on teaching styles: The behavior of controlling teachers. Journal of Educational Psychology, 74(6), 852–859.
5.
FlinkC., BoggianoA.K., and BarrettM. (1990). Controlling teacher strategies: Undermining children's self-determination and performance. Journal of Personality and Social Psychology, 59(5), 916–924.
6.
GiussaniL (2001). The risk of education.Rosanna M. Giammanco Frongia, trans. New York: The Crossroad Publishing Company.
7.
GoodenowC (1993). Classroom belonging among early adolescent students: Relationships to motivation and achievement. Journal of Early Adolescence, 13(1), 21–43.
8.
GrmekM.I., and KrecicM.J. (2004). Impact of external examinations (Matura) on school lessons. Educational Studies, 30, 319–329.
9.
GrolnickW.S., and RyanR.M. (1987). Autonomy in children's learning: An experimental and individual difference investigation. Journal of Personality and Social Psychology, 52(5), 890–898.
10.
KleinS.P., HamiltonL.S., McCaffreyD.F., and StecherB.M. (2000). What do test scores in Texas tell us?Education Policy Analysis Archives, 8(49).
11.
KoretzD (2002). Limitations in the use of achievement tests as measures of educators' productivity. The Journal of Human Resources, 37, 752–777.
12.
KoretzD (2003). Using multiple measures to address perverse incentives and score inflation. Educational Measurement: Issues and Practice, 22(2), 18–26.
13.
ResnickL.B., and ResnickD.P. (1992). Assessing the thinking curriculum: New tools for educational reform. In GiffordB. R., and O'ConnorM. C. (Eds.), Changing assessments: Alternative views of aptitude, achievement, and instruction. (pp. 37–75). Boston, MA: Kluwer Academic Publishers.
14.
RotbergI. C. (1995). Myths about test score comparisons. Science, 270, 1446–1448.
15.
RyanR.M., and ConnellJ.P. (1989). Perceived locus of causality and internalization: Examining reasons for acting in two domains. Journal of Personality and Social Psychology, 57(5), 749–761.