AbediJ. (2002). Standardized achievement tests and English language learners: Psychometrics issues. Educational Assessment, 8, 231–257.
2.
AbediJ. (2004). The No Child Left Behind Act and English language learners: Assessment and accountability issues. Educational Researcher, 33(1), 4–14.
3.
AbediJ. (Ed.). (2007). English language proficiency assessment in the nation: Current status and future practice. Davis: UC Davis School of Education.
4.
AbediJ. (2008). Classification system for English language learners: Issues and recommendations. Educational Measurement: Issues and Practice, 27(3), 17–31.
5.
AbediJ. (2009). English Language Learners with disabilities: Classification, assessment, and accommodation issues. Journal of Applied Testing Technology, 10(2), 1–30.
6.
AbediJ. (2010). Research and recommendations for formative assessment with English language learners. In AndradeH. L.CizekG. J. (Eds.), Handbook of formative assessment (pp. 37–66). New York, NY: Routledge.
7.
AbediJ.HofstetterC. H.LordC. (2004). Assessment accommodations for English language learners: Implications for policy-based empirical research. Review of Educational Research, 74, 1–28.
8.
American Educational Research Association, American Psychological Association, & National Council on Measurement in Education. (2014). Standards for educational and psychological testing. Washington, DC: American Educational Research Association.
9.
BaileyA. L. (Ed.). (2007). The language demands of school: Putting academic language to the test. New Haven, CT: Yale University Press.
10.
BaileyA. L. (2010). Implications for instruction and assessment. In ShatzM.WilkinsonL. (Eds.), The education of English language learners (pp. 222–247). New York, NY: Guilford Press.
11.
BaileyA. L. (2011). Lessons from AZ’s EL identification struggle: How guidance could strengthen process. NCLB Advisor, 6(4), 5–8.
12.
BaileyA. L.ButlerF. A.StevensR.LordC. (2007). Further specifying the language demands of School. In BaileyA. L. (Ed.), The language demands of school: Putting academic language to the test (pp. 103–156). New Haven, CT: Yale University Press.
13.
BaileyA. L.HeritageM. (2008). Formative assessment for literacy, Grades K-6: Building reading and academic language skills across the curriculum. Thousand Oaks, CA: Sage.
14.
BaileyA. L.HeritageM. (2014). The role of language learning progressions in improved instruction and assessment of English language learners. TESOL Quarterly, 48, 480–506.
15.
BaileyA. L.HeritageM.ButlerF. A. (2014). Developmental considerations and curricular contexts in the assessment of young language learners. In KunnanA. J. (Ed.), The companion to language assessment (pp. 421–439). Hoboken, NJ: Wiley-Blackwell.
16.
BaileyA. L.HuangB. H. (2011). Do current English language development/proficiency standards reflect the English needed for success in school?Language Testing, 28, 343–365.
17.
BaileyA. L.KellyK. R. (2013). Home language survey practices in the initial identification of English learners in the United States. Educational Policy, 27, 770–804.
BennettR. E. (2010). Cognitively based assessment of, for, and as learning (CBAL): A preliminary theory of action for summative and formative assessment. Measurement, 8(2–3), 70–91.
20.
BlackP.WiliamD. (1998). Inside the black box: Raising standards through classroom assessment. London, England: Granada Learning Group.
21.
BlackP. J.WilsonM.YaoS.-Y. (2011). Road maps for learning: A guide to the navigation of learning progressions. Measurement, 9(2/3), 71–123.
22.
BoalsT.KenyonD. M.BlairA.CranleyM. E.WilmesC.WrightL. J. (2015). Transformation in K–12 English language proficiency assessment: Changing contexts, changing constructs. Review of Research in Education, 39, 122–164.
23.
BriggsD. C.Ruiz-PrimoM. A.FurtakE.ShepardL.YinY. (2012). Meta-analytic methodology and inferences about the efficacy of formative assessment. Educational Measurement: Issues and Practice, 31(4), 13–17.
24.
BrownJ. D. (2014). Score dependability and decision consistency. In KunnanA.J. (Ed.), The companion to language assessment (pp. 1182–1206). Hoboken, NJ: Wiley-Blackwell.
25.
CarrollP. E. (2012). Examining the validity of classifications from an English language proficiency assessment for English language learners and native English speakers in fifth grade (Master’s thesis). Available from ProQuest Dissertations and Theses database. (UMI No. 1516569)
26.
CarrollP. E. (2014, October). Where policy meets practice: Data use in a Title III program. Paper presented at the Northeastern Educational Research Association Annual Conference, Trumbull, CT.
27.
CarrollP. E.BaileyA. L. (2012, November). Examining conjunctive and compensatory classification models for an English language proficiency assessment: A descriptive study of English learners and native English speakers. Paper presented at the conference to honor Ronald Hambleton, University of Massachusetts, Amherst.
28.
CarrollP. E.BaileyA.L. (2013a, April). Combining multiple indicators in classifications of English language proficiency: A descriptive study. Paper presented at the annual meeting of the National Council on Measurement in Education, San Francisco, CA.
29.
CarrollP. E.BaileyA. L. (2013b, April). Language-as-resource in English learner assessment systems: Evaluating the fit of classification models. Paper presented at the annual meeting of the American Educational Research Association, San Francisco, CA.
30.
Castro-OlivoS. M.PreciadoJ. A.SanfordA. K.PerryV. (2011). The academic and social-emotional needs of secondary Latino English learners: Implications for screening, identification, and instructional planning. Exceptionality, 19, 160–174.
31.
Council of Chief State School Officers. (2012, June). Framework for English language proficiency/development standards corresponding to the Common Core State Standards and the Next Generation Science Standards. Washington, DC: Author.
32.
ChesterM. D. (2003). Multiple measures and high-stakes decisions: A framework for combining measures. Educational Measurement: Issues in Practice, 22(2), 32–41.
33.
ClauserB. E.ClymanS. G.MargolisM. J.RossL. P. (1996). Are fully compensatory models appropriate for setting standards on performance assessments of clinical skills?Academic Medicine, 71(1 Suppl.), S90–S92.
34.
ColemanR.GoldenbergC. (2010). What does research say about effective practices for English learners? Part IV: Models for schools and districts. Kappa Delta Pi Record, 46, 156–163.
35.
Common Core State Standards Initiative. (2010a). Common core state standards for English language arts & literacy in history/social studies, science, and technical subjects. Retrieved from http://www.corestandards.org/assets/CCSSI_ELA%20Standards.pdf
CookH. G.BoalsT.WilmesC.SantosM. (2008). Issues in the development of annual measurable achievement objectives for WIDA consortium states (WCER Working Paper No. 2008-2). Madison: University of Wisconsin-Madison, Wisconsin Center for Education Research. Retrieved from http://www.wcer.wisc.edu/publications/workingPapers/Working_Paper_No_2008_02.pdf
38.
CookH. G.LinquantiR.ChinenM.JungH. (2012). National evaluation of Title III implementation supplemental report: Exploring approaches to setting English language proficiency performance criteria and monitoring English learner progress. Washington, DC: U.S. Department of Education, Office of Planning, Evaluation, and Policy Development, Policy and Program Studies Service. Retrieved from http://www.ed.gov/about/offices/list/opepd/ppss/index.html
39.
CookH. G.MacDonaldR. (2014). Reference performance level descriptors: Summary of a national working session on policies, practices, and tools for identifying potential English learners. Washington, DC: Council of Chief State School Officers.
40.
CraneE. W.BarratV. X.HuangM. (2011). The relationship between English proficiency and content knowledge for English language learner students in Grades 10 and 11 in Utah (Issues & Answers report, REL 2011-No. 110). Washington, DC: Regional Educational Laboratory West.
41.
DavidsonF.KimJ. T.LeeH.LiJ.LópezA. A. (2007). English language testing: Evidence from the evolution of test specifications. In BaileyA. L. (Ed.), The language demands of school: Putting academic English to the test (pp. 157–170). New Haven, CT: Yale University Press.
42.
DuránR. P. (2008). Assessing English-language learners’ achievement. Review of Research in Education, 32, 292–327.
43.
DuránR. P. (2011). Ensuring valid educational assessments for ELL students: Scores, score interpretation, and assessment uses. In BasterraM. R.TrumbullE.Solano-FloresG. (Eds.), Cultural validity in assessment: Addressing linguistic and cultural diversity (pp. 115–142). New York, NY: Routledge.
44.
DutroS. (2006, January). Providing language instruction. Aiming high/Aspirando a lo Mejor resource. Santa Rosa, CA: Sonoma County Office of Education.
45.
ElmoreR. F. (2004). Conclusion: The problem of stakes in performance-based accountability systems. In FuhrmanS.ElmoreR. F. (Eds.), Redesigning accountability systems for education (Vol. 38, pp. 274–296). New York, NY: Teachers College Press.
EricksonF. (2007). Some thoughts on “proximal” formative assessment of student learning. In MossP. (Ed.), Evidence in decision making: Yearbook of the National Society for the Study of Education (Vol. 106, pp. 186–216). Malden, MA: Blackwell.
48.
EstradaP. (2010, May). Expanding or eclipsing horizons for EL students: A cross-case analysis. Paper presented at the American Educational Research Association, Denver, Colorado.
EstradaP.WangH. (2013, May). Reclassifying and not reclassifying English learners to fluent English proficient, year 1 findings: Factors impeding and facilitating reclassification and access to the core. Paper presented at the American Educational Research Association, San Francisco, California.
51.
Faulkner-BondM.WaringS.ForteE.CrenshawR. L.TindleK.BelkapB. (2012). Language instruction educational programs (LIEPs): A review of the foundational literature. Washington, DC: U.S. Department of Education; Office of Planning, Evaluation and Policy Development; Policy and Program Studies Service. Retrieved from http://www2.ed.gov/rschstat/eval/title-iii/language-instruction-ed-programs-report.pdf
52.
FlorezI. R. (2012). Examining the validity of the Arizona English Language Learners Assessment cut scores. Language Policy, 11(1), 33–45.
FrantzR. S.BaileyA. L.StarrL.PereaL. (in press). Measuring academic language proficiency in school-age English language proficiency assessments under New College and Career Readiness Standards in the U.S. Language Assessment Quarterly.
55.
GándaraP. (2000). In the aftermath of the storm: English learners in the post-227 era. Bilingual Research Journal, 24(1&2), 1–13.
56.
GándaraP.RumbergerR.Maxwell-JollyJ.CallahanR. (2003, October7). English learners in California schools: Unequal resources, unequal outcomes. Education Policy Analysis Archives, 11(36). Retrieved from http://epaa.asu.edu/epaa/v11n36/
57.
GándaraP.MerinoB. (1993), Measuring the outcome of LEP programs: Test scores, exit rates, and other mythological data. Educational Evaluation and Policy Analysis, 15, 320–338.
58.
GarciaE. E.LawtonK.DinizdeFigueiredoE. H. (2010). Assessment of young English language learners in Arizona: Questioning the validity of the state measure of English proficiency. Los Angeles: Civil Rights Project, University of California. Retrieved from http://civilrightsproject.ucla.edu/research
59.
GarcíaG.McKoonG.AugustD. (2006). Language and literacy assessment of language-minority students. In AugustD.ShanahanT. (Eds.), Developing literacy in second-language learners (pp. 597–624). Mahwah, NJ: Lawrence Erlbaum.
60.
García BedollaL. G.RodriguezR. (2011). Classifying California’s English learners: Is the CELDT too blunt an instrument? (Policy reports and research briefs). Berkeley: Center for Latino Policy Research, Institute for the Study of Societal Issues, UC Berkeley.
HakutaK. (2011). Educating language minority students and affirming their equal rights: Research and practical perspectives. Educational Researcher, 40, 163–174.
65.
HaladynaT. M.DowningS. M. (2004). Construct-irrelevant variance in high-stakes testing. Educational Measurement: Issues and Practice, 23(1), 17–27.
HambletonR. K.PitoniakM. J. (2006). Setting performance standards. In BrennanR. (Ed.), Educational measurement (4th ed., pp. 433–470). Portsmouth, NH: Praeger.
68.
HeinS. F.SkaggsG. E. (2009). A qualitative investigation of panelists’ experiences of standard setting using two variations of the Bookmark method. Applied Measurement in Education, 22, 207–228.
69.
HeritageM. (2010). Formative assessment and next-generation assessment systems: Are we losing an opportunity?Washington, DC: Council of Chief State School Officers.
70.
HeritageM.HeritageJ. (2011, April). Teacher questioning: The epicenter of instruction and assessment. Paper presented at the annual meeting of the American Educational Research Association, New Orleans, LA.
71.
HeritageM.WalquiA.LinquantiR. T. (2013, April). Formative assessment as contingent communication: Perspectives on assessment as and for language learning in the content areas. Paper presented at the annual meeting of the American Educational Research Association, San Francisco, CA.
72.
HeubertJ. P. (2004). High-stakes testing in a changing environment: Disparate impact, opportunity to learn, and current legal protections. In FuhrmanS. H.ElmoreR. F. (Eds.), Redesigning accountability systems for education (Vol. 38, pp. 220–242). New York, NY: Teachers College Press.
73.
HillL.WestonM.HayesJ. (2014). Reclassification of English-learner students in California. San Francisco: Public Policy Institute of California.
74.
HoffE. (2013). Interpreting the early language trajectories of children from low-SES and language minority homes: Implications for closing achievement gaps. Developmental Psychology, 49, 4–14.
75.
HopkinsM.ThompsonK. D.LinquantiR.HakutaK.AugustD. (2013). Fully accounting for English learner performance: A key issue in ESEA reauthorization. Educational Researcher, 42, 101–108.
76.
KaneM. (2013). Validating the interpretations and uses of test scores. Journal of Educational Measurement, 50, 1–73.
77.
KaratonisA.SireciS. G. (2006). The Bookmark standard-setting method: A literature review. Educational Measurement: Issues and Practice, 25(1), 4–12.
78.
KenyonD. M.MacGregorD.LiD.CookH. G. (2011). Issues in vertical scaling of a K-12 English language proficiency test. Language Testing, 28, 383–400.
79.
KiefferM. J.LesauxN. K.RiveraM.FrancisD. J. (2009). Accommodations for English language learners taking large-scale assessments: A meta-analysis of effectiveness and validity. Review of Educational Research, 79, 1168–1201.
80.
KiefferM. J.RiveraM.FrancisD. J. (2012). Practical guidelines for the education of English language learners: Book 4. Research-based recommendations for the use of accommodations in large-scale assessments: 2012 Update. Portsmouth, NH: Center on Instruction.
81.
KimJ. (2011). Relationships among and between ELL status, demographic characteristics, enrollment history, and school persistence (CRESST Report No. 810). Los Angeles: University of California, National Center for Research on Evaluation, Standards, and Student Testing.
82.
KimJ.HermanJ. L. (2010). When to exit ELL students: monitoring success and failure in mainstream classrooms after ELLs’ reclassification (CRESST Report No. 779). Los Angeles: University of California, National Center for Research on Evaluation, Standards, and Student Testing.
83.
KindlerA. (2002). Survey of the states’ limited English proficient students and available educational programs and services: 2000-2001 Summary report. Washington, DC: National Clearinghouse for English Language Acquisition.
84.
KoprivaR.AlbersC. A. (2013). Considerations for achievement testing of students with individual needs. In GeisingerK. F. (Ed.), Handbook of testing and assessment in psychology (pp. 370–390). Washington, DC: American Psychological Association.
85.
KoprivaR.GabelD.CameronC. (2011). Designing dynamic and interactive assessments for English learners that directly measure targeted science constructs. Evanston, IL: Society for Research on Educational Effectiveness.
86.
LaneS.LeventhalB. (2015). Psychometric challenges in assessing English language learners and students with disabilites. Review of Research in Education, 39, 165–214.
87.
LinquantiR.BaileyA. L. (2014). Reprising the Home Language Survey: Summary of a national working session on policies, practices, and tools for identifying potential English learners. Washington, DC: Council of Chief State School Officers.
88.
LinquantiR.CookH. G. (2013). Toward a “common definition of English learner”: Guidance for states and state assessment consortia in defining and addressing policy and technical issues and options. Washington, DC: Council of Chief State School Officers.
89.
LinquantiR.HakutaK. (2012). How next-generation standards and assessments can foster success for California’s English learners (PACE Policy Brief No. 12-1). Stanford, CA: School of Education, Stanford University. Retrieved from http://www.edpolicyinca.org
90.
LlosaL. (2008). Building and supporting a validity argument for a standards-based classroom assessment of English proficiency based on teacher judgments. Educational Measurement: Issues and Practice, 27(3), 32–42.
91.
MahoneyK. S.HaladynaT.MacSwanJ. (2009). The need for multiple measures in reclassification decisions: A validity study of the Stanford English Language Proficiency Test. In WileyT. G.LeeJ. S.RumbergerR. W. (Eds.), The education of language minority immigrants in the United States (pp. 240–262). Tonawanda, NY: Multilingual Matters.
92.
MartinielloM. (2009). Linguistic complexity, schematic representations, and differential item functioning for English language learners in math tests. Educational Assessment, 14, 160–179.
93.
MayerJ. (2007). Policy needs: What federal and state governments need from language research. In BaileyA. L. (Ed.), The language demands of school: Putting academic language to the test. New Haven, CT: Yale University Press.
94.
McMillanJ. H. (2010). The practical implications of educational aims and contexts for formative assessment. In AndradeH. L.CizekG. J. (Eds.), Handbook of formative assessment (pp. 41–58). New York, NY: Routledge.
95.
MenkenK. (2008). English learners left behind: Standardized testing as language policy (Vol. 65). Cleveden, England: Multilingual Matters.
96.
MillerJ. F.HeilmannJ.NockertsA.IglesiasA.FabianoL.FrancisD. J. (2006). Oral language and reading in bilingual children. Learning Disabilities Research & Practice, 21, 30–43.
97.
MitzelH. C.LewisD. M.PatzR. J.GreenD. R. (2001). The bookmark procedure: Psychological perspectives. In CizekG. J. (Ed.), Setting performance standards: Concepts, methods, and perspectives (pp. 249–281). Mahwah, NJ: Lawrence Erlbaum.
98.
MosierC. I. (1943). On the reliability of a weighted composite. Psychometrika, 8, 161–168.
National Research Council. (2011). Allocating federal funds for state programs for English language learners. Washington, DC: National Academies Press.
101.
NGSS Lead States. (2013). Next Generation Science Standards. For states, by states. Washington, DC: National Academies Press.
102.
No Child Left Behind Act of 2001. Conference report to accompany H.R., 1, Rep. No. 107-334, House of Representatives, 107th Congress, 1st Session, December 13. Pub. L. No. 107-110, 115 Stat. 1425.
103.
OlsenL. (2010). Reparable harm: Fulfilling the unkept promise of educational opportunity for long term English Learners. Long Beach: Californians Together.
104.
PalermoF.MikulskiA. M.FabesR. A.HanishL. D.MartinC. L.StargelL. E. (2013). English exposure in the home and classroom: Predictions to Spanish-speaking preschoolers’ English vocabulary skills. Applied Psycholinguistics, 35, 1163–1187.
105.
ParrishT. B.MerickelA.PerezM.LinquantiR.SociasM.SpainA.. . . DelanceyD. (2006). Effects of the implementation of Proposition 227 on the education of English learners, K-12: Findings from a five-year evaluation. Palo Alto, CA: American Institutes for Research and WestEd.
106.
PerieM.ForteE. (2011). Developing a validity argument for assessments of students in the margins. In RussellM. (Ed.), Assessing students in the margins: Challenges, strategies, and techniques (pp. 335–378). Charlotte, NC: Information Age.
107.
PorterS. G.VegaJ. (2007). Overview of existing English language proficiency tests. In AbediJ. (Ed.), English language proficiency assessment in the nation: Current status and future practice (pp. 93–104). Davis: UC Davis School of Education.
108.
RaganA.LesauxN. (2006). Federal, state, and district level English language learner program entry and exit requirements: Effects on the education of language minority learners. Education Policy Analysis Archives, 14, 1–29. Retrieved from http://epaa.asu.edu/ojs/article/view/91
109.
RobinsonJ. P. (2011). Evaluating criteria for English learner reclassification: A causal-effects approach using a binding-score regression discontinuity design with instrumental variables. Educational Evaluation and Policy Analysis, 33, 267–292.
110.
SireciS. G.Faulkner-BondM. (2015). Promoting validity in the assessment of English learners. Review of Research in Education, 39, 215–252.
111.
SireciS. G.LiS.ScarpatiS. (2003). The effects of test accommodation on test performance: A review of the literature (Center for Educational Assessment Research Report No. 485). Retrieved from http://www.cehd.umn.edu/nceo/onlinepubs/testaccommlitreview.pdf
112.
SlamaR. B. (2012). A longitudinal analysis of academic English proficiency outcomes for adolescent English language learners in the United States. Journal of Educational Psychology, 104, 265–285.
113.
SlamaR. B. (2014). Investigating whether and when English learners are reclassified into mainstream classrooms in the United States A discrete-time survival analysis. American Educational Research Journal, 51, 220–252.
114.
SnowC. E.UccelliP. (2009). The challenge of academic language. In OlsonD. R.TorranceN. (Eds.), The Cambridge handbook of literacy (pp. 112–133). New York, NY: Cambridge University Press.
115.
Solano-FloresG. (2011). Assessing the cultural validity of assessment practices: An introduction. In BasterraM. R.TrumbullE.Solano-FloresE.& G. (Eds.), Cultural validity in assessment: Addressing linguistic and cultural diversity (pp. 3–21). New York, NY: Routledge.
116.
Stokes-GuinanK.GoldenbergC. (2010). Use with caution: What CELDT results can and cannot tell us. CATESOL Journal, 22, 189–202.
117.
SztajnP.ConfreyJ.WilsonP. H.EdgingtonC. (2012). Learning trajectory based instruction: Toward a theory of teaching. Educational Researcher, 41, 147–156.
118.
ThurlowM. L.KoprivaR. J. (2015). Advancing accessibility and accommodations in content assessments for students with disabilities and English learners. Review of Research in Education, 39, 331–369.
119.
TorranceH.PryorJ. (1998). Investigating formative assessment. Philadelphia: Open University Press.
120.
ValdésG.FigueroaR. A. (1995). Bilingualism and testing: A special case of bias. Norwood, NJ: Ablex.
121.
van LierL.WalquiA. (2012). Language and the common core standards. In HakutaK.SantosM. (Eds.), Understanding language: Commissioned papers on language and literacy issues in the Common Core State Standards and Next Generation Science Standards (pp. 44–51). Palo Alto, CA: Stanford University
122.
WangJ.NiemiD.WangH. (2007). Impact of different performance assessment cut scores on student promotion (CSE Report No. 719). Retrieved from http://files.eric.ed.gov/fulltext/ED498443.pdf
123.
WIDA Consortium. (2004). English language proficiency standards for English language learners in pre-kindergarten through Grade 12. Madison: Board of Regents of the University of Wisconsin System.
124.
WIDA Consortium. (2007). English language proficiency standards for English language learners in pre-kindergarten through Grade 12. Madison: Board of Regents of the University of Wisconsin System.
125.
WIDA Consortium. (2011). MODEL-K. Madison, WI: Board of Regents of the University of Wisconsin System.
126.
WIDA Consortium. (2012). Amplification of the English Language Development Standards. Madison: Board of Regents of the University of Wisconsin System. Retrieved from http://www.wida.us/standards/eld.aspx
127.
WinkeP. (2011). Evaluating the validity of a high-stakes ESL test: Why teachers’ perceptions matter. TESOL Quarterly, 45, 628–660.
WolfM. K.KaoJ.GriffinN.HermanJ. L.BachmanP. L.ChangS.FarnsworthT. (2008). Issues in assessing English language learners: English language proficiency measures and accommodation uses—Practice review (CSE Tech. Rep. No. 732). Los Angeles: University of California, National Center for Research on Evaluation, Standards, and Student Testing.
130.
YoungJ. W.ChoY.LingG.ClineF.SteinbergJ.StoneE. (2008). Validity and fairness of state standards-based assessments for English language learners. Educational Assessment, 13, 170–192.
131.
YoungJ. W.SteinbergJ.ClineF.StoneE.MartinielloM.LingG.ChoY. (2010). Examining the validity of standards-based assessments for initially fluent students and former English language learners. Educational Assessment, 15, 87–106.
132.
ZhangB. (2010). Assessing the accuracy and consistency of language proficiency classification under competing measurement models. Language Testing, 27, 119–140.