Alderson, J. C. (1979). The effect on the cloze test of changes in deletion frequency. Journal of Research in Reading, 2(2), 108-119. 	doi: 10.1111/j.1467-9817.1979.tb00198.x

Alderson, J. C. (1980). Native and nonnative speaker performance on cloze tests. Language Learning, 30(1), 59-76. doi: 10.1111/j.1467-1770.1980.tb00151.x

Alderson, J. C. (1983). The cloze procedure and proficiency in English as a foreign language. In J. W. Jr. Oller (Ed.), Issues in language testing research (pp. 205-217). Rowley, MA: Newbury House.  

Alderson, J. C. (2000). Assessing reading. Cambridge: Cambridge University Press.

Alderson, J. C. (2005). Diagnosing foreign language proficiency: The interface between learning and assessment. London: Continuum.

Anderson, J. (1976). Psycholinguistic experiments in foreign language testing. St Lucia, Queensland: University of Queensland Press.

Andrich, D. (1978). A rating formulation for ordered response categories. Psychometrika, 43(4), 561-573. doi: 10.1007/BF02293814

Andrich, D. (2011). Testlets and threshold disordering. Rasch Measurement Transactions, 251(1), 1318-1399.

Babaii, E., & Ansary, H. (2001). The C-test: A valid operationalization of reduced redundancy principle? System, 29(2), 209-219. doi: 10.1016/S0346-251X(01)00012-4

Bachman, L. F. (1981). The trait structure of cloze test scores. Paper presented at the 1981 TESOL Midwest Regional Conference and Illinois TESOL/BE Convention, Champaign-Urbana.

Bachman, L. F. (1985). Performance on cloze tests with fixed-ratio and rational deletion. TESOL Quarterly, 19(3), 535-56. doi: 10.2307/3586277

Bachman, L. F. (1990). Fundamental considerations in language testing. Oxford: Oxford University Press.

Baghaei, P., & Doebler, P. (2019). Introduction to the Rasch Poisson counts model: An R tutorial. Psychological Reports, 122(5), 1967-1994. doi: 10.1177/0033294118797577

Baghaei, P., & Effatpanah, F. (2022). Elements of psychometrics (2nd Ed.). Mashhad, Iran: Sokhan Gostar Publishing. 

Baghaei, P., & Grotjahn, R. (2014). The validity of C-Tests as measures of academic and everyday language proficiency: A multidimensional item response modeling study. In R. Grotjahn (Ed.), Der C-Test: Aktuelle Tendenzen/The C-Test: Current trends (pp. 163-171.). Frankfurt/M.: Lang.

Baker, B. A. (2011). Use of the cloze-elide task in high-stakes English proficiency testing. Spaan Fellow Working Papers in Second or Foreign Language Assessment, 9, 1-16.

Bolt, D. M., Cohen, A. S., & Wollack, J. A. (2002). Item parameter estimation under conditions of test speededness: Application of a mixture Rasch model with ordinal constraints. Journal of Educational Measurement, 39(4), 331-348. doi: 10.1111/j.1745-3984.2002.tb01146.x

Bond, T. G., & Fox, C. M. (2015). Applying the Rasch model: Fundamental measurement in the human sciences (3rd Ed.). New York: Routledge.

Bormuth, J. R. (1967). Comparable cloze and multiple-choice comprehension tests scores. Journal of Reading, 10(5), 291-299. 

Bowen, J. D. (1978). The identification of irrelevant lexical distraction: an editing task. TESL Reporter, 12(1), 14-16. 

Carroll, J. B. (1961). Fundamental considerations in testing for English language proficiency of foreign students. In Testing the English proficiency of foreign students. Washington, DC: Center for Applied Linguistics.

Chapelle, C. A., & Abraham, R. G. (1990). Cloze method: What difference does it make? Language Testing, 7(2), 121-146. doi: 10.1177/026553229000700201

Davies, A. (1975). Two tests of speeded reading. In R. L. Jones, & B. Spolsky (Eds.), Testing language proficiency. Washington, DC: Center for Applied Linguistics. 

Davies, A. (1989). Testing reading speed through text retrieval. In C. N. Candlin, & T. F. McNamara (Eds.), Language learning and community. Sydney, NSW: NCELTR. 

Davis, M., & Gardner, D. (2010). A frequency dictionary of contemporary American English: Word Sketches, Collocates & Thematic Lists. New York: Routledge.

Desjardins, C. D., & Bulut, O. (2018). Handbook of educational measurement and psychometrics using R. Boca Raton, FL: Chapman & Hall/CRC Press.

Doebler, A., & Holling, H. (2016). A processing speed test based on rule-based item generation: An analysis with the Rasch Poisson Counts Model. Learning and Individual Differences, 52, 121-128. doi: 10.1016/j.lindif.2015.01.013

Eckes, T. (2010). Rasch models for C-tests: Closing the gap on modern psychometric theory. In A. Berndt, & K. Kleppin (Eds.), Sprachlehrforschung: Theorie und empirie – Festschrift für Rüdiger Grotjahn (pp. 39-49). Frankfurt, Germany: Lang.

Eckes, T. (2011). Item banking for C-tests: A polytomous Rasch modeling approach. Psychological Test and Assessment Modeling, 53(4), 414-439. 

Eckes, T., & Grotjahn, R. (2006a). A closer look at the construct validity of C-tests.  Language Testing, 23(3), 290-325. doi: 10.1191/0265532206lt330oa

Eckes, T., & Grotjahn, R. (2006b). C-Tests als Anker fur TestDaF: Rasch-Analysen mit dem kontinuierlichen Ratingskalen-Modell [C-tests as an anchor in TestDaF: Rasch-analyses with the continous rating scale model]. In R. Grotjahn (Ed.), Der C-Test: Theorie, empirie, anwendungen (pp. 167–193). Frankfurt am Main, Germany: Peter Lang.

Effatpanah, F. (2019). Cognitive diagnostic assessment of Iranian EFL university students’ L2 writing ability: Selecting the best model (Unpublished master’s thesis). Islamic Azad University, Mashhad, Iran. 

Effatpanah, F., & Baghaei, P. (2021). Cognitive components of writing in a second language: an analysis with the Linear Logistic Test Model. Psychological Test and Assessment Modeling, 63(1), 13-44. 

Elder, C., & von Randow, J. (2008). Exploring the utility of a web-based English language screening tool. Language Assessment Quarterly, 5(3), 173-194. doi: 10.1080/15434300802229334

Farhady, H. (1979). The disjunctive fallacy between discrete-point and integrative tests. TESOL Quarterly, 13(3), 347-357. doi: 10.2307/3585882

Farhady, H. (1996). Varieties of cloze procedure in EFL education. Roshd Foreign Language Teaching Journal, 12, 217-229.

Forthmann, B., Gühne, D., & Doebler, P. (2019). Revisiting dispersion in count data item  response theory models: The Conway–Maxwell–Poisson counts model. British Journal of Mathematical and Statistical Psychology, 73(1), 32-50. doi: 10.1111/bmsp.12184

Forthmann, B., Grotjahn, R., Doebler, P., & Baghaei, P. (2020). A comparison of different item response theory models for scaling speeded C-tests. Journal of Psychoeducational Assessment, 38(6), 692-705. doi: 10.1177/0734282919889262

Friedman, M. M. (1964). The use of the cloze procedure for improving the reading comprehension of foreign students at the University of Florida (Unpublished doctoral dissertation). Miami: University of Florida.

Gaies, S. J. (1987). Validation of the Noise Test. In R. Grotjahn, C. Klein-Braley, & D. K. Stevenson, (Eds.), Taking their measure: The validity and validation of language tests (pp. 41-74). Bochum: Brockmeyer. 

Gaies, S. J., Gradman, H. J., & Spolsky, B. (1977). Towards the measurement of functional proficiency: Contextualization of the Noise Test. TESOL Quarterly, 11(1), 51-57. doi: 10.2307/3585591

Gradman, H. L., & Spolsky, B. (1975). Reduced redundancy testing: A progress report. In   R. L. Jones, & B. Spolsky (Eds.), Testing language proficiency (pp. 59-70). Arlington, VA: Center for Applied Linguistics.

Grotjahn, R. (2010). Der C-Test: Beitrage aus der aktullen forschung The C-Test: Contributions from current research. Frankfurt/ M: Lang.

Hambleton, R. K., & Swaminathan, H. (1985). Item response theory: Principles and applications. Boston, MA: Kluwer- Nijhoff.

Harsch, C., & Hartig, J. (2010). Empirische und inhaltliche Analyse lokaler Abhängigkeiten im C-Test [Empirical and content analysis of local dependencies in C-tests]. In R. Grotjahn (Ed.), Der C-Test: Beiträge aus der aktuellen Forschung [The C-test: Contributions from current research] (pp. 193–204). Frankfurt, Germany: Lang.

Heckman, R. W., Tiffin, J., & Snow, R. E. (1967). Effects of controlling item exposure in achievement testing. Educational and Psychological Measurement, 27(1), 113-125. doi: 10.1177/001316446702700111

Hudson, T. (2007). Teaching second language reading. New York: Oxford University Press.

Janssen, R., & De Boeck, P. (1999). Confirmatory analyses of componential test structure using multidimensional item response theory. Multivariate Behavioral Research, 34(2), 245-268. doi: 10.1207/S15327906Mb340205

Johansson, S. (1973). Partial dictation as a test of foreign language proficiency. Swedish English Contrastive Studies 3. Lund: University of Lund, Department of English.

Johansson, S. (1974). Controlled distortion as a language testing tool. In J. Qvistgaard, H. Schwarz, & H. Spang-Hanssen, (Eds.), Applied linguistics, problems, and solutions: AILA proceedings Copenhagen. III (pp. 397-411). Heidelberg: Julius Groos Verlag.

Jonz, J. (1976). Improving on the basic egg: The multiple choice cloze. Language Learning, 26(2), 255-65. doi: 10.1111/j.1467-1770.1976.tb00276.x

Klein-Braley, C. (1981). Empirical investigations of cloze tests (Unpublished doctoral dissertation). Germany: University of Duisburg.

Klein-Braley, C. (1997). C-tests in the context of reduced redundancy testing: An appraisal. Language Testing, 14(1), 47-84. doi: 10.1177/026553229701400104

Klein-Braley, C., & Raatz, U. (1984). A survey of research on the C-test. Language Testing, 1(2), 134-146. doi: 10.1177/026553228400100202

Lee, S. H. (2008). Beyond reading and proficiency assessment: The rational cloze procedure as stimulus for integrated reading, writing, and vocabulary instruction and teacher–student interaction in ESL. System, 36(4), 642-660. doi: 10.1016/j.system.2008.04.002

Lee, L., & Gunderson, E. (2011). Select Reading. London: Oxford University Press.

Li, E. F. (2013). The impact of unobserved extreme categories on item and person estimates: A simulation study. In Q. Zhang, & H. Yang (Eds.), Pacific Rim Objective Measurement Symposium (PROMS) 2012 Conference Proceeding (pp. 117-128). Springer.

Linacre, J. M. (1999). Investigating rating scale category utility. Journal of Outcome Measurement, 3, 103-122. 

Linacre, M. (2002). What do infit and outfit, mean-square and standardized mean? Rasch Measurement Transactions, 16(2), 1-99. 

Linacre, J. M. (2009). A user’s guide to WINSTEPS. Chicago, IL: Winsteps.

Manning, W. H. (1987). Development of cloze-elide tests of English as a second language. Princeton, NJ: Educational Testing Service.

Masters, G. N. (1982). A Rasch model for partial credit scoring. Psychometrika, 47, 149-174. doi: 10.1007/BF02296272

Müller, H. (1987). A Rasch model for continuous ratings. Psychometrika, 52(2), 165-181. doi: 10.1007/BF02294232

Norris, J. M. (2018). Developing and investigating C-tests in eight languages: Measuring proficiency for research purposes. In J. M. Norris (Ed.), Developing C-tests for estimating proficiency in foreign language research (pp. 7-33). Germany: Lang.

Oller, J. W. Jr. (1971). Dictation as a device for testing foreign language proficiency. English Language Teaching Journal, 25(3), 254-259. doi: 10.1093/elt/XXV.3.254

Oller, J. W. Jr. (1973). Cloze tests of language proficiency and what they measure. Language Learning, 23(1), 105-18. doi: 10.1111/j.1467-1770.1973.tb00100.x

Oller, J. W. Jr. (1976). Evidence for a general language proficiency factor: An expectancy grammar. Die Neueren Spracen, 75(2), 165-174.

Oller, J. W. Jr. (1979). Language tests at school: A pragmatic approach. London: Longman.

Oller, J. W. Jr., & Conrad, C. (1971). The cloze technique and ESL proficiency. Language Learning, 21(2), 183-95. doi: 10.1111/j.1467-1770.1971.tb00057.x

O'Reilly, R. P., & Streeter, R. E. (1977). Report on the development and validation of a system for measuring literal comprehension in a multiple-choice cloze format: Preliminary factor analytic results. Journal of Literacy Research, 9, 45-69. doi: 10.1080/10862967709547206

Ozete, O. (1977). The cloze procedure: A modification. Foreign Language Annals, 10(5), 565-568. doi: 10.1111/j.1944-9720.1977.tb03033.x

Porter D. (1976). Modified cloze procedure: A more valid reading comprehension test. English Language Teaching Journal, 30(2), 151-155. doi: 10.1093/elt/XXX.2.151

Raatz, U. (1985). Tests of reduced redundancy- the C-Test, a practical example. In C. Klein-Braley & U. Raatz (Eds.), Fremdsprachen und Hochschule 13/14: Thematischer Teil: C-Tests in der Praxis (pp. 14-19). Bochum: AKS. 

Raatz, U., & Klein-Braley, C. (1981). The C-test: A modification of the cloze procedure. In T. Culhane, C. Klein-Braley, & D. K. Stevenson (Eds.), Practice and problems in language testing (pp. 113-145). Colchester, UK: University of Essex.

Rasch, G. (1960). Probabilistic models for some intelligence and attainment tests. Copenhagen: Danish Institute for Educational Research.

Rosenbaum, P. R. (1988). Item bundles. Psychometrika, 53(3), 349-359. doi: 10.1007/BF02294217

Ruddell, R. B. (1964). A study of the cloze comprehension technique in relation to Structurally controlled reading material. Improvement of Reading Through Classroom Practice, 9, 298-303. 

Sireci, S. G., Thissen, D., & Wainer, H. (1991). On the reliability of testlet-based tests. Journal of Educational Measurement, 28(3), 237-247. doi: 10.1111/j.1745-3984.1991.tb00356.x

Smith, R. M., & Plackner, C. (2009). The family approach to assessing fit in Rasch measurement. Journal of Applied Measurement, 10(4), 424-437. 

Spolsky, B. (1968). What does it mean to know a language, or how do you get someone to perform his competence? Presented at the Second Conference on Problems in Foreign Language Testing (pp. 1-24), University of Southern California, U.S.A, November 7-9.

Spolsky, B. (1969). Reduced redundancy as a language testing tool (pp. 1-18). Presented at the Second International Congress of Applied Linguistics, Cambridge, England, September, 8-12.

Spolsky, B. (1971). Reduced redundancy as a language testing tool. In G. E. Perren & J. L. M. Trim (Eds.), Applications of linguistics (pp. 383-390).Cambridge: Cambridge University Press. 

Spolsky, B., Sigurd, B., Sato, M., Walker, E., & Arterburn, C. (1968). Preliminary studies in the development of techniques for testing overall second language proficiency. Language Learning, 18(Suppl 3), 79-101. doi: 10.1111/j.1467-1770.1968.tb00224.x

Stansfield, C., & Hansen, J. (1983). Field dependence-independence as a variable in second language cloze test performance. TESOL Quarterly, 17(1), 29-38. doi: 10.2307/3586422

Taylor, W. L. (1953). “Cloze procedure”: A new tool for measuring readability. Journalism  Quarterly, 30(4), 415-433. doi: 10.1177/107769905303000401

Thissen, D., Steinberg, L., & Mooney, J. A. (1989). Trace lines for testlets: A use of multiple-categorical-response models. Journal of Educational Measurement, 26(3), 247-260.

Wainer, H., & Wang, X. (2000). Using a new statistical model for testlets to score TOEFL. Journal of Educational Measurement, 37(3), 203-220. doi: 10.1111/j.1745-3984.2000.tb01083.x

Wang, W. C., & Wilson, M. (2005a). Exploring local item dependence using a random-effects facet model. Applied Psychological Measurement, 29(4), 296-318. doi: 10.1177/0146621605276281

Wang, W. C., & Wilson, M. (2005b). The Rasch testlet model. Applied Psychological Measurement, 29(2), 126-149. doi: 10.1177/0146621604271053

Wright, B. D. (1994). Local dependency, correlations, and principal components. Rasch Measurement Transactions, 10(3), 509-511. 

Wright, B. D., & Linacre, J. M. (1994). Reasonable mean-square fit values. Rasch Measurement Transactions, 8, 370-399. 

Yen, W. M., & Fitzpatrick, A. R. (2006). Item response theory. In R. L. Brennan (Ed.), Educational measurement (4th ed., pp. 111-153). Westport, CT: Praeger.

Zare, S., & Boori, A. A. (2018). Psychometric evaluation of the speeded cloze-elide test as a general test of proficiency in English as a foreign language. International Journal of Language Testing, 8(2), 33-43.