Use of Test Blueprint in Improving Teachers’ Test Construction Skills for Quality Assessment
Main Article Content
Abstract
This study examined the use of test blueprint in improving teachers’ test construction skills for quality assessment. Test blueprint is an important and effective tool for quality test construction. Test blueprints provide a systematic approach to test development, ensuring that assessments are well-constructed, reliable, and valid measures of students’ learning outcomes. By using a test blueprint, teachers can ensure that their assessments are closely aligned with learning objectives, provide appropriate test item selection, and provide a balanced coverage of content. The use of test blueprints can help teachers create assessments that provide accurate measures of students’ learning outcomes and useful feedback to students. By providing a systematic approach to assessment development, teachers can improve the reliability, validity, and fairness of their assessments, which ultimately results in more accurate measurements of students’ learning experiences, leading to quality assessment of students’ learning outcomes. The study concluded that Test blueprints are crucial for enhancing teachers’ test construction skills, ensuring tests are well-constructed, align with learning objectives, and provide accurate measures and feedback, thereby improving the reliability and validity of assessments. The study suggested among others that teachers should have a clear understanding of the test blueprint, which outlines the content, objectives, and skills that a test should assess; teachers should use test blueprint as a guide for developing test items that accurately assess the intended knowledge and skills of learners; teachers should continuously review their test blueprints to ensure that they are still relevant and valid.
Article Details
References
Ajayi, I. A. (2019). Evaluation of test construction skills among Nigerian teachers: A comparative Study. African Journal of Educational Research, 11(2), 45-56.
Akinsola, M.K., Tella, A. & Tella, A. (2017). Assessment of test construction skills among teachers in Nigeria: Implications for quality education. Journal of Education and Practice, 8(4), 11-18.
Allen, J. J., & Yen, W. M. (2021). Validity and reliability of a new computer-adaptive test for assessing English language proficiency. Language Testing, 38(1), 48-69.
American Psychological Association. (2020). Publication manual of the American psychological association (7th ed.). American Psychological Association.
Amin, T., Shabbir, M., & Amin, N. (2021). Writing better multiple choice questions: A guide for novice test writers. Medical Education Online, 26(1), 50-58.
Amrein-Beardsley, A., & Collins, C. (2021). Beyond test blueprints: Using frameworks to enhance teacher-created assessments. Educational Assessment, Evaluation and Accountability, 33(1), 1-22.
Bakare, J. A. (2018). Enhancing test construction skills: A case study of Nigerian teachers. International Journal of Educational Psychology and Counseling, 2(1), 22-33.
Baruwa, A. O. (2020). Investigating the test construction skills of Nigerian teachers: A mixed-methods approach. Journal of Educational Measurement, 15(3), 78-89.
Baumeister, R. F., &Vohs, K. D. (2022). Test blueprints and the measurement of self-control. Journal of Personality Assessment, 104(1), 25-36.
Bello, S. O. (2017). Assessing test construction skills among Nigerian educators: A comparative analysis. Nigerian Journal of Educational Assessment, 5(2), 33-44.
Black, P., & Wiliam, D. (2018). Classroom assessment and pedagogy. Assessment in Education: Principles, Policy & Practice, 25(6), 551-575.
Briscoe, J. P., & Claus, L. (2018). Handbook of employee selection. London: Penguin Random House.
Cavanagh, M., & Chenoweth, L. (2021). Constructing effective multiple-choice questions to measure learning outcomes in online environments. Journal of Applied Research in Higher Education, 13(1), 94-107.
Chen, C. H., & Chen, H. L. (2022). The effects of online quizzes on learning: A meta-analysis. Computers & Education, 174, 104-129.
Cohen, R. J., & Swerdlik, M. E. (2018). Psychological testing and assessment: An introduction to tests and measurement (9th ed.). McGraw-Hill Education.
Deryugina, T., Shurchkov, O., & Babajanian, B. (2021). Algorithmic bias in hiring: An audit study. Journal of Political Economy, 129(4), 1241-1284.
Downing, S. M. (2006). Twelve steps for effective test development. In S. M. Downing & T. M. Haladyna (Eds.), Handbook of test development (pp. 3-25). Lawrence Erlbaum Associates.
Downing, S. M. (2019). Validity and reliability of assessment in medical education. AMEE Guide No. 37. Medical Teacher, 41(3), 271-279.
Educational Testing Service. (2018). Test blueprinting: An overview. Retrieved from https://www.ets.org/
Eze, C. E. (2018). Enhancing test construction skills among Nigerian teachers: A professional development approach. Journal of African Educational Research, 7(1), 56-68.
Fives, H., & DiDonato-Barnes, N. (2013). Classroom test construction: The power of a table of specifications. Practical Assessment, Research, and Evaluation, 18(3), 12-17.
Ford, J. K., & Kozlowski, S. W. (2021). Developing and using test blueprints: A primer for practitioners. Organizational Research Methods, 24(1), 1-26.
Furr, R. M. (2011). Scale construction and psychometrics for social and personality psychology. SAGE Publications.
Gbadegesin, S. (2019). Test construction skills among Nigerian teachers: Challenges and solutions. African Educational Research Journal, 6(3), 112-125.
Geisinger, K. F. (2013). APA handbook of testing and assessment in psychology: Test theory and testing and assessment in industrial and organizational psychology. American Psychological Association.
Gonzalez, M. (2022). Test blueprint development for a computer-based test of English for academic purposes. Language Assessment Quarterly, 19(1), 49-71.
Guskey, T. R., & Bailey, J. M. (2010). Developing grading and reporting systems for student learning. Corwin Press.
Gutiérrez, D., García, J. I., & Llor, M. (2020). Virtual reality psychological assessment: Advantages, limitations, and future challenges. Frontiers in Psychology, 11, 20-26.
Haladyna, T. M. (2019). Developing and validating multiple-choice test items (4th ed.). United Kingdom: British Publishing Company.
Haladyna, T. M., & Downing, S. M. (2004). Construct-irrelevant variance in high-stakes testing. Educational Measurement: Issues and Practice, 23(1), 17-27.
Haladyna, T. M., & Downing, S. M. (2004). Construct-irrelevant variance in high-stakes testing. Educational Measurement: Issues and Practice, 23(1), 17-27.
Haladyna, T. M., Downing, S. M., & Rodriguez, M. C. (2002). A review of multiple-choice item-writing guidelines for classroom assessment. Applied Measurement in Education, 15(3), 309-334).
Hambleton, R. K., & Zenisky, A. L. (2011). Criterion-referenced testing. In M. L. Kamil, P. D. Pearson, E. B. Moje, & P. P. Afflerbach (Eds.), Handbook of reading research, (pp. 509-539). Routledge.
Herman, J. L., & Golan, S. (1991). Effects of standardized testing on teachers and learning another look (CSE Technical Report 334). National Center for Research on Evaluation, Standards and Student Testing (CRESST) UCLA Graduate School of Education. https://files.eric.ed.gov/fulltext/ED341738.pdf
Kane, M. T. (2006). Validation. In R. L. Brennan (Ed.), educational measurement (4th ed., pp. 17-64). American Council on Education/Praeger.
Kim, Y. J., & Park, H. S. (2024). Reliability estimation of rubric-based performance assessments: A comparative study of classical and modern test theory approaches. Educational Assessment, 29(1), 67-84.
Krouska, A., Troussas, C., & Virvou, M. (2018). Computerized adaptive assessment using accumulative learning activities based on revised bloom's taxonomy. In Joint Conference on knowledge-based software engineering (pp. 252-258). Springer Charm
Lane, S., Raymond, M. R., & Haladyna, T. M. (2016). Handbook of test development.United Kingdom: Hodder & Stoughton.
Lee, Y. H., & Huang, W. D. (2018). Developing a test blueprint for a computerized adaptive testing system for assessing English proficiency. Language Testing, 35(3), 329-348.
Lin, T. J., & Linn, M. C. (2022). Test blueprint design for innovative science assessments. Journal of Research in Science Teaching, 59(2), 218-238.
Linn, R. L., Baker, E. L., & Dunbar, S. B. (Eds.). (2017). Complex, interdisciplinary assessments: Theory and practice. New York: Penguin Random House.
Luecht, R. M., & Nungester, R. J. (2018). Developing and validating test items. In C. A. Clauser, E. F. Downing, & T. M. Haladyna (Eds.), Handbook of test development (2nd ed.). United Kingdom: Pluto Press
McMillan, J. H. (2017). Classroom assessment: Principles and practice for effective standards-based instruction. Pearson
Miciak, J., Horn, I. S., & Schweinle, W. (2020). The role of diagnostic assessment in evaluating teaching quality. Review of Research in Education, 44(1), 240-265.
National Council on Measurement in Education. (2017). Guidelines for developing assessment blueprints. Retrieved from https://www.ncme.org/
Nguyen, T. H., & Nguyen, H. T. (2023). Investigating the reliability and validity of a mathematics achievement test using Rasch analysis. Journal of Educational Measurement, 39(4), 412-430.
Nitko, A. J. (2016). Educational assessment of students (7th ed.). Boston, MA: Pearson.
Nitko, A. J., & Brookhart, S. M. (2011). Educational assessment of students (6th ed.). Boston: Pearson Education.
Osterlind, S. J. (2017). Constructing test items: Multiple-choice, constructed-response, performance and other formats (4th ed.). United Kingdom: Verso Books.
Pellegrino, J.W. (2022). Using test blueprints to improve the quality of assessments in educational settings. Educational Assessment, Evaluation and Accountability, 34(1), 67-83.
Popham, W. J. (2006). Assessment for educational leaders. Pearson.
Popham, W. J. (2018). Classroom assessment: What teachers need to know (8th ed.). Pearson Publishers Ltd.
Quansah, F., Amoako, I., & Ankomah, F. (2019). Teachers’ test construction skills in Senior High Schools in Ghana: Document analysis. International Journal of Assessment Tools in Education, 6(1), 1-8.
Rodriguez, M. C. (2019). Test development and validation: A primer for teachers and test developers. United Kingdom: Atlantic Books.
Salkind, N. J. (2012). Tests and measurement for people who (think they) hate tests & measurement (2nd ed.). Sage Publications.
Scherbaum, C. A., & Goldstein, H. W. (2018). An introduction to competency-based selection. New York: Zed Books.
Schmeiser, C. B., & Welch, C. J. (2019). Principles of test development. In R. K. Hambleton, P. F. Merenda, & C. D. Spielberger (Eds.), Adapting educational and psychological tests for cross-cultural assessment (pp. 29-42). London: Quercus.
Shepard, L. A. (2000). The role of assessment in a learning culture. Educational Researcher, 29(7), 4-14.
Sireci, S. G., & Zenisky, A. L. (2006). Factors affecting the validity of cross-lingual assessments. Educational Measurement: Issues and Practice, 25(4), 14-21.
Smith, J. K. (2022). The impact of test blueprint use on student performance and learning measurement. Journal of Educational Assessment, 39(1), 43-56.
Stiggins, R. J. (2020). Assessment for learning revisited: An historical and contemporary analysis. Assessment in Education: Principles, Policy & Practice, 27(3), 247-266.
Wiggins, G. (1998). Educative assessment: Designing assessments to inform and improve student performance. Jossey-Bass.
Wiliam, D. (2018). Formative assessment: Ten years on. British Educational Research Journal, 44(3), 375-378.
Yang, H., & Chen, W. (2023). Investigating the reliability and validity of a writing assessment rubric for English language learners: A mixed-methods study. Language Assessment Quarterly, 30(3), 291-310.