7
0

From Model to Classroom: Evaluating Generated MCQs for Portuguese with Narrative and Difficulty Concerns

Bernardo Leite
Henrique Lopes Cardoso
Pedro Pinto
Abel Ferreira
Luís Abreu
Isabel Rangel
Sandra Monteiro
Main:32 Pages
15 Figures
Bibliography:8 Pages
11 Tables
Appendix:3 Pages
Abstract

While MCQs are valuable for learning and evaluation, manually creating them with varying difficulty levels and targeted reading skills remains a time-consuming and costly task. Recent advances in generative AI provide an opportunity to automate MCQ generation efficiently. However, assessing the actual quality and reliability of generated MCQs has received limited attention -- particularly regarding cases where generation fails. This aspect becomes particularly important when the generated MCQs are meant to be applied in real-world settings. Additionally, most MCQ generation studies focus on English, leaving other languages underexplored. This paper investigates the capabilities of current generative models in producing MCQs for reading comprehension in Portuguese, a morphologically rich language. Our study focuses on generating MCQs that align with curriculum-relevant narrative elements and span different difficulty levels. We evaluate these MCQs through expert review and by analyzing the psychometric properties extracted from student responses to assess their suitability for elementary school students. Our results show that current models can generate MCQs of comparable quality to human-authored ones. However, we identify issues related to semantic clarity and answerability. Also, challenges remain in generating distractors that engage students and meet established criteria for high-quality MCQ option design.

View on arXiv
@article{leite2025_2506.15598,
  title={ From Model to Classroom: Evaluating Generated MCQs for Portuguese with Narrative and Difficulty Concerns },
  author={ Bernardo Leite and Henrique Lopes Cardoso and Pedro Pinto and Abel Ferreira and Luís Abreu and Isabel Rangel and Sandra Monteiro },
  journal={arXiv preprint arXiv:2506.15598},
  year={ 2025 }
}
Comments on this paper