Advances in Large Language Models revolutionized medical education by enabling scalable and efficient learning solutions. This paper presents a pipeline employing Retrieval-Augmented Generation (RAG) system to prepare comments generation for Poland's State Specialization Examination (PES) based on verified resources. The system integrates these generated comments and source documents with a spaced repetition learning algorithm to enhance knowledge retention while minimizing cognitive overload. By employing a refined retrieval system, query rephraser, and an advanced reranker, our modified RAG solution promotes accuracy more than efficiency. Rigorous evaluation by medical annotators demonstrates improvements in key metrics such as document relevance, credibility, and logical coherence of generated content, proven by a series of experiments presented in the paper. This study highlights the potential of RAG systems to provide scalable, high-quality, and individualized educational resources, addressing non-English speaking users.
View on arXiv@article{kaczmarek2025_2503.01859, title={ Optimizing Retrieval-Augmented Generation of Medical Content for Spaced Repetition Learning }, author={ Jeremi I. Kaczmarek and Jakub Pokrywka and Krzysztof Biedalak and Grzegorz Kurzyp and Łukasz Grzybowski }, journal={arXiv preprint arXiv:2503.01859}, year={ 2025 } }