From Plain Text to Poetic Form: Generating Metrically-Constrained Sanskrit Verses

1 June 2025

Main:7 Pages

3 Figures

Bibliography:4 Pages

15 Tables

Appendix:5 Pages

Abstract

Recent advances in large language models (LLMs) have significantly improved natural language generation, including creative tasks like poetry composition. However, most progress remains concentrated in high-resource languages. This raises an important question: Can LLMs be adapted for structured poetic generation in a low-resource, morphologically rich language such as Sanskrit? In this work, we introduce a dataset designed for translating English prose into structured Sanskrit verse, with strict adherence to classical metrical patterns, particularly the Anushtub meter. We evaluate a range of generative models-both open-source and proprietary-under multiple settings. Specifically, we explore constrained decoding strategies and instruction-based fine-tuning tailored to metrical and semantic fidelity. Our decoding approach achieves over 99% accuracy in producing syntactically valid poetic forms, substantially outperforming general-purpose models in meter conformity. Meanwhile, instruction-tuned variants show improved alignment with source meaning and poetic style, as supported by human assessments, albeit with marginal trade-offs in metrical precision.

View on arXiv

@article{jagadeeshan2025_2506.00815,
  title={ From Plain Text to Poetic Form: Generating Metrically-Constrained Sanskrit Verses },
  author={ Manoj Balaji Jagadeeshan and Samarth Bhatia and Pretam Ray and Harshul Raj Surana and Akhil Rajeev P and Priya Mishra and Annarao Kulkarni and Ganesh Ramakrishnan and Prathosh AP and Pawan Goyal },
  journal={arXiv preprint arXiv:2506.00815},
  year={ 2025 }
}

Comments on this paper