REALM: A Dataset of Real-World LLM Use Cases
- OffRL

Large Language Models (LLMs), such as the GPT series, have driven significant industrial applications, leading to economic and societal transformations. However, a comprehensive understanding of their real-world applications remains limited. To address this, we introduce REALM, a dataset of over 94,000 LLM use cases collected from Reddit and news articles. REALM captures two key dimensions: the diverse applications of LLMs and the demographics of their users. It categorizes LLM applications and explores how users' occupations relate to the types of applications they use. By integrating real-world data, REALM offers insights into LLM adoption across different domains, providing a foundation for future research on their evolving societal roles.
View on arXiv@article{cheng2025_2503.18792, title={ REALM: A Dataset of Real-World LLM Use Cases }, author={ Jingwen Cheng and Kshitish Ghate and Wenyue Hua and William Yang Wang and Hong Shen and Fei Fang }, journal={arXiv preprint arXiv:2503.18792}, year={ 2025 } }