System Log Parsing with Large Language Models: A Review

Log data provides crucial insights for tasks like monitoring, root cause analysis, and anomaly detection. Due to the vast volume of logs, automated log parsing is essential to transform semi-structured log messages into structured representations. Recent advances in large language models (LLMs) have introduced the new research field of LLM-based log parsing. Despite promising results, there is no structured overview of the approaches in this relatively new research field with the earliest advances published in late 2023. This work systematically reviews 29 LLM-based log parsing methods. We benchmark seven of them on public datasets and critically assess their comparability and the reproducibility of their reported results. Our findings summarize the advances of this new research field, with insights on how to report results, which data sets, metrics and which terminology to use, and which inconsistencies to avoid, with code and results made publicly available for transparency.
View on arXiv@article{beck2025_2504.04877, title={ System Log Parsing with Large Language Models: A Review }, author={ Viktor Beck and Max Landauer and Markus Wurzenberger and Florian Skopik and Andreas Rauber }, journal={arXiv preprint arXiv:2504.04877}, year={ 2025 } }