Explainable Multi-modal Time Series Prediction with LLM-in-the-Loop

2 March 2025

Abstract

Time series analysis provides essential insights for real-world system dynamics and informs downstream decision-making, yet most existing methods often overlook the rich contextual signals present in auxiliary modalities. To bridge this gap, we introduce TimeXL, a multi-modal prediction framework that integrates a prototype-based time series encoder with three collaborating Large Language Models (LLMs) to deliver more accurate predictions and interpretable explanations. First, a multi-modal prototype-based encoder processes both time series and textual inputs to generate preliminary forecasts alongside case-based rationales. These outputs then feed into a prediction LLM, which refines the forecasts by reasoning over the encoder's predictions and explanations. Next, a reflection LLM compares the predicted values against the ground truth, identifying textual inconsistencies or noise. Guided by this feedback, a refinement LLM iteratively enhances text quality and triggers encoder retraining. This closed-loop workflow -- prediction, critique (reflect), and refinement -- continuously boosts the framework's performance and interpretability. Empirical evaluations on four real-world datasets demonstrate that TimeXL achieves up to 8.9\% improvement in AUC and produces human-centric, multi-modal explanations, highlighting the power of LLM-driven reasoning for time series prediction.

View on arXiv

@article{jiang2025_2503.01013,
  title={ Explainable Multi-modal Time Series Prediction with LLM-in-the-Loop },
  author={ Yushan Jiang and Wenchao Yu and Geon Lee and Dongjin Song and Kijung Shin and Wei Cheng and Yanchi Liu and Haifeng Chen },
  journal={arXiv preprint arXiv:2503.01013},
  year={ 2025 }
}

Comments on this paper