Multimodal Conditioned Diffusive Time Series Forecasting

28 April 2025

Abstract

Diffusion models achieve remarkable success in processing images and text, and have been extended to special domains such as time series forecasting (TSF). Existing diffusion-based approaches for TSF primarily focus on modeling single-modality numerical sequences, overlooking the rich multimodal information in time series data. To effectively leverage such information for prediction, we propose a multimodal conditioned diffusion model for TSF, namely, MCD-TSF, to jointly utilize timestamps and texts as extra guidance for time series modeling, especially for forecasting. Specifically, Timestamps are combined with time series to establish temporal and semantic correlations among different data points when aggregating information along the temporal dimension. Texts serve as supplementary descriptions of time series' history, and adaptively aligned with data points as well as dynamically controlled in a classifier-free manner. Extensive experiments on real-world benchmark datasets across eight domains demonstrate that the proposed MCD-TSF model achieves state-of-the-art performance.

View on arXiv

@article{su2025_2504.19669,
  title={ Multimodal Conditioned Diffusive Time Series Forecasting },
  author={ Chen Su and Yuanhe Tian and Yan Song },
  journal={arXiv preprint arXiv:2504.19669},
  year={ 2025 }
}

Comments on this paper