Multi-Scale Finetuning for Encoder-based Time Series Foundation Models
- AI4TSAI4CE

Time series foundation models (TSFMs) demonstrate impressive zero-shot performance for time series forecasting. However, an important yet underexplored challenge is how to effectively finetune TSFMs on specific downstream tasks. While naive finetuning can yield performance gains, we argue that it falls short of fully leveraging TSFMs' capabilities, often resulting in overfitting and suboptimal performance. Given the diverse temporal patterns across sampling scales and the inherent multi-scale forecasting capabilities of TSFMs, we adopt a causal perspective to analyze finetuning process, through which we highlight the critical importance of explicitly modeling multiple scales and reveal the shortcomings of naive approaches. Focusing on \textit{encoder-based} TSFMs, we propose \textbf{M}ulti\textbf{\textsc{s}}cale \textbf{\textsc{f}}ine\textbf{\textsc{t}}uning (\textbf{MSFT}), a simple yet general framework that explicitly integrates multi-scale modeling into the finetuning process. Experimental results on three different backbones (\moirai, \moment\ and \units) demonstrate that TSFMs finetuned with MSFT not only outperform naive and typical parameter efficient finetuning methods but also surpass state-of-the-art deep learning methods.
View on arXiv@article{qiao2025_2506.14087, title={ Multi-Scale Finetuning for Encoder-based Time Series Foundation Models }, author={ Zhongzheng Qiao and Chenghao Liu and Yiming Zhang and Ming Jin and Quang Pham and Qingsong Wen and P.N. Suganthan and Xudong Jiang and Savitha Ramasamy }, journal={arXiv preprint arXiv:2506.14087}, year={ 2025 } }