Fine-tuning Timeseries Predictors Using Reinforcement Learning

20 March 2026

Hugo Cazaux

Ralph Rudd

Hlynur Stefánsson

Sverrir Ólafsson

Eyjólfur Ingi Ásgeirsson

AI4TS

ArXiv (abs)PDF HTML Github

Main:14 Pages

1 Figures

Bibliography:4 Pages

10 Tables

Abstract

This chapter presents three major reinforcement learning algorithms used for fine-tuning financial forecasters. We propose a clear implementation plan for backpropagating the loss of a reinforcement learning task to a model trained using supervised learning, and compare the performance before and after the fine-tuning. We find an increase in performance after fine-tuning, and transfer learning properties to the models, indicating the benefits of fine-tuning. We also highlight the tuning process and empirical results for future implementation by practitioners.

View on arXiv

Comments on this paper