Fine-tuning Timeseries Predictors Using Reinforcement Learning
Hugo Cazaux
Ralph Rudd
Hlynur Stefánsson
Sverrir Ólafsson
Eyjólfur Ingi Ásgeirsson
- AI4TS
Main:14 Pages
1 Figures
Bibliography:4 Pages
10 Tables
Abstract
This chapter presents three major reinforcement learning algorithms used for fine-tuning financial forecasters. We propose a clear implementation plan for backpropagating the loss of a reinforcement learning task to a model trained using supervised learning, and compare the performance before and after the fine-tuning. We find an increase in performance after fine-tuning, and transfer learning properties to the models, indicating the benefits of fine-tuning. We also highlight the tuning process and empirical results for future implementation by practitioners.
View on arXivComments on this paper
