Combining Off and On-Policy Training in Model-Based Reinforcement
Learning

v1v2 (latest)

Combining Off and On-Policy Training in Model-Based Reinforcement Learning

24 February 2021

Alexandre Borges

Arlindo L. Oliveira

ArXiv (abs)PDF HTML

Papers citing "Combining Off and On-Policy Training in Model-Based Reinforcement Learning"

6 / 6 papers shown

Title
Monte-Carlo Tree Search as Regularized Policy Optimization Jean-Bastien Grill Florent Altché Yunhao Tang Thomas Hubert Michal Valko Ioannis Antonoglou Rémi Munos 91 75 0 24 Jul 2020
Towards Combining On-Off-Policy Methods for Real-World Applications Kai-Chun Hu Chen-Huan Pi Ting Han Wei I-Chen Wu Stone Cheng Yi-Wei Dai Wei-Yuan Ye OffRL 21 2 0 24 Apr 2019
A0C: Alpha Zero in Continuous Action Space Thomas M. Moerland Joost Broekens Aske Plaat Catholijn M. Jonker 83 48 0 24 May 2018
The Predictron: End-To-End Learning and Planning David Silver H. V. Hasselt Matteo Hessel Tom Schaul A. Guez ... Gabriel Dulac-Arnold David P. Reichert Neil C. Rabinowitz André Barreto T. Degris 74 291 0 28 Dec 2016
Continuous control with deep reinforcement learning Timothy Lillicrap Jonathan J. Hunt Alexander Pritzel N. Heess Tom Erez Yuval Tassa David Silver Daan Wierstra 320 13,248 0 09 Sep 2015
Playing Atari with Deep Reinforcement Learning Volodymyr Mnih Koray Kavukcuoglu David Silver Alex Graves Ioannis Antonoglou Daan Wierstra Martin Riedmiller 127 12,231 0 19 Dec 2013