Hill Climbing on Value Estimates for Search-control in Dyna

Hill Climbing on Value Estimates for Search-control in Dyna

18 June 2019

Amir-massoud Farahmand

Papers citing "Hill Climbing on Value Estimates for Search-control in Dyna"

12 / 12 papers shown

Title
MAD-TD: Model-Augmented Data stabilizes High Update Ratio RL C. Voelcker Marcel Hussing Eric Eaton Amir-massoud Farahmand Igor Gilitschenski 72 4 0 11 Oct 2024
Organizing Experience: A Deeper Look at Replay Mechanisms for Sample-based Planning in Continuous State Domains Yangchen Pan M. Zaheer Adam White Andrew Patterson Martha White 51 46 0 12 Jun 2018
The Effect of Planning Shape on Dyna-style Planning in High-dimensional State Spaces G. Z. Holland Erik Talvitie Michael Bowling AI4CE 37 43 0 05 Jun 2018
Efficient Model-Based Deep Reinforcement Learning with Variational State Tabulation Dane S. Corneil W. Gerstner Johanni Brea OffRL 51 62 0 12 Feb 2018
Deep Dyna-Q: Integrating Planning for Task-Completion Dialogue Policy Learning Baolin Peng Xiujun Li Jianfeng Gao Jingjing Liu Kam-Fai Wong Shang-Yu Su OffRL 57 156 0 18 Jan 2018
OpenAI Gym Greg Brockman Vicki Cheung Ludwig Pettersson Jonas Schneider John Schulman Jie Tang Wojciech Zaremba OffRL ODL 181 5,056 0 05 Jun 2016
Continuous Deep Q-Learning with Model-based Acceleration S. Gu Timothy Lillicrap Ilya Sutskever Sergey Levine 64 1,010 0 02 Mar 2016
Prioritized Experience Replay Tom Schaul John Quan Ioannis Antonoglou David Silver OffRL 198 3,781 0 18 Nov 2015
Continuous control with deep reinforcement learning Timothy Lillicrap Jonathan J. Hunt Alexander Pritzel N. Heess Tom Erez Yuval Tassa David Silver Daan Wierstra 225 13,174 0 09 Sep 2015
Non-asymptotic convergence analysis for the Unadjusted Langevin Algorithm Alain Durmus Eric Moulines 61 410 0 17 Jul 2015
Dyna-Style Planning with Linear Function Approximation and Prioritized Sweeping R. Sutton Csaba Szepesvári A. Geramifard Michael Bowling OffRL 65 203 0 13 Jun 2012
Dyna-H: a heuristic planning reinforcement learning algorithm applied to role-playing-game strategy decision systems Matilde Santos Jos´e Antonio Victoria L´opez Guillermo Botella 70 54 0 20 Jan 2011