Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1906.07791
Cited By
Hill Climbing on Value Estimates for Search-control in Dyna
18 June 2019
Yangchen Pan
Hengshuai Yao
Amir-massoud Farahmand
Martha White
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Hill Climbing on Value Estimates for Search-control in Dyna"
12 / 12 papers shown
Title
MAD-TD: Model-Augmented Data stabilizes High Update Ratio RL
C. Voelcker
Marcel Hussing
Eric Eaton
Amir-massoud Farahmand
Igor Gilitschenski
72
4
0
11 Oct 2024
Organizing Experience: A Deeper Look at Replay Mechanisms for Sample-based Planning in Continuous State Domains
Yangchen Pan
M. Zaheer
Adam White
Andrew Patterson
Martha White
51
46
0
12 Jun 2018
The Effect of Planning Shape on Dyna-style Planning in High-dimensional State Spaces
G. Z. Holland
Erik Talvitie
Michael Bowling
AI4CE
37
43
0
05 Jun 2018
Efficient Model-Based Deep Reinforcement Learning with Variational State Tabulation
Dane S. Corneil
W. Gerstner
Johanni Brea
OffRL
51
62
0
12 Feb 2018
Deep Dyna-Q: Integrating Planning for Task-Completion Dialogue Policy Learning
Baolin Peng
Xiujun Li
Jianfeng Gao
Jingjing Liu
Kam-Fai Wong
Shang-Yu Su
OffRL
57
156
0
18 Jan 2018
OpenAI Gym
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
OffRL
ODL
181
5,056
0
05 Jun 2016
Continuous Deep Q-Learning with Model-based Acceleration
S. Gu
Timothy Lillicrap
Ilya Sutskever
Sergey Levine
64
1,010
0
02 Mar 2016
Prioritized Experience Replay
Tom Schaul
John Quan
Ioannis Antonoglou
David Silver
OffRL
198
3,781
0
18 Nov 2015
Continuous control with deep reinforcement learning
Timothy Lillicrap
Jonathan J. Hunt
Alexander Pritzel
N. Heess
Tom Erez
Yuval Tassa
David Silver
Daan Wierstra
225
13,174
0
09 Sep 2015
Non-asymptotic convergence analysis for the Unadjusted Langevin Algorithm
Alain Durmus
Eric Moulines
61
410
0
17 Jul 2015
Dyna-Style Planning with Linear Function Approximation and Prioritized Sweeping
R. Sutton
Csaba Szepesvári
A. Geramifard
Michael Bowling
OffRL
65
203
0
13 Jun 2012
Dyna-H: a heuristic planning reinforcement learning algorithm applied to role-playing-game strategy decision systems
Matilde Santos
Jos´e Antonio
Victoria L´opez
Guillermo Botella
70
54
0
20 Jan 2011
1