ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1906.07791
  4. Cited By
Hill Climbing on Value Estimates for Search-control in Dyna

Hill Climbing on Value Estimates for Search-control in Dyna

18 June 2019
Yangchen Pan
Hengshuai Yao
Amir-massoud Farahmand
Martha White
ArXivPDFHTML

Papers citing "Hill Climbing on Value Estimates for Search-control in Dyna"

12 / 12 papers shown
Title
MAD-TD: Model-Augmented Data stabilizes High Update Ratio RL
MAD-TD: Model-Augmented Data stabilizes High Update Ratio RL
C. Voelcker
Marcel Hussing
Eric Eaton
Amir-massoud Farahmand
Igor Gilitschenski
72
4
0
11 Oct 2024
Organizing Experience: A Deeper Look at Replay Mechanisms for
  Sample-based Planning in Continuous State Domains
Organizing Experience: A Deeper Look at Replay Mechanisms for Sample-based Planning in Continuous State Domains
Yangchen Pan
M. Zaheer
Adam White
Andrew Patterson
Martha White
51
46
0
12 Jun 2018
The Effect of Planning Shape on Dyna-style Planning in High-dimensional
  State Spaces
The Effect of Planning Shape on Dyna-style Planning in High-dimensional State Spaces
G. Z. Holland
Erik Talvitie
Michael Bowling
AI4CE
37
43
0
05 Jun 2018
Efficient Model-Based Deep Reinforcement Learning with Variational State
  Tabulation
Efficient Model-Based Deep Reinforcement Learning with Variational State Tabulation
Dane S. Corneil
W. Gerstner
Johanni Brea
OffRL
51
62
0
12 Feb 2018
Deep Dyna-Q: Integrating Planning for Task-Completion Dialogue Policy
  Learning
Deep Dyna-Q: Integrating Planning for Task-Completion Dialogue Policy Learning
Baolin Peng
Xiujun Li
Jianfeng Gao
Jingjing Liu
Kam-Fai Wong
Shang-Yu Su
OffRL
57
156
0
18 Jan 2018
OpenAI Gym
OpenAI Gym
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
OffRL
ODL
181
5,056
0
05 Jun 2016
Continuous Deep Q-Learning with Model-based Acceleration
Continuous Deep Q-Learning with Model-based Acceleration
S. Gu
Timothy Lillicrap
Ilya Sutskever
Sergey Levine
64
1,010
0
02 Mar 2016
Prioritized Experience Replay
Prioritized Experience Replay
Tom Schaul
John Quan
Ioannis Antonoglou
David Silver
OffRL
198
3,781
0
18 Nov 2015
Continuous control with deep reinforcement learning
Continuous control with deep reinforcement learning
Timothy Lillicrap
Jonathan J. Hunt
Alexander Pritzel
N. Heess
Tom Erez
Yuval Tassa
David Silver
Daan Wierstra
225
13,174
0
09 Sep 2015
Non-asymptotic convergence analysis for the Unadjusted Langevin
  Algorithm
Non-asymptotic convergence analysis for the Unadjusted Langevin Algorithm
Alain Durmus
Eric Moulines
61
410
0
17 Jul 2015
Dyna-Style Planning with Linear Function Approximation and Prioritized
  Sweeping
Dyna-Style Planning with Linear Function Approximation and Prioritized Sweeping
R. Sutton
Csaba Szepesvári
A. Geramifard
Michael Bowling
OffRL
65
203
0
13 Jun 2012
Dyna-H: a heuristic planning reinforcement learning algorithm applied to
  role-playing-game strategy decision systems
Dyna-H: a heuristic planning reinforcement learning algorithm applied to role-playing-game strategy decision systems
Matilde Santos
Jos´e Antonio
Victoria L´opez
Guillermo Botella
70
54
0
20 Jan 2011
1