Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2102.12194
Cited By
v1
v2 (latest)
Combining Off and On-Policy Training in Model-Based Reinforcement Learning
24 February 2021
Alexandre Borges
Arlindo L. Oliveira
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Combining Off and On-Policy Training in Model-Based Reinforcement Learning"
6 / 6 papers shown
Title
Monte-Carlo Tree Search as Regularized Policy Optimization
Jean-Bastien Grill
Florent Altché
Yunhao Tang
Thomas Hubert
Michal Valko
Ioannis Antonoglou
Rémi Munos
91
75
0
24 Jul 2020
Towards Combining On-Off-Policy Methods for Real-World Applications
Kai-Chun Hu
Chen-Huan Pi
Ting Han Wei
I-Chen Wu
Stone Cheng
Yi-Wei Dai
Wei-Yuan Ye
OffRL
21
2
0
24 Apr 2019
A0C: Alpha Zero in Continuous Action Space
Thomas M. Moerland
Joost Broekens
Aske Plaat
Catholijn M. Jonker
83
48
0
24 May 2018
The Predictron: End-To-End Learning and Planning
David Silver
H. V. Hasselt
Matteo Hessel
Tom Schaul
A. Guez
...
Gabriel Dulac-Arnold
David P. Reichert
Neil C. Rabinowitz
André Barreto
T. Degris
74
291
0
28 Dec 2016
Continuous control with deep reinforcement learning
Timothy Lillicrap
Jonathan J. Hunt
Alexander Pritzel
N. Heess
Tom Erez
Yuval Tassa
David Silver
Daan Wierstra
320
13,248
0
09 Sep 2015
Playing Atari with Deep Reinforcement Learning
Volodymyr Mnih
Koray Kavukcuoglu
David Silver
Alex Graves
Ioannis Antonoglou
Daan Wierstra
Martin Riedmiller
127
12,231
0
19 Dec 2013
1