ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2102.12194
  4. Cited By
Combining Off and On-Policy Training in Model-Based Reinforcement
  Learning
v1v2 (latest)

Combining Off and On-Policy Training in Model-Based Reinforcement Learning

24 February 2021
Alexandre Borges
Arlindo L. Oliveira
ArXiv (abs)PDFHTML

Papers citing "Combining Off and On-Policy Training in Model-Based Reinforcement Learning"

6 / 6 papers shown
Title
Monte-Carlo Tree Search as Regularized Policy Optimization
Monte-Carlo Tree Search as Regularized Policy Optimization
Jean-Bastien Grill
Florent Altché
Yunhao Tang
Thomas Hubert
Michal Valko
Ioannis Antonoglou
Rémi Munos
91
75
0
24 Jul 2020
Towards Combining On-Off-Policy Methods for Real-World Applications
Towards Combining On-Off-Policy Methods for Real-World Applications
Kai-Chun Hu
Chen-Huan Pi
Ting Han Wei
I-Chen Wu
Stone Cheng
Yi-Wei Dai
Wei-Yuan Ye
OffRL
21
2
0
24 Apr 2019
A0C: Alpha Zero in Continuous Action Space
A0C: Alpha Zero in Continuous Action Space
Thomas M. Moerland
Joost Broekens
Aske Plaat
Catholijn M. Jonker
83
48
0
24 May 2018
The Predictron: End-To-End Learning and Planning
The Predictron: End-To-End Learning and Planning
David Silver
H. V. Hasselt
Matteo Hessel
Tom Schaul
A. Guez
...
Gabriel Dulac-Arnold
David P. Reichert
Neil C. Rabinowitz
André Barreto
T. Degris
74
291
0
28 Dec 2016
Continuous control with deep reinforcement learning
Continuous control with deep reinforcement learning
Timothy Lillicrap
Jonathan J. Hunt
Alexander Pritzel
N. Heess
Tom Erez
Yuval Tassa
David Silver
Daan Wierstra
320
13,248
0
09 Sep 2015
Playing Atari with Deep Reinforcement Learning
Playing Atari with Deep Reinforcement Learning
Volodymyr Mnih
Koray Kavukcuoglu
David Silver
Alex Graves
Ioannis Antonoglou
Daan Wierstra
Martin Riedmiller
127
12,231
0
19 Dec 2013
1