Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2209.07676
Cited By
Conservative Dual Policy Optimization for Efficient Model-Based Reinforcement Learning
16 September 2022
Shen Zhang
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Conservative Dual Policy Optimization for Efficient Model-Based Reinforcement Learning"
15 / 15 papers shown
Title
Bilinear Classes: A Structural Framework for Provable Generalization in RL
S. Du
Sham Kakade
Jason D. Lee
Shachar Lovett
G. Mahajan
Wen Sun
Ruosong Wang
OffRL
152
191
0
19 Mar 2021
Efficient Model-Based Reinforcement Learning through Optimistic Policy Search and Planning
Sebastian Curi
Felix Berkenkamp
Andreas Krause
67
83
0
15 Jun 2020
Model-Augmented Actor-Critic: Backpropagating through Paths
I. Clavera
Yao Fu
Pieter Abbeel
68
88
0
16 May 2020
Dream to Control: Learning Behaviors by Latent Imagination
Danijar Hafner
Timothy Lillicrap
Jimmy Ba
Mohammad Norouzi
VLM
111
1,349
0
03 Dec 2019
Provably Efficient Reinforcement Learning with Linear Function Approximation
Chi Jin
Zhuoran Yang
Zhaoran Wang
Michael I. Jordan
86
555
0
11 Jul 2019
Exploring Model-based Planning with Policy Networks
Tingwu Wang
Jimmy Ba
71
149
0
20 Jun 2019
When to Trust Your Model: Model-Based Policy Optimization
Michael Janner
Justin Fu
Marvin Zhang
Sergey Levine
OffRL
83
948
0
19 Jun 2019
Reinforcement Learning in Feature Space: Matrix Bandit, Kernels, and Regret Bound
Lin F. Yang
Mengdi Wang
OffRL
GP
55
285
0
24 May 2019
Model-Based Value Estimation for Efficient Model-Free Reinforcement Learning
Vladimir Feinberg
Alvin Wan
Ion Stoica
Michael I. Jordan
Joseph E. Gonzalez
Sergey Levine
OffRL
56
317
0
28 Feb 2018
Model-Ensemble Trust-Region Policy Optimization
Thanard Kurutach
I. Clavera
Yan Duan
Aviv Tamar
Pieter Abbeel
75
451
0
28 Feb 2018
The Uncertainty Bellman Equation and Exploration
Brendan O'Donoghue
Ian Osband
Rémi Munos
Volodymyr Mnih
63
191
0
15 Sep 2017
Ensemble Sampling
Xiuyuan Lu
Benjamin Van Roy
119
119
0
20 May 2017
Deep Exploration via Randomized Value Functions
Ian Osband
Benjamin Van Roy
Daniel Russo
Zheng Wen
89
304
0
22 Mar 2017
Why is Posterior Sampling Better than Optimism for Reinforcement Learning?
Ian Osband
Benjamin Van Roy
BDL
76
260
0
01 Jul 2016
Dual Control for Approximate Bayesian Reinforcement Learning
Edgar D. Klenske
Philipp Hennig
BDL
OffRL
40
66
0
13 Oct 2015
1