Conservative Dual Policy Optimization for Efficient Model-Based Reinforcement Learning

16 September 2022

Papers citing "Conservative Dual Policy Optimization for Efficient Model-Based Reinforcement Learning"

15 / 15 papers shown

Title
Bilinear Classes: A Structural Framework for Provable Generalization in RL S. Du Sham Kakade Jason D. Lee Shachar Lovett G. Mahajan Wen Sun Ruosong Wang OffRL 152 191 0 19 Mar 2021
Efficient Model-Based Reinforcement Learning through Optimistic Policy Search and Planning Sebastian Curi Felix Berkenkamp Andreas Krause 67 83 0 15 Jun 2020
Model-Augmented Actor-Critic: Backpropagating through Paths I. Clavera Yao Fu Pieter Abbeel 68 88 0 16 May 2020
Dream to Control: Learning Behaviors by Latent Imagination Danijar Hafner Timothy Lillicrap Jimmy Ba Mohammad Norouzi VLM 111 1,349 0 03 Dec 2019
Provably Efficient Reinforcement Learning with Linear Function Approximation Chi Jin Zhuoran Yang Zhaoran Wang Michael I. Jordan 86 555 0 11 Jul 2019
Exploring Model-based Planning with Policy Networks Tingwu Wang Jimmy Ba 71 149 0 20 Jun 2019
When to Trust Your Model: Model-Based Policy Optimization Michael Janner Justin Fu Marvin Zhang Sergey Levine OffRL 83 948 0 19 Jun 2019
Reinforcement Learning in Feature Space: Matrix Bandit, Kernels, and Regret Bound Lin F. Yang Mengdi Wang OffRL GP 55 285 0 24 May 2019
Model-Based Value Estimation for Efficient Model-Free Reinforcement Learning Vladimir Feinberg Alvin Wan Ion Stoica Michael I. Jordan Joseph E. Gonzalez Sergey Levine OffRL 56 317 0 28 Feb 2018
Model-Ensemble Trust-Region Policy Optimization Thanard Kurutach I. Clavera Yan Duan Aviv Tamar Pieter Abbeel 75 451 0 28 Feb 2018
The Uncertainty Bellman Equation and Exploration Brendan O'Donoghue Ian Osband Rémi Munos Volodymyr Mnih 63 191 0 15 Sep 2017
Ensemble Sampling Xiuyuan Lu Benjamin Van Roy 119 119 0 20 May 2017
Deep Exploration via Randomized Value Functions Ian Osband Benjamin Van Roy Daniel Russo Zheng Wen 89 304 0 22 Mar 2017
Why is Posterior Sampling Better than Optimism for Reinforcement Learning? Ian Osband Benjamin Van Roy BDL 76 260 0 01 Jul 2016
Dual Control for Approximate Bayesian Reinforcement Learning Edgar D. Klenske Philipp Hennig BDL OffRL 40 66 0 13 Oct 2015