Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1803.00101
Cited By
Model-Based Value Estimation for Efficient Model-Free Reinforcement Learning
28 February 2018
Vladimir Feinberg
Alvin Wan
Ion Stoica
Michael I. Jordan
Joseph E. Gonzalez
Sergey Levine
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Model-Based Value Estimation for Efficient Model-Free Reinforcement Learning"
50 / 197 papers shown
Title
Sample Efficient Reinforcement Learning via Model-Ensemble Exploration and Exploitation
Yaowen Yao
Li Xiao
Zhicheng An
Wanpeng Zhang
Dijun Luo
101
21
0
05 Jul 2021
Improve Agents without Retraining: Parallel Tree Search with Off-Policy Correction
Assaf Hallak
Gal Dalal
Steven Dalton
I. Frosio
Shie Mannor
Gal Chechik
OffRL
OnRL
100
10
0
04 Jul 2021
MHER: Model-based Hindsight Experience Replay
Rui Yang
Meng Fang
Lei Han
Yali Du
Feng Luo
Xiu Li
OffRL
95
17
0
01 Jul 2021
Learning Task Informed Abstractions
Xiang Fu
Ge Yang
Pulkit Agrawal
Tommi Jaakkola
110
69
0
29 Jun 2021
Uncertainty-Aware Model-Based Reinforcement Learning with Application to Autonomous Driving
Jingda Wu
Zhiyu Huang
Chen Lv
56
7
0
23 Jun 2021
Efficient Continuous Control with Double Actors and Regularized Critics
Jiafei Lyu
Xiaoteng Ma
Jiangpeng Yan
Xiu Li
OffRL
47
50
0
06 Jun 2021
Stochastic Intervention for Causal Inference via Reinforcement Learning
Tri Dung Duong
Qian Li
Guandong Xu
CML
48
3
0
28 May 2021
Robust Value Iteration for Continuous Control Tasks
M. Lutter
Shie Mannor
Jan Peters
Dieter Fox
Animesh Garg
64
19
0
25 May 2021
MapGo: Model-Assisted Policy Optimization for Goal-Oriented Tasks
Menghui Zhu
Minghuan Liu
Jian Shen
Zhicheng Zhang
Sheng Chen
Weinan Zhang
Deheng Ye
Yong Yu
Qiang Fu
Wei Yang
114
22
0
13 May 2021
Value Iteration in Continuous Actions, States and Time
M. Lutter
Shie Mannor
Jan Peters
Dieter Fox
Animesh Garg
52
37
0
10 May 2021
Learning to drive from a world on rails
Di Chen
V. Koltun
Philipp Krahenbuhl
191
122
0
03 May 2021
Adaptive learning for financial markets mixing model-based and model-free RL for volatility targeting
Eric Benhamou
David Saltiel
S. Tabachnik
Sui Kai Wong
François Chareyron
OOD
91
4
0
19 Apr 2021
Planning with Expectation Models for Control
Katya Kudashkina
Yi Wan
Abhishek Naik
R. Sutton
OffRL
37
0
0
17 Apr 2021
Discriminator Augmented Model-Based Reinforcement Learning
Behzad Haghgoo
Allan Zhou
Archit Sharma
Chelsea Finn
OffRL
67
3
0
24 Mar 2021
Sample-efficient Reinforcement Learning Representation Learning with Curiosity Contrastive Forward Dynamics Model
Thanh Nguyen
Tung M. Luu
Thang Vu
Chang D. Yoo
47
17
0
15 Mar 2021
Bias-reduced Multi-step Hindsight Experience Replay for Efficient Multi-goal Reinforcement Learning
Rui Yang
Jiafei Lyu
Yu Yang
Jiangpeng Yan
Feng Luo
Dijun Luo
Lanqing Li
Xiu Li
88
6
0
25 Feb 2021
Mixed Policy Gradient: off-policy reinforcement learning driven jointly by data and model
Yang Guan
Jingliang Duan
Shengbo Eben Li
Jie Li
Jianyu Chen
B. Cheng
OffRL
77
12
0
23 Feb 2021
MUSBO: Model-based Uncertainty Regularized and Sample Efficient Batch Optimization for Deployment Constrained Reinforcement Learning
DiJia Su
Jason D. Lee
John M. Mulvey
H. Vincent Poor
OffRL
62
6
0
23 Feb 2021
Model-Invariant State Abstractions for Model-Based Reinforcement Learning
Manan Tomar
Amy Zhang
Roberto Calandra
Matthew E. Taylor
Joelle Pineau
97
26
0
19 Feb 2021
A Survey on Active Deep Learning: From Model-driven to Data-driven
Peng Liu
Lizhe Wang
Guojin He
Lei Zhao
85
14
0
25 Jan 2021
Sample-Efficient Reinforcement Learning via Counterfactual-Based Data Augmentation
Chaochao Lu
Erdun Gao
Ke Wang
José Miguel Hernández-Lobato
Kun Zhang
Bernhard Schölkopf
CML
OOD
OffRL
87
60
0
16 Dec 2020
Double Meta-Learning for Data Efficient Policy Optimization in Non-Stationary Environments
Elahe Aghapour
Nora Ayanian
OffRL
42
4
0
21 Nov 2020
Deep Reinforcement Learning for Navigation in AAA Video Games
Eloi Alonso
Maxim Peter
David Goumard
Joshua Romoff
59
37
0
09 Nov 2020
Learning Trajectories for Visual-Inertial System Calibration via Model-based Heuristic Deep Reinforcement Learning
Le Chen
Yu Ao
Florian Tschopp
Andrei Cramariuc
Michel Breyer
Jen Jen Chung
Roland Siegwart
Cesar Cadena
51
3
0
04 Nov 2020
Low-Variance Policy Gradient Estimation with World Models
Michal Nauman
Floris den Hengst
OffRL
51
1
0
29 Oct 2020
Learning to Plan Optimistically: Uncertainty-Guided Deep Exploration via Latent Model Ensembles
Tim Seyde
Wilko Schwarting
S. Karaman
Daniela Rus
100
14
0
27 Oct 2020
Generative Temporal Difference Learning for Infinite-Horizon Prediction
Michael Janner
Igor Mordatch
Sergey Levine
AI4CE
79
35
0
27 Oct 2020
Planning with Exploration: Addressing Dynamics Bottleneck in Model-based Reinforcement Learning
Xiyao Wang
Junge Zhang
Wenzhen Huang
Qiyue Yin
41
0
0
24 Oct 2020
Bridging Imagination and Reality for Model-Based Deep Reinforcement Learning
Guangxiang Zhu
Minghao Zhang
Honglak Lee
Chongjie Zhang
OffRL
135
18
0
23 Oct 2020
Model-based Policy Optimization with Unsupervised Model Adaptation
Jian Shen
Han Zhao
Weinan Zhang
Yong Yu
114
28
0
19 Oct 2020
Softmax Deep Double Deterministic Policy Gradients
Ling Pan
Qingpeng Cai
Longbo Huang
118
93
0
19 Oct 2020
Trust the Model When It Is Confident: Masked Model-based Actor-Critic
Feiyang Pan
Jia He
Dandan Tu
Qing He
OffRL
61
47
0
10 Oct 2020
Episodic Memory for Learning Subjective-Timescale Models
Alexey Zakharov
Matthew Crosby
Zafeirios Fountas
41
4
0
03 Oct 2020
Dynamic Horizon Value Estimation for Model-based Reinforcement Learning
Junjie Wang
Qichao Zhang
Dongbin Zhao
Mengchen Zhao
Jianye Hao
OffRL
55
5
0
21 Sep 2020
Transfer Learning in Deep Reinforcement Learning: A Survey
Zhuangdi Zhu
Kaixiang Lin
Anil K. Jain
Jiayu Zhou
OffRL
LRM
140
606
0
16 Sep 2020
DyNODE: Neural Ordinary Differential Equations for Dynamics Modeling in Continuous Control
V. M. Alvarez
R. Rosca
Cristian G. Falcutescu
63
11
0
09 Sep 2020
Human-in-the-Loop Methods for Data-Driven and Reinforcement Learning Systems
Vinicius G. Goecks
115
11
0
30 Aug 2020
On the model-based stochastic value gradient for continuous reinforcement learning
Brandon Amos
Samuel Stanton
Denis Yarats
A. Wilson
85
71
0
28 Aug 2020
Learning Off-Policy with Online Planning
Harshit S. Sikchi
Wenxuan Zhou
David Held
OffRL
139
50
0
23 Aug 2020
Deep Model-Based Reinforcement Learning for High-Dimensional Problems, a Survey
Aske Plaat
W. Kosters
Mike Preuss
BDL
OffRL
127
17
0
11 Aug 2020
Modular Transfer Learning with Transition Mismatch Compensation for Excessive Disturbance Rejection
Tianming Wang
Wenjie Lu
H. Yu
Dikai Liu
87
1
0
29 Jul 2020
Off-Policy Multi-Agent Decomposed Policy Gradients
Yihan Wang
Beining Han
Tonghan Wang
Heng Dong
Chongjie Zhang
102
181
0
24 Jul 2020
Evaluating the Apperception Engine
Richard Evans
Jose Hernandez-Orallo
Johannes Welbl
Pushmeet Kohli
Marek Sergot
73
5
0
09 Jul 2020
TDprop: Does Jacobi Preconditioning Help Temporal Difference Learning?
Joshua Romoff
Peter Henderson
David Kanaa
Emmanuel Bengio
Ahmed Touati
Pierre-Luc Bacon
Joelle Pineau
66
3
0
06 Jul 2020
Selective Dyna-style Planning Under Limited Model Capacity
Zaheer Abbas
Samuel Sokota
Erin J. Talvitie
Martha White
93
34
0
05 Jul 2020
The Effect of Multi-step Methods on Overestimation in Deep Reinforcement Learning
Lingheng Meng
R. Gorbet
Dana Kulic
OffRL
76
27
0
23 Jun 2020
Model Embedding Model-Based Reinforcement Learning
Xiao Tan
Chao Qu
Junwu Xiong
James Y. Zhang
OffRL
32
0
0
16 Jun 2020
Model-based Adversarial Meta-Reinforcement Learning
Zichuan Lin
G. Thomas
Guangwen Yang
Tengyu Ma
OOD
74
52
0
16 Jun 2020
Efficient Model-Based Reinforcement Learning through Optimistic Policy Search and Planning
Sebastian Curi
Felix Berkenkamp
Andreas Krause
132
85
0
15 Jun 2020
Combining Model-Based and Model-Free Methods for Nonlinear Control: A Provably Convergent Policy Gradient Approach
Guannan Qu
Chenkai Yu
S. Low
Adam Wierman
66
19
0
12 Jun 2020
Previous
1
2
3
4
Next