ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1803.00101
  4. Cited By
Model-Based Value Estimation for Efficient Model-Free Reinforcement
  Learning

Model-Based Value Estimation for Efficient Model-Free Reinforcement Learning

28 February 2018
Vladimir Feinberg
Alvin Wan
Ion Stoica
Michael I. Jordan
Joseph E. Gonzalez
Sergey Levine
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Model-Based Value Estimation for Efficient Model-Free Reinforcement Learning"

50 / 197 papers shown
Title
Sample Efficient Reinforcement Learning via Model-Ensemble Exploration
  and Exploitation
Sample Efficient Reinforcement Learning via Model-Ensemble Exploration and Exploitation
Yaowen Yao
Li Xiao
Zhicheng An
Wanpeng Zhang
Dijun Luo
101
21
0
05 Jul 2021
Improve Agents without Retraining: Parallel Tree Search with Off-Policy
  Correction
Improve Agents without Retraining: Parallel Tree Search with Off-Policy Correction
Assaf Hallak
Gal Dalal
Steven Dalton
I. Frosio
Shie Mannor
Gal Chechik
OffRLOnRL
100
10
0
04 Jul 2021
MHER: Model-based Hindsight Experience Replay
MHER: Model-based Hindsight Experience Replay
Rui Yang
Meng Fang
Lei Han
Yali Du
Feng Luo
Xiu Li
OffRL
95
17
0
01 Jul 2021
Learning Task Informed Abstractions
Learning Task Informed Abstractions
Xiang Fu
Ge Yang
Pulkit Agrawal
Tommi Jaakkola
110
69
0
29 Jun 2021
Uncertainty-Aware Model-Based Reinforcement Learning with Application to
  Autonomous Driving
Uncertainty-Aware Model-Based Reinforcement Learning with Application to Autonomous Driving
Jingda Wu
Zhiyu Huang
Chen Lv
56
7
0
23 Jun 2021
Efficient Continuous Control with Double Actors and Regularized Critics
Efficient Continuous Control with Double Actors and Regularized Critics
Jiafei Lyu
Xiaoteng Ma
Jiangpeng Yan
Xiu Li
OffRL
47
50
0
06 Jun 2021
Stochastic Intervention for Causal Inference via Reinforcement Learning
Stochastic Intervention for Causal Inference via Reinforcement Learning
Tri Dung Duong
Qian Li
Guandong Xu
CML
48
3
0
28 May 2021
Robust Value Iteration for Continuous Control Tasks
Robust Value Iteration for Continuous Control Tasks
M. Lutter
Shie Mannor
Jan Peters
Dieter Fox
Animesh Garg
64
19
0
25 May 2021
MapGo: Model-Assisted Policy Optimization for Goal-Oriented Tasks
MapGo: Model-Assisted Policy Optimization for Goal-Oriented Tasks
Menghui Zhu
Minghuan Liu
Jian Shen
Zhicheng Zhang
Sheng Chen
Weinan Zhang
Deheng Ye
Yong Yu
Qiang Fu
Wei Yang
114
22
0
13 May 2021
Value Iteration in Continuous Actions, States and Time
Value Iteration in Continuous Actions, States and Time
M. Lutter
Shie Mannor
Jan Peters
Dieter Fox
Animesh Garg
52
37
0
10 May 2021
Learning to drive from a world on rails
Learning to drive from a world on rails
Di Chen
V. Koltun
Philipp Krahenbuhl
191
122
0
03 May 2021
Adaptive learning for financial markets mixing model-based and
  model-free RL for volatility targeting
Adaptive learning for financial markets mixing model-based and model-free RL for volatility targeting
Eric Benhamou
David Saltiel
S. Tabachnik
Sui Kai Wong
François Chareyron
OOD
91
4
0
19 Apr 2021
Planning with Expectation Models for Control
Planning with Expectation Models for Control
Katya Kudashkina
Yi Wan
Abhishek Naik
R. Sutton
OffRL
37
0
0
17 Apr 2021
Discriminator Augmented Model-Based Reinforcement Learning
Discriminator Augmented Model-Based Reinforcement Learning
Behzad Haghgoo
Allan Zhou
Archit Sharma
Chelsea Finn
OffRL
67
3
0
24 Mar 2021
Sample-efficient Reinforcement Learning Representation Learning with
  Curiosity Contrastive Forward Dynamics Model
Sample-efficient Reinforcement Learning Representation Learning with Curiosity Contrastive Forward Dynamics Model
Thanh Nguyen
Tung M. Luu
Thang Vu
Chang D. Yoo
47
17
0
15 Mar 2021
Bias-reduced Multi-step Hindsight Experience Replay for Efficient
  Multi-goal Reinforcement Learning
Bias-reduced Multi-step Hindsight Experience Replay for Efficient Multi-goal Reinforcement Learning
Rui Yang
Jiafei Lyu
Yu Yang
Jiangpeng Yan
Feng Luo
Dijun Luo
Lanqing Li
Xiu Li
88
6
0
25 Feb 2021
Mixed Policy Gradient: off-policy reinforcement learning driven jointly
  by data and model
Mixed Policy Gradient: off-policy reinforcement learning driven jointly by data and model
Yang Guan
Jingliang Duan
Shengbo Eben Li
Jie Li
Jianyu Chen
B. Cheng
OffRL
77
12
0
23 Feb 2021
MUSBO: Model-based Uncertainty Regularized and Sample Efficient Batch
  Optimization for Deployment Constrained Reinforcement Learning
MUSBO: Model-based Uncertainty Regularized and Sample Efficient Batch Optimization for Deployment Constrained Reinforcement Learning
DiJia Su
Jason D. Lee
John M. Mulvey
H. Vincent Poor
OffRL
62
6
0
23 Feb 2021
Model-Invariant State Abstractions for Model-Based Reinforcement
  Learning
Model-Invariant State Abstractions for Model-Based Reinforcement Learning
Manan Tomar
Amy Zhang
Roberto Calandra
Matthew E. Taylor
Joelle Pineau
97
26
0
19 Feb 2021
A Survey on Active Deep Learning: From Model-driven to Data-driven
Peng Liu
Lizhe Wang
Guojin He
Lei Zhao
85
14
0
25 Jan 2021
Sample-Efficient Reinforcement Learning via Counterfactual-Based Data
  Augmentation
Sample-Efficient Reinforcement Learning via Counterfactual-Based Data Augmentation
Chaochao Lu
Erdun Gao
Ke Wang
José Miguel Hernández-Lobato
Kun Zhang
Bernhard Schölkopf
CMLOODOffRL
87
60
0
16 Dec 2020
Double Meta-Learning for Data Efficient Policy Optimization in
  Non-Stationary Environments
Double Meta-Learning for Data Efficient Policy Optimization in Non-Stationary Environments
Elahe Aghapour
Nora Ayanian
OffRL
42
4
0
21 Nov 2020
Deep Reinforcement Learning for Navigation in AAA Video Games
Deep Reinforcement Learning for Navigation in AAA Video Games
Eloi Alonso
Maxim Peter
David Goumard
Joshua Romoff
59
37
0
09 Nov 2020
Learning Trajectories for Visual-Inertial System Calibration via
  Model-based Heuristic Deep Reinforcement Learning
Learning Trajectories for Visual-Inertial System Calibration via Model-based Heuristic Deep Reinforcement Learning
Le Chen
Yu Ao
Florian Tschopp
Andrei Cramariuc
Michel Breyer
Jen Jen Chung
Roland Siegwart
Cesar Cadena
51
3
0
04 Nov 2020
Low-Variance Policy Gradient Estimation with World Models
Low-Variance Policy Gradient Estimation with World Models
Michal Nauman
Floris den Hengst
OffRL
51
1
0
29 Oct 2020
Learning to Plan Optimistically: Uncertainty-Guided Deep Exploration via
  Latent Model Ensembles
Learning to Plan Optimistically: Uncertainty-Guided Deep Exploration via Latent Model Ensembles
Tim Seyde
Wilko Schwarting
S. Karaman
Daniela Rus
100
14
0
27 Oct 2020
Generative Temporal Difference Learning for Infinite-Horizon Prediction
Generative Temporal Difference Learning for Infinite-Horizon Prediction
Michael Janner
Igor Mordatch
Sergey Levine
AI4CE
79
35
0
27 Oct 2020
Planning with Exploration: Addressing Dynamics Bottleneck in Model-based
  Reinforcement Learning
Planning with Exploration: Addressing Dynamics Bottleneck in Model-based Reinforcement Learning
Xiyao Wang
Junge Zhang
Wenzhen Huang
Qiyue Yin
41
0
0
24 Oct 2020
Bridging Imagination and Reality for Model-Based Deep Reinforcement
  Learning
Bridging Imagination and Reality for Model-Based Deep Reinforcement Learning
Guangxiang Zhu
Minghao Zhang
Honglak Lee
Chongjie Zhang
OffRL
135
18
0
23 Oct 2020
Model-based Policy Optimization with Unsupervised Model Adaptation
Model-based Policy Optimization with Unsupervised Model Adaptation
Jian Shen
Han Zhao
Weinan Zhang
Yong Yu
114
28
0
19 Oct 2020
Softmax Deep Double Deterministic Policy Gradients
Softmax Deep Double Deterministic Policy Gradients
Ling Pan
Qingpeng Cai
Longbo Huang
118
93
0
19 Oct 2020
Trust the Model When It Is Confident: Masked Model-based Actor-Critic
Trust the Model When It Is Confident: Masked Model-based Actor-Critic
Feiyang Pan
Jia He
Dandan Tu
Qing He
OffRL
61
47
0
10 Oct 2020
Episodic Memory for Learning Subjective-Timescale Models
Episodic Memory for Learning Subjective-Timescale Models
Alexey Zakharov
Matthew Crosby
Zafeirios Fountas
41
4
0
03 Oct 2020
Dynamic Horizon Value Estimation for Model-based Reinforcement Learning
Dynamic Horizon Value Estimation for Model-based Reinforcement Learning
Junjie Wang
Qichao Zhang
Dongbin Zhao
Mengchen Zhao
Jianye Hao
OffRL
55
5
0
21 Sep 2020
Transfer Learning in Deep Reinforcement Learning: A Survey
Transfer Learning in Deep Reinforcement Learning: A Survey
Zhuangdi Zhu
Kaixiang Lin
Anil K. Jain
Jiayu Zhou
OffRLLRM
140
606
0
16 Sep 2020
DyNODE: Neural Ordinary Differential Equations for Dynamics Modeling in
  Continuous Control
DyNODE: Neural Ordinary Differential Equations for Dynamics Modeling in Continuous Control
V. M. Alvarez
R. Rosca
Cristian G. Falcutescu
63
11
0
09 Sep 2020
Human-in-the-Loop Methods for Data-Driven and Reinforcement Learning
  Systems
Human-in-the-Loop Methods for Data-Driven and Reinforcement Learning Systems
Vinicius G. Goecks
115
11
0
30 Aug 2020
On the model-based stochastic value gradient for continuous
  reinforcement learning
On the model-based stochastic value gradient for continuous reinforcement learning
Brandon Amos
Samuel Stanton
Denis Yarats
A. Wilson
85
71
0
28 Aug 2020
Learning Off-Policy with Online Planning
Learning Off-Policy with Online Planning
Harshit S. Sikchi
Wenxuan Zhou
David Held
OffRL
139
50
0
23 Aug 2020
Deep Model-Based Reinforcement Learning for High-Dimensional Problems, a
  Survey
Deep Model-Based Reinforcement Learning for High-Dimensional Problems, a Survey
Aske Plaat
W. Kosters
Mike Preuss
BDLOffRL
127
17
0
11 Aug 2020
Modular Transfer Learning with Transition Mismatch Compensation for
  Excessive Disturbance Rejection
Modular Transfer Learning with Transition Mismatch Compensation for Excessive Disturbance Rejection
Tianming Wang
Wenjie Lu
H. Yu
Dikai Liu
87
1
0
29 Jul 2020
Off-Policy Multi-Agent Decomposed Policy Gradients
Off-Policy Multi-Agent Decomposed Policy Gradients
Yihan Wang
Beining Han
Tonghan Wang
Heng Dong
Chongjie Zhang
102
181
0
24 Jul 2020
Evaluating the Apperception Engine
Evaluating the Apperception Engine
Richard Evans
Jose Hernandez-Orallo
Johannes Welbl
Pushmeet Kohli
Marek Sergot
73
5
0
09 Jul 2020
TDprop: Does Jacobi Preconditioning Help Temporal Difference Learning?
TDprop: Does Jacobi Preconditioning Help Temporal Difference Learning?
Joshua Romoff
Peter Henderson
David Kanaa
Emmanuel Bengio
Ahmed Touati
Pierre-Luc Bacon
Joelle Pineau
66
3
0
06 Jul 2020
Selective Dyna-style Planning Under Limited Model Capacity
Selective Dyna-style Planning Under Limited Model Capacity
Zaheer Abbas
Samuel Sokota
Erin J. Talvitie
Martha White
93
34
0
05 Jul 2020
The Effect of Multi-step Methods on Overestimation in Deep Reinforcement
  Learning
The Effect of Multi-step Methods on Overestimation in Deep Reinforcement Learning
Lingheng Meng
R. Gorbet
Dana Kulic
OffRL
76
27
0
23 Jun 2020
Model Embedding Model-Based Reinforcement Learning
Model Embedding Model-Based Reinforcement Learning
Xiao Tan
Chao Qu
Junwu Xiong
James Y. Zhang
OffRL
32
0
0
16 Jun 2020
Model-based Adversarial Meta-Reinforcement Learning
Model-based Adversarial Meta-Reinforcement Learning
Zichuan Lin
G. Thomas
Guangwen Yang
Tengyu Ma
OOD
74
52
0
16 Jun 2020
Efficient Model-Based Reinforcement Learning through Optimistic Policy
  Search and Planning
Efficient Model-Based Reinforcement Learning through Optimistic Policy Search and Planning
Sebastian Curi
Felix Berkenkamp
Andreas Krause
132
85
0
15 Jun 2020
Combining Model-Based and Model-Free Methods for Nonlinear Control: A
  Provably Convergent Policy Gradient Approach
Combining Model-Based and Model-Free Methods for Nonlinear Control: A Provably Convergent Policy Gradient Approach
Guannan Qu
Chenkai Yu
S. Low
Adam Wierman
66
19
0
12 Jun 2020
Previous
1234
Next