Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1803.00101
Cited By
Model-Based Value Estimation for Efficient Model-Free Reinforcement Learning
28 February 2018
Vladimir Feinberg
Alvin Wan
Ion Stoica
Michael I. Jordan
Joseph E. Gonzalez
Sergey Levine
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Model-Based Value Estimation for Efficient Model-Free Reinforcement Learning"
50 / 197 papers shown
Title
DisCO: Reinforcing Large Reasoning Models with Discriminative Constrained Optimization
Gang Li
Ming Lin
Tomer Galanti
Zhengzhong Tu
Tianbao Yang
111
1
0
18 May 2025
Coupled Distributional Random Expert Distillation for World Model Online Imitation Learning
Shangzhe Li
Zhiao Huang
Hao Su
133
0
0
04 May 2025
MInCo: Mitigating Information Conflicts in Distracted Visual Model-based Reinforcement Learning
Shiguang Sun
Hanbo Zhang
Zeyang Liu
Xinrui Yang
Lipeng Wan
Bing Yan
Xingyu Chen
230
0
0
05 Apr 2025
Look Before Leap: Look-Ahead Planning with Uncertainty in Reinforcement Learning
Yongshuai Liu
Xin Liu
207
1
0
26 Mar 2025
A Comprehensive Survey of Mixture-of-Experts: Algorithms, Theory, and Applications
Siyuan Mu
Sen Lin
MoE
498
5
0
10 Mar 2025
Implicit Search via Discrete Diffusion: A Study on Chess
Jiacheng Ye
Zhenyu Wu
Jiahui Gao
Zhiyong Wu
Xin Jiang
Zhiyu Li
Dianbo Sui
DiffM
103
4
0
27 Feb 2025
Zero-shot Model-based Reinforcement Learning using Large Language Models
Abdelhakim Benechehab
Youssef Attia El Hili
Ambroise Odonnat
Oussama Zekri
Albert Thomas
Giuseppe Paolo
Maurizio Filippone
I. Redko
Balázs Kégl
OffRL
142
1
0
17 Feb 2025
EvoAgent: Agent Autonomous Evolution with Continual World Model for Long-Horizon Tasks
Tongtong Feng
X. Wang
Zekai Zhou
Ren Wang
Yuwei Zhan
Guangyao Li
Qing Li
Wenwu Zhu
LM&Ro
176
0
0
09 Feb 2025
Deterministic Uncertainty Propagation for Improved Model-Based Offline Reinforcement Learning
Abdullah Akgul
Manuel Haußmann
M. Kandemir
OffRL
217
0
0
17 Jan 2025
Learn A Flexible Exploration Model for Parameterized Action Markov Decision Processes
Zijian Wang
Bin Wang
Mingwen Shao
Hongbo Dou
Boxiang Tao
115
0
0
06 Jan 2025
Stabilizing Reinforcement Learning in Differentiable Multiphysics Simulation
Eliot Xing
Vernon Luk
Jean Oh
184
1
0
16 Dec 2024
Prioritized Generative Replay
Renhao Wang
Kevin Frans
Pieter Abbeel
Sergey Levine
Alexei A. Efros
OnRL
DiffM
190
7
0
23 Oct 2024
Bayes Adaptive Monte Carlo Tree Search for Offline Model-based Reinforcement Learning
Jiayu Chen
Wentse Chen
Jeff Schneider
OffRL
100
4
0
15 Oct 2024
Grounded Answers for Multi-agent Decision-making Problem through Generative World Model
Zeyang Liu
Xinrui Yang
Shiguang Sun
Long Qian
Lipeng Wan
Xingyu Chen
Xuguang Lan
112
3
0
03 Oct 2024
SAMBO-RL: Shifts-aware Model-based Offline Reinforcement Learning
Wang Luo
Haoran Li
Zicheng Zhang
Congying Han
Jiayu Lv
Tiande Guo
OffRL
134
1
0
23 Aug 2024
Meta-Gradient Search Control: A Method for Improving the Efficiency of Dyna-style Planning
Bradley Burega
John D. Martin
Luke Kapeluck
Michael Bowling
97
0
0
27 Jun 2024
Bounding-Box Inference for Error-Aware Model-Based Reinforcement Learning
Erin J. Talvitie
Zilei Shao
Huiying Li
Jinghan Hu
Jacob Boerma
Rory Zhao
Xintong Wang
OffRL
63
1
0
23 Jun 2024
Learning-based legged locomotion; state of the art and future perspectives
Sehoon Ha
Joonho Lee
M. van de Panne
Zhaoming Xie
Wenhao Yu
Majid Khadiv
147
20
0
03 Jun 2024
Trust the Model Where It Trusts Itself -- Model-Based Actor-Critic with Uncertainty-Aware Rollout Adaption
Bernd Frauenknecht
Artur Eisele
Devdutt Subhasish
Friedrich Solowjow
Sebastian Trimpe
110
5
0
29 May 2024
Adaptive Horizon Actor-Critic for Policy Learning in Contact-Rich Differentiable Simulation
Ignat Georgiev
K. Srinivasan
Jie Xu
Eric Heiden
Animesh Garg
93
14
0
28 May 2024
Multi-State TD Target for Model-Free Reinforcement Learning
Wuhao Wang
Zhiyong Chen
Lepeng Zhang
TTA
39
0
0
26 May 2024
Beyond the Edge: An Advanced Exploration of Reinforcement Learning for Mobile Edge Computing, its Applications, and Future Research Trajectories
Ning Yang
Shuo Chen
Haijun Zhang
Randall Berry
OffRL
106
9
0
22 Apr 2024
Hindsight PRIORs for Reward Learning from Human Preferences
Mudit Verma
Katherine Metcalf
95
6
0
12 Apr 2024
Learning Off-policy with Model-based Intrinsic Motivation For Active Online Exploration
Yibo Wang
Jiang Zhao
OffRL
OnRL
92
0
0
31 Mar 2024
Deep Reinforcement Learning in Autonomous Car Path Planning and Control: A Survey
Yiyang Chen
Chao Ji
Yunrui Cai
Tong Yan
Bo Su
81
11
0
30 Mar 2024
Decomposing Control Lyapunov Functions for Efficient Reinforcement Learning
Antonio Lopez
David Fridovich-Keil
87
1
0
18 Mar 2024
EfficientZero V2: Mastering Discrete and Continuous Control with Limited Data
Shengjie Wang
Shaohuai Liu
Weirui Ye
Jiacheng You
Yang Gao
OffRL
99
15
0
01 Mar 2024
Diffusion World Model: Future Modeling Beyond Step-by-Step Rollout for Offline Reinforcement Learning
Zihan Ding
Amy Zhang
Yuandong Tian
Qinqing Zheng
OffRL
135
20
0
05 Feb 2024
Building Minimal and Reusable Causal State Abstractions for Reinforcement Learning
Zizhao Wang
Caroline Wang
Xuesu Xiao
Yuke Zhu
Peter Stone
OffRL
61
5
0
23 Jan 2024
MOTO: Offline Pre-training to Online Fine-tuning for Model-based Robot Learning
Rafael Rafailov
Kyle Hatch
Victor Kolev
John D. Martin
Mariano Phielipp
Chelsea Finn
OffRL
OnRL
112
12
0
06 Jan 2024
TAPE: Leveraging Agent Topology for Cooperative Multi-Agent Policy Gradient
Xingzhou Lou
Junge Zhang
Timothy J. Norman
Kaiqi Huang
Yali Du
70
1
0
25 Dec 2023
Multi-agent Reinforcement Learning: A Comprehensive Survey
Dom Huh
Prasant Mohapatra
AI4CE
82
10
0
15 Dec 2023
Data-efficient Deep Reinforcement Learning for Vehicle Trajectory Control
Bernd Frauenknecht
Tobias Ehlgen
Sebastian Trimpe
85
4
0
30 Nov 2023
Provable Representation with Efficient Planning for Partial Observable Reinforcement Learning
Hongming Zhang
Zhaolin Ren
Chenjun Xiao
Dale Schuurmans
Bo Dai
104
4
0
20 Nov 2023
Model-assisted Reinforcement Learning of a Quadrotor
Arshad Javeed
77
0
0
12 Nov 2023
Model-Based Reparameterization Policy Gradient Methods: Theory and Practical Algorithms
Shenao Zhang
Boyi Liu
Zhaoran Wang
Tuo Zhao
65
2
0
30 Oct 2023
On Representation Complexity of Model-based and Model-free Reinforcement Learning
Hanlin Zhu
Baihe Huang
Stuart Russell
OffRL
76
4
0
03 Oct 2023
Efficiency Separation between RL Methods: Model-Free, Model-Based and Goal-Conditioned
Han Bao
Raphaël Jungers
Jean-Charles Delvenne
OffRL
72
1
0
28 Sep 2023
Guided Cooperation in Hierarchical Reinforcement Learning via Model-based Rollout
Haoran Wang
Zeshen Tang
Leya Yang
Yaoru Sun
Fang Wang
Siyu Zhang
Ye-Ting Chen
96
2
0
24 Sep 2023
How to Fine-tune the Model: Unified Model Shift and Model Bias Policy Optimization
Hai Zhang
Hang Yu
Junqiao Zhao
Di Zhang
Chang Huang
Hongtu Zhou
Xiao Zhang
Chen Ye
87
10
0
22 Sep 2023
Machine Learning Meets Advanced Robotic Manipulation
Saeid Nahavandi
R. Alizadehsani
D. Nahavandi
Chee Peng Lim
Kevin Kelly
Fernando Bello
92
20
0
22 Sep 2023
A Review on Robot Manipulation Methods in Human-Robot Interactions
Haoxu Zhang
P. Kebria
Shady M. K. Mohamed
Samson Yu
Saeid Nahavandi
65
0
0
09 Sep 2023
Exploiting Generalization in Offline Reinforcement Learning via Unseen State Augmentations
Nirbhay Modhe
Qiaozi Gao
Ashwin Kalyan
Dhruv Batra
Govind Thattai
Gaurav Sukhatme
OffRL
75
2
0
07 Aug 2023
λ
λ
λ
-models: Effective Decision-Aware Reinforcement Learning with Latent Models
C. Voelcker
Arash Ahmadian
Romina Abachi
Igor Gilitschenski
Amir-massoud Farahmand
124
0
0
30 Jun 2023
MRHER: Model-based Relay Hindsight Experience Replay for Sequential Object Manipulation Tasks with Sparse Rewards
Yuming Huang
Bin Ren
Ziming Xu
Lianghong Wu
OffRL
70
0
0
28 Jun 2023
Rethinking Closed-loop Training for Autonomous Driving
Chris Zhang
R. Guo
Wenyuan Zeng
Yuwen Xiong
Binbin Dai
Rui Hu
Mengye Ren
R. Urtasun
OffRL
103
30
0
27 Jun 2023
Actor-Critic Model Predictive Control
Angel Romero
Yunlong Song
Davide Scaramuzza
103
41
0
16 Jun 2023
Simplified Temporal Consistency Reinforcement Learning
Yi Zhao
Wenshuai Zhao
Rinu Boney
Arno Solin
Joni Pajarinen
OffRL
72
13
0
15 Jun 2023
Deep Generative Models for Decision-Making and Control
Michael Janner
78
1
0
15 Jun 2023
Dynamically Conservative Self-Driving Planner for Long-Tail Cases
Weitao Zhou
Zhong Cao
Nanshan Deng
Xiaoyu Liu
Kun Jiang
Diange Yang
57
20
0
12 May 2023
1
2
3
4
Next