ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1803.00101
  4. Cited By
Model-Based Value Estimation for Efficient Model-Free Reinforcement
  Learning

Model-Based Value Estimation for Efficient Model-Free Reinforcement Learning

28 February 2018
Vladimir Feinberg
Alvin Wan
Ion Stoica
Michael I. Jordan
Joseph E. Gonzalez
Sergey Levine
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Model-Based Value Estimation for Efficient Model-Free Reinforcement Learning"

50 / 197 papers shown
Title
DisCO: Reinforcing Large Reasoning Models with Discriminative Constrained Optimization
DisCO: Reinforcing Large Reasoning Models with Discriminative Constrained Optimization
Gang Li
Ming Lin
Tomer Galanti
Zhengzhong Tu
Tianbao Yang
111
1
0
18 May 2025
Coupled Distributional Random Expert Distillation for World Model Online Imitation Learning
Coupled Distributional Random Expert Distillation for World Model Online Imitation Learning
Shangzhe Li
Zhiao Huang
Hao Su
133
0
0
04 May 2025
MInCo: Mitigating Information Conflicts in Distracted Visual Model-based Reinforcement Learning
MInCo: Mitigating Information Conflicts in Distracted Visual Model-based Reinforcement Learning
Shiguang Sun
Hanbo Zhang
Zeyang Liu
Xinrui Yang
Lipeng Wan
Bing Yan
Xingyu Chen
230
0
0
05 Apr 2025
Look Before Leap: Look-Ahead Planning with Uncertainty in Reinforcement Learning
Look Before Leap: Look-Ahead Planning with Uncertainty in Reinforcement Learning
Yongshuai Liu
Xin Liu
207
1
0
26 Mar 2025
A Comprehensive Survey of Mixture-of-Experts: Algorithms, Theory, and Applications
A Comprehensive Survey of Mixture-of-Experts: Algorithms, Theory, and Applications
Siyuan Mu
Sen Lin
MoE
498
5
0
10 Mar 2025
Implicit Search via Discrete Diffusion: A Study on Chess
Implicit Search via Discrete Diffusion: A Study on Chess
Jiacheng Ye
Zhenyu Wu
Jiahui Gao
Zhiyong Wu
Xin Jiang
Zhiyu Li
Dianbo Sui
DiffM
103
4
0
27 Feb 2025
Zero-shot Model-based Reinforcement Learning using Large Language Models
Zero-shot Model-based Reinforcement Learning using Large Language Models
Abdelhakim Benechehab
Youssef Attia El Hili
Ambroise Odonnat
Oussama Zekri
Albert Thomas
Giuseppe Paolo
Maurizio Filippone
I. Redko
Balázs Kégl
OffRL
142
1
0
17 Feb 2025
EvoAgent: Agent Autonomous Evolution with Continual World Model for Long-Horizon Tasks
Tongtong Feng
X. Wang
Zekai Zhou
Ren Wang
Yuwei Zhan
Guangyao Li
Qing Li
Wenwu Zhu
LM&Ro
176
0
0
09 Feb 2025
Deterministic Uncertainty Propagation for Improved Model-Based Offline Reinforcement Learning
Deterministic Uncertainty Propagation for Improved Model-Based Offline Reinforcement Learning
Abdullah Akgul
Manuel Haußmann
M. Kandemir
OffRL
217
0
0
17 Jan 2025
Learn A Flexible Exploration Model for Parameterized Action Markov Decision Processes
Zijian Wang
Bin Wang
Mingwen Shao
Hongbo Dou
Boxiang Tao
115
0
0
06 Jan 2025
Stabilizing Reinforcement Learning in Differentiable Multiphysics Simulation
Stabilizing Reinforcement Learning in Differentiable Multiphysics Simulation
Eliot Xing
Vernon Luk
Jean Oh
184
1
0
16 Dec 2024
Prioritized Generative Replay
Prioritized Generative Replay
Renhao Wang
Kevin Frans
Pieter Abbeel
Sergey Levine
Alexei A. Efros
OnRLDiffM
190
7
0
23 Oct 2024
Bayes Adaptive Monte Carlo Tree Search for Offline Model-based Reinforcement Learning
Bayes Adaptive Monte Carlo Tree Search for Offline Model-based Reinforcement Learning
Jiayu Chen
Wentse Chen
Jeff Schneider
OffRL
100
4
0
15 Oct 2024
Grounded Answers for Multi-agent Decision-making Problem through
  Generative World Model
Grounded Answers for Multi-agent Decision-making Problem through Generative World Model
Zeyang Liu
Xinrui Yang
Shiguang Sun
Long Qian
Lipeng Wan
Xingyu Chen
Xuguang Lan
112
3
0
03 Oct 2024
SAMBO-RL: Shifts-aware Model-based Offline Reinforcement Learning
SAMBO-RL: Shifts-aware Model-based Offline Reinforcement Learning
Wang Luo
Haoran Li
Zicheng Zhang
Congying Han
Jiayu Lv
Tiande Guo
OffRL
134
1
0
23 Aug 2024
Meta-Gradient Search Control: A Method for Improving the Efficiency of
  Dyna-style Planning
Meta-Gradient Search Control: A Method for Improving the Efficiency of Dyna-style Planning
Bradley Burega
John D. Martin
Luke Kapeluck
Michael Bowling
97
0
0
27 Jun 2024
Bounding-Box Inference for Error-Aware Model-Based Reinforcement
  Learning
Bounding-Box Inference for Error-Aware Model-Based Reinforcement Learning
Erin J. Talvitie
Zilei Shao
Huiying Li
Jinghan Hu
Jacob Boerma
Rory Zhao
Xintong Wang
OffRL
63
1
0
23 Jun 2024
Learning-based legged locomotion; state of the art and future
  perspectives
Learning-based legged locomotion; state of the art and future perspectives
Sehoon Ha
Joonho Lee
M. van de Panne
Zhaoming Xie
Wenhao Yu
Majid Khadiv
147
20
0
03 Jun 2024
Trust the Model Where It Trusts Itself -- Model-Based Actor-Critic with
  Uncertainty-Aware Rollout Adaption
Trust the Model Where It Trusts Itself -- Model-Based Actor-Critic with Uncertainty-Aware Rollout Adaption
Bernd Frauenknecht
Artur Eisele
Devdutt Subhasish
Friedrich Solowjow
Sebastian Trimpe
110
5
0
29 May 2024
Adaptive Horizon Actor-Critic for Policy Learning in Contact-Rich
  Differentiable Simulation
Adaptive Horizon Actor-Critic for Policy Learning in Contact-Rich Differentiable Simulation
Ignat Georgiev
K. Srinivasan
Jie Xu
Eric Heiden
Animesh Garg
93
14
0
28 May 2024
Multi-State TD Target for Model-Free Reinforcement Learning
Multi-State TD Target for Model-Free Reinforcement Learning
Wuhao Wang
Zhiyong Chen
Lepeng Zhang
TTA
39
0
0
26 May 2024
Beyond the Edge: An Advanced Exploration of Reinforcement Learning for
  Mobile Edge Computing, its Applications, and Future Research Trajectories
Beyond the Edge: An Advanced Exploration of Reinforcement Learning for Mobile Edge Computing, its Applications, and Future Research Trajectories
Ning Yang
Shuo Chen
Haijun Zhang
Randall Berry
OffRL
106
9
0
22 Apr 2024
Hindsight PRIORs for Reward Learning from Human Preferences
Hindsight PRIORs for Reward Learning from Human Preferences
Mudit Verma
Katherine Metcalf
95
6
0
12 Apr 2024
Learning Off-policy with Model-based Intrinsic Motivation For Active
  Online Exploration
Learning Off-policy with Model-based Intrinsic Motivation For Active Online Exploration
Yibo Wang
Jiang Zhao
OffRLOnRL
92
0
0
31 Mar 2024
Deep Reinforcement Learning in Autonomous Car Path Planning and Control:
  A Survey
Deep Reinforcement Learning in Autonomous Car Path Planning and Control: A Survey
Yiyang Chen
Chao Ji
Yunrui Cai
Tong Yan
Bo Su
81
11
0
30 Mar 2024
Decomposing Control Lyapunov Functions for Efficient Reinforcement
  Learning
Decomposing Control Lyapunov Functions for Efficient Reinforcement Learning
Antonio Lopez
David Fridovich-Keil
87
1
0
18 Mar 2024
EfficientZero V2: Mastering Discrete and Continuous Control with Limited
  Data
EfficientZero V2: Mastering Discrete and Continuous Control with Limited Data
Shengjie Wang
Shaohuai Liu
Weirui Ye
Jiacheng You
Yang Gao
OffRL
99
15
0
01 Mar 2024
Diffusion World Model: Future Modeling Beyond Step-by-Step Rollout for
  Offline Reinforcement Learning
Diffusion World Model: Future Modeling Beyond Step-by-Step Rollout for Offline Reinforcement Learning
Zihan Ding
Amy Zhang
Yuandong Tian
Qinqing Zheng
OffRL
135
20
0
05 Feb 2024
Building Minimal and Reusable Causal State Abstractions for
  Reinforcement Learning
Building Minimal and Reusable Causal State Abstractions for Reinforcement Learning
Zizhao Wang
Caroline Wang
Xuesu Xiao
Yuke Zhu
Peter Stone
OffRL
61
5
0
23 Jan 2024
MOTO: Offline Pre-training to Online Fine-tuning for Model-based Robot
  Learning
MOTO: Offline Pre-training to Online Fine-tuning for Model-based Robot Learning
Rafael Rafailov
Kyle Hatch
Victor Kolev
John D. Martin
Mariano Phielipp
Chelsea Finn
OffRLOnRL
112
12
0
06 Jan 2024
TAPE: Leveraging Agent Topology for Cooperative Multi-Agent Policy
  Gradient
TAPE: Leveraging Agent Topology for Cooperative Multi-Agent Policy Gradient
Xingzhou Lou
Junge Zhang
Timothy J. Norman
Kaiqi Huang
Yali Du
70
1
0
25 Dec 2023
Multi-agent Reinforcement Learning: A Comprehensive Survey
Multi-agent Reinforcement Learning: A Comprehensive Survey
Dom Huh
Prasant Mohapatra
AI4CE
82
10
0
15 Dec 2023
Data-efficient Deep Reinforcement Learning for Vehicle Trajectory
  Control
Data-efficient Deep Reinforcement Learning for Vehicle Trajectory Control
Bernd Frauenknecht
Tobias Ehlgen
Sebastian Trimpe
85
4
0
30 Nov 2023
Provable Representation with Efficient Planning for Partial Observable
  Reinforcement Learning
Provable Representation with Efficient Planning for Partial Observable Reinforcement Learning
Hongming Zhang
Zhaolin Ren
Chenjun Xiao
Dale Schuurmans
Bo Dai
104
4
0
20 Nov 2023
Model-assisted Reinforcement Learning of a Quadrotor
Model-assisted Reinforcement Learning of a Quadrotor
Arshad Javeed
77
0
0
12 Nov 2023
Model-Based Reparameterization Policy Gradient Methods: Theory and
  Practical Algorithms
Model-Based Reparameterization Policy Gradient Methods: Theory and Practical Algorithms
Shenao Zhang
Boyi Liu
Zhaoran Wang
Tuo Zhao
65
2
0
30 Oct 2023
On Representation Complexity of Model-based and Model-free Reinforcement
  Learning
On Representation Complexity of Model-based and Model-free Reinforcement Learning
Hanlin Zhu
Baihe Huang
Stuart Russell
OffRL
76
4
0
03 Oct 2023
Efficiency Separation between RL Methods: Model-Free, Model-Based and
  Goal-Conditioned
Efficiency Separation between RL Methods: Model-Free, Model-Based and Goal-Conditioned
Han Bao
Raphaël Jungers
Jean-Charles Delvenne
OffRL
72
1
0
28 Sep 2023
Guided Cooperation in Hierarchical Reinforcement Learning via
  Model-based Rollout
Guided Cooperation in Hierarchical Reinforcement Learning via Model-based Rollout
Haoran Wang
Zeshen Tang
Leya Yang
Yaoru Sun
Fang Wang
Siyu Zhang
Ye-Ting Chen
96
2
0
24 Sep 2023
How to Fine-tune the Model: Unified Model Shift and Model Bias Policy
  Optimization
How to Fine-tune the Model: Unified Model Shift and Model Bias Policy Optimization
Hai Zhang
Hang Yu
Junqiao Zhao
Di Zhang
Chang Huang
Hongtu Zhou
Xiao Zhang
Chen Ye
87
10
0
22 Sep 2023
Machine Learning Meets Advanced Robotic Manipulation
Machine Learning Meets Advanced Robotic Manipulation
Saeid Nahavandi
R. Alizadehsani
D. Nahavandi
Chee Peng Lim
Kevin Kelly
Fernando Bello
92
20
0
22 Sep 2023
A Review on Robot Manipulation Methods in Human-Robot Interactions
A Review on Robot Manipulation Methods in Human-Robot Interactions
Haoxu Zhang
P. Kebria
Shady M. K. Mohamed
Samson Yu
Saeid Nahavandi
65
0
0
09 Sep 2023
Exploiting Generalization in Offline Reinforcement Learning via Unseen
  State Augmentations
Exploiting Generalization in Offline Reinforcement Learning via Unseen State Augmentations
Nirbhay Modhe
Qiaozi Gao
Ashwin Kalyan
Dhruv Batra
Govind Thattai
Gaurav Sukhatme
OffRL
75
2
0
07 Aug 2023
$λ$-models: Effective Decision-Aware Reinforcement Learning with
  Latent Models
λλλ-models: Effective Decision-Aware Reinforcement Learning with Latent Models
C. Voelcker
Arash Ahmadian
Romina Abachi
Igor Gilitschenski
Amir-massoud Farahmand
124
0
0
30 Jun 2023
MRHER: Model-based Relay Hindsight Experience Replay for Sequential
  Object Manipulation Tasks with Sparse Rewards
MRHER: Model-based Relay Hindsight Experience Replay for Sequential Object Manipulation Tasks with Sparse Rewards
Yuming Huang
Bin Ren
Ziming Xu
Lianghong Wu
OffRL
70
0
0
28 Jun 2023
Rethinking Closed-loop Training for Autonomous Driving
Rethinking Closed-loop Training for Autonomous Driving
Chris Zhang
R. Guo
Wenyuan Zeng
Yuwen Xiong
Binbin Dai
Rui Hu
Mengye Ren
R. Urtasun
OffRL
103
30
0
27 Jun 2023
Actor-Critic Model Predictive Control
Actor-Critic Model Predictive Control
Angel Romero
Yunlong Song
Davide Scaramuzza
103
41
0
16 Jun 2023
Simplified Temporal Consistency Reinforcement Learning
Simplified Temporal Consistency Reinforcement Learning
Yi Zhao
Wenshuai Zhao
Rinu Boney
Arno Solin
Joni Pajarinen
OffRL
72
13
0
15 Jun 2023
Deep Generative Models for Decision-Making and Control
Deep Generative Models for Decision-Making and Control
Michael Janner
78
1
0
15 Jun 2023
Dynamically Conservative Self-Driving Planner for Long-Tail Cases
Dynamically Conservative Self-Driving Planner for Long-Tail Cases
Weitao Zhou
Zhong Cao
Nanshan Deng
Xiaoyu Liu
Kun Jiang
Diange Yang
57
20
0
12 May 2023
1234
Next