ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1906.08253
  4. Cited By
When to Trust Your Model: Model-Based Policy Optimization

When to Trust Your Model: Model-Based Policy Optimization

19 June 2019
Michael Janner
Justin Fu
Marvin Zhang
Sergey Levine
    OffRL
ArXivPDFHTML

Papers citing "When to Trust Your Model: Model-Based Policy Optimization"

50 / 235 papers shown
Title
Drive Fast, Learn Faster: On-Board RL for High Performance Autonomous Racing
Drive Fast, Learn Faster: On-Board RL for High Performance Autonomous Racing
Benedict Hildisch
Edoardo Ghignone
Nicolas Baumann
Cheng Hu
Andrea Carron
Michele Magno
36
0
0
12 May 2025
DARLR: Dual-Agent Offline Reinforcement Learning for Recommender Systems with Dynamic Reward
DARLR: Dual-Agent Offline Reinforcement Learning for Recommender Systems with Dynamic Reward
Yi Zhang
Ruihong Qiu
Xuwei Xu
Jiajun Liu
Sen Wang
OffRL
34
0
0
12 May 2025
Coupled Distributional Random Expert Distillation for World Model Online Imitation Learning
Coupled Distributional Random Expert Distillation for World Model Online Imitation Learning
Shangzhe Li
Zhiao Huang
Hao Su
71
0
0
04 May 2025
A Survey of Robotic Navigation and Manipulation with Physics Simulators in the Era of Embodied AI
A Survey of Robotic Navigation and Manipulation with Physics Simulators in the Era of Embodied AI
Lik Hang Kenny Wong
Xueyang Kang
Kaixin Bai
Jianwei Zhang
61
0
0
01 May 2025
MInCo: Mitigating Information Conflicts in Distracted Visual Model-based Reinforcement Learning
MInCo: Mitigating Information Conflicts in Distracted Visual Model-based Reinforcement Learning
Shiguang Sun
Hanbo Zhang
Zeyang Liu
Xinrui Yang
Lipeng Wan
Bing Yan
Xingyu Chen
Xuguang Lan
40
0
0
05 Apr 2025
A Robust Model-Based Approach for Continuous-Time Policy Evaluation with Unknown Lévy Process Dynamics
A Robust Model-Based Approach for Continuous-Time Policy Evaluation with Unknown Lévy Process Dynamics
Qihao Ye
Xiaochuan Tian
Yuhua Zhu
36
1
0
02 Apr 2025
Look Before Leap: Look-Ahead Planning with Uncertainty in Reinforcement Learning
Look Before Leap: Look-Ahead Planning with Uncertainty in Reinforcement Learning
Yongshuai Liu
Xin Liu
93
1
0
26 Mar 2025
Sample-Efficient Reinforcement Learning of Koopman eNMPC
Sample-Efficient Reinforcement Learning of Koopman eNMPC
Daniel Mayfrank
M. Velioglu
Alexander Mitsos
Manuel Dahmen
OffRL
49
0
0
24 Mar 2025
Towards Better Sample Efficiency in Multi-Agent Reinforcement Learning via Exploration
Towards Better Sample Efficiency in Multi-Agent Reinforcement Learning via Exploration
Amir Baghi
Jens Sjölund
Joakim Bergdahl
Linus Gisslén
Alessandro Sestini
58
0
0
17 Mar 2025
Policy Regularization on Globally Accessible States in Cross-Dynamics Reinforcement Learning
Zhenghai Xue
Lang Feng
Jiacheng Xu
Kang Kang
Xiang Wen
Jingyi Wang
Shuicheng Yan
OffRL
53
0
0
10 Mar 2025
Precise Insulin Delivery for Artificial Pancreas: A Reinforcement Learning Optimized Adaptive Fuzzy Control Approach
Omar Mameche
Abdelhadi Abedou
Taqwa Mezaache
Mohamed Tadjine
59
0
0
09 Mar 2025
Multi-Fidelity Policy Gradient Algorithms
Multi-Fidelity Policy Gradient Algorithms
Xinjie Liu
Cyrus Neary
Kushagra Gupta
Christian Ellis
Ufuk Topcu
David Fridovich-Keil
OffRL
208
0
0
07 Mar 2025
Zero-shot Model-based Reinforcement Learning using Large Language Models
Zero-shot Model-based Reinforcement Learning using Large Language Models
Abdelhakim Benechehab
Youssef Attia El Hili
Ambroise Odonnat
Oussama Zekri
Albert Thomas
Giuseppe Paolo
Maurizio Filippone
I. Redko
Balázs Kégl
OffRL
72
1
0
17 Feb 2025
Model-Based Offline Reinforcement Learning with Reliability-Guaranteed Sequence Modeling
Model-Based Offline Reinforcement Learning with Reliability-Guaranteed Sequence Modeling
Shenghong He
OffRL
219
0
0
10 Feb 2025
Rapidly Adapting Policies to the Real World via Simulation-Guided Fine-Tuning
Rapidly Adapting Policies to the Real World via Simulation-Guided Fine-Tuning
Patrick Yin
Tyler Westenbroek
Simran Bagaria
Kevin Huang
Ching-an Cheng
Andrey Kobolov
Abhishek Gupta
80
2
0
04 Feb 2025
Dual Alignment Maximin Optimization for Offline Model-based RL
Dual Alignment Maximin Optimization for Offline Model-based RL
Chi Zhou
Wang Luo
Haoran Li
Congying Han
Tiande Guo
Zicheng Zhang
OffRL
75
0
0
02 Feb 2025
Deterministic Uncertainty Propagation for Improved Model-Based Offline Reinforcement Learning
Deterministic Uncertainty Propagation for Improved Model-Based Offline Reinforcement Learning
Abdullah Akgul
Manuel Haußmann
M. Kandemir
OffRL
76
1
0
17 Jan 2025
Learn A Flexible Exploration Model for Parameterized Action Markov Decision Processes
Zijian Wang
Bin Wang
Mingwen Shao
Hongbo Dou
Boxiang Tao
36
0
0
06 Jan 2025
ReZero: Boosting MCTS-based Algorithms by Backward-view and Entire-buffer Reanalyze
ReZero: Boosting MCTS-based Algorithms by Backward-view and Entire-buffer Reanalyze
Chunyu Xuan
Yazhe Niu
Yuan Pu
Shuai Hu
Yu Liu
Jing Yang
70
0
0
03 Jan 2025
ACL-QL: Adaptive Conservative Level in Q-Learning for Offline Reinforcement Learning
ACL-QL: Adaptive Conservative Level in Q-Learning for Offline Reinforcement Learning
Kun Wu
Yinuo Zhao
Zhihao Xu
Zhengping Che
Chengxiang Yin
C. Liu
Qinru Qiu
Feiferi Feng
OffRL
102
1
0
22 Dec 2024
Prioritized Generative Replay
Prioritized Generative Replay
Renhao Wang
Kevin Frans
Pieter Abbeel
Sergey Levine
Alexei A. Efros
OnRL
DiffM
116
2
0
23 Oct 2024
MAD-TD: Model-Augmented Data stabilizes High Update Ratio RL
MAD-TD: Model-Augmented Data stabilizes High Update Ratio RL
C. Voelcker
Marcel Hussing
Eric Eaton
Amir-massoud Farahmand
Igor Gilitschenski
41
1
0
11 Oct 2024
Overcoming Slow Decision Frequencies in Continuous Control: Model-Based Sequence Reinforcement Learning for Model-Free Control
Overcoming Slow Decision Frequencies in Continuous Control: Model-Based Sequence Reinforcement Learning for Model-Free Control
Devdhar Patel
H. Siegelmann
OffRL
37
0
0
11 Oct 2024
Efficient Model-Based Reinforcement Learning Through Optimistic Thompson Sampling
Efficient Model-Based Reinforcement Learning Through Optimistic Thompson Sampling
Jasmine Bayrooti
Carl Henrik Ek
Amanda Prorok
42
0
0
07 Oct 2024
SHIRE: Enhancing Sample Efficiency using Human Intuition in REinforcement Learning
SHIRE: Enhancing Sample Efficiency using Human Intuition in REinforcement Learning
Amogh Joshi
Adarsh Kosta
Kaushik Roy
OffRL
47
2
0
16 Sep 2024
Traffic expertise meets residual RL: Knowledge-informed model-based residual reinforcement learning for CAV trajectory control
Traffic expertise meets residual RL: Knowledge-informed model-based residual reinforcement learning for CAV trajectory control
Zihao Sheng
Zilin Huang
Sikai Chen
39
9
0
30 Aug 2024
SAMBO-RL: Shifts-aware Model-based Offline Reinforcement Learning
SAMBO-RL: Shifts-aware Model-based Offline Reinforcement Learning
Wang Luo
Haoran Li
Zicheng Zhang
Congying Han
Jiayu Lv
Tiande Guo
OffRL
48
1
0
23 Aug 2024
Leveraging Unlabeled Data Sharing through Kernel Function Approximation in Offline Reinforcement Learning
Leveraging Unlabeled Data Sharing through Kernel Function Approximation in Offline Reinforcement Learning
Yen-Ru Lai
Fu-Chieh Chang
Pei-Yuan Wu
OffRL
81
1
0
22 Aug 2024
Detecting Unsafe Behavior in Neural Network Imitation Policies for
  Caregiving Robotics
Detecting Unsafe Behavior in Neural Network Imitation Policies for Caregiving Robotics
Andrii Tytarenko
OffRL
52
0
0
29 Jul 2024
Reinforcement Learning for Sustainable Energy: A Survey
Reinforcement Learning for Sustainable Energy: A Survey
Koen Ponse
Felix Kleuker
Márton Fejér
Álvaro Serra-Gómez
Aske Plaat
Thomas M. Moerland
OffRL
AI4CE
40
1
0
26 Jul 2024
ROLeR: Effective Reward Shaping in Offline Reinforcement Learning for Recommender Systems
ROLeR: Effective Reward Shaping in Offline Reinforcement Learning for Recommender Systems
Yi Zhang
Ruihong Qiu
Jiajun Liu
Sen Wang
OffRL
21
0
0
18 Jul 2024
Benchmarks for Reinforcement Learning with Biased Offline Data and
  Imperfect Simulators
Benchmarks for Reinforcement Learning with Biased Offline Data and Imperfect Simulators
Ori Linial
Guy Tennenholtz
Uri Shalit
OffRL
46
1
0
30 Jun 2024
UniZero: Generalized and Efficient Planning with Scalable Latent World Models
UniZero: Generalized and Efficient Planning with Scalable Latent World Models
Yuan Pu
Yazhe Niu
Jiyuan Ren
Zhenjie Yang
Hongsheng Li
Yu Liu
OffRL
49
1
0
15 Jun 2024
NeoRL: Efficient Exploration for Nonepisodic RL
NeoRL: Efficient Exploration for Nonepisodic RL
Bhavya Sukhija
Lenart Treven
Florian Dorfler
Stelian Coros
Andreas Krause
OffRL
41
0
0
03 Jun 2024
Trust the Model Where It Trusts Itself -- Model-Based Actor-Critic with
  Uncertainty-Aware Rollout Adaption
Trust the Model Where It Trusts Itself -- Model-Based Actor-Critic with Uncertainty-Aware Rollout Adaption
Bernd Frauenknecht
Artur Eisele
Devdutt Subhasish
Friedrich Solowjow
Sebastian Trimpe
49
5
0
29 May 2024
BWArea Model: Learning World Model, Inverse Dynamics, and Policy for
  Controllable Language Generation
BWArea Model: Learning World Model, Inverse Dynamics, and Policy for Controllable Language Generation
Chengxing Jia
Pengyuan Wang
Ziniu Li
Yi-Chen Li
Zhilong Zhang
Nan Tang
Yang Yu
OffRL
42
1
0
27 May 2024
Pausing Policy Learning in Non-stationary Reinforcement Learning
Pausing Policy Learning in Non-stationary Reinforcement Learning
Hyunin Lee
Ming Jin
Javad Lavaei
Somayeh Sojoudi
OffRL
39
2
0
25 May 2024
Cross-Domain Policy Adaptation by Capturing Representation Mismatch
Cross-Domain Policy Adaptation by Capturing Representation Mismatch
Jiafei Lyu
Chenjia Bai
Jingwen Yang
Zongqing Lu
Xiu Li
30
8
0
24 May 2024
iVideoGPT: Interactive VideoGPTs are Scalable World Models
iVideoGPT: Interactive VideoGPTs are Scalable World Models
Jialong Wu
Shaofeng Yin
Ningya Feng
Xu He
Dong Li
Haifeng Zhang
Mingsheng Long
VGen
49
26
0
24 May 2024
State-Constrained Offline Reinforcement Learning
State-Constrained Offline Reinforcement Learning
Charles A. Hepburn
Yue Jin
Giovanni Montana
OffRL
37
0
0
23 May 2024
Model-based Reinforcement Learning for Parameterized Action Spaces
Model-based Reinforcement Learning for Parameterized Action Spaces
Renhao Zhang
Haotian Fu
Yilin Miao
George Konidaris
31
3
0
03 Apr 2024
Active Exploration in Bayesian Model-based Reinforcement Learning for
  Robot Manipulation
Active Exploration in Bayesian Model-based Reinforcement Learning for Robot Manipulation
Carlos Plou
Ana C. Murillo
Ruben Martinez-Cantin
OffRL
40
0
0
02 Apr 2024
Learning Off-policy with Model-based Intrinsic Motivation For Active
  Online Exploration
Learning Off-policy with Model-based Intrinsic Motivation For Active Online Exploration
Yibo Wang
Jiang Zhao
OffRL
OnRL
30
0
0
31 Mar 2024
Dissecting Deep RL with High Update Ratios: Combatting Value Divergence
Dissecting Deep RL with High Update Ratios: Combatting Value Divergence
Marcel Hussing
C. Voelcker
Igor Gilitschenski
Amir-massoud Farahmand
Eric Eaton
42
3
0
09 Mar 2024
Overestimation, Overfitting, and Plasticity in Actor-Critic: the Bitter
  Lesson of Reinforcement Learning
Overestimation, Overfitting, and Plasticity in Actor-Critic: the Bitter Lesson of Reinforcement Learning
Michal Nauman
Michal Bortkiewicz
Piotr Milo's
Tomasz Trzciñski
M. Ostaszewski
Marek Cygan
OffRL
35
17
0
01 Mar 2024
A Model-Based Approach for Improving Reinforcement Learning Efficiency
  Leveraging Expert Observations
A Model-Based Approach for Improving Reinforcement Learning Efficiency Leveraging Expert Observations
E. C. Ozcan
Vittorio Giammarino
James Queeney
I. Paschalidis
OffRL
42
0
0
29 Feb 2024
The Edge-of-Reach Problem in Offline Model-Based Reinforcement Learning
The Edge-of-Reach Problem in Offline Model-Based Reinforcement Learning
Anya Sims
Cong Lu
Yee Whye Teh
OffRL
35
3
0
19 Feb 2024
Offline Imitation Learning by Controlling the Effective Planning Horizon
Offline Imitation Learning by Controlling the Effective Planning Horizon
Hee-Jun Ahn
Seong-Woong Shim
Byung-Jun Lee
26
0
0
18 Jan 2024
Bridging State and History Representations: Understanding
  Self-Predictive RL
Bridging State and History Representations: Understanding Self-Predictive RL
Tianwei Ni
Benjamin Eysenbach
Erfan Seyedsalehi
Michel Ma
Clement Gehring
Aditya Mahajan
Pierre-Luc Bacon
AI4TS
AI4CE
22
20
0
17 Jan 2024
MOTO: Offline Pre-training to Online Fine-tuning for Model-based Robot
  Learning
MOTO: Offline Pre-training to Online Fine-tuning for Model-based Robot Learning
Rafael Rafailov
Kyle Hatch
Victor Kolev
John D. Martin
Mariano Phielipp
Chelsea Finn
OffRL
OnRL
22
10
0
06 Jan 2024
12345
Next