ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1906.08253
  4. Cited By
When to Trust Your Model: Model-Based Policy Optimization

When to Trust Your Model: Model-Based Policy Optimization

19 June 2019
Michael Janner
Justin Fu
Marvin Zhang
Sergey Levine
    OffRL
ArXivPDFHTML

Papers citing "When to Trust Your Model: Model-Based Policy Optimization"

50 / 245 papers shown
Title
MOTO: Offline Pre-training to Online Fine-tuning for Model-based Robot
  Learning
MOTO: Offline Pre-training to Online Fine-tuning for Model-based Robot Learning
Rafael Rafailov
Kyle Hatch
Victor Kolev
John D. Martin
Mariano Phielipp
Chelsea Finn
OffRL
OnRL
22
10
0
06 Jan 2024
Predictable Reinforcement Learning Dynamics through Entropy Rate Minimization
Predictable Reinforcement Learning Dynamics through Entropy Rate Minimization
Daniel Jarne Ornia
Giannis Delimpaltadakis
Jens Kober
Javier Alonso-Mora
28
2
0
30 Nov 2023
On the Theory of Risk-Aware Agents: Bridging Actor-Critic and Economics
On the Theory of Risk-Aware Agents: Bridging Actor-Critic and Economics
Michal Nauman
Marek Cygan
40
1
0
30 Oct 2023
One is More: Diverse Perspectives within a Single Network for Efficient
  DRL
One is More: Diverse Perspectives within a Single Network for Efficient DRL
Yiqin Tan
Ling Pan
Longbo Huang
OffRL
40
0
0
21 Oct 2023
Reinforcement Learning in the Era of LLMs: What is Essential? What is
  needed? An RL Perspective on RLHF, Prompting, and Beyond
Reinforcement Learning in the Era of LLMs: What is Essential? What is needed? An RL Perspective on RLHF, Prompting, and Beyond
Hao Sun
OffRL
34
21
0
09 Oct 2023
Planning to Go Out-of-Distribution in Offline-to-Online Reinforcement
  Learning
Planning to Go Out-of-Distribution in Offline-to-Online Reinforcement Learning
Trevor A. McInroe
Adam Jelley
Stefano V. Albrecht
Amos Storkey
OffRL
OnRL
28
6
0
09 Oct 2023
Reward-Consistent Dynamics Models are Strongly Generalizable for Offline
  Reinforcement Learning
Reward-Consistent Dynamics Models are Strongly Generalizable for Offline Reinforcement Learning
Fan Luo
Tian Xu
Xingchen Cao
Yang Yu
OffRL
32
7
0
09 Oct 2023
Pre-training with Synthetic Data Helps Offline Reinforcement Learning
Pre-training with Synthetic Data Helps Offline Reinforcement Learning
Zecheng Wang
Che Wang
Zixuan Dong
Keith Ross
OffRL
41
5
0
01 Oct 2023
Recent Advances in Path Integral Control for Trajectory Optimization: An
  Overview in Theoretical and Algorithmic Perspectives
Recent Advances in Path Integral Control for Trajectory Optimization: An Overview in Theoretical and Algorithmic Perspectives
Muhammad Kazim
JunGee Hong
Min-Gyeom Kim
Kwang-Ki K. Kim
44
16
0
22 Sep 2023
Machine Learning Meets Advanced Robotic Manipulation
Machine Learning Meets Advanced Robotic Manipulation
Saeid Nahavandi
R. Alizadehsani
D. Nahavandi
Chee Peng Lim
Kevin Kelly
Fernando Bello
24
17
0
22 Sep 2023
Practical Probabilistic Model-based Deep Reinforcement Learning by
  Integrating Dropout Uncertainty and Trajectory Sampling
Practical Probabilistic Model-based Deep Reinforcement Learning by Integrating Dropout Uncertainty and Trajectory Sampling
Wenjun Huang
Yunduan Cui
Huiyun Li
Xin Wu
MU
27
0
0
20 Sep 2023
Signal Temporal Logic Neural Predictive Control
Signal Temporal Logic Neural Predictive Control
Yue Meng
Chuchu Fan
18
15
0
10 Sep 2023
RePo: Resilient Model-Based Reinforcement Learning by Regularizing
  Posterior Predictability
RePo: Resilient Model-Based Reinforcement Learning by Regularizing Posterior Predictability
Chuning Zhu
Max Simchowitz
Siri Gadipudi
Abhishek Gupta
46
13
0
31 Aug 2023
Benchmarking Offline Reinforcement Learning on Real-Robot Hardware
Benchmarking Offline Reinforcement Learning on Real-Robot Hardware
Nico Gürtler
Sebastian Blaes
Pavel Kolev
Felix Widmaier
Manuel Wüthrich
Stefan Bauer
Bernhard Schölkopf
Georg Martius
OffRL
33
28
0
28 Jul 2023
PASTA: Pretrained Action-State Transformer Agents
PASTA: Pretrained Action-State Transformer Agents
Raphael Boige
Yannis Flet-Berliac
Arthur Flajolet
Guillaume Richard
Thomas Pierrot
LM&Ro
OffRL
42
5
0
20 Jul 2023
Adaptive Ensemble Q-learning: Minimizing Estimation Bias via Error
  Feedback
Adaptive Ensemble Q-learning: Minimizing Estimation Bias via Error Feedback
Hang Wang
Sen Lin
Junshan Zhang
23
19
0
20 Jun 2023
Simplified Temporal Consistency Reinforcement Learning
Simplified Temporal Consistency Reinforcement Learning
Yi Zhao
Wenshuai Zhao
Rinu Boney
Arno Solin
Joni Pajarinen
OffRL
30
13
0
15 Jun 2023
Safe Offline Reinforcement Learning with Real-Time Budget Constraints
Safe Offline Reinforcement Learning with Real-Time Budget Constraints
Qian Lin
Bo Tang
Zifan Wu
Chao Yu
Shangqin Mao
Qianlong Xie
Xingxing Wang
Dong Wang
OffRL
38
11
0
01 Jun 2023
Off-Policy RL Algorithms Can be Sample-Efficient for Continuous Control
  via Sample Multiple Reuse
Off-Policy RL Algorithms Can be Sample-Efficient for Continuous Control via Sample Multiple Reuse
Jiafei Lyu
Le Wan
Zongqing Lu
Xiu Li
OffRL
36
9
0
29 May 2023
TOM: Learning Policy-Aware Models for Model-Based Reinforcement Learning
  via Transition Occupancy Matching
TOM: Learning Policy-Aware Models for Model-Based Reinforcement Learning via Transition Occupancy Matching
Yecheng Jason Ma
K. Sivakumar
Jason Yan
Osbert Bastani
Dinesh Jayaraman
OffRL
MU
32
6
0
22 May 2023
Truncating Trajectories in Monte Carlo Reinforcement Learning
Truncating Trajectories in Monte Carlo Reinforcement Learning
Riccardo Poiani
Alberto Maria Metelli
Marcello Restelli
29
2
0
07 May 2023
3D-IntPhys: Towards More Generalized 3D-grounded Visual Intuitive
  Physics under Challenging Scenes
3D-IntPhys: Towards More Generalized 3D-grounded Visual Intuitive Physics under Challenging Scenes
Haotian Xue
Antonio Torralba
J. Tenenbaum
Daniel L. K. Yamins
Yunzhu Li
H. Tung
PINN
VGen
AI4CE
66
8
0
22 Apr 2023
Approximate Shielding of Atari Agents for Safe Exploration
Approximate Shielding of Atari Agents for Safe Exploration
Alexander W. Goodall
Francesco Belardinelli
27
2
0
21 Apr 2023
Efficient Deep Reinforcement Learning Requires Regulating Overfitting
Efficient Deep Reinforcement Learning Requires Regulating Overfitting
Qiyang Li
Aviral Kumar
Ilya Kostrikov
Sergey Levine
OffRL
32
31
0
20 Apr 2023
MABL: Bi-Level Latent-Variable World Model for Sample-Efficient
  Multi-Agent Reinforcement Learning
MABL: Bi-Level Latent-Variable World Model for Sample-Efficient Multi-Agent Reinforcement Learning
Aravind Venugopal
Stephanie Milani
Fei Fang
Balaraman Ravindran
OffRL
21
0
0
12 Apr 2023
Neural Constraint Satisfaction: Hierarchical Abstraction for
  Combinatorial Generalization in Object Rearrangement
Neural Constraint Satisfaction: Hierarchical Abstraction for Combinatorial Generalization in Object Rearrangement
Michael Chang
Alyssa Dayan
Franziska Meier
Thomas Griffiths
Sergey Levine
Amy Zhang
OCL
OffRL
37
9
0
20 Mar 2023
Hybrid Systems Neural Control with Region-of-Attraction Planner
Hybrid Systems Neural Control with Region-of-Attraction Planner
Yue Meng
Chuchu Fan
29
1
0
18 Mar 2023
Beware of Instantaneous Dependence in Reinforcement Learning
Beware of Instantaneous Dependence in Reinforcement Learning
Zhengmao Zhu
Yu-Ren Liu
Hong Tian
Yang Yu
Kun Zhang
OffRL
36
1
0
09 Mar 2023
Environment Transformer and Policy Optimization for Model-Based Offline
  Reinforcement Learning
Environment Transformer and Policy Optimization for Model-Based Offline Reinforcement Learning
Pengqin Wang
Meixin Zhu
Shaojie Shen
OffRL
33
1
0
07 Mar 2023
Guarded Policy Optimization with Imperfect Online Demonstrations
Guarded Policy Optimization with Imperfect Online Demonstrations
Zhenghai Xue
Zhenghao Peng
Quanyi Li
Zhihan Liu
Bolei Zhou
OffRL
51
10
0
03 Mar 2023
Hallucinated Adversarial Control for Conservative Offline Policy
  Evaluation
Hallucinated Adversarial Control for Conservative Offline Policy Evaluation
Jonas Rothfuss
Bhavya Sukhija
Tobias Birchler
Parnian Kassraie
Andreas Krause
OffRL
21
10
0
02 Mar 2023
The Virtues of Laziness in Model-based RL: A Unified Objective and
  Algorithms
The Virtues of Laziness in Model-based RL: A Unified Objective and Algorithms
Anirudh Vemula
Yuda Song
Aarti Singh
J. Andrew Bagnell
Sanjiban Choudhury
OffRL
38
13
0
01 Mar 2023
Q-Cogni: An Integrated Causal Reinforcement Learning Framework
Q-Cogni: An Integrated Causal Reinforcement Learning Framework
C. Cunha
Wen Liu
T. French
Ajmal Mian
26
1
0
26 Feb 2023
The Dormant Neuron Phenomenon in Deep Reinforcement Learning
The Dormant Neuron Phenomenon in Deep Reinforcement Learning
Ghada Sokar
Rishabh Agarwal
Pablo Samuel Castro
Utku Evci
CLL
51
90
0
24 Feb 2023
Neural Laplace Control for Continuous-time Delayed Systems
Neural Laplace Control for Continuous-time Delayed Systems
Samuel Holt
Alihan Huyuk
Zhaozhi Qian
Hao Sun
M. Schaar
OffRL
34
10
0
24 Feb 2023
Model-Based Uncertainty in Value Functions
Model-Based Uncertainty in Value Functions
Carlos E. Luis
A. Bottero
Julia Vinogradska
Felix Berkenkamp
Jan Peters
36
14
0
24 Feb 2023
Learning a model is paramount for sample efficiency in reinforcement
  learning control of PDEs
Learning a model is paramount for sample efficiency in reinforcement learning control of PDEs
Stefan Werner
Sebastian Peitz
41
9
0
14 Feb 2023
Predictable MDP Abstraction for Unsupervised Model-Based RL
Predictable MDP Abstraction for Unsupervised Model-Based RL
Seohong Park
Sergey Levine
24
9
0
08 Feb 2023
Online Reinforcement Learning in Non-Stationary Context-Driven Environments
Online Reinforcement Learning in Non-Stationary Context-Driven Environments
Pouya Hamadanian
Arash Nasr-Esfahany
Malte Schwarzkopf
Siddartha Sen
MohammadIman Alizadeh
CLL
OffRL
52
0
0
04 Feb 2023
Is Model Ensemble Necessary? Model-based RL via a Single Model with
  Lipschitz Regularized Value Function
Is Model Ensemble Necessary? Model-based RL via a Single Model with Lipschitz Regularized Value Function
Ruijie Zheng
Xiyao Wang
Huazhe Xu
Furong Huang
48
14
0
02 Feb 2023
Distillation Policy Optimization
Distillation Policy Optimization
Jianfei Ma
OffRL
26
1
0
01 Feb 2023
Risk-Averse Model Uncertainty for Distributionally Robust Safe
  Reinforcement Learning
Risk-Averse Model Uncertainty for Distributionally Robust Safe Reinforcement Learning
James Queeney
M. Benosman
OOD
OffRL
41
5
0
30 Jan 2023
STEERING: Stein Information Directed Exploration for Model-Based
  Reinforcement Learning
STEERING: Stein Information Directed Exploration for Model-Based Reinforcement Learning
Souradip Chakraborty
Amrit Singh Bedi
Alec Koppel
Mengdi Wang
Furong Huang
Dinesh Manocha
24
7
0
28 Jan 2023
Hint assisted reinforcement learning: an application in radio astronomy
Hint assisted reinforcement learning: an application in radio astronomy
S. Yatawatta
30
1
0
10 Jan 2023
A Survey on Transformers in Reinforcement Learning
A Survey on Transformers in Reinforcement Learning
Wenzhe Li
Hao Luo
Zichuan Lin
Chongjie Zhang
Zongqing Lu
Deheng Ye
OffRL
MU
AI4CE
37
56
0
08 Jan 2023
Benchmarking Offline Reinforcement Learning Algorithms for E-Commerce
  Order Fraud Evaluation
Benchmarking Offline Reinforcement Learning Algorithms for E-Commerce Order Fraud Evaluation
Soysal Degirmenci
Chris Jones
OffRL
27
1
0
05 Dec 2022
Policy Learning for Active Target Tracking over Continuous SE(3)
  Trajectories
Policy Learning for Active Target Tracking over Continuous SE(3) Trajectories
Pengzhi Yang
Shumon Koga
Arash Asgharivaskasi
Nikolay Atanasov
31
3
0
03 Dec 2022
Safe Reinforcement Learning with Probabilistic Control Barrier Functions
  for Ramp Merging
Safe Reinforcement Learning with Probabilistic Control Barrier Functions for Ramp Merging
Soumith Udatha
Yiwei Lyu
John M. Dolan
17
1
0
01 Dec 2022
Domain Generalization for Robust Model-Based Offline Reinforcement
  Learning
Domain Generalization for Robust Model-Based Offline Reinforcement Learning
Alan Clark
Shoaib Ahmed Siddiqui
Robert Kirk
Usman Anwar
Stephen Chung
David M. Krueger
OOD
OffRL
29
0
0
27 Nov 2022
$β$-Multivariational Autoencoder for Entangled Representation
  Learning in Video Frames
βββ-Multivariational Autoencoder for Entangled Representation Learning in Video Frames
F. Nouri
R. Bergevin
21
0
0
22 Nov 2022
Previous
12345
Next