Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2011.03506
Cited By
The Value Equivalence Principle for Model-Based Reinforcement Learning
6 November 2020
Christopher Grimm
André Barreto
Satinder Singh
David Silver
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"The Value Equivalence Principle for Model-Based Reinforcement Learning"
24 / 24 papers shown
Title
ReZero: Boosting MCTS-based Algorithms by Backward-view and Entire-buffer Reanalyze
Chunyu Xuan
Yazhe Niu
Yuan Pu
Shuai Hu
Yu Liu
Jing Yang
78
0
0
03 Jan 2025
MAD-TD: Model-Augmented Data stabilizes High Update Ratio RL
C. Voelcker
Marcel Hussing
Eric Eaton
Amir-massoud Farahmand
Igor Gilitschenski
46
2
0
11 Oct 2024
Meta-Gradient Search Control: A Method for Improving the Efficiency of Dyna-style Planning
Bradley Burega
John D. Martin
Luke Kapeluck
Michael Bowling
40
0
0
27 Jun 2024
UniZero: Generalized and Efficient Planning with Scalable Latent World Models
Yuan Pu
Yazhe Niu
Jiyuan Ren
Zhenjie Yang
Hongsheng Li
Yu Liu
OffRL
54
1
0
15 Jun 2024
Feasibility Consistent Representation Learning for Safe Reinforcement Learning
Zhepeng Cen
Yi-Fan Yao
Zuxin Liu
Ding Zhao
OffRL
45
3
0
20 May 2024
Bridging State and History Representations: Understanding Self-Predictive RL
Tianwei Ni
Benjamin Eysenbach
Erfan Seyedsalehi
Michel Ma
Clement Gehring
Aditya Mahajan
Pierre-Luc Bacon
AI4TS
AI4CE
31
21
0
17 Jan 2024
Pixel State Value Network for Combined Prediction and Planning in Interactive Environments
Sascha Rosbach
Stefan M. Leupold
S. Großjohann
Stefan Roth
29
0
0
11 Oct 2023
λ
λ
λ
-models: Effective Decision-Aware Reinforcement Learning with Latent Models
C. Voelcker
Arash Ahmadian
Romina Abachi
Igor Gilitschenski
Amir-massoud Farahmand
61
0
0
30 Jun 2023
TOM: Learning Policy-Aware Models for Model-Based Reinforcement Learning via Transition Occupancy Matching
Yecheng Jason Ma
K. Sivakumar
Jason Yan
Osbert Bastani
Dinesh Jayaraman
OffRL
MU
34
6
0
22 May 2023
Policy Gradient Methods in the Presence of Symmetries and State Abstractions
Prakash Panangaden
S. Rezaei-Shoshtari
Rosie Zhao
David Meger
Doina Precup
38
2
0
09 May 2023
Bayesian Reinforcement Learning with Limited Cognitive Load
Dilip Arumugam
Mark K. Ho
Noah D. Goodman
Benjamin Van Roy
OffRL
39
8
0
05 May 2023
A Review of Symbolic, Subsymbolic and Hybrid Methods for Sequential Decision Making
Carlos Núnez-Molina
Pablo Mesejo
Juan Fernández-Olivares
39
3
0
20 Apr 2023
The Virtues of Laziness in Model-based RL: A Unified Objective and Algorithms
Anirudh Vemula
Yuda Song
Aarti Singh
J. Andrew Bagnell
Sanjiban Choudhury
OffRL
46
13
0
01 Mar 2023
Is Model Ensemble Necessary? Model-based RL via a Single Model with Lipschitz Regularized Value Function
Ruijie Zheng
Xiyao Wang
Huazhe Xu
Furong Huang
50
14
0
02 Feb 2023
Operator Splitting Value Iteration
Amin Rakhsha
Andrew Wang
Mohammad Ghavamzadeh
Amir-massoud Farahmand
OffRL
33
7
0
25 Nov 2022
On Rate-Distortion Theory in Capacity-Limited Cognition & Reinforcement Learning
Dilip Arumugam
Mark K. Ho
Noah D. Goodman
Benjamin Van Roy
33
4
0
30 Oct 2022
Continuous MDP Homomorphisms and Homomorphic Policy Gradient
S. Rezaei-Shoshtari
Rosie Zhao
Prakash Panangaden
David Meger
Doina Precup
35
18
0
15 Sep 2022
Integrating Symmetry into Differentiable Planning with Steerable Convolutions
Linfeng Zhao
Xu Zhu
Lingzhi Kong
Robin Walters
Lawson L. S. Wong
26
7
0
08 Jun 2022
Between Rate-Distortion Theory & Value Equivalence in Model-Based Reinforcement Learning
Dilip Arumugam
Benjamin Van Roy
OffRL
38
1
0
04 Jun 2022
Transfer RL across Observation Feature Spaces via Model-Based Regularization
Yanchao Sun
Ruijie Zheng
Xiyao Wang
Andrew Cohen
Furong Huang
OOD
OffRL
25
21
0
01 Jan 2022
Model-Value Inconsistency as a Signal for Epistemic Uncertainty
Angelos Filos
Eszter Vértes
Zita Marinho
Gregory Farquhar
Diana Borsa
A. Friesen
Feryal M. P. Behbahani
Tom Schaul
André Barreto
Simon Osindero
44
7
0
08 Dec 2021
Self-Consistent Models and Values
Roy Miles
Kate Baumli
Zita Marinho
Angelos Filos
Matteo Hessel
Hado van Hasselt
David Silver
38
8
0
25 Oct 2021
Control-Oriented Model-Based Reinforcement Learning with Implicit Differentiation
Evgenii Nikishin
Romina Abachi
Rishabh Agarwal
Pierre-Luc Bacon
OffRL
54
35
0
06 Jun 2021
Reinforcement Learning, Bit by Bit
Xiuyuan Lu
Benjamin Van Roy
Vikranth Dwaracherla
M. Ibrahimi
Ian Osband
Zheng Wen
30
70
0
06 Mar 2021
1