ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2011.03506
  4. Cited By
The Value Equivalence Principle for Model-Based Reinforcement Learning

The Value Equivalence Principle for Model-Based Reinforcement Learning

6 November 2020
Christopher Grimm
André Barreto
Satinder Singh
David Silver
    OffRL
ArXivPDFHTML

Papers citing "The Value Equivalence Principle for Model-Based Reinforcement Learning"

24 / 24 papers shown
Title
ReZero: Boosting MCTS-based Algorithms by Backward-view and Entire-buffer Reanalyze
ReZero: Boosting MCTS-based Algorithms by Backward-view and Entire-buffer Reanalyze
Chunyu Xuan
Yazhe Niu
Yuan Pu
Shuai Hu
Yu Liu
Jing Yang
78
0
0
03 Jan 2025
MAD-TD: Model-Augmented Data stabilizes High Update Ratio RL
MAD-TD: Model-Augmented Data stabilizes High Update Ratio RL
C. Voelcker
Marcel Hussing
Eric Eaton
Amir-massoud Farahmand
Igor Gilitschenski
46
2
0
11 Oct 2024
Meta-Gradient Search Control: A Method for Improving the Efficiency of
  Dyna-style Planning
Meta-Gradient Search Control: A Method for Improving the Efficiency of Dyna-style Planning
Bradley Burega
John D. Martin
Luke Kapeluck
Michael Bowling
40
0
0
27 Jun 2024
UniZero: Generalized and Efficient Planning with Scalable Latent World Models
UniZero: Generalized and Efficient Planning with Scalable Latent World Models
Yuan Pu
Yazhe Niu
Jiyuan Ren
Zhenjie Yang
Hongsheng Li
Yu Liu
OffRL
54
1
0
15 Jun 2024
Feasibility Consistent Representation Learning for Safe Reinforcement
  Learning
Feasibility Consistent Representation Learning for Safe Reinforcement Learning
Zhepeng Cen
Yi-Fan Yao
Zuxin Liu
Ding Zhao
OffRL
45
3
0
20 May 2024
Bridging State and History Representations: Understanding
  Self-Predictive RL
Bridging State and History Representations: Understanding Self-Predictive RL
Tianwei Ni
Benjamin Eysenbach
Erfan Seyedsalehi
Michel Ma
Clement Gehring
Aditya Mahajan
Pierre-Luc Bacon
AI4TS
AI4CE
31
21
0
17 Jan 2024
Pixel State Value Network for Combined Prediction and Planning in
  Interactive Environments
Pixel State Value Network for Combined Prediction and Planning in Interactive Environments
Sascha Rosbach
Stefan M. Leupold
S. Großjohann
Stefan Roth
29
0
0
11 Oct 2023
$λ$-models: Effective Decision-Aware Reinforcement Learning with
  Latent Models
λλλ-models: Effective Decision-Aware Reinforcement Learning with Latent Models
C. Voelcker
Arash Ahmadian
Romina Abachi
Igor Gilitschenski
Amir-massoud Farahmand
61
0
0
30 Jun 2023
TOM: Learning Policy-Aware Models for Model-Based Reinforcement Learning
  via Transition Occupancy Matching
TOM: Learning Policy-Aware Models for Model-Based Reinforcement Learning via Transition Occupancy Matching
Yecheng Jason Ma
K. Sivakumar
Jason Yan
Osbert Bastani
Dinesh Jayaraman
OffRL
MU
34
6
0
22 May 2023
Policy Gradient Methods in the Presence of Symmetries and State
  Abstractions
Policy Gradient Methods in the Presence of Symmetries and State Abstractions
Prakash Panangaden
S. Rezaei-Shoshtari
Rosie Zhao
David Meger
Doina Precup
38
2
0
09 May 2023
Bayesian Reinforcement Learning with Limited Cognitive Load
Bayesian Reinforcement Learning with Limited Cognitive Load
Dilip Arumugam
Mark K. Ho
Noah D. Goodman
Benjamin Van Roy
OffRL
39
8
0
05 May 2023
A Review of Symbolic, Subsymbolic and Hybrid Methods for Sequential
  Decision Making
A Review of Symbolic, Subsymbolic and Hybrid Methods for Sequential Decision Making
Carlos Núnez-Molina
Pablo Mesejo
Juan Fernández-Olivares
39
3
0
20 Apr 2023
The Virtues of Laziness in Model-based RL: A Unified Objective and
  Algorithms
The Virtues of Laziness in Model-based RL: A Unified Objective and Algorithms
Anirudh Vemula
Yuda Song
Aarti Singh
J. Andrew Bagnell
Sanjiban Choudhury
OffRL
46
13
0
01 Mar 2023
Is Model Ensemble Necessary? Model-based RL via a Single Model with
  Lipschitz Regularized Value Function
Is Model Ensemble Necessary? Model-based RL via a Single Model with Lipschitz Regularized Value Function
Ruijie Zheng
Xiyao Wang
Huazhe Xu
Furong Huang
50
14
0
02 Feb 2023
Operator Splitting Value Iteration
Operator Splitting Value Iteration
Amin Rakhsha
Andrew Wang
Mohammad Ghavamzadeh
Amir-massoud Farahmand
OffRL
33
7
0
25 Nov 2022
On Rate-Distortion Theory in Capacity-Limited Cognition & Reinforcement
  Learning
On Rate-Distortion Theory in Capacity-Limited Cognition & Reinforcement Learning
Dilip Arumugam
Mark K. Ho
Noah D. Goodman
Benjamin Van Roy
33
4
0
30 Oct 2022
Continuous MDP Homomorphisms and Homomorphic Policy Gradient
Continuous MDP Homomorphisms and Homomorphic Policy Gradient
S. Rezaei-Shoshtari
Rosie Zhao
Prakash Panangaden
David Meger
Doina Precup
35
18
0
15 Sep 2022
Integrating Symmetry into Differentiable Planning with Steerable
  Convolutions
Integrating Symmetry into Differentiable Planning with Steerable Convolutions
Linfeng Zhao
Xu Zhu
Lingzhi Kong
Robin Walters
Lawson L. S. Wong
26
7
0
08 Jun 2022
Between Rate-Distortion Theory & Value Equivalence in Model-Based
  Reinforcement Learning
Between Rate-Distortion Theory & Value Equivalence in Model-Based Reinforcement Learning
Dilip Arumugam
Benjamin Van Roy
OffRL
38
1
0
04 Jun 2022
Transfer RL across Observation Feature Spaces via Model-Based
  Regularization
Transfer RL across Observation Feature Spaces via Model-Based Regularization
Yanchao Sun
Ruijie Zheng
Xiyao Wang
Andrew Cohen
Furong Huang
OOD
OffRL
25
21
0
01 Jan 2022
Model-Value Inconsistency as a Signal for Epistemic Uncertainty
Model-Value Inconsistency as a Signal for Epistemic Uncertainty
Angelos Filos
Eszter Vértes
Zita Marinho
Gregory Farquhar
Diana Borsa
A. Friesen
Feryal M. P. Behbahani
Tom Schaul
André Barreto
Simon Osindero
44
7
0
08 Dec 2021
Self-Consistent Models and Values
Self-Consistent Models and Values
Roy Miles
Kate Baumli
Zita Marinho
Angelos Filos
Matteo Hessel
Hado van Hasselt
David Silver
38
8
0
25 Oct 2021
Control-Oriented Model-Based Reinforcement Learning with Implicit
  Differentiation
Control-Oriented Model-Based Reinforcement Learning with Implicit Differentiation
Evgenii Nikishin
Romina Abachi
Rishabh Agarwal
Pierre-Luc Bacon
OffRL
54
35
0
06 Jun 2021
Reinforcement Learning, Bit by Bit
Reinforcement Learning, Bit by Bit
Xiuyuan Lu
Benjamin Van Roy
Vikranth Dwaracherla
M. Ibrahimi
Ian Osband
Zheng Wen
30
70
0
06 Mar 2021
1