ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2110.12840
  4. Cited By
Self-Consistent Models and Values

Self-Consistent Models and Values

25 October 2021
Roy Miles
Kate Baumli
Zita Marinho
Angelos Filos
Matteo Hessel
Hado van Hasselt
David Silver
ArXivPDFHTML

Papers citing "Self-Consistent Models and Values"

34 / 34 papers shown
Title
PlayVirtual: Augmenting Cycle-Consistent Virtual Trajectories for
  Reinforcement Learning
PlayVirtual: Augmenting Cycle-Consistent Virtual Trajectories for Reinforcement Learning
Tao Yu
Cuiling Lan
Wenjun Zeng
Mingxiao Feng
Zhizheng Zhang
Zhibo Chen
OffRL
38
46
0
08 Jun 2021
Online and Offline Reinforcement Learning by Planning with a Learned
  Model
Online and Offline Reinforcement Learning by Planning with a Learned Model
Julian Schrittwieser
Thomas Hubert
Amol Mandhane
M. Barekatain
Ioannis Antonoglou
David Silver
OffRL
45
114
0
13 Apr 2021
Podracer architectures for scalable Reinforcement Learning
Podracer architectures for scalable Reinforcement Learning
Matteo Hessel
M. Kroiss
Aidan Clark
Iurii Kemaev
John Quan
Thomas Keck
Fabio Viola
H. V. Hasselt
26
39
0
13 Apr 2021
Muesli: Combining Improvements in Policy Optimization
Muesli: Combining Improvements in Policy Optimization
Matteo Hessel
Ivo Danihelka
Fabio Viola
A. Guez
Simon Schmitt
Laurent Sifre
T. Weber
David Silver
H. V. Hasselt
36
66
0
13 Apr 2021
PsiPhi-Learning: Reinforcement Learning with Demonstrations using
  Successor Features and Inverse Temporal Difference Learning
PsiPhi-Learning: Reinforcement Learning with Demonstrations using Successor Features and Inverse Temporal Difference Learning
Angelos Filos
Clare Lyle
Y. Gal
Sergey Levine
Natasha Jaques
Gregory Farquhar
28
22
0
24 Feb 2021
On the role of planning in model-based deep reinforcement learning
On the role of planning in model-based deep reinforcement learning
Jessica B. Hamrick
A. Friesen
Feryal M. P. Behbahani
A. Guez
Fabio Viola
Sims Witherspoon
Thomas W. Anthony
Lars Buesing
Petar Velickovic
T. Weber
OffRL
34
65
0
08 Nov 2020
The Value Equivalence Principle for Model-Based Reinforcement Learning
The Value Equivalence Principle for Model-Based Reinforcement Learning
Christopher Grimm
André Barreto
Satinder Singh
David Silver
OffRL
28
85
0
06 Nov 2020
Forethought and Hindsight in Credit Assignment
Forethought and Hindsight in Credit Assignment
Veronica Chelu
Doina Precup
H. V. Hasselt
42
25
0
26 Oct 2020
Mastering Atari with Discrete World Models
Mastering Atari with Discrete World Models
Danijar Hafner
Timothy Lillicrap
Mohammad Norouzi
Jimmy Ba
DRL
62
834
0
05 Oct 2020
Value-driven Hindsight Modelling
Value-driven Hindsight Modelling
A. Guez
Fabio Viola
T. Weber
Lars Buesing
Steven Kapturowski
Doina Precup
David Silver
N. Heess
OffRL
37
12
0
19 Feb 2020
Frequency-based Search-control in Dyna
Frequency-based Search-control in Dyna
Yangchen Pan
Jincheng Mei
Amir-massoud Farahmand
27
15
0
14 Feb 2020
Causally Correct Partial Models for Reinforcement Learning
Causally Correct Partial Models for Reinforcement Learning
Danilo Jimenez Rezende
Ivo Danihelka
George Papamakarios
Nan Rosemary Ke
Ray Jiang
...
Jane X. Wang
Jovana Mitrović
F. Besse
Ioannis Antonoglou
Lars Buesing
AI4TS
43
33
0
07 Feb 2020
Dream to Control: Learning Behaviors by Latent Imagination
Dream to Control: Learning Behaviors by Latent Imagination
Danijar Hafner
Timothy Lillicrap
Jimmy Ba
Mohammad Norouzi
VLM
81
1,333
0
03 Dec 2019
Imagined Value Gradients: Model-Based Policy Optimization with
  Transferable Latent Dynamics Models
Imagined Value Gradients: Model-Based Policy Optimization with Transferable Latent Dynamics Models
Arunkumar Byravan
Jost Tobias Springenberg
A. Abdolmaleki
Roland Hafner
Michael Neunert
Thomas Lampe
Noah Y. Siegel
N. Heess
Martin Riedmiller
OffRL
38
41
0
09 Oct 2019
Stochastic Latent Actor-Critic: Deep Reinforcement Learning with a
  Latent Variable Model
Stochastic Latent Actor-Critic: Deep Reinforcement Learning with a Latent Variable Model
Alex X. Lee
Anusha Nagabandi
Pieter Abbeel
Sergey Levine
OffRL
BDL
47
377
0
01 Jul 2019
Shaping Belief States with Generative Environment Models for RL
Shaping Belief States with Generative Environment Models for RL
Karol Gregor
Danilo Jimenez Rezende
F. Besse
Yan Wu
Hamza Merzic
Aaron van den Oord
OffRL
AI4CE
55
118
0
21 Jun 2019
When to use parametric models in reinforcement learning?
When to use parametric models in reinforcement learning?
H. V. Hasselt
Matteo Hessel
John Aslanides
55
192
0
12 Jun 2019
Deep Residual Reinforcement Learning
Deep Residual Reinforcement Learning
Shangtong Zhang
Wendelin Bohmer
Shimon Whiteson
17
31
0
03 May 2019
Model-Based Reinforcement Learning for Atari
Model-Based Reinforcement Learning for Atari
Lukasz Kaiser
Mohammad Babaeizadeh
Piotr Milos
B. Osinski
R. Campbell
...
Sergey Levine
Afroz Mohiuddin
Ryan Sepassi
George Tucker
Henryk Michalewski
OffRL
82
851
0
01 Mar 2019
Maximum a Posteriori Policy Optimisation
Maximum a Posteriori Policy Optimisation
A. Abdolmaleki
Jost Tobias Springenberg
Yuval Tassa
Rémi Munos
N. Heess
Martin Riedmiller
59
471
0
14 Jun 2018
The Effect of Planning Shape on Dyna-style Planning in High-dimensional
  State Spaces
The Effect of Planning Shape on Dyna-style Planning in High-dimensional State Spaces
G. Z. Holland
Erik Talvitie
Michael Bowling
AI4CE
22
43
0
05 Jun 2018
Observe and Look Further: Achieving Consistent Performance on Atari
Observe and Look Further: Achieving Consistent Performance on Atari
Tobias Pohlen
Bilal Piot
Todd Hester
M. G. Azar
Dan Horgan
...
John Quan
Mel Vecerík
Matteo Hessel
Rémi Munos
Olivier Pietquin
39
121
0
29 May 2018
World Models
World Models
David R Ha
Jürgen Schmidhuber
SyDa
82
1,050
0
27 Mar 2018
IMPALA: Scalable Distributed Deep-RL with Importance Weighted
  Actor-Learner Architectures
IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures
L. Espeholt
Hubert Soyer
Rémi Munos
Karen Simonyan
Volodymyr Mnih
...
Vlad Firoiu
Tim Harley
Iain Dunning
Shane Legg
Koray Kavukcuoglu
114
1,584
0
05 Feb 2018
Self-supervised Deep Reinforcement Learning with Generalized Computation
  Graphs for Robot Navigation
Self-supervised Deep Reinforcement Learning with Generalized Computation Graphs for Robot Navigation
G. Kahn
Adam R. Villaflor
Bosen Ding
Pieter Abbeel
Sergey Levine
SSL
57
287
0
29 Sep 2017
Revisiting the Arcade Learning Environment: Evaluation Protocols and
  Open Problems for General Agents
Revisiting the Arcade Learning Environment: Evaluation Protocols and Open Problems for General Agents
Marlos C. Machado
Marc G. Bellemare
Erik Talvitie
J. Veness
Matthew J. Hausknecht
Michael Bowling
51
549
0
18 Sep 2017
Imagination-Augmented Agents for Deep Reinforcement Learning
Imagination-Augmented Agents for Deep Reinforcement Learning
T. Weber
S. Racanière
David P. Reichert
Lars Buesing
A. Guez
...
Razvan Pascanu
Peter W. Battaglia
Demis Hassabis
David Silver
Daan Wierstra
LM&Ro
61
552
0
19 Jul 2017
Value Prediction Network
Value Prediction Network
Junhyuk Oh
Satinder Singh
Honglak Lee
58
332
0
11 Jul 2017
The Predictron: End-To-End Learning and Planning
The Predictron: End-To-End Learning and Planning
David Silver
H. V. Hasselt
Matteo Hessel
Tom Schaul
A. Guez
...
Gabriel Dulac-Arnold
David P. Reichert
Neil C. Rabinowitz
André Barreto
T. Degris
36
289
0
28 Dec 2016
Reinforcement Learning with Unsupervised Auxiliary Tasks
Reinforcement Learning with Unsupervised Auxiliary Tasks
Max Jaderberg
Volodymyr Mnih
Wojciech M. Czarnecki
Tom Schaul
Joel Z Leibo
David Silver
Koray Kavukcuoglu
SSL
29
1,225
0
16 Nov 2016
Safe and Efficient Off-Policy Reinforcement Learning
Safe and Efficient Off-Policy Reinforcement Learning
Rémi Munos
T. Stepleton
Anna Harutyunyan
Marc G. Bellemare
OffRL
101
611
0
08 Jun 2016
Value Iteration Networks
Value Iteration Networks
Aviv Tamar
Yi Wu
G. Thomas
Sergey Levine
Pieter Abbeel
48
650
0
09 Feb 2016
Action-Conditional Video Prediction using Deep Networks in Atari Games
Action-Conditional Video Prediction using Deep Networks in Atari Games
Junhyuk Oh
Xiaoxiao Guo
Honglak Lee
Richard L. Lewis
Satinder Singh
62
852
0
31 Jul 2015
The Arcade Learning Environment: An Evaluation Platform for General
  Agents
The Arcade Learning Environment: An Evaluation Platform for General Agents
Marc G. Bellemare
Yavar Naddaf
J. Veness
Michael Bowling
42
2,992
0
19 Jul 2012
1