ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1910.04142
  4. Cited By
Imagined Value Gradients: Model-Based Policy Optimization with
  Transferable Latent Dynamics Models

Imagined Value Gradients: Model-Based Policy Optimization with Transferable Latent Dynamics Models

9 October 2019
Arunkumar Byravan
Jost Tobias Springenberg
A. Abdolmaleki
Roland Hafner
Michael Neunert
Thomas Lampe
Noah Y. Siegel
N. Heess
Martin Riedmiller
    OffRL
ArXivPDFHTML

Papers citing "Imagined Value Gradients: Model-Based Policy Optimization with Transferable Latent Dynamics Models"

34 / 34 papers shown
Title
Evaluating World Models with LLM for Decision Making
Evaluating World Models with LLM for Decision Making
Chang Yang
Xinrun Wang
Junzhe Jiang
Qinggang Zhang
Xiao Huang
LLMAG
ELM
36
2
0
13 Nov 2024
Meta-DT: Offline Meta-RL as Conditional Sequence Modeling with World
  Model Disentanglement
Meta-DT: Offline Meta-RL as Conditional Sequence Modeling with World Model Disentanglement
Zhi Wang
Li Lyna Zhang
Wenhao Wu
Yuanheng Zhu
Dongbin Zhao
C. L. Philip Chen
OffRL
37
6
0
15 Oct 2024
Adaptive Horizon Actor-Critic for Policy Learning in Contact-Rich
  Differentiable Simulation
Adaptive Horizon Actor-Critic for Policy Learning in Contact-Rich Differentiable Simulation
Ignat Georgiev
K. Srinivasan
Jie Xu
Eric Heiden
Animesh Garg
41
6
0
28 May 2024
Guided Cooperation in Hierarchical Reinforcement Learning via
  Model-based Rollout
Guided Cooperation in Hierarchical Reinforcement Learning via Model-based Rollout
Haoran Wang
Zeshen Tang
Leya Yang
Yaoru Sun
Fang Wang
Siyu Zhang
Ye-Ting Chen
30
2
0
24 Sep 2023
Diminishing Return of Value Expansion Methods in Model-Based
  Reinforcement Learning
Diminishing Return of Value Expansion Methods in Model-Based Reinforcement Learning
Daniel Palenicek
M. Lutter
João Carvalho
Jan Peters
29
4
0
07 Mar 2023
Leveraging Jumpy Models for Planning and Fast Learning in Robotic
  Domains
Leveraging Jumpy Models for Planning and Fast Learning in Robotic Domains
Jingwei Zhang
Jost Tobias Springenberg
Arunkumar Byravan
Leonard Hasenclever
A. Abdolmaleki
Dushyant Rao
N. Heess
Martin Riedmiller
31
5
0
24 Feb 2023
Investigating the role of model-based learning in exploration and
  transfer
Investigating the role of model-based learning in exploration and transfer
Jacob Walker
Eszter Vértes
Yazhe Li
Gabriel Dulac-Arnold
Ankesh Anand
T. Weber
Jessica B. Hamrick
OffRL
36
7
0
08 Feb 2023
Learning General World Models in a Handful of Reward-Free Deployments
Learning General World Models in a Handful of Reward-Free Deployments
Yingchen Xu
Jack Parker-Holder
Aldo Pacchiano
Philip J. Ball
Oleh Rybkin
Stephen J. Roberts
Tim Rocktaschel
Edward Grefenstette
OffRL
55
8
0
23 Oct 2022
A model-based approach to meta-Reinforcement Learning: Transformers and
  tree search
A model-based approach to meta-Reinforcement Learning: Transformers and tree search
Brieuc Pinon
Jean-Charles Delvenne
Raphaël Jungers
OffRL
29
3
0
24 Aug 2022
A Survey on Model-based Reinforcement Learning
A Survey on Model-based Reinforcement Learning
Fan Luo
Tian Xu
Hang Lai
Xiong-Hui Chen
Weinan Zhang
Yang Yu
OffRL
LRM
44
101
0
19 Jun 2022
DreamingV2: Reinforcement Learning with Discrete World Models without
  Reconstruction
DreamingV2: Reinforcement Learning with Discrete World Models without Reconstruction
Masashi Okada
T. Taniguchi
3DV
OffRL
28
22
0
01 Mar 2022
GrASP: Gradient-Based Affordance Selection for Planning
GrASP: Gradient-Based Affordance Selection for Planning
Vivek Veeriah
Zeyu Zheng
Richard L. Lewis
Satinder Singh
17
4
0
08 Feb 2022
Tutorial on amortized optimization
Tutorial on amortized optimization
Brandon Amos
OffRL
75
43
0
01 Feb 2022
Model-Value Inconsistency as a Signal for Epistemic Uncertainty
Model-Value Inconsistency as a Signal for Epistemic Uncertainty
Angelos Filos
Eszter Vértes
Zita Marinho
Gregory Farquhar
Diana Borsa
A. Friesen
Feryal M. P. Behbahani
Tom Schaul
André Barreto
Simon Osindero
44
7
0
08 Dec 2021
Self-Consistent Models and Values
Self-Consistent Models and Values
Roy Miles
Kate Baumli
Zita Marinho
Angelos Filos
Matteo Hessel
Hado van Hasselt
David Silver
38
8
0
25 Oct 2021
Evaluating model-based planning and planner amortization for continuous
  control
Evaluating model-based planning and planner amortization for continuous control
Arunkumar Byravan
Leonard Hasenclever
Piotr Trochim
M. Berk Mirza
Alessandro Davide Ialongo
...
Jost Tobias Springenberg
A. Abdolmaleki
N. Heess
J. Merel
Martin Riedmiller
55
17
0
07 Oct 2021
Learning Dynamics Models for Model Predictive Agents
Learning Dynamics Models for Model Predictive Agents
M. Lutter
Leonard Hasenclever
Arunkumar Byravan
Gabriel Dulac-Arnold
Piotr Trochim
N. Heess
J. Merel
Yuval Tassa
AI4CE
57
26
0
29 Sep 2021
Collect & Infer -- a fresh look at data-efficient Reinforcement Learning
Collect & Infer -- a fresh look at data-efficient Reinforcement Learning
Martin Riedmiller
Jost Tobias Springenberg
Roland Hafner
N. Heess
OffRL
23
17
0
23 Aug 2021
MBRL-Lib: A Modular Library for Model-based Reinforcement Learning
MBRL-Lib: A Modular Library for Model-based Reinforcement Learning
Luis Pineda
Brandon Amos
Amy Zhang
Nathan Lambert
Roberto Calandra
OffRL
22
46
0
20 Apr 2021
Learning and Planning in Complex Action Spaces
Learning and Planning in Complex Action Spaces
Thomas Hubert
Julian Schrittwieser
Ioannis Antonoglou
M. Barekatain
Simon Schmitt
David Silver
21
76
0
13 Apr 2021
Muesli: Combining Improvements in Policy Optimization
Muesli: Combining Improvements in Policy Optimization
Matteo Hessel
Ivo Danihelka
Fabio Viola
A. Guez
Simon Schmitt
Laurent Sifre
T. Weber
David Silver
H. V. Hasselt
11
66
0
13 Apr 2021
Latent Skill Planning for Exploration and Transfer
Latent Skill Planning for Exploration and Transfer
Kevin Xie
Homanga Bharadhwaj
Danijar Hafner
Animesh Garg
Florian Shkurti
39
20
0
27 Nov 2020
On the role of planning in model-based deep reinforcement learning
On the role of planning in model-based deep reinforcement learning
Jessica B. Hamrick
A. Friesen
Feryal M. P. Behbahani
A. Guez
Fabio Viola
Sims Witherspoon
Thomas W. Anthony
Lars Buesing
Petar Velickovic
T. Weber
OffRL
19
65
0
08 Nov 2020
Representation Matters: Improving Perception and Exploration for
  Robotics
Representation Matters: Improving Perception and Exploration for Robotics
Markus Wulfmeier
Arunkumar Byravan
Tim Hertweck
I. Higgins
Ankush Gupta
...
Malcolm Reynolds
Denis Teplyashin
Roland Hafner
Thomas Lampe
Martin Riedmiller
29
15
0
03 Nov 2020
Bridging Imagination and Reality for Model-Based Deep Reinforcement
  Learning
Bridging Imagination and Reality for Model-Based Deep Reinforcement Learning
Guangxiang Zhu
Minghao Zhang
Honglak Lee
Chongjie Zhang
OffRL
71
17
0
23 Oct 2020
Iterative Amortized Policy Optimization
Iterative Amortized Policy Optimization
Joseph Marino
Alexandre Piché
Alessandro Davide Ialongo
Yisong Yue
OffRL
63
21
0
20 Oct 2020
Local Search for Policy Iteration in Continuous Control
Local Search for Policy Iteration in Continuous Control
Jost Tobias Springenberg
N. Heess
D. Mankowitz
J. Merel
Arunkumar Byravan
...
Julian Schrittwieser
Yuval Tassa
J. Buchli
Dan Belov
Martin Riedmiller
OffRL
16
15
0
12 Oct 2020
Mastering Atari with Discrete World Models
Mastering Atari with Discrete World Models
Danijar Hafner
Timothy Lillicrap
Mohammad Norouzi
Jimmy Ba
DRL
48
810
0
05 Oct 2020
On the model-based stochastic value gradient for continuous
  reinforcement learning
On the model-based stochastic value gradient for continuous reinforcement learning
Brandon Amos
Samuel Stanton
Denis Yarats
A. Wilson
12
71
0
28 Aug 2020
Goal-Aware Prediction: Learning to Model What Matters
Goal-Aware Prediction: Learning to Model What Matters
Suraj Nair
Silvio Savarese
Chelsea Finn
16
64
0
14 Jul 2020
Learning to Fly via Deep Model-Based Reinforcement Learning
Learning to Fly via Deep Model-Based Reinforcement Learning
Philip Becker-Ehmck
Maximilian Karl
Jan Peters
Patrick van der Smagt
SSL
35
37
0
19 Mar 2020
Dream to Control: Learning Behaviors by Latent Imagination
Dream to Control: Learning Behaviors by Latent Imagination
Danijar Hafner
Timothy Lillicrap
Jimmy Ba
Mohammad Norouzi
VLM
11
1,298
0
03 Dec 2019
Emergence of Locomotion Behaviours in Rich Environments
Emergence of Locomotion Behaviours in Rich Environments
N. Heess
TB Dhruva
S. Sriram
Jay Lemmon
J. Merel
...
Tom Erez
Ziyun Wang
S. M. Ali Eslami
Martin Riedmiller
David Silver
134
928
0
07 Jul 2017
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
Chelsea Finn
Pieter Abbeel
Sergey Levine
OOD
338
11,684
0
09 Mar 2017
1