ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1811.04551
  4. Cited By
Learning Latent Dynamics for Planning from Pixels

Learning Latent Dynamics for Planning from Pixels

12 November 2018
Danijar Hafner
Timothy Lillicrap
Ian S. Fischer
Ruben Villegas
David R Ha
Honglak Lee
James Davidson
    BDL
ArXivPDFHTML

Papers citing "Learning Latent Dynamics for Planning from Pixels"

50 / 391 papers shown
Title
Off-Policy RL Algorithms Can be Sample-Efficient for Continuous Control
  via Sample Multiple Reuse
Off-Policy RL Algorithms Can be Sample-Efficient for Continuous Control via Sample Multiple Reuse
Jiafei Lyu
Le Wan
Zongqing Lu
Xiu Li
OffRL
36
9
0
29 May 2023
On the Value of Myopic Behavior in Policy Reuse
On the Value of Myopic Behavior in Policy Reuse
Kang Xu
Chenjia Bai
Shuang Qiu
Haoran He
Bin Zhao
Zhen Wang
Wei Li
Xuelong Li
36
1
0
28 May 2023
Black-Box vs. Gray-Box: A Case Study on Learning Table Tennis Ball
  Trajectory Prediction with Spin and Impacts
Black-Box vs. Gray-Box: A Case Study on Learning Table Tennis Ball Trajectory Prediction with Spin and Impacts
Jan Achterhold
Philip Tobuschat
Hao Ma
Dieter Buechler
Michael Muehlebach
Joerg Stueckler
14
6
0
24 May 2023
TOM: Learning Policy-Aware Models for Model-Based Reinforcement Learning
  via Transition Occupancy Matching
TOM: Learning Policy-Aware Models for Model-Based Reinforcement Learning via Transition Occupancy Matching
Yecheng Jason Ma
K. Sivakumar
Jason Yan
Osbert Bastani
Dinesh Jayaraman
OffRL
MU
32
6
0
22 May 2023
Understanding the World to Solve Social Dilemmas Using Multi-Agent
  Reinforcement Learning
Understanding the World to Solve Social Dilemmas Using Multi-Agent Reinforcement Learning
Manuel Rios
Nicanor Quijano
Luis Felipe Giraldo
36
1
0
19 May 2023
Policy Gradient Methods in the Presence of Symmetries and State
  Abstractions
Policy Gradient Methods in the Presence of Symmetries and State Abstractions
Prakash Panangaden
S. Rezaei-Shoshtari
Rosie Zhao
David Meger
Doina Precup
33
2
0
09 May 2023
Cheap and Deterministic Inference for Deep State-Space Models of
  Interacting Dynamical Systems
Cheap and Deterministic Inference for Deep State-Space Models of Interacting Dynamical Systems
Andreas Look
M. Kandemir
Barbara Rakitsch
Jan Peters
BDL
38
6
0
02 May 2023
Get Back Here: Robust Imitation by Return-to-Distribution Planning
Get Back Here: Robust Imitation by Return-to-Distribution Planning
Geoffrey Cideron
B. Tabanpour
Sebastian Curi
Sertan Girgin
Léonard Hussenot
Gabriel Dulac-Arnold
M. Geist
Olivier Pietquin
Robert Dadashi
OOD
84
2
0
02 May 2023
3D-IntPhys: Towards More Generalized 3D-grounded Visual Intuitive
  Physics under Challenging Scenes
3D-IntPhys: Towards More Generalized 3D-grounded Visual Intuitive Physics under Challenging Scenes
Haotian Xue
Antonio Torralba
J. Tenenbaum
Daniel L. K. Yamins
Yunzhu Li
H. Tung
PINN
VGen
AI4CE
64
8
0
22 Apr 2023
Approximate Shielding of Atari Agents for Safe Exploration
Approximate Shielding of Atari Agents for Safe Exploration
Alexander W. Goodall
Francesco Belardinelli
27
2
0
21 Apr 2023
Filter-Aware Model-Predictive Control
Filter-Aware Model-Predictive Control
Baris Kayalibay
Atanas Mirchev
Ahmed Agha
Patrick van der Smagt
Justin Bayer
44
0
0
20 Apr 2023
Model Predictive Control with Self-supervised Representation Learning
Model Predictive Control with Self-supervised Representation Learning
Jonas A. Matthies
Muhammad Burhan Hafez
Mostafa Kotb
S. Wermter
SSL
13
0
0
14 Apr 2023
Learning Robot Manipulation from Cross-Morphology Demonstration
Learning Robot Manipulation from Cross-Morphology Demonstration
G. Salhotra
Isabella Liu
Gaurav Sukhatme
LM&Ro
25
9
0
07 Apr 2023
Tracker: Model-based Reinforcement Learning for Tracking Control of
  Human Finger Attached with Thin McKibben Muscles
Tracker: Model-based Reinforcement Learning for Tracking Control of Human Finger Attached with Thin McKibben Muscles
Daichi Saito
Eri Nagatomo
Jefferson Pardomuan
Hideki Koike
21
0
0
01 Apr 2023
Model-Based Reinforcement Learning with Isolated Imaginations
Model-Based Reinforcement Learning with Isolated Imaginations
Minting Pan
Xiangming Zhu
Yitao Zheng
Yunbo Wang
Xiaokang Yang
34
0
0
27 Mar 2023
Learning Foresightful Dense Visual Affordance for Deformable Object
  Manipulation
Learning Foresightful Dense Visual Affordance for Deformable Object Manipulation
Ruihai Wu
Chuanruo Ning
Hao Dong
AI4CE
24
28
0
20 Mar 2023
Discovering Predictable Latent Factors for Time Series Forecasting
Discovering Predictable Latent Factors for Time Series Forecasting
Jingyi Hou
Zhen Dong
Jiayu Zhou
Zhijie Liu
AI4TS
BDL
30
1
0
18 Mar 2023
Dynamic Update-to-Data Ratio: Minimizing World Model Overfitting
Dynamic Update-to-Data Ratio: Minimizing World Model Overfitting
Nicolai Dorka
Tim Welschehold
Wolfram Burgard
16
3
0
17 Mar 2023
Beware of Instantaneous Dependence in Reinforcement Learning
Beware of Instantaneous Dependence in Reinforcement Learning
Zhengmao Zhu
Yu-Ren Liu
Hong Tian
Yang Yu
Kun Zhang
OffRL
36
1
0
09 Mar 2023
Recent Advances of Deep Robotic Affordance Learning: A Reinforcement
  Learning Perspective
Recent Advances of Deep Robotic Affordance Learning: A Reinforcement Learning Perspective
Xintong Yang
Ze Ji
Jing Wu
Yunyu Lai
46
12
0
09 Mar 2023
Model-based Constrained MDP for Budget Allocation in Sequential
  Incentive Marketing
Model-based Constrained MDP for Budget Allocation in Sequential Incentive Marketing
Shuai Xiao
Le Guo
Zaifan Jiang
Lei Lv
Yuanbo Chen
Jun Zhu
Shuang Yang
30
21
0
02 Mar 2023
Learning a model is paramount for sample efficiency in reinforcement
  learning control of PDEs
Learning a model is paramount for sample efficiency in reinforcement learning control of PDEs
Stefan Werner
Sebastian Peitz
41
9
0
14 Feb 2023
ALAN: Autonomously Exploring Robotic Agents in the Real World
ALAN: Autonomously Exploring Robotic Agents in the Real World
Russell Mendonca
Shikhar Bahl
Deepak Pathak
LM&Ro
36
20
0
13 Feb 2023
Predictable MDP Abstraction for Unsupervised Model-Based RL
Predictable MDP Abstraction for Unsupervised Model-Based RL
Seohong Park
Sergey Levine
24
9
0
08 Feb 2023
CRC-RL: A Novel Visual Feature Representation Architecture for
  Unsupervised Reinforcement Learning
CRC-RL: A Novel Visual Feature Representation Architecture for Unsupervised Reinforcement Learning
Darshita Jain
A. Majumder
S. Dutta
Swagat Kumar
SSL
34
1
0
31 Jan 2023
PAC-Bayesian Soft Actor-Critic Learning
PAC-Bayesian Soft Actor-Critic Learning
Bahareh Tasdighi
Abdullah Akgul
Manuel Haussmann
Kenny Kazimirzak Brink
M. Kandemir
34
3
0
30 Jan 2023
Variational Latent Branching Model for Off-Policy Evaluation
Variational Latent Branching Model for Off-Policy Evaluation
Qitong Gao
Ge Gao
Min Chi
Miroslav Pajic
OffRL
36
6
0
28 Jan 2023
Predictive World Models from Real-World Partial Observations
Predictive World Models from Real-World Partial Observations
Robin Karlsson
Alexander Carballo
Keisuke Fujii
Kento Ohtani
K. Takeda
44
5
0
12 Jan 2023
Action Dynamics Task Graphs for Learning Plannable Representations of
  Procedural Tasks
Action Dynamics Task Graphs for Learning Plannable Representations of Procedural Tasks
Weichao Mao
Ruta Desai
Michael L. Iuzzolino
Nitin Kamra
36
5
0
11 Jan 2023
Multimodal Sequential Generative Models for Semi-Supervised Language
  Instruction Following
Multimodal Sequential Generative Models for Semi-Supervised Language Instruction Following
K. Akuzawa
Yusuke Iwasawa
Yutaka Matsuo
GAN
35
0
0
29 Dec 2022
A Simple Decentralized Cross-Entropy Method
A Simple Decentralized Cross-Entropy Method
Zichen Zhang
Jun Jin
Martin Jägersand
Jun Luo
Dale Schuurmans
15
8
0
16 Dec 2022
CACTI: A Framework for Scalable Multi-Task Multi-Scene Visual Imitation
  Learning
CACTI: A Framework for Scalable Multi-Task Multi-Scene Visual Imitation Learning
Zhao Mandi
Homanga Bharadhwaj
Vincent Moens
Shuran Song
Aravind Rajeswaran
Vikash Kumar
LM&Ro
28
70
0
12 Dec 2022
Off-Policy Deep Reinforcement Learning Algorithms for Handling Various
  Robotic Manipulator Tasks
Off-Policy Deep Reinforcement Learning Algorithms for Handling Various Robotic Manipulator Tasks
Altun Rzayev
Vahid Tavakol Aghaei
OffRL
21
0
0
11 Dec 2022
A Rubric for Human-like Agents and NeuroAI
A Rubric for Human-like Agents and NeuroAI
Ida Momennejad
60
14
0
08 Dec 2022
PRISM: Probabilistic Real-Time Inference in Spatial World Models
PRISM: Probabilistic Real-Time Inference in Spatial World Models
Atanas Mirchev
Baris Kayalibay
Ahmed Agha
Patrick van der Smagt
Daniel Cremers
Justin Bayer
VGen
31
0
0
06 Dec 2022
Learning to Optimize in Model Predictive Control
Learning to Optimize in Model Predictive Control
Jacob Sacks
Byron Boots
29
22
0
05 Dec 2022
Learning Sampling Distributions for Model Predictive Control
Learning Sampling Distributions for Model Predictive Control
Jacob Sacks
Byron Boots
13
21
0
05 Dec 2022
Tackling Visual Control via Multi-View Exploration Maximization
Tackling Visual Control via Multi-View Exploration Maximization
Mingqi Yuan
Xin Jin
Bo Li
Wenjun Zeng
30
1
0
28 Nov 2022
Representation Learning for Continuous Action Spaces is Beneficial for
  Efficient Policy Learning
Representation Learning for Continuous Action Spaces is Beneficial for Efficient Policy Learning
Tingting Zhao
Ying Wang
Weidong Sun
Yarui Chen
Gang Niu
Masashi Sugiyama
19
1
0
23 Nov 2022
Powderworld: A Platform for Understanding Generalization via Rich Task
  Distributions
Powderworld: A Platform for Understanding Generalization via Rich Task Distributions
Kevin Frans
Phillip Isola
OffRL
47
9
0
23 Nov 2022
Active Exploration based on Information Gain by Particle Filter for
  Efficient Spatial Concept Formation
Active Exploration based on Information Gain by Particle Filter for Efficient Spatial Concept Formation
Akira Taniguchi
Y. Tabuchi
Tomochika Ishikawa
Lotfi El Hafi
Y. Hagiwara
T. Taniguchi
26
4
0
20 Nov 2022
Joint Embedding Predictive Architectures Focus on Slow Features
Joint Embedding Predictive Architectures Focus on Slow Features
Vlad Sobal
V. JyothirS
Siddhartha Jalagam
Nicolas Carion
Kyunghyun Cho
Yann LeCun
24
8
0
20 Nov 2022
Rewards Encoding Environment Dynamics Improves Preference-based
  Reinforcement Learning
Rewards Encoding Environment Dynamics Improves Preference-based Reinforcement Learning
Katherine Metcalf
Miguel Sarabia
B. Theobald
OffRL
38
4
0
12 Nov 2022
On learning history based policies for controlling Markov decision
  processes
On learning history based policies for controlling Markov decision processes
Gandharv Patil
Aditya Mahajan
Doina Precup
OffRL
21
5
0
06 Nov 2022
Disentangled (Un)Controllable Features
Disentangled (Un)Controllable Features
Jacob E. Kooi
Mark Hoogendoorn
Vincent François-Lavet
DRL
27
0
0
31 Oct 2022
Goal Exploration Augmentation via Pre-trained Skills for Sparse-Reward
  Long-Horizon Goal-Conditioned Reinforcement Learning
Goal Exploration Augmentation via Pre-trained Skills for Sparse-Reward Long-Horizon Goal-Conditioned Reinforcement Learning
Lisheng Wu
Ke Chen
34
3
0
28 Oct 2022
Learning on the Job: Self-Rewarding Offline-to-Online Finetuning for
  Industrial Insertion of Novel Connectors from Vision
Learning on the Job: Self-Rewarding Offline-to-Online Finetuning for Industrial Insertion of Novel Connectors from Vision
Ashvin Nair
Brian Zhu
Gokul Narayanan
Eugen Solowjow
Sergey Levine
OffRL
OnRL
28
15
0
27 Oct 2022
SAM-RL: Sensing-Aware Model-Based Reinforcement Learning via
  Differentiable Physics-Based Simulation and Rendering
SAM-RL: Sensing-Aware Model-Based Reinforcement Learning via Differentiable Physics-Based Simulation and Rendering
Jun Lv
Yunhai Feng
Cheng Zhang
Shu Zhao
Lin Shao
Cewu Lu
18
24
0
27 Oct 2022
Learning General World Models in a Handful of Reward-Free Deployments
Learning General World Models in a Handful of Reward-Free Deployments
Yingchen Xu
Jack Parker-Holder
Aldo Pacchiano
Philip J. Ball
Oleh Rybkin
Stephen J. Roberts
Tim Rocktaschel
Edward Grefenstette
OffRL
62
9
0
23 Oct 2022
Random Actions vs Random Policies: Bootstrapping Model-Based Direct
  Policy Search
Random Actions vs Random Policies: Bootstrapping Model-Based Direct Policy Search
Elias Hanna
Alexandre Coninx
Stéphane Doncieux
OffRL
34
0
0
21 Oct 2022
Previous
12345678
Next