Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1811.04551
Cited By
v1
v2
v3
v4
v5 (latest)
Learning Latent Dynamics for Planning from Pixels
12 November 2018
Danijar Hafner
Timothy Lillicrap
Ian S. Fischer
Ruben Villegas
David R Ha
Honglak Lee
James Davidson
BDL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Learning Latent Dynamics for Planning from Pixels"
50 / 994 papers shown
Title
Behavior Prior Representation learning for Offline Reinforcement Learning
Hongyu Zang
Xin Li
Jie Yu
Chen Liu
Riashat Islam
Rémi Tachet des Combes
Romain Laroche
OffRL
OnRL
104
10
0
02 Nov 2022
Disentangled (Un)Controllable Features
Jacob E. Kooi
Mark Hoogendoorn
Vincent François-Lavet
DRL
53
0
0
31 Oct 2022
Goal Exploration Augmentation via Pre-trained Skills for Sparse-Reward Long-Horizon Goal-Conditioned Reinforcement Learning
Lisheng Wu
Ke Chen
66
4
0
28 Oct 2022
Learning on the Job: Self-Rewarding Offline-to-Online Finetuning for Industrial Insertion of Novel Connectors from Vision
Ashvin Nair
Brian Zhu
Gokul Narayanan
Eugen Solowjow
Sergey Levine
OffRL
OnRL
125
16
0
27 Oct 2022
SAM-RL: Sensing-Aware Model-Based Reinforcement Learning via Differentiable Physics-Based Simulation and Rendering
Jun Lv
Yunhai Feng
Cheng Zhang
Shu Zhao
Lin Shao
Cewu Lu
75
26
0
27 Oct 2022
Evaluating Long-Term Memory in 3D Mazes
J. Pašukonis
Timothy Lillicrap
Danijar Hafner
3DV
88
23
0
24 Oct 2022
Active Exploration for Robotic Manipulation
Tim Schneider
Boris Belousov
Georgia Chalvatzaki
Diego Romeres
Devesh K. Jha
Jan Peters
129
11
0
23 Oct 2022
Learning General World Models in a Handful of Reward-Free Deployments
Yingchen Xu
Jack Parker-Holder
Aldo Pacchiano
Philip J. Ball
Oleh Rybkin
Stephen J. Roberts
Tim Rocktaschel
Edward Grefenstette
OffRL
112
10
0
23 Oct 2022
STAP: Sequencing Task-Agnostic Policies
Christopher Agia
Toki Migimatsu
Jiajun Wu
Jeannette Bohg
109
20
0
21 Oct 2022
Random Actions vs Random Policies: Bootstrapping Model-Based Direct Policy Search
Elias Hanna
Alexandre Coninx
Stéphane Doncieux
OffRL
58
0
0
21 Oct 2022
Reaching Through Latent Space: From Joint Statistics to Path Planning in Manipulation
Chia-Man Hung
Shaohong Zhong
Walter Goodwin
Oiwi Parker Jones
Martin Engelcke
Ioannis Havoutis
Ingmar Posner
DRL
49
13
0
21 Oct 2022
Learning Robust Dynamics through Variational Sparse Gating
A. Jain
Shivakanth Sujit
S. Joshi
Vincent Michalski
Danijar Hafner
Samira Ebrahimi Kahou
68
9
0
21 Oct 2022
Safe Policy Improvement in Constrained Markov Decision Processes
Luigi Berducci
Radu Grosu
OffRL
97
2
0
20 Oct 2022
On the Feasibility of Cross-Task Transfer with Model-Based Reinforcement Learning
Yifan Xu
Nicklas Hansen
Zirui Wang
Yung-Chieh Chan
H. Su
Zhuowen Tu
OffRL
72
17
0
19 Oct 2022
Planning for Sample Efficient Imitation Learning
Zhao-Heng Yin
Weirui Ye
Qifeng Chen
Yang Gao
OffRL
89
21
0
18 Oct 2022
On Uncertainty in Deep State Space Models for Model-Based Reinforcement Learning
P. Becker
Gerhard Neumann
68
9
0
17 Oct 2022
Just Round: Quantized Observation Spaces Enable Memory Efficient Learning of Dynamic Locomotion
Lev Grossman
Brian Plancher
MQ
63
4
0
14 Oct 2022
Model-Based Imitation Learning for Urban Driving
Anthony Hu
Gianluca Corrado
Nicolas Griffiths
Zak Murez
Corina Gurau
Hudson Yeo
Alex Kendall
R. Cipolla
Jamie Shotton
179
142
0
14 Oct 2022
Safe Model-Based Reinforcement Learning with an Uncertainty-Aware Reachability Certificate
Dongjie Yu
Wenjun Zou
Yujie Yang
Haitong Ma
Sheng Li
Jingliang Duan
Jianyu Chen
86
15
0
14 Oct 2022
Reinforcement Learning with Automated Auxiliary Loss Search
Tairan He
Yuge Zhang
Kan Ren
Minghuan Liu
Che Wang
Weinan Zhang
Yuqing Yang
Dongsheng Li
108
16
0
12 Oct 2022
The Role of Exploration for Task Transfer in Reinforcement Learning
Jonathan C. Balloch
Julia Kim
Jessica B. Langebrake Inman
Mark O. Riedl
OffRL
114
3
0
11 Oct 2022
A Comprehensive Survey of Data Augmentation in Visual Reinforcement Learning
Guozheng Ma
Zhen Wang
Zhecheng Yuan
Xueqian Wang
Bo Yuan
Dacheng Tao
OffRL
87
28
0
10 Oct 2022
Enhance Sample Efficiency and Robustness of End-to-end Urban Autonomous Driving via Semantic Masked World Model
Zeyu Gao
Yao Mu
Chen Chen
Yangang Ren
Shengbo Eben Li
Ping Luo
Yanfeng Lu
79
30
0
08 Oct 2022
Learning the Dynamics of Compliant Tool-Environment Interaction for Visuo-Tactile Contact Servoing
Mark Van der Merwe
Dmitry Berenson
Nima Fazeli
79
13
0
07 Oct 2022
See, Plan, Predict: Language-guided Cognitive Planning with Video Prediction
Maria Attarian
Advaya Gupta
Ziyi Zhou
Wei Yu
Igor Gilitschenski
Animesh Garg
LM&Ro
58
8
0
07 Oct 2022
Continuous Monte Carlo Graph Search
Kalle Kujanpää
Amin Babadi
Yi Zhao
Arno Solin
Alexander Ilin
Joni Pajarinen
LRM
390
2
0
04 Oct 2022
LOPR: Latent Occupancy PRediction using Generative Models
Bernard Lange
Masha Itkina
Mykel J. Kochenderfer
AI4CE
108
7
0
03 Oct 2022
CostNet: An End-to-End Framework for Goal-Directed Reinforcement Learning
Per-Arne Andersen
M. G. Olsen
Ole-Christoffer Granmo
3DV
OffRL
21
0
0
03 Oct 2022
Latent State Marginalization as a Low-cost Approach for Improving Exploration
Dinghuai Zhang
Aaron Courville
Yoshua Bengio
Qinqing Zheng
Amy Zhang
Ricky T. Q. Chen
OOD
101
10
0
03 Oct 2022
Safe Reinforcement Learning From Pixels Using a Stochastic Latent Representation
Yannick Hogewind
T. D. Simão
Tal Kachman
N. Jansen
59
10
0
02 Oct 2022
EUCLID: Towards Efficient Unsupervised Reinforcement Learning with Multi-choice Dynamics Model
Yifu Yuan
Jianye Hao
Fei Ni
Yao Mu
Yan Zheng
Yujing Hu
Jinyi Liu
Yingfeng Chen
Changjie Fan
128
14
0
02 Oct 2022
Visuo-Tactile Transformers for Manipulation
Yizhou Chen
A. Sipos
Mark Van der Merwe
Nima Fazeli
ViT
91
36
0
30 Sep 2022
PyPose: A Library for Robot Learning with Physics-based Optimization
Chen Wang
Dasong Gao
Kuan Xu
Junyi Geng
Yaoyu Hu
...
Jiajun Wu
Lihua Xie
Luca Carlone
Marco Hutter
Sebastian Scherer
PINN
AI4CE
140
46
0
30 Sep 2022
S2P: State-conditioned Image Synthesis for Data Augmentation in Offline Reinforcement Learning
Daesol Cho
D. Shim
H. J. Kim
OffRL
96
11
0
30 Sep 2022
Hyperbolic VAE via Latent Gaussian Distributions
Seunghyuk Cho
Juyong Lee
Dongwoo Kim
DRL
116
8
0
30 Sep 2022
Learning Parsimonious Dynamics for Generalization in Reinforcement Learning
Tankred Saanum
Eric Schulz
53
1
0
29 Sep 2022
Unified Control Framework for Real-Time Interception and Obstacle Avoidance of Fast-Moving Objects with Diffusion Variational Autoencoder
Apan Dastider
Hao Fang
Mingjie Lin
48
1
0
27 Sep 2022
Training Efficient Controllers via Analytic Policy Gradient
Nina Wiedemann
Valentin Wüest
Antonio Loquercio
M. Müller
Dario Floreano
Davide Scaramuzza
OffRL
74
20
0
26 Sep 2022
It Takes Two: Learning to Plan for Human-Robot Cooperative Carrying
Eley Ng
Ziang Liu
Monroe Kennedy
71
18
0
26 Sep 2022
Stochastic Gradient Descent Captures How Children Learn About Physics
Luca M. Schulze Buschoff
Eric Schulz
Marcel Binz
74
0
0
25 Sep 2022
Mastering the Unsupervised Reinforcement Learning Benchmark from Pixels
Sai Rajeswar
Pietro Mazzaglia
Tim Verbelen
Alexandre Piché
Bart Dhoedt
Rameswar Panda
Alexandre Lacoste
SSL
95
21
0
24 Sep 2022
Partially Observable Markov Decision Processes in Robotics: A Survey
M. Lauri
David Hsu
Joni Pajarinen
144
107
0
21 Sep 2022
Locally Constrained Representations in Reinforcement Learning
Somjit Nath
Rushiv Arora
Samira Ebrahimi Kahou
OOD
OffRL
53
0
0
20 Sep 2022
Active Predicting Coding: Brain-Inspired Reinforcement Learning for Sparse Reward Robotic Control Problems
Alexander Ororbia
A. Mali
88
8
0
19 Sep 2022
An information-theoretic perspective on intrinsic motivation in reinforcement learning: a survey
A. Aubret
L. Matignon
S. Hassas
96
37
0
19 Sep 2022
Simplifying Model-based RL: Learning Representations, Latent-space Models, and Policies with One Objective
Raj Ghugare
Homanga Bharadhwaj
Benjamin Eysenbach
Sergey Levine
Ruslan Salakhutdinov
OffRL
106
27
0
18 Sep 2022
A Biologically-Inspired Dual Stream World Model
Arthur Juliani
Margaret E. Sereno
86
0
0
16 Sep 2022
Continuous MDP Homomorphisms and Homomorphic Policy Gradient
S. Rezaei-Shoshtari
Rosie Zhao
Prakash Panangaden
David Meger
Doina Precup
97
20
0
15 Sep 2022
HARP: Autoregressive Latent Video Prediction with High-Fidelity Image Generator
Younggyo Seo
Kimin Lee
Fangchen Liu
Stephen James
Pieter Abbeel
VGen
65
29
0
15 Sep 2022
Using Forwards-Backwards Models to Approximate MDP Homomorphisms
Augustine N. Mavor-Parker
Matthew J. Sargent
Christian Pehle
Andrea Banino
Lewis D. Griffin
Caswell Barry
62
1
0
14 Sep 2022
Previous
1
2
3
...
9
10
11
...
18
19
20
Next