Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1809.01999
Cited By
Recurrent World Models Facilitate Policy Evolution
4 September 2018
David R Ha
Jürgen Schmidhuber
SyDa
TPM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Recurrent World Models Facilitate Policy Evolution"
50 / 325 papers shown
Title
Implicit Two-Tower Policies
Yunfan Zhao
Qingkai Pan
K. Choromanski
Deepali Jain
Vikas Sindhwani
OffRL
121
3
0
02 Aug 2022
Explain My Surprise: Learning Efficient Long-Term Memory by Predicting Uncertain Outcomes
A. Sorokin
N. Buzun
Leonid Pugachev
Andrey Kravchenko
167
8
0
27 Jul 2022
A Closed-Loop Perception, Decision-Making and Reasoning Mechanism for Human-Like Navigation
Wenqi Zhang
Kai Zhao
Peng Li
Xiao Zhu
Yongliang Shen
Yanna Ma
Yingfeng Chen
Weiming Lu
LRM
57
8
0
25 Jul 2022
A survey of multimodal deep generative models
Masahiro Suzuki
Y. Matsuo
SyDa
DRL
82
82
0
05 Jul 2022
Variational Causal Dynamics: Discovering Modular World Models from Interventions
Anson Lei
Bernhard Schölkopf
Ingmar Posner
CML
78
9
0
22 Jun 2022
A Survey on Model-based Reinforcement Learning
Fan Luo
Tian Xu
Hang Lai
Xiong-Hui Chen
Weinan Zhang
Yang Yu
OffRL
LRM
121
110
0
19 Jun 2022
The State of Sparse Training in Deep Reinforcement Learning
L. Graesser
Utku Evci
Erich Elsen
Pablo Samuel Castro
OffRL
75
40
0
17 Jun 2022
Multi-Source Transfer Learning for Deep Model-Based Reinforcement Learning
Remo Sasso
M. Sabatelli
M. Wiering
104
9
0
28 May 2022
Iso-Dream: Isolating and Leveraging Noncontrollable Visual Dynamics in World Models
Minting Pan
Geng Chen
Yunbo Wang
Xiaokang Yang
103
42
0
27 May 2022
Memory-efficient Reinforcement Learning with Value-based Knowledge Consolidation
Qingfeng Lan
Yangchen Pan
Jun Luo
A. R. Mahmood
OffRL
113
8
0
22 May 2022
Towards biologically plausible Dreaming and Planning in recurrent spiking networks
C. Capone
P. Paolucci
CLL
35
7
0
20 May 2022
Planning with Diffusion for Flexible Behavior Synthesis
Michael Janner
Yilun Du
J. Tenenbaum
Sergey Levine
DiffM
331
706
0
20 May 2022
Neighborhood Mixup Experience Replay: Local Convex Interpolation for Improved Sample Efficiency in Continuous Control Tasks
Ryan M Sander
Wilko Schwarting
Tim Seyde
Igor Gilitschenski
S. Karaman
Daniela Rus
69
2
0
18 May 2022
Pervasive Machine Learning for Smart Radio Environments Enabled by Reconfigurable Intelligent Surfaces
G. C. Alexandropoulos
Kyriakos Stylianopoulos
Chongwen Huang
Chau Yuen
M. Bennis
Mérouane Debbah
76
89
0
08 May 2022
Fixing Malfunctional Objects With Learned Physical Simulation and Functional Prediction
Yining Hong
Kaichun Mo
L. Yi
Leonidas Guibas
Antonio Torralba
J. Tenenbaum
Chuang Gan
91
5
0
05 May 2022
Data-driven control of spatiotemporal chaos with reduced-order neural ODE-based models and reinforcement learning
Kevin Zeng
Alec J. Linot
M. Graham
AI4CE
77
29
0
01 May 2022
Predicting Real-time Scientific Experiments Using Transformer models and Reinforcement Learning
J. M. Parrilla-Gutierrez
AI4CE
32
0
0
25 Apr 2022
Learning Sequential Latent Variable Models from Multimodal Time Series Data
Oliver Limoyo
Trevor Ablett
Jonathan Kelly
73
5
0
21 Apr 2022
Evaluating Vision Transformer Methods for Deep Reinforcement Learning from Pixels
Tianxin Tao
Daniele Reda
M. van de Panne
ViT
78
19
0
11 Apr 2022
Temporal Alignment for History Representation in Reinforcement Learning
Aleksandr Ermolov
E. Sangineto
N. Sebe
AI4TS
45
2
0
07 Apr 2022
Factored Adaptation for Non-Stationary Reinforcement Learning
Fan Feng
Erdun Gao
Kun Zhang
Sara Magliacane
CML
OffRL
134
35
0
30 Mar 2022
NavDreams: Towards Camera-Only RL Navigation Among Humans
Daniel Dugas
Olov Andersson
Roland Siegwart
Jen Jen Chung
VGen
84
12
0
23 Mar 2022
Multi-View Dreaming: Multi-View World Model with Contrastive Learning
Akira Kinose
Masashi Okada
Ryogo Okumura
T. Taniguchi
OffRL
68
11
0
15 Mar 2022
Towards Self-Supervised Learning of Global and Object-Centric Representations
Federico Baldassarre
Hossein Azizpour
SSL
3DPC
OCL
103
13
0
11 Mar 2022
Tactile-Sensitive NewtonianVAE for High-Accuracy Industrial Connector Insertion
Ryogo Okumura
Nobuki Nishio
T. Taniguchi
68
8
0
10 Mar 2022
SAGE: Generating Symbolic Goals for Myopic Models in Deep Reinforcement Learning
A. Chester
Michael Dann
Fabio Zambetta
John Thangarajah
39
0
0
09 Mar 2022
Temporal Difference Learning for Model Predictive Control
Nicklas Hansen
Xiaolong Wang
H. Su
PINN
MU
101
255
0
09 Mar 2022
"If you could see me through my eyes": Predicting Pedestrian Perception
Julian Petzold
Mostafa Wahby
F. Stark
Ulrich Behrje
Heiko Hamann
85
3
0
28 Feb 2022
Gradient-free Multi-domain Optimization for Autonomous Systems
Hongrui Zheng
Johannes Betz
Rahul Mangharam
54
7
0
28 Feb 2022
Abstraction for Deep Reinforcement Learning
Murray Shanahan
Melanie Mitchell
OffRL
95
28
0
10 Feb 2022
Influence-Augmented Local Simulators: A Scalable Solution for Fast Deep RL in Large Networked Systems
Miguel Suau
Jinke He
M. Spaan
F. Oliehoek
58
4
0
03 Feb 2022
Safe Deep RL in 3D Environments using Human Feedback
Matthew Rahtz
Vikrant Varma
Ramana Kumar
Zachary Kenton
Shane Legg
Jan Leike
82
4
0
20 Jan 2022
Automated Reinforcement Learning (AutoRL): A Survey and Open Problems
Jack Parker-Holder
Raghunandan Rajan
Xingyou Song
André Biedenkapp
Yingjie Miao
...
Vu-Linh Nguyen
Roberto Calandra
Aleksandra Faust
Frank Hutter
Marius Lindauer
AI4CE
112
107
0
11 Jan 2022
A Surrogate-Assisted Controller for Expensive Evolutionary Reinforcement Learning
Yuxing Wang
Tiantian Zhang
Yongzhe Chang
Bin Liang
Xueqian Wang
Bo Yuan
88
17
0
01 Jan 2022
Towards Disturbance-Free Visual Mobile Manipulation
Tianwei Ni
Kiana Ehsani
Luca Weihs
Jordi Salvador
120
9
0
17 Dec 2021
Compositional Learning-based Planning for Vision POMDPs
Sampada Deglurkar
M. H. Lim
Johnathan Tucker
Zachary Sunberg
Aleksandra Faust
Claire Tomlin
79
5
0
17 Dec 2021
Assistive Tele-op: Leveraging Transformers to Collect Robotic Task Demonstrations
Henry M. Clever
Ankur Handa
H. Mazhar
Kevin Parker
Omer Shapira
Qian Wan
Yashraj S. Narang
Iretiayo Akinola
Maya Cakmak
Dieter Fox
78
18
0
09 Dec 2021
Causal Imitative Model for Autonomous Driving
Mohammad Reza Samsami
Mohammadhossein Bahari
Saber Salehkaleybar
Alexandre Alahi
CML
67
12
0
07 Dec 2021
ED2: Environment Dynamics Decomposition World Models for Continuous Control
Jianye Hao
Yifu Yuan
Cong Wang
Zhen Wang
OffRL
89
1
0
06 Dec 2021
Diffusion Autoencoders: Toward a Meaningful and Decodable Representation
Konpat Preechakul
Nattanat Chatthee
Suttisak Wizadwongsa
Supasorn Suwajanakorn
SyDa
DiffM
131
434
0
30 Nov 2021
Collective Intelligence for Deep Learning: A Survey of Recent Developments
David R Ha
Yu Tang
AI4CE
127
70
0
29 Nov 2021
Value Function Spaces: Skill-Centric State Abstractions for Long-Horizon Reasoning
Dhruv Shah
Peng Xu
Yao Lu
Ted Xiao
Alexander Toshev
Sergey Levine
Brian Ichter
OffRL
86
43
0
04 Nov 2021
Proximal Policy Optimization with Continuous Bounded Action Space via the Beta Distribution
Irving G. B. Petrazzini
Eric A. Antonelo
OffRL
59
12
0
03 Nov 2021
Model-Based Episodic Memory Induces Dynamic Hybrid Controls
Hung Le
Thommen George Karimpanal
Majid Abdolshah
T. Tran
Svetha Venkatesh
72
19
0
03 Nov 2021
DreamerPro: Reconstruction-Free Model-Based Reinforcement Learning with Prototypical Representations
Fei Deng
Ingook Jang
Sungjin Ahn
VLM
79
62
0
27 Oct 2021
Common Information based Approximate State Representations in Multi-Agent Reinforcement Learning
Shitao Xiao
V. Subramanian
74
9
0
25 Oct 2021
Planning from Pixels in Environments with Combinatorially Hard Search Spaces
Marco Bagatella
Miroslav Olsák
Michal Rolínek
Georg Martius
OffRL
65
7
0
12 Oct 2021
Neural Algorithmic Reasoners are Implicit Planners
Andreea Deac
Petar Velivcković
Ognjen Milinković
Pierre-Luc Bacon
Jian Tang
Mladen Nikolic
OffRL
72
24
0
11 Oct 2021
The Information Geometry of Unsupervised Reinforcement Learning
Benjamin Eysenbach
Ruslan Salakhutdinov
Sergey Levine
SSL
OffRL
118
35
0
06 Oct 2021
Dropout's Dream Land: Generalization from Learned Simulators to Reality
Zac Wellmer
James T. Kwok
SyDa
69
9
0
17 Sep 2021
Previous
1
2
3
4
5
6
7
Next