Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1809.01999
Cited By
Recurrent World Models Facilitate Policy Evolution
4 September 2018
David R Ha
Jürgen Schmidhuber
SyDa
TPM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Recurrent World Models Facilitate Policy Evolution"
50 / 505 papers shown
Title
Simplifying Model-based RL: Learning Representations, Latent-space Models, and Policies with One Objective
Raj Ghugare
Homanga Bharadhwaj
Benjamin Eysenbach
Sergey Levine
Ruslan Salakhutdinov
OffRL
48
25
0
18 Sep 2022
Concept-modulated model-based offline reinforcement learning for rapid generalization
Nicholas A. Ketz
Praveen K. Pilly
OffRL
27
1
0
07 Sep 2022
Transformers are Sample-Efficient World Models
Vincent Micheli
Eloi Alonso
Franccois Fleuret
VLM
OffRL
19
162
0
01 Sep 2022
Intelligent problem-solving as integrated hierarchical reinforcement learning
Manfred Eppe
Christian Gumbsch
Matthias Kerzel
Phuong D. H. Nguyen
Martin Volker Butz
S. Wermter
31
75
0
18 Aug 2022
Implicit Two-Tower Policies
Yunfan Zhao
Qingkai Pan
K. Choromanski
Deepali Jain
Vikas Sindhwani
OffRL
31
3
0
02 Aug 2022
Explain My Surprise: Learning Efficient Long-Term Memory by Predicting Uncertain Outcomes
A. Sorokin
N. Buzun
Leonid Pugachev
Andrey Kravchenko
31
8
0
27 Jul 2022
A Closed-Loop Perception, Decision-Making and Reasoning Mechanism for Human-Like Navigation
Wenqi Zhang
Kai Zhao
Peng Li
Xiao Zhu
Yongliang Shen
Yanna Ma
Yingfeng Chen
Weiming Lu
LRM
29
8
0
25 Jul 2022
The Free Energy Principle for Perception and Action: A Deep Learning Perspective
Pietro Mazzaglia
Tim Verbelen
Ozan Çatal
Bart Dhoedt
DRL
AI4CE
30
31
0
13 Jul 2022
A survey of multimodal deep generative models
Masahiro Suzuki
Y. Matsuo
SyDa
DRL
57
76
0
05 Jul 2022
Variational Causal Dynamics: Discovering Modular World Models from Interventions
Anson Lei
Bernhard Schölkopf
Ingmar Posner
CML
19
8
0
22 Jun 2022
A Survey on Model-based Reinforcement Learning
Fan Luo
Tian Xu
Hang Lai
Xiong-Hui Chen
Weinan Zhang
Yang Yu
OffRL
LRM
50
101
0
19 Jun 2022
CtrlFormer: Learning Transferable State Representation for Visual Control via Transformer
Yao Mu
Shoufa Chen
Mingyu Ding
Jianyu Chen
Runjian Chen
Ping Luo
ViT
23
9
0
17 Jun 2022
The State of Sparse Training in Deep Reinforcement Learning
L. Graesser
Utku Evci
Erich Elsen
Pablo Samuel Castro
OffRL
12
34
0
17 Jun 2022
Hybrid Value Estimation for Off-policy Evaluation and Offline Reinforcement Learning
Xuefeng Jin
Xu-Hui Liu
Shengyi Jiang
Yang Yu
OffRL
31
4
0
04 Jun 2022
Offline Reinforcement Learning with Causal Structured World Models
Zhengbang Zhu
Xiong-Hui Chen
Hong Tian
Kun Zhang
Yang Yu
CML
OffRL
12
16
0
03 Jun 2022
Graph Backup: Data Efficient Backup Exploiting Markovian Transitions
Zhengyao Jiang
Tianjun Zhang
Robert Kirk
Tim Rocktaschel
Edward Grefenstette
OffRL
8
2
0
31 May 2022
Multi-Source Transfer Learning for Deep Model-Based Reinforcement Learning
Remo Sasso
M. Sabatelli
M. Wiering
49
9
0
28 May 2022
Iso-Dream: Isolating and Leveraging Noncontrollable Visual Dynamics in World Models
Minting Pan
Xiangming Zhu
Yunbo Wang
Xiaokang Yang
26
39
0
27 May 2022
POLTER: Policy Trajectory Ensemble Regularization for Unsupervised Reinforcement Learning
Frederik Schubert
C. Benjamins
Sebastian Dohler
Bodo Rosenhahn
Marius Lindauer
SSL
OffRL
62
4
0
23 May 2022
Memory-efficient Reinforcement Learning with Value-based Knowledge Consolidation
Qingfeng Lan
Yangchen Pan
Jun Luo
A. R. Mahmood
OffRL
29
8
0
22 May 2022
Towards biologically plausible Dreaming and Planning in recurrent spiking networks
C. Capone
P. Paolucci
CLL
31
7
0
20 May 2022
Planning with Diffusion for Flexible Behavior Synthesis
Michael Janner
Yilun Du
J. Tenenbaum
Sergey Levine
DiffM
202
633
0
20 May 2022
Deterministic training of generative autoencoders using invertible layers
Gianluigi Silvestri
Daan Roos
L. Ambrogioni
TPM
21
2
0
19 May 2022
CARNet: A Dynamic Autoencoder for Learning Latent Dynamics in Autonomous Driving Tasks
A. Pak
Hemanth Manjunatha
Dimitar Filev
Panagiotis Tsiotras
14
5
0
18 May 2022
Neighborhood Mixup Experience Replay: Local Convex Interpolation for Improved Sample Efficiency in Continuous Control Tasks
Ryan M Sander
Wilko Schwarting
Tim Seyde
Igor Gilitschenski
S. Karaman
Daniela Rus
41
2
0
18 May 2022
Pervasive Machine Learning for Smart Radio Environments Enabled by Reconfigurable Intelligent Surfaces
G. C. Alexandropoulos
Kyriakos Stylianopoulos
Chongwen Huang
Chau Yuen
M. Bennis
Mérouane Debbah
25
88
0
08 May 2022
Fixing Malfunctional Objects With Learned Physical Simulation and Functional Prediction
Yining Hong
Kaichun Mo
L. Yi
Leonidas J. Guibas
Antonio Torralba
J. Tenenbaum
Chuang Gan
42
5
0
05 May 2022
RLFlow: Optimising Neural Network Subgraph Transformation with World Models
Sean Parker
Sami Alabed
Eiko Yoneki
16
0
0
03 May 2022
Data-driven control of spatiotemporal chaos with reduced-order neural ODE-based models and reinforcement learning
Kevin Zeng
Alec J. Linot
M. Graham
AI4CE
22
28
0
01 May 2022
TTOpt: A Maximum Volume Quantized Tensor Train-based Optimization and its Application to Reinforcement Learning
Konstantin Sozykin
Andrei Chertkov
R. Schutski
Anh-Huy Phan
A. Cichocki
Ivan Oseledets
14
35
0
30 Apr 2022
Predicting Real-time Scientific Experiments Using Transformer models and Reinforcement Learning
J. M. Parrilla-Gutierrez
AI4CE
24
0
0
25 Apr 2022
Deep Reinforcement Learning Using a Low-Dimensional Observation Filter for Visual Complex Video Game Playing
V. A. Kich
J. C. Jesus
Ricardo B. Grando
A. H. Kolling
Gabriel V. Heisler
R. S. Guerra
OffRL
30
2
0
24 Apr 2022
Learning Sequential Latent Variable Models from Multimodal Time Series Data
Oliver Limoyo
Trevor Ablett
Jonathan Kelly
25
5
0
21 Apr 2022
Evaluating Vision Transformer Methods for Deep Reinforcement Learning from Pixels
Tianxin Tao
Daniele Reda
M. van de Panne
ViT
11
19
0
11 Apr 2022
Gradient-Based Trajectory Optimization With Learned Dynamics
Bhavya Sukhija
Nathanael Kohler
Miguel Zamora
Simon Zimmermann
Sebastian Curi
Andreas Krause
Stelian Coros
30
9
0
09 Apr 2022
Temporal Alignment for History Representation in Reinforcement Learning
Aleksandr Ermolov
E. Sangineto
N. Sebe
AI4TS
21
2
0
07 Apr 2022
Learning List-wise Representation in Reinforcement Learning for Ads Allocation with Multiple Auxiliary Tasks
Zehua Wang
Guogang Liao
Xiaowen Shi
Xiaoxu Wu
Chuheng Zhang
Yongkang Wang
Xingxing Wang
Dong Wang
OffRL
19
4
0
02 Apr 2022
Factored Adaptation for Non-Stationary Reinforcement Learning
Fan Feng
Erdun Gao
Kun Zhang
Sara Magliacane
CML
OffRL
47
32
0
30 Mar 2022
NavDreams: Towards Camera-Only RL Navigation Among Humans
Daniel Dugas
Olov Andersson
Roland Siegwart
Jen Jen Chung
VGen
23
12
0
23 Mar 2022
Multi-View Dreaming: Multi-View World Model with Contrastive Learning
Akira Kinose
Masashi Okada
Ryogo Okumura
T. Taniguchi
OffRL
21
10
0
15 Mar 2022
Towards Self-Supervised Learning of Global and Object-Centric Representations
Federico Baldassarre
Hossein Azizpour
SSL
3DPC
OCL
46
13
0
11 Mar 2022
Tactile-Sensitive NewtonianVAE for High-Accuracy Industrial Connector Insertion
Ryogo Okumura
Nobuki Nishio
T. Taniguchi
22
8
0
10 Mar 2022
SAGE: Generating Symbolic Goals for Myopic Models in Deep Reinforcement Learning
A. Chester
Michael Dann
Fabio Zambetta
John Thangarajah
11
0
0
09 Mar 2022
Temporal Difference Learning for Model Predictive Control
Nicklas Hansen
Xiaolong Wang
H. Su
PINN
MU
36
224
0
09 Mar 2022
"If you could see me through my eyes": Predicting Pedestrian Perception
Julian Petzold
Mostafa Wahby
F. Stark
Ulrich Behrje
Heiko Hamann
45
3
0
28 Feb 2022
Gradient-free Multi-domain Optimization for Autonomous Systems
Hongrui Zheng
Johannes Betz
Rahul Mangharam
14
7
0
28 Feb 2022
Abstraction for Deep Reinforcement Learning
Murray Shanahan
Melanie Mitchell
OffRL
35
28
0
10 Feb 2022
Influence-Augmented Local Simulators: A Scalable Solution for Fast Deep RL in Large Networked Systems
Miguel Suau
Jinke He
M. Spaan
F. Oliehoek
27
4
0
03 Feb 2022
Safe Deep RL in 3D Environments using Human Feedback
Matthew Rahtz
Vikrant Varma
Ramana Kumar
Zachary Kenton
Shane Legg
Jan Leike
32
4
0
20 Jan 2022
Bayesian sense of time in biological and artificial brains
Z. Fountas
Alexey Zakharov
35
0
0
14 Jan 2022
Previous
1
2
3
...
5
6
7
...
9
10
11
Next