Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1809.01999
Cited By
Recurrent World Models Facilitate Policy Evolution
4 September 2018
David R Ha
Jürgen Schmidhuber
SyDa
TPM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Recurrent World Models Facilitate Policy Evolution"
50 / 505 papers shown
Title
Mastering Strategy Card Game (Legends of Code and Magic) via End-to-End Policy and Optimistic Smooth Fictitious Play
Wei Xi
Yongxin Zhang
Changnan Xiao
Xuefeng Huang
Shihong Deng
Haowei Liang
Jie Chen
Peng Sun
OffRL
50
8
0
07 Mar 2023
Evolutionary Reinforcement Learning: A Survey
Hui Bai
Ran Cheng
Yaochu Jin
OffRL
45
52
0
07 Mar 2023
Dynamic Competency Self-Assessment for Autonomous Agents
Nicholas Conlon
Nisar R. Ahmed
D. Szafir
34
3
0
03 Mar 2023
K-SHAP: Policy Clustering Algorithm for Anonymous Multi-Agent State-Action Pairs
Andrea Coletta
Svitlana Vyetrenko
T. Balch
OffRL
21
7
0
23 Feb 2023
An Investigation into Pre-Training Object-Centric Representations for Reinforcement Learning
Jaesik Yoon
Yi-Fu Wu
Heechul Bae
Sungjin Ahn
OCL
35
41
0
09 Feb 2023
A Systematic Performance Analysis of Deep Perceptual Loss Networks: Breaking Transfer Learning Conventions
G. Pihlgren
Konstantina Nikolaidou
Prakash Chandra Chhipa
Nosheen Abid
Rajkumar Saini
Fredrik Sandin
Marcus Liwicki
29
10
0
08 Feb 2023
Investigating the role of model-based learning in exploration and transfer
Jacob Walker
Eszter Vértes
Yazhe Li
Gabriel Dulac-Arnold
Ankesh Anand
T. Weber
Jessica B. Hamrick
OffRL
36
7
0
08 Feb 2023
Predictable MDP Abstraction for Unsupervised Model-Based RL
Seohong Park
Sergey Levine
24
9
0
08 Feb 2023
Object-Centric Scene Representations using Active Inference
Toon Van de Maele
Tim Verbelen
Pietro Mazzaglia
Stefano Ferraro
Bart Dhoedt
OCL
BDL
43
5
0
07 Feb 2023
DITTO: Offline Imitation Learning with World Models
Branton DeMoss
Paul Duckworth
Nick Hawes
Ingmar Posner
Ingmar Posner
OffRL
21
18
0
06 Feb 2023
Aligning Robot and Human Representations
Andreea Bobu
Andi Peng
Pulkit Agrawal
Julie A. Shah
Anca D. Dragan
48
10
0
03 Feb 2023
Few-Shot Image-to-Semantics Translation for Policy Transfer in Reinforcement Learning
Reimi Sato
Kazuto Fukuchi
Jun Sakuma
Youhei Akimoto
OffRL
18
0
0
31 Jan 2023
Generative Slate Recommendation with Reinforcement Learning
Romain Deffayet
Thibaut Thonet
Jean-Michel Render
Maarten de Rijke
27
23
0
20 Jan 2023
Plan To Predict: Learning an Uncertainty-Foreseeing Model for Model-Based Reinforcement Learning
Zifan Wu
Chao Yu
Chong Chen
Jianye Hao
H. Zhuo
27
16
0
20 Jan 2023
Neuro-Symbolic World Models for Adapting to Open World Novelty
Jonathan C. Balloch
Zhiyu Lin
Robert Wright
Xiangyu Peng
Mustafa Hussain
Aarun Srinivas
Julia Kim
Mark O. Riedl
18
10
0
16 Jan 2023
World Models and Predictive Coding for Cognitive and Developmental Robotics: Frontiers and Challenges
T. Taniguchi
Shingo Murata
Masahiro Suzuki
D. Ognibene
Pablo Lanillos
...
L. Jamone
Tomoaki Nakamura
Alejandra Ciria
B. Lara
G. Pezzulo
32
52
0
14 Jan 2023
Estimation of User's World Model Using Graph2vec
Tatsuya Sakai
Takayuki Nagai
19
2
0
10 Jan 2023
Annotated History of Modern AI and Deep Learning
Juergen Schmidhuber
MLAU
AI4TS
AI4CE
33
22
0
21 Dec 2022
Towards Smooth Video Composition
Qihang Zhang
Ceyuan Yang
Yujun Shen
Yinghao Xu
Bolei Zhou
VGen
44
14
0
14 Dec 2022
MoDem: Accelerating Visual Model-Based Reinforcement Learning with Demonstrations
Nicklas Hansen
Yixin Lin
H. Su
Xiaolong Wang
Vikash Kumar
Aravind Rajeswaran
OffRL
32
49
0
12 Dec 2022
Decentralized cooperative perception for autonomous vehicles: Learning to value the unknown
Maxime Chaveroche
Franck Davoine
V. Berge-Cherfaoui
17
1
0
12 Dec 2022
PRISM: Probabilistic Real-Time Inference in Spatial World Models
Atanas Mirchev
Baris Kayalibay
Ahmed Agha
Patrick van der Smagt
Daniel Cremers
Justin Bayer
VGen
31
0
0
06 Dec 2022
A General Purpose Supervisory Signal for Embodied Agents
Kunal Pratap Singh
Jordi Salvador
Luca Weihs
Aniruddha Kembhavi
SSL
26
3
0
01 Dec 2022
Adaptive Scenario Subset Selection for Worst-Case Optimization and its Application to Well Placement Optimization
Atsuhiro Miyagi
Kazuto Fukuchi
Jun Sakuma
Youhei Akimoto
27
1
0
29 Nov 2022
Learning and Understanding a Disentangled Feature Representation for Hidden Parameters in Reinforcement Learning
Christopher P. Reale
Rebecca L. Russell
17
1
0
29 Nov 2022
Operator Splitting Value Iteration
Amin Rakhsha
Andrew Wang
Mohammad Ghavamzadeh
Amir-massoud Farahmand
OffRL
33
7
0
25 Nov 2022
Choreographer: Learning and Adapting Skills in Imagination
Pietro Mazzaglia
Tim Verbelen
Bart Dhoedt
Alexandre Lacoste
Sai Rajeswar
29
22
0
23 Nov 2022
Powderworld: A Platform for Understanding Generalization via Rich Task Distributions
Kevin Frans
Phillip Isola
OffRL
47
9
0
23 Nov 2022
Prototypical context-aware dynamics generalization for high-dimensional model-based reinforcement learning
Junjie Wang
Yao Mu
Dong Li
Qichao Zhang
Dongbin Zhao
Yuzheng Zhuang
Ping Luo
Bin Wang
Jianye Hao
OffRL
30
3
0
23 Nov 2022
Efficient Exploration using Model-Based Quality-Diversity with Gradients
Bryan Lim
Manon Flageat
Antoine Cully
23
4
0
22 Nov 2022
Efficient Deep Reinforcement Learning with Predictive Processing Proximal Policy Optimization
Burcu Küçükoglu
Walraaf Borkent
Bodo Rueckauer
Nasir Ahmad
Umut Güçlü
Marcel van Gerven
39
2
0
11 Nov 2022
ERL-Re
2
^2
2
: Efficient Evolutionary Reinforcement Learning with Shared State Representation and Individual Policy Representation
Jianye Hao
Pengyi Li
Hongyao Tang
Yan Zheng
Xian Fu
Zhaopeng Meng
29
23
0
26 Oct 2022
On Many-Actions Policy Gradient
Michal Nauman
Marek Cygan
19
0
0
24 Oct 2022
Learning General World Models in a Handful of Reward-Free Deployments
Yingchen Xu
Jack Parker-Holder
Aldo Pacchiano
Philip J. Ball
Oleh Rybkin
Stephen J. Roberts
Tim Rocktaschel
Edward Grefenstette
OffRL
62
9
0
23 Oct 2022
On the Feasibility of Cross-Task Transfer with Model-Based Reinforcement Learning
Yifan Xu
Nicklas Hansen
Zirui Wang
Yung-Chieh Chan
H. Su
Zhuowen Tu
OffRL
31
16
0
19 Oct 2022
Symbol Guided Hindsight Priors for Reward Learning from Human Preferences
Mudit Verma
Katherine Metcalf
35
8
0
17 Oct 2022
Model-Based Imitation Learning for Urban Driving
Anthony Hu
Gianluca Corrado
Nicolas Griffiths
Zak Murez
Corina Gurau
Hudson Yeo
Alex Kendall
R. Cipolla
Jamie Shotton
112
135
0
14 Oct 2022
Towards Multi-Agent Reinforcement Learning driven Over-The-Counter Market Simulations
N. Vadori
Leo Ardon
Sumitra Ganesh
Thomas Spooner
Selim Amrouni
Jared Vann
Mengda Xu
Zeyu Zheng
T. Balch
Manuela Veloso
18
16
0
13 Oct 2022
ControlVAE: Model-Based Learning of Generative Controllers for Physics-Based Characters
Heyuan Yao
Zhenhua Song
B. Chen
Libin Liu
DRL
VGen
16
41
0
12 Oct 2022
The Role of Exploration for Task Transfer in Reinforcement Learning
Jonathan C. Balloch
Julia Kim
Jessica B. Langebrake Inman
Mark O. Riedl
OffRL
31
3
0
11 Oct 2022
Enhance Sample Efficiency and Robustness of End-to-end Urban Autonomous Driving via Semantic Masked World Model
Zeyu Gao
Yao Mu
Chen Chen
Yangang Ren
Shengbo Eben Li
Ping Luo
Yanfeng Lu
22
27
0
08 Oct 2022
EgoTaskQA: Understanding Human Tasks in Egocentric Videos
Baoxiong Jia
Ting Lei
Song-Chun Zhu
Siyuan Huang
EgoV
30
61
0
08 Oct 2022
LOPR: Latent Occupancy PRediction using Generative Models
Bernard Lange
Masha Itkina
Mykel J. Kochenderfer
AI4CE
46
5
0
03 Oct 2022
CostNet: An End-to-End Framework for Goal-Directed Reinforcement Learning
Per-Arne Andersen
M. G. Olsen
Ole-Christoffer Granmo
3DV
OffRL
14
0
0
03 Oct 2022
Interpretable Option Discovery using Deep Q-Learning and Variational Autoencoders
Per-Arne Andersen
Ole-Christoffer Granmo
Morten Goodwin
OOD
28
0
0
03 Oct 2022
Improving Policy Learning via Language Dynamics Distillation
Victor Zhong
Jesse Mu
Luke Zettlemoyer
Edward Grefenstette
Tim Rocktaschel
OffRL
50
15
0
30 Sep 2022
Contrastive Unsupervised Learning of World Model with Invariant Causal Features
Rudra P. K. Poudel
Harit Pandya
R. Cipolla
SSL
CML
21
3
0
29 Sep 2022
Learning Parsimonious Dynamics for Generalization in Reinforcement Learning
Tankred Saanum
Eric Schulz
26
1
0
29 Sep 2022
Mastering the Unsupervised Reinforcement Learning Benchmark from Pixels
Sai Rajeswar
Pietro Mazzaglia
Tim Verbelen
Alexandre Piché
Bart Dhoedt
Rameswar Panda
Alexandre Lacoste
SSL
28
21
0
24 Sep 2022
Active Predicting Coding: Brain-Inspired Reinforcement Learning for Sparse Reward Robotic Control Problems
Alexander Ororbia
A. Mali
40
7
0
19 Sep 2022
Previous
1
2
3
4
5
6
...
9
10
11
Next