ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1809.01999
  4. Cited By
Recurrent World Models Facilitate Policy Evolution

Recurrent World Models Facilitate Policy Evolution

4 September 2018
David R Ha
Jürgen Schmidhuber
    SyDa
    TPM
ArXivPDFHTML

Papers citing "Recurrent World Models Facilitate Policy Evolution"

50 / 505 papers shown
Title
Mastering Strategy Card Game (Legends of Code and Magic) via End-to-End
  Policy and Optimistic Smooth Fictitious Play
Mastering Strategy Card Game (Legends of Code and Magic) via End-to-End Policy and Optimistic Smooth Fictitious Play
Wei Xi
Yongxin Zhang
Changnan Xiao
Xuefeng Huang
Shihong Deng
Haowei Liang
Jie Chen
Peng Sun
OffRL
50
8
0
07 Mar 2023
Evolutionary Reinforcement Learning: A Survey
Evolutionary Reinforcement Learning: A Survey
Hui Bai
Ran Cheng
Yaochu Jin
OffRL
45
52
0
07 Mar 2023
Dynamic Competency Self-Assessment for Autonomous Agents
Dynamic Competency Self-Assessment for Autonomous Agents
Nicholas Conlon
Nisar R. Ahmed
D. Szafir
34
3
0
03 Mar 2023
K-SHAP: Policy Clustering Algorithm for Anonymous Multi-Agent
  State-Action Pairs
K-SHAP: Policy Clustering Algorithm for Anonymous Multi-Agent State-Action Pairs
Andrea Coletta
Svitlana Vyetrenko
T. Balch
OffRL
21
7
0
23 Feb 2023
An Investigation into Pre-Training Object-Centric Representations for
  Reinforcement Learning
An Investigation into Pre-Training Object-Centric Representations for Reinforcement Learning
Jaesik Yoon
Yi-Fu Wu
Heechul Bae
Sungjin Ahn
OCL
35
41
0
09 Feb 2023
A Systematic Performance Analysis of Deep Perceptual Loss Networks:
  Breaking Transfer Learning Conventions
A Systematic Performance Analysis of Deep Perceptual Loss Networks: Breaking Transfer Learning Conventions
G. Pihlgren
Konstantina Nikolaidou
Prakash Chandra Chhipa
Nosheen Abid
Rajkumar Saini
Fredrik Sandin
Marcus Liwicki
29
10
0
08 Feb 2023
Investigating the role of model-based learning in exploration and
  transfer
Investigating the role of model-based learning in exploration and transfer
Jacob Walker
Eszter Vértes
Yazhe Li
Gabriel Dulac-Arnold
Ankesh Anand
T. Weber
Jessica B. Hamrick
OffRL
36
7
0
08 Feb 2023
Predictable MDP Abstraction for Unsupervised Model-Based RL
Predictable MDP Abstraction for Unsupervised Model-Based RL
Seohong Park
Sergey Levine
24
9
0
08 Feb 2023
Object-Centric Scene Representations using Active Inference
Object-Centric Scene Representations using Active Inference
Toon Van de Maele
Tim Verbelen
Pietro Mazzaglia
Stefano Ferraro
Bart Dhoedt
OCL
BDL
43
5
0
07 Feb 2023
DITTO: Offline Imitation Learning with World Models
DITTO: Offline Imitation Learning with World Models
Branton DeMoss
Paul Duckworth
Nick Hawes
Ingmar Posner
Ingmar Posner
OffRL
21
18
0
06 Feb 2023
Aligning Robot and Human Representations
Aligning Robot and Human Representations
Andreea Bobu
Andi Peng
Pulkit Agrawal
Julie A. Shah
Anca D. Dragan
48
10
0
03 Feb 2023
Few-Shot Image-to-Semantics Translation for Policy Transfer in
  Reinforcement Learning
Few-Shot Image-to-Semantics Translation for Policy Transfer in Reinforcement Learning
Reimi Sato
Kazuto Fukuchi
Jun Sakuma
Youhei Akimoto
OffRL
18
0
0
31 Jan 2023
Generative Slate Recommendation with Reinforcement Learning
Generative Slate Recommendation with Reinforcement Learning
Romain Deffayet
Thibaut Thonet
Jean-Michel Render
Maarten de Rijke
27
23
0
20 Jan 2023
Plan To Predict: Learning an Uncertainty-Foreseeing Model for
  Model-Based Reinforcement Learning
Plan To Predict: Learning an Uncertainty-Foreseeing Model for Model-Based Reinforcement Learning
Zifan Wu
Chao Yu
Chong Chen
Jianye Hao
H. Zhuo
27
16
0
20 Jan 2023
Neuro-Symbolic World Models for Adapting to Open World Novelty
Neuro-Symbolic World Models for Adapting to Open World Novelty
Jonathan C. Balloch
Zhiyu Lin
Robert Wright
Xiangyu Peng
Mustafa Hussain
Aarun Srinivas
Julia Kim
Mark O. Riedl
18
10
0
16 Jan 2023
World Models and Predictive Coding for Cognitive and Developmental
  Robotics: Frontiers and Challenges
World Models and Predictive Coding for Cognitive and Developmental Robotics: Frontiers and Challenges
T. Taniguchi
Shingo Murata
Masahiro Suzuki
D. Ognibene
Pablo Lanillos
...
L. Jamone
Tomoaki Nakamura
Alejandra Ciria
B. Lara
G. Pezzulo
32
52
0
14 Jan 2023
Estimation of User's World Model Using Graph2vec
Estimation of User's World Model Using Graph2vec
Tatsuya Sakai
Takayuki Nagai
19
2
0
10 Jan 2023
Annotated History of Modern AI and Deep Learning
Annotated History of Modern AI and Deep Learning
Juergen Schmidhuber
MLAU
AI4TS
AI4CE
33
22
0
21 Dec 2022
Towards Smooth Video Composition
Towards Smooth Video Composition
Qihang Zhang
Ceyuan Yang
Yujun Shen
Yinghao Xu
Bolei Zhou
VGen
44
14
0
14 Dec 2022
MoDem: Accelerating Visual Model-Based Reinforcement Learning with
  Demonstrations
MoDem: Accelerating Visual Model-Based Reinforcement Learning with Demonstrations
Nicklas Hansen
Yixin Lin
H. Su
Xiaolong Wang
Vikash Kumar
Aravind Rajeswaran
OffRL
32
49
0
12 Dec 2022
Decentralized cooperative perception for autonomous vehicles: Learning
  to value the unknown
Decentralized cooperative perception for autonomous vehicles: Learning to value the unknown
Maxime Chaveroche
Franck Davoine
V. Berge-Cherfaoui
17
1
0
12 Dec 2022
PRISM: Probabilistic Real-Time Inference in Spatial World Models
PRISM: Probabilistic Real-Time Inference in Spatial World Models
Atanas Mirchev
Baris Kayalibay
Ahmed Agha
Patrick van der Smagt
Daniel Cremers
Justin Bayer
VGen
31
0
0
06 Dec 2022
A General Purpose Supervisory Signal for Embodied Agents
A General Purpose Supervisory Signal for Embodied Agents
Kunal Pratap Singh
Jordi Salvador
Luca Weihs
Aniruddha Kembhavi
SSL
26
3
0
01 Dec 2022
Adaptive Scenario Subset Selection for Worst-Case Optimization and its
  Application to Well Placement Optimization
Adaptive Scenario Subset Selection for Worst-Case Optimization and its Application to Well Placement Optimization
Atsuhiro Miyagi
Kazuto Fukuchi
Jun Sakuma
Youhei Akimoto
27
1
0
29 Nov 2022
Learning and Understanding a Disentangled Feature Representation for
  Hidden Parameters in Reinforcement Learning
Learning and Understanding a Disentangled Feature Representation for Hidden Parameters in Reinforcement Learning
Christopher P. Reale
Rebecca L. Russell
17
1
0
29 Nov 2022
Operator Splitting Value Iteration
Operator Splitting Value Iteration
Amin Rakhsha
Andrew Wang
Mohammad Ghavamzadeh
Amir-massoud Farahmand
OffRL
33
7
0
25 Nov 2022
Choreographer: Learning and Adapting Skills in Imagination
Choreographer: Learning and Adapting Skills in Imagination
Pietro Mazzaglia
Tim Verbelen
Bart Dhoedt
Alexandre Lacoste
Sai Rajeswar
29
22
0
23 Nov 2022
Powderworld: A Platform for Understanding Generalization via Rich Task
  Distributions
Powderworld: A Platform for Understanding Generalization via Rich Task Distributions
Kevin Frans
Phillip Isola
OffRL
47
9
0
23 Nov 2022
Prototypical context-aware dynamics generalization for high-dimensional
  model-based reinforcement learning
Prototypical context-aware dynamics generalization for high-dimensional model-based reinforcement learning
Junjie Wang
Yao Mu
Dong Li
Qichao Zhang
Dongbin Zhao
Yuzheng Zhuang
Ping Luo
Bin Wang
Jianye Hao
OffRL
30
3
0
23 Nov 2022
Efficient Exploration using Model-Based Quality-Diversity with Gradients
Efficient Exploration using Model-Based Quality-Diversity with Gradients
Bryan Lim
Manon Flageat
Antoine Cully
23
4
0
22 Nov 2022
Efficient Deep Reinforcement Learning with Predictive Processing
  Proximal Policy Optimization
Efficient Deep Reinforcement Learning with Predictive Processing Proximal Policy Optimization
Burcu Küçükoglu
Walraaf Borkent
Bodo Rueckauer
Nasir Ahmad
Umut Güçlü
Marcel van Gerven
39
2
0
11 Nov 2022
ERL-Re$^2$: Efficient Evolutionary Reinforcement Learning with Shared
  State Representation and Individual Policy Representation
ERL-Re2^22: Efficient Evolutionary Reinforcement Learning with Shared State Representation and Individual Policy Representation
Jianye Hao
Pengyi Li
Hongyao Tang
Yan Zheng
Xian Fu
Zhaopeng Meng
29
23
0
26 Oct 2022
On Many-Actions Policy Gradient
On Many-Actions Policy Gradient
Michal Nauman
Marek Cygan
19
0
0
24 Oct 2022
Learning General World Models in a Handful of Reward-Free Deployments
Learning General World Models in a Handful of Reward-Free Deployments
Yingchen Xu
Jack Parker-Holder
Aldo Pacchiano
Philip J. Ball
Oleh Rybkin
Stephen J. Roberts
Tim Rocktaschel
Edward Grefenstette
OffRL
62
9
0
23 Oct 2022
On the Feasibility of Cross-Task Transfer with Model-Based Reinforcement
  Learning
On the Feasibility of Cross-Task Transfer with Model-Based Reinforcement Learning
Yifan Xu
Nicklas Hansen
Zirui Wang
Yung-Chieh Chan
H. Su
Zhuowen Tu
OffRL
31
16
0
19 Oct 2022
Symbol Guided Hindsight Priors for Reward Learning from Human
  Preferences
Symbol Guided Hindsight Priors for Reward Learning from Human Preferences
Mudit Verma
Katherine Metcalf
35
8
0
17 Oct 2022
Model-Based Imitation Learning for Urban Driving
Model-Based Imitation Learning for Urban Driving
Anthony Hu
Gianluca Corrado
Nicolas Griffiths
Zak Murez
Corina Gurau
Hudson Yeo
Alex Kendall
R. Cipolla
Jamie Shotton
112
135
0
14 Oct 2022
Towards Multi-Agent Reinforcement Learning driven Over-The-Counter
  Market Simulations
Towards Multi-Agent Reinforcement Learning driven Over-The-Counter Market Simulations
N. Vadori
Leo Ardon
Sumitra Ganesh
Thomas Spooner
Selim Amrouni
Jared Vann
Mengda Xu
Zeyu Zheng
T. Balch
Manuela Veloso
18
16
0
13 Oct 2022
ControlVAE: Model-Based Learning of Generative Controllers for
  Physics-Based Characters
ControlVAE: Model-Based Learning of Generative Controllers for Physics-Based Characters
Heyuan Yao
Zhenhua Song
B. Chen
Libin Liu
DRL
VGen
16
41
0
12 Oct 2022
The Role of Exploration for Task Transfer in Reinforcement Learning
The Role of Exploration for Task Transfer in Reinforcement Learning
Jonathan C. Balloch
Julia Kim
Jessica B. Langebrake Inman
Mark O. Riedl
OffRL
31
3
0
11 Oct 2022
Enhance Sample Efficiency and Robustness of End-to-end Urban Autonomous
  Driving via Semantic Masked World Model
Enhance Sample Efficiency and Robustness of End-to-end Urban Autonomous Driving via Semantic Masked World Model
Zeyu Gao
Yao Mu
Chen Chen
Yangang Ren
Shengbo Eben Li
Ping Luo
Yanfeng Lu
22
27
0
08 Oct 2022
EgoTaskQA: Understanding Human Tasks in Egocentric Videos
EgoTaskQA: Understanding Human Tasks in Egocentric Videos
Baoxiong Jia
Ting Lei
Song-Chun Zhu
Siyuan Huang
EgoV
30
61
0
08 Oct 2022
LOPR: Latent Occupancy PRediction using Generative Models
LOPR: Latent Occupancy PRediction using Generative Models
Bernard Lange
Masha Itkina
Mykel J. Kochenderfer
AI4CE
46
5
0
03 Oct 2022
CostNet: An End-to-End Framework for Goal-Directed Reinforcement
  Learning
CostNet: An End-to-End Framework for Goal-Directed Reinforcement Learning
Per-Arne Andersen
M. G. Olsen
Ole-Christoffer Granmo
3DV
OffRL
14
0
0
03 Oct 2022
Interpretable Option Discovery using Deep Q-Learning and Variational
  Autoencoders
Interpretable Option Discovery using Deep Q-Learning and Variational Autoencoders
Per-Arne Andersen
Ole-Christoffer Granmo
Morten Goodwin
OOD
28
0
0
03 Oct 2022
Improving Policy Learning via Language Dynamics Distillation
Improving Policy Learning via Language Dynamics Distillation
Victor Zhong
Jesse Mu
Luke Zettlemoyer
Edward Grefenstette
Tim Rocktaschel
OffRL
50
15
0
30 Sep 2022
Contrastive Unsupervised Learning of World Model with Invariant Causal
  Features
Contrastive Unsupervised Learning of World Model with Invariant Causal Features
Rudra P. K. Poudel
Harit Pandya
R. Cipolla
SSL
CML
21
3
0
29 Sep 2022
Learning Parsimonious Dynamics for Generalization in Reinforcement
  Learning
Learning Parsimonious Dynamics for Generalization in Reinforcement Learning
Tankred Saanum
Eric Schulz
26
1
0
29 Sep 2022
Mastering the Unsupervised Reinforcement Learning Benchmark from Pixels
Mastering the Unsupervised Reinforcement Learning Benchmark from Pixels
Sai Rajeswar
Pietro Mazzaglia
Tim Verbelen
Alexandre Piché
Bart Dhoedt
Rameswar Panda
Alexandre Lacoste
SSL
28
21
0
24 Sep 2022
Active Predicting Coding: Brain-Inspired Reinforcement Learning for
  Sparse Reward Robotic Control Problems
Active Predicting Coding: Brain-Inspired Reinforcement Learning for Sparse Reward Robotic Control Problems
Alexander Ororbia
A. Mali
40
7
0
19 Sep 2022
Previous
123456...91011
Next