ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1809.01999
  4. Cited By
Recurrent World Models Facilitate Policy Evolution

Recurrent World Models Facilitate Policy Evolution

4 September 2018
David R Ha
Jürgen Schmidhuber
    SyDa
    TPM
ArXivPDFHTML

Papers citing "Recurrent World Models Facilitate Policy Evolution"

50 / 505 papers shown
Title
Simplifying Model-based RL: Learning Representations, Latent-space
  Models, and Policies with One Objective
Simplifying Model-based RL: Learning Representations, Latent-space Models, and Policies with One Objective
Raj Ghugare
Homanga Bharadhwaj
Benjamin Eysenbach
Sergey Levine
Ruslan Salakhutdinov
OffRL
48
25
0
18 Sep 2022
Concept-modulated model-based offline reinforcement learning for rapid
  generalization
Concept-modulated model-based offline reinforcement learning for rapid generalization
Nicholas A. Ketz
Praveen K. Pilly
OffRL
27
1
0
07 Sep 2022
Transformers are Sample-Efficient World Models
Transformers are Sample-Efficient World Models
Vincent Micheli
Eloi Alonso
Franccois Fleuret
VLM
OffRL
19
162
0
01 Sep 2022
Intelligent problem-solving as integrated hierarchical reinforcement
  learning
Intelligent problem-solving as integrated hierarchical reinforcement learning
Manfred Eppe
Christian Gumbsch
Matthias Kerzel
Phuong D. H. Nguyen
Martin Volker Butz
S. Wermter
31
75
0
18 Aug 2022
Implicit Two-Tower Policies
Implicit Two-Tower Policies
Yunfan Zhao
Qingkai Pan
K. Choromanski
Deepali Jain
Vikas Sindhwani
OffRL
31
3
0
02 Aug 2022
Explain My Surprise: Learning Efficient Long-Term Memory by Predicting
  Uncertain Outcomes
Explain My Surprise: Learning Efficient Long-Term Memory by Predicting Uncertain Outcomes
A. Sorokin
N. Buzun
Leonid Pugachev
Andrey Kravchenko
31
8
0
27 Jul 2022
A Closed-Loop Perception, Decision-Making and Reasoning Mechanism for
  Human-Like Navigation
A Closed-Loop Perception, Decision-Making and Reasoning Mechanism for Human-Like Navigation
Wenqi Zhang
Kai Zhao
Peng Li
Xiao Zhu
Yongliang Shen
Yanna Ma
Yingfeng Chen
Weiming Lu
LRM
29
8
0
25 Jul 2022
The Free Energy Principle for Perception and Action: A Deep Learning
  Perspective
The Free Energy Principle for Perception and Action: A Deep Learning Perspective
Pietro Mazzaglia
Tim Verbelen
Ozan Çatal
Bart Dhoedt
DRL
AI4CE
30
31
0
13 Jul 2022
A survey of multimodal deep generative models
A survey of multimodal deep generative models
Masahiro Suzuki
Y. Matsuo
SyDa
DRL
57
76
0
05 Jul 2022
Variational Causal Dynamics: Discovering Modular World Models from
  Interventions
Variational Causal Dynamics: Discovering Modular World Models from Interventions
Anson Lei
Bernhard Schölkopf
Ingmar Posner
CML
19
8
0
22 Jun 2022
A Survey on Model-based Reinforcement Learning
A Survey on Model-based Reinforcement Learning
Fan Luo
Tian Xu
Hang Lai
Xiong-Hui Chen
Weinan Zhang
Yang Yu
OffRL
LRM
50
101
0
19 Jun 2022
CtrlFormer: Learning Transferable State Representation for Visual
  Control via Transformer
CtrlFormer: Learning Transferable State Representation for Visual Control via Transformer
Yao Mu
Shoufa Chen
Mingyu Ding
Jianyu Chen
Runjian Chen
Ping Luo
ViT
23
9
0
17 Jun 2022
The State of Sparse Training in Deep Reinforcement Learning
The State of Sparse Training in Deep Reinforcement Learning
L. Graesser
Utku Evci
Erich Elsen
Pablo Samuel Castro
OffRL
12
34
0
17 Jun 2022
Hybrid Value Estimation for Off-policy Evaluation and Offline
  Reinforcement Learning
Hybrid Value Estimation for Off-policy Evaluation and Offline Reinforcement Learning
Xuefeng Jin
Xu-Hui Liu
Shengyi Jiang
Yang Yu
OffRL
31
4
0
04 Jun 2022
Offline Reinforcement Learning with Causal Structured World Models
Offline Reinforcement Learning with Causal Structured World Models
Zhengbang Zhu
Xiong-Hui Chen
Hong Tian
Kun Zhang
Yang Yu
CML
OffRL
12
16
0
03 Jun 2022
Graph Backup: Data Efficient Backup Exploiting Markovian Transitions
Graph Backup: Data Efficient Backup Exploiting Markovian Transitions
Zhengyao Jiang
Tianjun Zhang
Robert Kirk
Tim Rocktaschel
Edward Grefenstette
OffRL
8
2
0
31 May 2022
Multi-Source Transfer Learning for Deep Model-Based Reinforcement
  Learning
Multi-Source Transfer Learning for Deep Model-Based Reinforcement Learning
Remo Sasso
M. Sabatelli
M. Wiering
49
9
0
28 May 2022
Iso-Dream: Isolating and Leveraging Noncontrollable Visual Dynamics in
  World Models
Iso-Dream: Isolating and Leveraging Noncontrollable Visual Dynamics in World Models
Minting Pan
Xiangming Zhu
Yunbo Wang
Xiaokang Yang
26
39
0
27 May 2022
POLTER: Policy Trajectory Ensemble Regularization for Unsupervised
  Reinforcement Learning
POLTER: Policy Trajectory Ensemble Regularization for Unsupervised Reinforcement Learning
Frederik Schubert
C. Benjamins
Sebastian Dohler
Bodo Rosenhahn
Marius Lindauer
SSL
OffRL
62
4
0
23 May 2022
Memory-efficient Reinforcement Learning with Value-based Knowledge
  Consolidation
Memory-efficient Reinforcement Learning with Value-based Knowledge Consolidation
Qingfeng Lan
Yangchen Pan
Jun Luo
A. R. Mahmood
OffRL
29
8
0
22 May 2022
Towards biologically plausible Dreaming and Planning in recurrent
  spiking networks
Towards biologically plausible Dreaming and Planning in recurrent spiking networks
C. Capone
P. Paolucci
CLL
31
7
0
20 May 2022
Planning with Diffusion for Flexible Behavior Synthesis
Planning with Diffusion for Flexible Behavior Synthesis
Michael Janner
Yilun Du
J. Tenenbaum
Sergey Levine
DiffM
202
633
0
20 May 2022
Deterministic training of generative autoencoders using invertible
  layers
Deterministic training of generative autoencoders using invertible layers
Gianluigi Silvestri
Daan Roos
L. Ambrogioni
TPM
21
2
0
19 May 2022
CARNet: A Dynamic Autoencoder for Learning Latent Dynamics in Autonomous
  Driving Tasks
CARNet: A Dynamic Autoencoder for Learning Latent Dynamics in Autonomous Driving Tasks
A. Pak
Hemanth Manjunatha
Dimitar Filev
Panagiotis Tsiotras
14
5
0
18 May 2022
Neighborhood Mixup Experience Replay: Local Convex Interpolation for
  Improved Sample Efficiency in Continuous Control Tasks
Neighborhood Mixup Experience Replay: Local Convex Interpolation for Improved Sample Efficiency in Continuous Control Tasks
Ryan M Sander
Wilko Schwarting
Tim Seyde
Igor Gilitschenski
S. Karaman
Daniela Rus
41
2
0
18 May 2022
Pervasive Machine Learning for Smart Radio Environments Enabled by
  Reconfigurable Intelligent Surfaces
Pervasive Machine Learning for Smart Radio Environments Enabled by Reconfigurable Intelligent Surfaces
G. C. Alexandropoulos
Kyriakos Stylianopoulos
Chongwen Huang
Chau Yuen
M. Bennis
Mérouane Debbah
25
88
0
08 May 2022
Fixing Malfunctional Objects With Learned Physical Simulation and
  Functional Prediction
Fixing Malfunctional Objects With Learned Physical Simulation and Functional Prediction
Yining Hong
Kaichun Mo
L. Yi
Leonidas J. Guibas
Antonio Torralba
J. Tenenbaum
Chuang Gan
42
5
0
05 May 2022
RLFlow: Optimising Neural Network Subgraph Transformation with World
  Models
RLFlow: Optimising Neural Network Subgraph Transformation with World Models
Sean Parker
Sami Alabed
Eiko Yoneki
16
0
0
03 May 2022
Data-driven control of spatiotemporal chaos with reduced-order neural
  ODE-based models and reinforcement learning
Data-driven control of spatiotemporal chaos with reduced-order neural ODE-based models and reinforcement learning
Kevin Zeng
Alec J. Linot
M. Graham
AI4CE
22
28
0
01 May 2022
TTOpt: A Maximum Volume Quantized Tensor Train-based Optimization and
  its Application to Reinforcement Learning
TTOpt: A Maximum Volume Quantized Tensor Train-based Optimization and its Application to Reinforcement Learning
Konstantin Sozykin
Andrei Chertkov
R. Schutski
Anh-Huy Phan
A. Cichocki
Ivan Oseledets
14
35
0
30 Apr 2022
Predicting Real-time Scientific Experiments Using Transformer models and
  Reinforcement Learning
Predicting Real-time Scientific Experiments Using Transformer models and Reinforcement Learning
J. M. Parrilla-Gutierrez
AI4CE
24
0
0
25 Apr 2022
Deep Reinforcement Learning Using a Low-Dimensional Observation Filter
  for Visual Complex Video Game Playing
Deep Reinforcement Learning Using a Low-Dimensional Observation Filter for Visual Complex Video Game Playing
V. A. Kich
J. C. Jesus
Ricardo B. Grando
A. H. Kolling
Gabriel V. Heisler
R. S. Guerra
OffRL
30
2
0
24 Apr 2022
Learning Sequential Latent Variable Models from Multimodal Time Series
  Data
Learning Sequential Latent Variable Models from Multimodal Time Series Data
Oliver Limoyo
Trevor Ablett
Jonathan Kelly
25
5
0
21 Apr 2022
Evaluating Vision Transformer Methods for Deep Reinforcement Learning
  from Pixels
Evaluating Vision Transformer Methods for Deep Reinforcement Learning from Pixels
Tianxin Tao
Daniele Reda
M. van de Panne
ViT
11
19
0
11 Apr 2022
Gradient-Based Trajectory Optimization With Learned Dynamics
Gradient-Based Trajectory Optimization With Learned Dynamics
Bhavya Sukhija
Nathanael Kohler
Miguel Zamora
Simon Zimmermann
Sebastian Curi
Andreas Krause
Stelian Coros
30
9
0
09 Apr 2022
Temporal Alignment for History Representation in Reinforcement Learning
Temporal Alignment for History Representation in Reinforcement Learning
Aleksandr Ermolov
E. Sangineto
N. Sebe
AI4TS
21
2
0
07 Apr 2022
Learning List-wise Representation in Reinforcement Learning for Ads
  Allocation with Multiple Auxiliary Tasks
Learning List-wise Representation in Reinforcement Learning for Ads Allocation with Multiple Auxiliary Tasks
Zehua Wang
Guogang Liao
Xiaowen Shi
Xiaoxu Wu
Chuheng Zhang
Yongkang Wang
Xingxing Wang
Dong Wang
OffRL
19
4
0
02 Apr 2022
Factored Adaptation for Non-Stationary Reinforcement Learning
Factored Adaptation for Non-Stationary Reinforcement Learning
Fan Feng
Erdun Gao
Kun Zhang
Sara Magliacane
CML
OffRL
47
32
0
30 Mar 2022
NavDreams: Towards Camera-Only RL Navigation Among Humans
NavDreams: Towards Camera-Only RL Navigation Among Humans
Daniel Dugas
Olov Andersson
Roland Siegwart
Jen Jen Chung
VGen
23
12
0
23 Mar 2022
Multi-View Dreaming: Multi-View World Model with Contrastive Learning
Multi-View Dreaming: Multi-View World Model with Contrastive Learning
Akira Kinose
Masashi Okada
Ryogo Okumura
T. Taniguchi
OffRL
21
10
0
15 Mar 2022
Towards Self-Supervised Learning of Global and Object-Centric
  Representations
Towards Self-Supervised Learning of Global and Object-Centric Representations
Federico Baldassarre
Hossein Azizpour
SSL
3DPC
OCL
46
13
0
11 Mar 2022
Tactile-Sensitive NewtonianVAE for High-Accuracy Industrial Connector
  Insertion
Tactile-Sensitive NewtonianVAE for High-Accuracy Industrial Connector Insertion
Ryogo Okumura
Nobuki Nishio
T. Taniguchi
22
8
0
10 Mar 2022
SAGE: Generating Symbolic Goals for Myopic Models in Deep Reinforcement
  Learning
SAGE: Generating Symbolic Goals for Myopic Models in Deep Reinforcement Learning
A. Chester
Michael Dann
Fabio Zambetta
John Thangarajah
11
0
0
09 Mar 2022
Temporal Difference Learning for Model Predictive Control
Temporal Difference Learning for Model Predictive Control
Nicklas Hansen
Xiaolong Wang
H. Su
PINN
MU
36
224
0
09 Mar 2022
"If you could see me through my eyes": Predicting Pedestrian Perception
"If you could see me through my eyes": Predicting Pedestrian Perception
Julian Petzold
Mostafa Wahby
F. Stark
Ulrich Behrje
Heiko Hamann
45
3
0
28 Feb 2022
Gradient-free Multi-domain Optimization for Autonomous Systems
Gradient-free Multi-domain Optimization for Autonomous Systems
Hongrui Zheng
Johannes Betz
Rahul Mangharam
14
7
0
28 Feb 2022
Abstraction for Deep Reinforcement Learning
Abstraction for Deep Reinforcement Learning
Murray Shanahan
Melanie Mitchell
OffRL
35
28
0
10 Feb 2022
Influence-Augmented Local Simulators: A Scalable Solution for Fast Deep
  RL in Large Networked Systems
Influence-Augmented Local Simulators: A Scalable Solution for Fast Deep RL in Large Networked Systems
Miguel Suau
Jinke He
M. Spaan
F. Oliehoek
27
4
0
03 Feb 2022
Safe Deep RL in 3D Environments using Human Feedback
Safe Deep RL in 3D Environments using Human Feedback
Matthew Rahtz
Vikrant Varma
Ramana Kumar
Zachary Kenton
Shane Legg
Jan Leike
32
4
0
20 Jan 2022
Bayesian sense of time in biological and artificial brains
Bayesian sense of time in biological and artificial brains
Z. Fountas
Alexey Zakharov
35
0
0
14 Jan 2022
Previous
123...567...91011
Next