ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1809.01999
  4. Cited By
Recurrent World Models Facilitate Policy Evolution

Recurrent World Models Facilitate Policy Evolution

4 September 2018
David R Ha
Jürgen Schmidhuber
    SyDa
    TPM
ArXivPDFHTML

Papers citing "Recurrent World Models Facilitate Policy Evolution"

50 / 505 papers shown
Title
Learning of feature points without additional supervision improves
  reinforcement learning from images
Learning of feature points without additional supervision improves reinforcement learning from images
Rinu Boney
Alexander Ilin
Arno Solin
SSL
20
2
0
15 Jun 2021
Temporal Predictive Coding For Model-Based Planning In Latent Space
Temporal Predictive Coding For Model-Based Planning In Latent Space
Tung D. Nguyen
Rui Shu
Tu Pham
Hung Bui
Stefano Ermon
OffRL
32
56
0
14 Jun 2021
Vector Quantized Models for Planning
Vector Quantized Models for Planning
Sherjil Ozair
Yazhe Li
Ali Razavi
Ioannis Antonoglou
Aaron van den Oord
Oriol Vinyals
OffRL
16
49
0
08 Jun 2021
Detecting and Adapting to Novelty in Games
Detecting and Adapting to Novelty in Games
Xiangyu Peng
Jonathan C. Balloch
Mark O. Riedl
TTA
13
10
0
04 Jun 2021
A Consciousness-Inspired Planning Agent for Model-Based Reinforcement
  Learning
A Consciousness-Inspired Planning Agent for Model-Based Reinforcement Learning
Mingde Zhao
Zhen Liu
Sitao Luan
Shuyuan Zhang
Doina Precup
Yoshua Bengio
47
37
0
03 Jun 2021
A unified view of likelihood ratio and reparameterization gradients
A unified view of likelihood ratio and reparameterization gradients
Paavo Parmas
Masashi Sugiyama
22
9
0
31 May 2021
Techniques Toward Optimizing Viewability in RTB Ad Campaigns Using
  Reinforcement Learning
Techniques Toward Optimizing Viewability in RTB Ad Campaigns Using Reinforcement Learning
M. Tashman
John Hoffman
Jiayi Xie
Feng Ye
Atefeh Morsali
Lee Winikor
Rouzbeh Gerami
OffRL
29
0
0
21 May 2021
Generic Itemset Mining Based on Reinforcement Learning
Generic Itemset Mining Based on Reinforcement Learning
Kazuma Fujioka
Kimiaki Shirahama
6
3
0
17 May 2021
A Framework of Explanation Generation toward Reliable Autonomous Robots
A Framework of Explanation Generation toward Reliable Autonomous Robots
Tatsuya Sakai
Kazuki Miyazawa
Takato Horii
Takayuki Nagai
22
8
0
06 May 2021
Explainable Autonomous Robots: A Survey and Perspective
Explainable Autonomous Robots: A Survey and Perspective
Tatsuya Sakai
Takayuki Nagai
20
67
0
06 May 2021
Data-Efficient Reinforcement Learning for Malaria Control
Data-Efficient Reinforcement Learning for Malaria Control
Lixin Zou
Long Xia
Linfang Hou
Xiangyu Zhao
Dawei Yin
OffRL
14
7
0
04 May 2021
Learning to drive from a world on rails
Learning to drive from a world on rails
Di Chen
V. Koltun
Philipp Krahenbuhl
98
116
0
03 May 2021
DriveGAN: Towards a Controllable High-Quality Neural Simulation
DriveGAN: Towards a Controllable High-Quality Neural Simulation
S. Kim
Jonah Philion
Antonio Torralba
Sanja Fidler
29
109
0
30 Apr 2021
Capability Iteration Network for Robot Path Planning
Capability Iteration Network for Robot Path Planning
Buqing Nie
Yue Gao
Yi Mei
Feng Gao
3DV
16
7
0
29 Apr 2021
Comparing Correspondences: Video Prediction with Correspondence-wise
  Losses
Comparing Correspondences: Video Prediction with Correspondence-wise Losses
Daniel Geng
Max Hamilton
Andrew Owens
3DH
32
16
0
19 Apr 2021
A Self-Supervised Auxiliary Loss for Deep RL in Partially Observable
  Settings
A Self-Supervised Auxiliary Loss for Deep RL in Partially Observable Settings
Eltayeb Ahmed
L. Zintgraf
Christian Schroeder de Witt
Nicolas Usunier
SSL
24
0
0
17 Apr 2021
Revisiting Hierarchical Approach for Persistent Long-Term Video
  Prediction
Revisiting Hierarchical Approach for Persistent Long-Term Video Prediction
Wonkwang Lee
Whie Jung
Han Zhang
Ting Chen
Jing Yu Koh
Thomas E. Huang
Hyungsuk Yoon
Honglak Lee
Seunghoon Hong
32
29
0
14 Apr 2021
Muesli: Combining Improvements in Policy Optimization
Muesli: Combining Improvements in Policy Optimization
Matteo Hessel
Ivo Danihelka
Fabio Viola
A. Guez
Simon Schmitt
Laurent Sifre
T. Weber
David Silver
H. V. Hasselt
24
66
0
13 Apr 2021
Augmented World Models Facilitate Zero-Shot Dynamics Generalization From
  a Single Offline Environment
Augmented World Models Facilitate Zero-Shot Dynamics Generalization From a Single Offline Environment
Philip J. Ball
Cong Lu
Jack Parker-Holder
Stephen J. Roberts
OffRL
32
40
0
12 Apr 2021
Causal Reasoning in Simulation for Structure and Transfer Learning of
  Robot Manipulation Policies
Causal Reasoning in Simulation for Structure and Transfer Learning of Robot Manipulation Policies
Timothy E. Lee
Jialiang Zhao
A. Sawhney
Siddharth Girdhar
Oliver Kroemer
CML
23
32
0
31 Mar 2021
PredRNN: A Recurrent Neural Network for Spatiotemporal Predictive
  Learning
PredRNN: A Recurrent Neural Network for Spatiotemporal Predictive Learning
Yunbo Wang
Haixu Wu
Jianjin Zhang
Zhifeng Gao
Jianmin Wang
Philip S. Yu
Mingsheng Long
28
380
0
17 Mar 2021
Sample-efficient Reinforcement Learning Representation Learning with
  Curiosity Contrastive Forward Dynamics Model
Sample-efficient Reinforcement Learning Representation Learning with Curiosity Contrastive Forward Dynamics Model
Thanh Nguyen
Tung M. Luu
Thang Vu
Chang D. Yoo
23
17
0
15 Mar 2021
Continuous 3D Multi-Channel Sign Language Production via Progressive
  Transformers and Mixture Density Networks
Continuous 3D Multi-Channel Sign Language Production via Progressive Transformers and Mixture Density Networks
Ben Saunders
Necati Cihan Camgöz
Richard Bowden
SLR
33
77
0
11 Mar 2021
Understanding the Origin of Information-Seeking Exploration in
  Probabilistic Objectives for Control
Understanding the Origin of Information-Seeking Exploration in Probabilistic Objectives for Control
Beren Millidge
A. Seth
Christopher L. Buckley
31
11
0
11 Mar 2021
Latent Imagination Facilitates Zero-Shot Transfer in Autonomous Racing
Latent Imagination Facilitates Zero-Shot Transfer in Autonomous Racing
Axel Brunnbauer
Luigi Berducci
Andreas Brandstätter
Mathias Lechner
Ramin Hasani
Daniela Rus
Radu Grosu
LM&Ro
38
38
0
08 Mar 2021
Learning a State Representation and Navigation in Cluttered and Dynamic
  Environments
Learning a State Representation and Navigation in Cluttered and Dynamic Environments
David Hoeller
Lorenz Wellhausen
Farbod Farshidian
Marco Hutter
SSL
22
72
0
07 Mar 2021
Convergence Rate of the (1+1)-Evolution Strategy with Success-Based
  Step-Size Adaptation on Convex Quadratic Functions
Convergence Rate of the (1+1)-Evolution Strategy with Success-Based Step-Size Adaptation on Convex Quadratic Functions
Daiki Morinaga
Kazuto Fukuchi
Jun Sakuma
Youhei Akimoto
11
8
0
02 Mar 2021
Beyond Fine-Tuning: Transferring Behavior in Reinforcement Learning
Beyond Fine-Tuning: Transferring Behavior in Reinforcement Learning
Victor Campos
Pablo Sprechmann
Steven Hansen
André Barreto
Steven Kapturowski
Alex Vitvitskyi
Adria Puigdomenech Badia
Charles Blundell
OffRL
OnRL
38
25
0
24 Feb 2021
Deep Latent Competition: Learning to Race Using Visual Control Policies
  in Latent Space
Deep Latent Competition: Learning to Race Using Visual Control Policies in Latent Space
Wilko Schwarting
Tim Seyde
Igor Gilitschenski
Lucas Liebenwein
Ryan M Sander
S. Karaman
Daniela Rus
BDL
22
37
0
19 Feb 2021
Learning Memory-Dependent Continuous Control from Demonstrations
Learning Memory-Dependent Continuous Control from Demonstrations
Siqing Hou
Dongqi Han
Jun Tani
16
0
0
18 Feb 2021
Training Larger Networks for Deep Reinforcement Learning
Training Larger Networks for Deep Reinforcement Learning
Keita Ota
Devesh K. Jha
Asako Kanezaki
OffRL
25
39
0
16 Feb 2021
Planning and Learning Using Adaptive Entropy Tree Search
Planning and Learning Using Adaptive Entropy Tree Search
Piotr Kozakowski
Mikolaj Pacek
Piotr Milo's
19
2
0
12 Feb 2021
Machine Learning for Mechanical Ventilation Control
Machine Learning for Mechanical Ventilation Control
Daniel Suo
Naman Agarwal
Wenhan Xia
Xinyi Chen
Udaya Ghai
...
J. LaChance
Tom Zadjel
Manuel Schottdorf
Daniel J. Cohen
Elad Hazan
OOD
AI4CE
54
10
0
12 Feb 2021
Derivative-Free Reinforcement Learning: A Review
Derivative-Free Reinforcement Learning: A Review
Hong Qian
Yang Yu
OffRL
26
42
0
10 Feb 2021
Environment Predictive Coding for Embodied Agents
Environment Predictive Coding for Embodied Agents
Santhosh Kumar Ramakrishnan
Tushar Nagarajan
Ziad Al-Halah
Kristen Grauman
8
14
0
03 Feb 2021
Evaluating the Interpretability of Generative Models by Interactive
  Reconstruction
Evaluating the Interpretability of Generative Models by Interactive Reconstruction
A. Ross
Nina Chen
Elisa Zhao Hang
Elena L. Glassman
Finale Doshi-Velez
105
49
0
02 Feb 2021
Meta-Reinforcement Learning for Adaptive Motor Control in Changing Robot
  Dynamics and Environments
Meta-Reinforcement Learning for Adaptive Motor Control in Changing Robot Dynamics and Environments
Timothée Anne
Jack Wilkinson
Zhibin Li
26
1
0
19 Jan 2021
Causal World Models by Unsupervised Deconfounding of Physical Dynamics
Causal World Models by Unsupervised Deconfounding of Physical Dynamics
Minne Li
Girish A. Koushik
Furui Liu
Xu Chen
Zhitang Chen
Jun Wang
SyDa
CML
33
12
0
28 Dec 2020
Hierarchical principles of embodied reinforcement learning: A review
Hierarchical principles of embodied reinforcement learning: A review
Manfred Eppe
Christian Gumbsch
Matthias Kerzel
Phuong D. H. Nguyen
Martin Volker Butz
S. Wermter
39
9
0
18 Dec 2020
Content Masked Loss: Human-Like Brush Stroke Planning in a Reinforcement
  Learning Painting Agent
Content Masked Loss: Human-Like Brush Stroke Planning in a Reinforcement Learning Painting Agent
Peter Schaldenbrand
Jean Oh
14
36
0
18 Dec 2020
Uncertainty Estimation with Deep Learning for Rainfall-Runoff Modelling
Uncertainty Estimation with Deep Learning for Rainfall-Runoff Modelling
D. Klotz
Frederik Kratzert
M. Gauch
A. Sampson
Günter Klambauer
Sepp Hochreiter
G. Nearing
BDL
UQCV
18
106
0
15 Dec 2020
NavRep: Unsupervised Representations for Reinforcement Learning of Robot
  Navigation in Dynamic Human Environments
NavRep: Unsupervised Representations for Reinforcement Learning of Robot Navigation in Dynamic Human Environments
Daniel Dugas
Juan I. Nieto
Roland Siegwart
Jen Jen Chung
SSL
24
51
0
08 Dec 2020
Deep Learning and the Global Workspace Theory
Deep Learning and the Global Workspace Theory
R. V. Rullen
Ryota Kanai
45
66
0
04 Dec 2020
Planning from Pixels using Inverse Dynamics Models
Planning from Pixels using Inverse Dynamics Models
Keiran Paster
Sheila A. McIlraith
Jimmy Ba
BDL
12
41
0
04 Dec 2020
Detection of False-Reading Attacks in the AMI Net-Metering System
Detection of False-Reading Attacks in the AMI Net-Metering System
Mahmoud M. Badr
Mohamed I. Ibrahem
Mohamed Mahmoud
M. Fouda
Waleed S. Alasmary
41
3
0
02 Dec 2020
World Model as a Graph: Learning Latent Landmarks for Planning
World Model as a Graph: Learning Latent Landmarks for Planning
Lunjun Zhang
Ge Yang
Bradly C. Stadie
DRL
20
73
0
25 Nov 2020
Generative Adversarial Simulator
Generative Adversarial Simulator
Jonathan Raiman
GAN
13
0
0
23 Nov 2020
Distilling a Hierarchical Policy for Planning and Control via
  Representation and Reinforcement Learning
Distilling a Hierarchical Policy for Planning and Control via Representation and Reinforcement Learning
Jung-Su Ha
Young-Jin Park
Hyeok-Joo Chae
Soon-Seo Park
Han-Lim Choi
30
3
0
16 Nov 2020
On the role of planning in model-based deep reinforcement learning
On the role of planning in model-based deep reinforcement learning
Jessica B. Hamrick
A. Friesen
Feryal M. P. Behbahani
A. Guez
Fabio Viola
Sims Witherspoon
Thomas W. Anthony
Lars Buesing
Petar Velickovic
T. Weber
OffRL
27
65
0
08 Nov 2020
Privacy-Preserving and Efficient Data Collection Scheme for AMI Networks
  Using Deep Learning
Privacy-Preserving and Efficient Data Collection Scheme for AMI Networks Using Deep Learning
Mohamed I. Ibrahem
Mohamed Mahmoud
M. Fouda
F. Alsolami
Waleed S. Alasmary
Xuemin Shen
Shen
31
28
0
07 Nov 2020
Previous
123...1011789
Next