ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1809.01999
  4. Cited By
Recurrent World Models Facilitate Policy Evolution

Recurrent World Models Facilitate Policy Evolution

4 September 2018
David R Ha
Jürgen Schmidhuber
    SyDa
    TPM
ArXivPDFHTML

Papers citing "Recurrent World Models Facilitate Policy Evolution"

50 / 504 papers shown
Title
UniWorld: Autonomous Driving Pre-training via World Models
UniWorld: Autonomous Driving Pre-training via World Models
Chen Min
Dawei Zhao
Liang Xiao
Yiming Nie
Bin Dai
VGen
39
22
0
14 Aug 2023
Bayesian Inverse Transition Learning for Offline Settings
Bayesian Inverse Transition Learning for Offline Settings
Leo Benac
S. Parbhoo
Finale Doshi-Velez
OffRL
16
0
0
09 Aug 2023
World-Model-Based Control for Industrial box-packing of Multiple Objects
  using NewtonianVAE
World-Model-Based Control for Industrial box-packing of Multiple Objects using NewtonianVAE
Yusuke Kato
Ryogo Okumura
T. Taniguchi
DRL
27
1
0
04 Aug 2023
Reinforcement Learning for Generative AI: State of the Art,
  Opportunities and Open Research Challenges
Reinforcement Learning for Generative AI: State of the Art, Opportunities and Open Research Challenges
Giorgio Franceschelli
Mirco Musolesi
AI4CE
40
20
0
31 Jul 2023
Learning to Model the World with Language
Learning to Model the World with Language
Jessy Lin
Yuqing Du
Olivia Watkins
Danijar Hafner
Pieter Abbeel
Dan Klein
Anca Dragan
LM&Ro
SyDa
38
51
0
31 Jul 2023
Thinker: Learning to Plan and Act
Thinker: Learning to Plan and Act
Stephen Chung
Ivan Anokhin
David M. Krueger
LLMAG
OffRL
LRM
30
5
0
27 Jul 2023
Approximate Model-Based Shielding for Safe Reinforcement Learning
Approximate Model-Based Shielding for Safe Reinforcement Learning
Alexander W. Goodall
Francesco Belardinelli
16
0
0
27 Jul 2023
Facing Off World Model Backbones: RNNs, Transformers, and S4
Facing Off World Model Backbones: RNNs, Transformers, and S4
Fei Deng
Junyeong Park
Sungjin Ahn
32
24
0
05 Jul 2023
Dynamic Feature-based Deep Reinforcement Learning for Flow Control of
  Circular Cylinder with Sparse Surface Pressure Sensing
Dynamic Feature-based Deep Reinforcement Learning for Flow Control of Circular Cylinder with Sparse Surface Pressure Sensing
Qiulei Wang
Lei Yan
Gang Hu
Wenli Chen
Jean Rabault
B. R. Noack
AI4CE
23
24
0
05 Jul 2023
Crossway Diffusion: Improving Diffusion-based Visuomotor Policy via
  Self-supervised Learning
Crossway Diffusion: Improving Diffusion-based Visuomotor Policy via Self-supervised Learning
Xiang Li
Varun Belagali
Jinghuan Shang
Michael S. Ryoo
40
28
0
04 Jul 2023
End-to-end Autonomous Driving: Challenges and Frontiers
End-to-end Autonomous Driving: Challenges and Frontiers
Li Chen
Peng Wu
Kashyap Chitta
Bernhard Jaeger
Andreas Geiger
Hongyang Li
3DV
64
264
0
29 Jun 2023
Curious Replay for Model-based Adaptation
Curious Replay for Model-based Adaptation
Isaac Kauvar
Christopher Doyle
Linqi Zhou
Nick Haber
23
11
0
28 Jun 2023
Fighting Uncertainty with Gradients: Offline Reinforcement Learning via
  Diffusion Score Matching
Fighting Uncertainty with Gradients: Offline Reinforcement Learning via Diffusion Score Matching
H.J. Terry Suh
Glen Chou
Hongkai Dai
Lujie Yang
Abhishek Gupta
Russ Tedrake
DiffM
OffRL
39
7
0
24 Jun 2023
Achieving Sample and Computational Efficient Reinforcement Learning by
  Action Space Reduction via Grouping
Achieving Sample and Computational Efficient Reinforcement Learning by Action Space Reduction via Grouping
Yining Li
Peizhong Ju
Ness B. Shroff
31
0
0
22 Jun 2023
Informed POMDP: Leveraging Additional Information in Model-Based RL
Informed POMDP: Leveraging Additional Information in Model-Based RL
Gaspard Lambrechts
Adrien Bolland
D. Ernst
31
7
0
20 Jun 2023
Simplified Temporal Consistency Reinforcement Learning
Simplified Temporal Consistency Reinforcement Learning
Yi Zhao
Wenshuai Zhao
Rinu Boney
Arno Solin
Joni Pajarinen
OffRL
30
13
0
15 Jun 2023
Semantic HELM: A Human-Readable Memory for Reinforcement Learning
Semantic HELM: A Human-Readable Memory for Reinforcement Learning
Fabian Paischer
Thomas Adler
M. Hofmarcher
Sepp Hochreiter
29
10
0
15 Jun 2023
Reward-Free Curricula for Training Robust World Models
Reward-Free Curricula for Training Robust World Models
Marc Rigter
Minqi Jiang
Ingmar Posner
VLM
OffRL
39
6
0
15 Jun 2023
Deep Generative Models for Decision-Making and Control
Deep Generative Models for Decision-Making and Control
Michael Janner
34
1
0
15 Jun 2023
Approximate information state based convergence analysis of recurrent
  Q-learning
Approximate information state based convergence analysis of recurrent Q-learning
Erfan Seyedsalehi
N. Akbarzadeh
Amit Sinha
Aditya Mahajan
27
6
0
09 Jun 2023
Bring Your Own (Non-Robust) Algorithm to Solve Robust MDPs by Estimating
  The Worst Kernel
Bring Your Own (Non-Robust) Algorithm to Solve Robust MDPs by Estimating The Worst Kernel
Kaixin Wang
Uri Gadot
Navdeep Kumar
Kfir Y. Levy
Shie Mannor
34
2
0
09 Jun 2023
IQL-TD-MPC: Implicit Q-Learning for Hierarchical Model Predictive
  Control
IQL-TD-MPC: Implicit Q-Learning for Hierarchical Model Predictive Control
Rohan Chitnis
Yingchen Xu
B. Hashemi
Lucas Lehnert
Ürün Dogan
Zheqing Zhu
Olivier Delalleau
OffRL
31
9
0
01 Jun 2023
What model does MuZero learn?
What model does MuZero learn?
Jinke He
Thomas M. Moerland
F. Oliehoek
33
4
0
01 Jun 2023
Pre-training Contextualized World Models with In-the-wild Videos for
  Reinforcement Learning
Pre-training Contextualized World Models with In-the-wild Videos for Reinforcement Learning
Jialong Wu
Haoyu Ma
Chao Deng
Mingsheng Long
OffRL
34
25
0
29 May 2023
Reasoning with Language Model is Planning with World Model
Reasoning with Language Model is Planning with World Model
Shibo Hao
Yi Gu
Haodi Ma
Joshua Jiahua Hong
Zhen Wang
D. Wang
Zhiting Hu
ReLM
LRM
LLMAG
68
519
0
24 May 2023
KARNet: Kalman Filter Augmented Recurrent Neural Network for Learning
  World Models in Autonomous Driving Tasks
KARNet: Kalman Filter Augmented Recurrent Neural Network for Learning World Models in Autonomous Driving Tasks
Hemanth Manjunatha
A. Pak
Dimitar Filev
Panagiotis Tsiotras
30
5
0
24 May 2023
Co-Learning Empirical Games and World Models
Co-Learning Empirical Games and World Models
Max O. Smith
Michael P. Wellman
24
2
0
23 May 2023
Understanding the World to Solve Social Dilemmas Using Multi-Agent
  Reinforcement Learning
Understanding the World to Solve Social Dilemmas Using Multi-Agent Reinforcement Learning
Manuel Rios
Nicanor Quijano
Luis Felipe Giraldo
31
1
0
19 May 2023
A Generalist Dynamics Model for Control
A Generalist Dynamics Model for Control
Ingmar Schubert
Jingwei Zhang
Jake Bruce
Sarah Bechtle
Emilio Parisotto
Martin Riedmiller
Jost Tobias Springenberg
Arunkumar Byravan
Leonard Hasenclever
N. Heess
AI4CE
41
28
0
18 May 2023
Language Models Meet World Models: Embodied Experiences Enhance Language
  Models
Language Models Meet World Models: Embodied Experiences Enhance Language Models
Jiannan Xiang
Tianhua Tao
Yi Gu
Tianmin Shu
Zirui Wang
Zichao Yang
Zhiting Hu
ALM
LLMAG
LM&Ro
CLL
36
94
0
18 May 2023
Posterior Sampling for Deep Reinforcement Learning
Posterior Sampling for Deep Reinforcement Learning
Remo Sasso
Michelangelo Conserva
Paulo E. Rauber
OffRL
BDL
37
6
0
30 Apr 2023
X-RLflow: Graph Reinforcement Learning for Neural Network Subgraphs
  Transformation
X-RLflow: Graph Reinforcement Learning for Neural Network Subgraphs Transformation
Guoliang He
Sean Parker
Eiko Yoneki
24
2
0
28 Apr 2023
Causal Semantic Communication for Digital Twins: A Generalizable
  Imitation Learning Approach
Causal Semantic Communication for Digital Twins: A Generalizable Imitation Learning Approach
Christo Kurisummoottil Thomas
Walid Saad
Yong Xiao
37
20
0
25 Apr 2023
Approximate Shielding of Atari Agents for Safe Exploration
Approximate Shielding of Atari Agents for Safe Exploration
Alexander W. Goodall
Francesco Belardinelli
27
2
0
21 Apr 2023
Observer-Feedback-Feedforward Controller Structures in Reinforcement
  Learning
Observer-Feedback-Feedforward Controller Structures in Reinforcement Learning
Ruoqing Zhang
Per Mattsson
T. Wigren
27
0
0
20 Apr 2023
Neuromorphic computing for attitude estimation onboard quadrotors
Neuromorphic computing for attitude estimation onboard quadrotors
S. Stroobants
Julien Dupeyroux
Guido C. H. E de Croon
38
4
0
18 Apr 2023
Model Predictive Control with Self-supervised Representation Learning
Model Predictive Control with Self-supervised Representation Learning
Jonas A. Matthies
Muhammad Burhan Hafez
Mostafa Kotb
S. Wermter
SSL
13
0
0
14 Apr 2023
Habits and goals in synergy: a variational Bayesian framework for
  behavior
Habits and goals in synergy: a variational Bayesian framework for behavior
Dongqi Han
Kenji Doya
Dongsheng Li
Jun Tani
BDL
28
220
0
11 Apr 2023
End-to-end Manipulator Calligraphy Planning via Variational Imitation
  Learning
End-to-end Manipulator Calligraphy Planning via Variational Imitation Learning
Fangping Xie
P. Meur
C. Fernando
14
1
0
06 Apr 2023
Can Large Language Models Play Text Games Well? Current State-of-the-Art and Open Questions
Can Large Language Models Play Text Games Well? Current State-of-the-Art and Open Questions
Chen Feng Tsai
Xiaochen Zhou
Sierra S. Liu
Jing Li
Mo Yu
Hongyuan Mei
LLMAG
ELM
AI4MH
LM&MA
11
30
0
06 Apr 2023
Self-Supervised Multimodal Learning: A Survey
Self-Supervised Multimodal Learning: A Survey
Yongshuo Zong
Oisin Mac Aodha
Timothy M. Hospedales
SSL
24
44
0
31 Mar 2023
Model-Based Reinforcement Learning with Isolated Imaginations
Model-Based Reinforcement Learning with Isolated Imaginations
Minting Pan
Xiangming Zhu
Yitao Zheng
Yunbo Wang
Xiaokang Yang
31
0
0
27 Mar 2023
Persistent Nature: A Generative Model of Unbounded 3D Worlds
Persistent Nature: A Generative Model of Unbounded 3D Worlds
Lucy Chai
Richard Tucker
Zhengqi Li
Phillip Isola
Noah Snavely
VGen
26
30
0
23 Mar 2023
InCrowdFormer: On-Ground Pedestrian World Model From Egocentric Views
InCrowdFormer: On-Ground Pedestrian World Model From Egocentric Views
Mai Nishimura
S. Nobuhara
Ko Nishino
EgoV
ViT
42
0
0
16 Mar 2023
Reliable Beamforming at Terahertz Bands: Are Causal Representations the
  Way Forward?
Reliable Beamforming at Terahertz Bands: Are Causal Representations the Way Forward?
Christo Kurisummoottil Thomas
Walid Saad
29
4
0
14 Mar 2023
Fast exploration and learning of latent graphs with aliased observations
Fast exploration and learning of latent graphs with aliased observations
Miguel Lazaro-Gredilla
Ishani Deshpande
Siva K. Swaminathan
Meet Dave
Dileep George
23
3
0
13 Mar 2023
Transformer-based World Models Are Happy With 100k Interactions
Transformer-based World Models Are Happy With 100k Interactions
Jan Robine
Marc Höftmann
Tobias Uelwer
Stefan Harmeling
OffRL
27
71
0
13 Mar 2023
Predictive Experience Replay for Continual Visual Control and
  Forecasting
Predictive Experience Replay for Continual Visual Control and Forecasting
Wendong Zhang
Geng Chen
Xiangming Zhu
Siyu Gao
Yunbo Wang
Xiaokang Yang
CLL
55
3
0
12 Mar 2023
Deep Occupancy-Predictive Representations for Autonomous Driving
Deep Occupancy-Predictive Representations for Autonomous Driving
Eivind Meyer
Lars Frederik Peiss
Matthias Althoff
37
3
0
07 Mar 2023
TrafficBots: Towards World Models for Autonomous Driving Simulation and
  Motion Prediction
TrafficBots: Towards World Models for Autonomous Driving Simulation and Motion Prediction
Zhejun Zhang
Alexander Liniger
Dengxin Dai
Feng Yu
Luc Van Gool
82
42
0
07 Mar 2023
Previous
12345...91011
Next