Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1906.09237
Cited By
Shaping Belief States with Generative Environment Models for RL
21 June 2019
Karol Gregor
Danilo Jimenez Rezende
F. Besse
Yan Wu
Hamza Merzic
Aaron van den Oord
OffRL
AI4CE
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Shaping Belief States with Generative Environment Models for RL"
50 / 86 papers shown
Title
Latent-Predictive Empowerment: Measuring Empowerment without a Simulator
Andrew Levy
A. Allievi
George Konidaris
76
0
0
15 Oct 2024
Reinforcement Learning via Auxiliary Task Distillation
Abhinav Harish
Larry Heck
Josiah P. Hanna
Z. Kira
Andrew Szot
42
0
0
24 Jun 2024
Scaling Instructable Agents Across Many Simulated Worlds
Sima Team
Maria Abi Raad
Arun Ahuja
Catarina Barros
F. Besse
...
Daan Wierstra
Duncan Williams
Nathaniel Wong
Sarah York
Nick Young
LM&Ro
115
39
0
13 Mar 2024
Spatially-Aware Transformer for Embodied Agents
Junmo Cho
Jaesik Yoon
Sungjin Ahn
41
0
0
23 Feb 2024
Self-evolving Autoencoder Embedded Q-Network
Ieee J. Senthilnath Senior Member
Zhen Bangjian Zhou
Wei Ng
Deeksha Aggarwal
Rajdeep Dutta
Ji Wei Yoon
Phyu Aung
Keyu Wu
Ieee Li Fellow
Xiaoli Li
64
1
0
18 Feb 2024
Do Transformer World Models Give Better Policy Gradients?
Michel Ma
Tianwei Ni
Clement Gehring
P. DÓro
Pierre-Luc Bacon
42
4
0
07 Feb 2024
Spatial and Temporal Hierarchy for Autonomous Navigation using Active Inference in Minigrid Environment
Daria de Tinguy
Toon Van de Maele
Tim Verbelen
Bart Dhoedt
38
6
0
08 Dec 2023
Provable Representation with Efficient Planning for Partial Observable Reinforcement Learning
Hongming Zhang
Tongzheng Ren
Chenjun Xiao
Dale Schuurmans
Bo Dai
45
4
0
20 Nov 2023
Selective Visual Representations Improve Convergence and Generalization for Embodied AI
Ainaz Eftekhar
Kuo-Hao Zeng
Jiafei Duan
Ali Farhadi
Aniruddha Kembhavi
Ranjay Krishna
40
13
0
07 Nov 2023
Learning to Navigate from Scratch using World Models and Curiosity: the Good, the Bad, and the Ugly
Daria de Tinguy
Sven Remmery
Pietro Mazzaglia
Tim Verbelen
Bart Dhoedt
34
0
0
30 Aug 2023
DREAMWALKER: Mental Planning for Continuous Vision-Language Navigation
Hanqing Wang
Wei Liang
Luc Van Gool
Wenguan Wang
LM&Ro
35
28
0
14 Aug 2023
Would I have gotten that reward? Long-term credit assignment by counterfactual contribution analysis
Alexander Meulemans
Simon Schug
Seijin Kobayashi
Nathaniel D. Daw
Gregory Wayne
29
3
0
29 Jun 2023
Informed POMDP: Leveraging Additional Information in Model-Based RL
Gaspard Lambrechts
Adrien Bolland
D. Ernst
31
7
0
20 Jun 2023
Representations and Exploration for Deep Reinforcement Learning using Singular Value Decomposition
Yash Chandak
S. Thakoor
Z. Guo
Yunhao Tang
Rémi Munos
Will Dabney
Diana Borsa
24
2
0
01 May 2023
Fast exploration and learning of latent graphs with aliased observations
Miguel Lazaro-Gredilla
Ishani Deshpande
Siva K. Swaminathan
Meet Dave
Dileep George
28
3
0
13 Mar 2023
The Wasserstein Believer: Learning Belief Updates for Partially Observable Environments through Reliable Latent Space Models
Raphael Avalos
Florent Delgrange
Ann Nowé
Guillermo A. Pérez
D. Roijers
39
2
0
06 Mar 2023
Graph schemas as abstractions for transfer learning, inference, and planning
J. S. Guntupalli
Rajkumar Vasudeva Raju
Shrinu Kushagra
Carter Wendelken
Daniel P. Sawyer
Ishani Deshpande
Guangyao Zhou
Miguel Lazaro-Gredilla
Dileep George
42
9
0
14 Feb 2023
Training Robots to Evaluate Robots: Example-Based Interactive Reward Functions for Policy Learning
Kun-Yen Huang
E. Hu
Dinesh Jayaraman
OffRL
38
5
0
17 Dec 2022
CACTI: A Framework for Scalable Multi-Task Multi-Scene Visual Imitation Learning
Zhao Mandi
Homanga Bharadhwaj
Vincent Moens
Shuran Song
Aravind Rajeswaran
Vikash Kumar
LM&Ro
28
70
0
12 Dec 2022
Decentralized cooperative perception for autonomous vehicles: Learning to value the unknown
Maxime Chaveroche
Franck Davoine
V. Berge-Cherfaoui
17
1
0
12 Dec 2022
PRISM: Probabilistic Real-Time Inference in Spatial World Models
Atanas Mirchev
Baris Kayalibay
Ahmed Agha
Patrick van der Smagt
Daniel Cremers
Justin Bayer
VGen
31
0
0
06 Dec 2022
A General Purpose Supervisory Signal for Embodied Agents
Kunal Pratap Singh
Jordi Salvador
Luca Weihs
Aniruddha Kembhavi
SSL
31
3
0
01 Dec 2022
Curiosity in Hindsight: Intrinsic Exploration in Stochastic Environments
Daniel Jarrett
Corentin Tallec
Florent Altché
Thomas Mesnard
Rémi Munos
Michal Valko
48
5
0
18 Nov 2022
The Benefits of Model-Based Generalization in Reinforcement Learning
K. Young
Aditya A. Ramesh
Louis Kirsch
Jürgen Schmidhuber
OffRL
28
12
0
04 Nov 2022
Towards Versatile Embodied Navigation
H. Wang
Wei Liang
Luc Van Gool
Wenguan Wang
LM&Ro
58
20
0
30 Oct 2022
Evaluating Long-Term Memory in 3D Mazes
J. Pašukonis
Timothy Lillicrap
Danijar Hafner
3DV
21
21
0
24 Oct 2022
Latent State Marginalization as a Low-cost Approach for Improving Exploration
Dinghuai Zhang
Aaron Courville
Yoshua Bengio
Qinqing Zheng
Amy Zhang
Ricky T. Q. Chen
OOD
38
9
0
03 Oct 2022
Unsupervised Representation Learning in Deep Reinforcement Learning: A Review
N. Botteghi
M. Poel
C. Brune
SSL
OffRL
36
11
0
27 Aug 2022
BYOL-Explore: Exploration by Bootstrapped Prediction
Z. Guo
S. Thakoor
Miruna Pislar
Bernardo Avila-Pires
Florent Altché
...
Yunhao Tang
Michal Valko
Rémi Munos
M. G. Azar
Bilal Piot
22
68
0
16 Jun 2022
Deep Hierarchical Planning from Pixels
Danijar Hafner
Kuang-Huei Lee
Ian S. Fischer
Pieter Abbeel
42
93
0
08 Jun 2022
Flow-based Recurrent Belief State Learning for POMDPs
Xiaoyu Chen
Yao Mu
Ping Luo
Sheng Li
Jianyu Chen
45
18
0
23 May 2022
Deterministic training of generative autoencoders using invertible layers
Gianluigi Silvestri
Daan Roos
L. Ambrogioni
TPM
21
2
0
19 May 2022
INFOrmation Prioritization through EmPOWERment in Visual Model-Based RL
Homanga Bharadhwaj
Mohammad Babaeizadeh
D. Erhan
Sergey Levine
38
31
0
18 Apr 2022
Optimizing Sequential Experimental Design with Deep Reinforcement Learning
Tom Blau
Edwin V. Bonilla
Iadine Chadès
Amir Dezfouli
BDL
OffRL
27
37
0
02 Feb 2022
Tracking and Planning with Spatial World Models
Baris Kayalibay
Atanas Mirchev
Patrick van der Smagt
Justin Bayer
46
2
0
25 Jan 2022
Safe Deep RL in 3D Environments using Human Feedback
Matthew Rahtz
Vikrant Varma
Ramana Kumar
Zachary Kenton
Shane Legg
Jan Leike
32
4
0
20 Jan 2022
Towards Disturbance-Free Visual Mobile Manipulation
Tianwei Ni
Kiana Ehsani
Luca Weihs
Jordi Salvador
28
9
0
17 Dec 2021
Model-Value Inconsistency as a Signal for Epistemic Uncertainty
Angelos Filos
Eszter Vértes
Zita Marinho
Gregory Farquhar
Diana Borsa
A. Friesen
Feryal M. P. Behbahani
Tom Schaul
André Barreto
Simon Osindero
44
7
0
08 Dec 2021
Tell me why! Explanations support learning relational and causal structure
Andrew Kyle Lampinen
Nicholas A. Roy
Ishita Dasgupta
Stephanie C. Y. Chan
Allison C. Tam
...
Chen Yan
Adam Santoro
Neil C. Rabinowitz
Jane X. Wang
Felix Hill
35
45
0
07 Dec 2021
Differentiable Spatial Planning using Transformers
Devendra Singh Chaplot
Deepak Pathak
Jitendra Malik
27
37
0
02 Dec 2021
Attention Approximates Sparse Distributed Memory
Trenton Bricken
Cengiz Pehlevan
35
34
0
10 Nov 2021
Value Function Spaces: Skill-Centric State Abstractions for Long-Horizon Reasoning
Dhruv Shah
Peng Xu
Yao Lu
Ted Xiao
Alexander Toshev
Sergey Levine
Brian Ichter
OffRL
37
41
0
04 Nov 2021
Procedural Generalization by Planning with Self-Supervised World Models
Ankesh Anand
Jacob Walker
Yazhe Li
Eszter Vértes
Julian Schrittwieser
Sherjil Ozair
T. Weber
Jessica B. Hamrick
31
31
0
02 Nov 2021
DreamerPro: Reconstruction-Free Model-Based Reinforcement Learning with Prototypical Representations
Fei Deng
Ingook Jang
Sungjin Ahn
VLM
29
62
0
27 Oct 2021
Self-Consistent Models and Values
Roy Miles
Kate Baumli
Zita Marinho
Angelos Filos
Matteo Hessel
Hado van Hasselt
David Silver
38
8
0
25 Oct 2021
Recurrent Model-Free RL Can Be a Strong Baseline for Many POMDPs
Tianwei Ni
Benjamin Eysenbach
Ruslan Salakhutdinov
26
103
0
11 Oct 2021
Temporally Abstract Partial Models
Khimya Khetarpal
Zafarali Ahmed
Gheorghe Comanici
Doina Precup
26
14
0
06 Aug 2021
Structured World Belief for Reinforcement Learning in POMDP
Gautam Singh
Skand Peri
Junghyun Kim
Hyunseok Kim
Sungjin Ahn
OCL
27
27
0
19 Jul 2021
Estimating Disentangled Belief about Hidden State and Hidden Task for Meta-RL
K. Akuzawa
Yusuke Iwasawa
Y. Matsuo
11
4
0
14 May 2021
A Self-Supervised Auxiliary Loss for Deep RL in Partially Observable Settings
Eltayeb Ahmed
L. Zintgraf
Christian Schroeder de Witt
Nicolas Usunier
SSL
24
0
0
17 Apr 2021
1
2
Next