ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1906.09237
  4. Cited By
Shaping Belief States with Generative Environment Models for RL

Shaping Belief States with Generative Environment Models for RL

21 June 2019
Karol Gregor
Danilo Jimenez Rezende
F. Besse
Yan Wu
Hamza Merzic
Aaron van den Oord
    OffRL
    AI4CE
ArXivPDFHTML

Papers citing "Shaping Belief States with Generative Environment Models for RL"

50 / 86 papers shown
Title
Latent-Predictive Empowerment: Measuring Empowerment without a Simulator
Latent-Predictive Empowerment: Measuring Empowerment without a Simulator
Andrew Levy
A. Allievi
George Konidaris
76
0
0
15 Oct 2024
Reinforcement Learning via Auxiliary Task Distillation
Reinforcement Learning via Auxiliary Task Distillation
Abhinav Harish
Larry Heck
Josiah P. Hanna
Z. Kira
Andrew Szot
42
0
0
24 Jun 2024
Scaling Instructable Agents Across Many Simulated Worlds
Scaling Instructable Agents Across Many Simulated Worlds
Sima Team
Maria Abi Raad
Arun Ahuja
Catarina Barros
F. Besse
...
Daan Wierstra
Duncan Williams
Nathaniel Wong
Sarah York
Nick Young
LM&Ro
115
39
0
13 Mar 2024
Spatially-Aware Transformer for Embodied Agents
Spatially-Aware Transformer for Embodied Agents
Junmo Cho
Jaesik Yoon
Sungjin Ahn
41
0
0
23 Feb 2024
Self-evolving Autoencoder Embedded Q-Network
Self-evolving Autoencoder Embedded Q-Network
Ieee J. Senthilnath Senior Member
Zhen Bangjian Zhou
Wei Ng
Deeksha Aggarwal
Rajdeep Dutta
Ji Wei Yoon
Phyu Aung
Keyu Wu
Ieee Li Fellow
Xiaoli Li
64
1
0
18 Feb 2024
Do Transformer World Models Give Better Policy Gradients?
Do Transformer World Models Give Better Policy Gradients?
Michel Ma
Tianwei Ni
Clement Gehring
P. DÓro
Pierre-Luc Bacon
42
4
0
07 Feb 2024
Spatial and Temporal Hierarchy for Autonomous Navigation using Active
  Inference in Minigrid Environment
Spatial and Temporal Hierarchy for Autonomous Navigation using Active Inference in Minigrid Environment
Daria de Tinguy
Toon Van de Maele
Tim Verbelen
Bart Dhoedt
38
6
0
08 Dec 2023
Provable Representation with Efficient Planning for Partial Observable
  Reinforcement Learning
Provable Representation with Efficient Planning for Partial Observable Reinforcement Learning
Hongming Zhang
Tongzheng Ren
Chenjun Xiao
Dale Schuurmans
Bo Dai
45
4
0
20 Nov 2023
Selective Visual Representations Improve Convergence and Generalization
  for Embodied AI
Selective Visual Representations Improve Convergence and Generalization for Embodied AI
Ainaz Eftekhar
Kuo-Hao Zeng
Jiafei Duan
Ali Farhadi
Aniruddha Kembhavi
Ranjay Krishna
40
13
0
07 Nov 2023
Learning to Navigate from Scratch using World Models and Curiosity: the
  Good, the Bad, and the Ugly
Learning to Navigate from Scratch using World Models and Curiosity: the Good, the Bad, and the Ugly
Daria de Tinguy
Sven Remmery
Pietro Mazzaglia
Tim Verbelen
Bart Dhoedt
34
0
0
30 Aug 2023
DREAMWALKER: Mental Planning for Continuous Vision-Language Navigation
DREAMWALKER: Mental Planning for Continuous Vision-Language Navigation
Hanqing Wang
Wei Liang
Luc Van Gool
Wenguan Wang
LM&Ro
35
28
0
14 Aug 2023
Would I have gotten that reward? Long-term credit assignment by
  counterfactual contribution analysis
Would I have gotten that reward? Long-term credit assignment by counterfactual contribution analysis
Alexander Meulemans
Simon Schug
Seijin Kobayashi
Nathaniel D. Daw
Gregory Wayne
29
3
0
29 Jun 2023
Informed POMDP: Leveraging Additional Information in Model-Based RL
Informed POMDP: Leveraging Additional Information in Model-Based RL
Gaspard Lambrechts
Adrien Bolland
D. Ernst
31
7
0
20 Jun 2023
Representations and Exploration for Deep Reinforcement Learning using
  Singular Value Decomposition
Representations and Exploration for Deep Reinforcement Learning using Singular Value Decomposition
Yash Chandak
S. Thakoor
Z. Guo
Yunhao Tang
Rémi Munos
Will Dabney
Diana Borsa
24
2
0
01 May 2023
Fast exploration and learning of latent graphs with aliased observations
Fast exploration and learning of latent graphs with aliased observations
Miguel Lazaro-Gredilla
Ishani Deshpande
Siva K. Swaminathan
Meet Dave
Dileep George
28
3
0
13 Mar 2023
The Wasserstein Believer: Learning Belief Updates for Partially
  Observable Environments through Reliable Latent Space Models
The Wasserstein Believer: Learning Belief Updates for Partially Observable Environments through Reliable Latent Space Models
Raphael Avalos
Florent Delgrange
Ann Nowé
Guillermo A. Pérez
D. Roijers
39
2
0
06 Mar 2023
Graph schemas as abstractions for transfer learning, inference, and
  planning
Graph schemas as abstractions for transfer learning, inference, and planning
J. S. Guntupalli
Rajkumar Vasudeva Raju
Shrinu Kushagra
Carter Wendelken
Daniel P. Sawyer
Ishani Deshpande
Guangyao Zhou
Miguel Lazaro-Gredilla
Dileep George
42
9
0
14 Feb 2023
Training Robots to Evaluate Robots: Example-Based Interactive Reward
  Functions for Policy Learning
Training Robots to Evaluate Robots: Example-Based Interactive Reward Functions for Policy Learning
Kun-Yen Huang
E. Hu
Dinesh Jayaraman
OffRL
38
5
0
17 Dec 2022
CACTI: A Framework for Scalable Multi-Task Multi-Scene Visual Imitation
  Learning
CACTI: A Framework for Scalable Multi-Task Multi-Scene Visual Imitation Learning
Zhao Mandi
Homanga Bharadhwaj
Vincent Moens
Shuran Song
Aravind Rajeswaran
Vikash Kumar
LM&Ro
28
70
0
12 Dec 2022
Decentralized cooperative perception for autonomous vehicles: Learning
  to value the unknown
Decentralized cooperative perception for autonomous vehicles: Learning to value the unknown
Maxime Chaveroche
Franck Davoine
V. Berge-Cherfaoui
17
1
0
12 Dec 2022
PRISM: Probabilistic Real-Time Inference in Spatial World Models
PRISM: Probabilistic Real-Time Inference in Spatial World Models
Atanas Mirchev
Baris Kayalibay
Ahmed Agha
Patrick van der Smagt
Daniel Cremers
Justin Bayer
VGen
31
0
0
06 Dec 2022
A General Purpose Supervisory Signal for Embodied Agents
A General Purpose Supervisory Signal for Embodied Agents
Kunal Pratap Singh
Jordi Salvador
Luca Weihs
Aniruddha Kembhavi
SSL
31
3
0
01 Dec 2022
Curiosity in Hindsight: Intrinsic Exploration in Stochastic Environments
Curiosity in Hindsight: Intrinsic Exploration in Stochastic Environments
Daniel Jarrett
Corentin Tallec
Florent Altché
Thomas Mesnard
Rémi Munos
Michal Valko
48
5
0
18 Nov 2022
The Benefits of Model-Based Generalization in Reinforcement Learning
The Benefits of Model-Based Generalization in Reinforcement Learning
K. Young
Aditya A. Ramesh
Louis Kirsch
Jürgen Schmidhuber
OffRL
28
12
0
04 Nov 2022
Towards Versatile Embodied Navigation
Towards Versatile Embodied Navigation
Han Wang
Wei Liang
Luc Van Gool
Wenguan Wang
LM&Ro
60
20
0
30 Oct 2022
Evaluating Long-Term Memory in 3D Mazes
Evaluating Long-Term Memory in 3D Mazes
J. Pašukonis
Timothy Lillicrap
Danijar Hafner
3DV
21
21
0
24 Oct 2022
Latent State Marginalization as a Low-cost Approach for Improving
  Exploration
Latent State Marginalization as a Low-cost Approach for Improving Exploration
Dinghuai Zhang
Aaron Courville
Yoshua Bengio
Qinqing Zheng
Amy Zhang
Ricky T. Q. Chen
OOD
38
9
0
03 Oct 2022
Unsupervised Representation Learning in Deep Reinforcement Learning: A
  Review
Unsupervised Representation Learning in Deep Reinforcement Learning: A Review
N. Botteghi
M. Poel
C. Brune
SSL
OffRL
41
11
0
27 Aug 2022
BYOL-Explore: Exploration by Bootstrapped Prediction
BYOL-Explore: Exploration by Bootstrapped Prediction
Z. Guo
S. Thakoor
Miruna Pislar
Bernardo Avila-Pires
Florent Altché
...
Yunhao Tang
Michal Valko
Rémi Munos
M. G. Azar
Bilal Piot
22
68
0
16 Jun 2022
Deep Hierarchical Planning from Pixels
Deep Hierarchical Planning from Pixels
Danijar Hafner
Kuang-Huei Lee
Ian S. Fischer
Pieter Abbeel
44
93
0
08 Jun 2022
Flow-based Recurrent Belief State Learning for POMDPs
Flow-based Recurrent Belief State Learning for POMDPs
Xiaoyu Chen
Yao Mu
Ping Luo
Sheng Li
Jianyu Chen
45
18
0
23 May 2022
Deterministic training of generative autoencoders using invertible
  layers
Deterministic training of generative autoencoders using invertible layers
Gianluigi Silvestri
Daan Roos
L. Ambrogioni
TPM
21
2
0
19 May 2022
INFOrmation Prioritization through EmPOWERment in Visual Model-Based RL
INFOrmation Prioritization through EmPOWERment in Visual Model-Based RL
Homanga Bharadhwaj
Mohammad Babaeizadeh
D. Erhan
Sergey Levine
38
31
0
18 Apr 2022
Optimizing Sequential Experimental Design with Deep Reinforcement
  Learning
Optimizing Sequential Experimental Design with Deep Reinforcement Learning
Tom Blau
Edwin V. Bonilla
Iadine Chadès
Amir Dezfouli
BDL
OffRL
27
37
0
02 Feb 2022
Tracking and Planning with Spatial World Models
Tracking and Planning with Spatial World Models
Baris Kayalibay
Atanas Mirchev
Patrick van der Smagt
Justin Bayer
46
2
0
25 Jan 2022
Safe Deep RL in 3D Environments using Human Feedback
Safe Deep RL in 3D Environments using Human Feedback
Matthew Rahtz
Vikrant Varma
Ramana Kumar
Zachary Kenton
Shane Legg
Jan Leike
32
4
0
20 Jan 2022
Towards Disturbance-Free Visual Mobile Manipulation
Towards Disturbance-Free Visual Mobile Manipulation
Tianwei Ni
Kiana Ehsani
Luca Weihs
Jordi Salvador
28
9
0
17 Dec 2021
Model-Value Inconsistency as a Signal for Epistemic Uncertainty
Model-Value Inconsistency as a Signal for Epistemic Uncertainty
Angelos Filos
Eszter Vértes
Zita Marinho
Gregory Farquhar
Diana Borsa
A. Friesen
Feryal M. P. Behbahani
Tom Schaul
André Barreto
Simon Osindero
44
7
0
08 Dec 2021
Tell me why! Explanations support learning relational and causal
  structure
Tell me why! Explanations support learning relational and causal structure
Andrew Kyle Lampinen
Nicholas A. Roy
Ishita Dasgupta
Stephanie C. Y. Chan
Allison C. Tam
...
Chen Yan
Adam Santoro
Neil C. Rabinowitz
Jane X. Wang
Felix Hill
35
45
0
07 Dec 2021
Differentiable Spatial Planning using Transformers
Differentiable Spatial Planning using Transformers
Devendra Singh Chaplot
Deepak Pathak
Jitendra Malik
27
37
0
02 Dec 2021
Attention Approximates Sparse Distributed Memory
Attention Approximates Sparse Distributed Memory
Trenton Bricken
Cengiz Pehlevan
35
34
0
10 Nov 2021
Value Function Spaces: Skill-Centric State Abstractions for Long-Horizon
  Reasoning
Value Function Spaces: Skill-Centric State Abstractions for Long-Horizon Reasoning
Dhruv Shah
Peng Xu
Yao Lu
Ted Xiao
Alexander Toshev
Sergey Levine
Brian Ichter
OffRL
37
41
0
04 Nov 2021
Procedural Generalization by Planning with Self-Supervised World Models
Procedural Generalization by Planning with Self-Supervised World Models
Ankesh Anand
Jacob Walker
Yazhe Li
Eszter Vértes
Julian Schrittwieser
Sherjil Ozair
T. Weber
Jessica B. Hamrick
31
31
0
02 Nov 2021
DreamerPro: Reconstruction-Free Model-Based Reinforcement Learning with
  Prototypical Representations
DreamerPro: Reconstruction-Free Model-Based Reinforcement Learning with Prototypical Representations
Fei Deng
Ingook Jang
Sungjin Ahn
VLM
29
62
0
27 Oct 2021
Self-Consistent Models and Values
Self-Consistent Models and Values
Roy Miles
Kate Baumli
Zita Marinho
Angelos Filos
Matteo Hessel
Hado van Hasselt
David Silver
38
8
0
25 Oct 2021
Recurrent Model-Free RL Can Be a Strong Baseline for Many POMDPs
Recurrent Model-Free RL Can Be a Strong Baseline for Many POMDPs
Tianwei Ni
Benjamin Eysenbach
Ruslan Salakhutdinov
26
103
0
11 Oct 2021
Temporally Abstract Partial Models
Temporally Abstract Partial Models
Khimya Khetarpal
Zafarali Ahmed
Gheorghe Comanici
Doina Precup
26
14
0
06 Aug 2021
Structured World Belief for Reinforcement Learning in POMDP
Structured World Belief for Reinforcement Learning in POMDP
Gautam Singh
Skand Peri
Junghyun Kim
Hyunseok Kim
Sungjin Ahn
OCL
27
27
0
19 Jul 2021
Estimating Disentangled Belief about Hidden State and Hidden Task for
  Meta-RL
Estimating Disentangled Belief about Hidden State and Hidden Task for Meta-RL
K. Akuzawa
Yusuke Iwasawa
Y. Matsuo
13
4
0
14 May 2021
A Self-Supervised Auxiliary Loss for Deep RL in Partially Observable
  Settings
A Self-Supervised Auxiliary Loss for Deep RL in Partially Observable Settings
Eltayeb Ahmed
L. Zintgraf
Christian Schroeder de Witt
Nicolas Usunier
SSL
24
0
0
17 Apr 2021
12
Next