Shaping Belief States with Generative Environment Models for RL

21 June 2019

Karol Gregor

Danilo Jimenez Rezende

Papers citing "Shaping Belief States with Generative Environment Models for RL"

50 / 86 papers shown

Title
Latent-Predictive Empowerment: Measuring Empowerment without a Simulator Andrew Levy A. Allievi George Konidaris 76 0 0 15 Oct 2024
Reinforcement Learning via Auxiliary Task Distillation Abhinav Harish Larry Heck Josiah P. Hanna Z. Kira Andrew Szot 42 0 0 24 Jun 2024
Scaling Instructable Agents Across Many Simulated Worlds Sima Team Maria Abi Raad Arun Ahuja Catarina Barros F. Besse ... Daan Wierstra Duncan Williams Nathaniel Wong Sarah York Nick Young LM&Ro 115 39 0 13 Mar 2024
Spatially-Aware Transformer for Embodied Agents Junmo Cho Jaesik Yoon Sungjin Ahn 41 0 0 23 Feb 2024
Self-evolving Autoencoder Embedded Q-Network Ieee J. Senthilnath Senior Member Zhen Bangjian Zhou Wei Ng Deeksha Aggarwal Rajdeep Dutta Ji Wei Yoon Phyu Aung Keyu Wu Ieee Li Fellow Xiaoli Li 64 1 0 18 Feb 2024
Do Transformer World Models Give Better Policy Gradients? Michel Ma Tianwei Ni Clement Gehring P. DÓro Pierre-Luc Bacon 42 4 0 07 Feb 2024
Spatial and Temporal Hierarchy for Autonomous Navigation using Active Inference in Minigrid Environment Daria de Tinguy Toon Van de Maele Tim Verbelen Bart Dhoedt 38 6 0 08 Dec 2023
Provable Representation with Efficient Planning for Partial Observable Reinforcement Learning Hongming Zhang Tongzheng Ren Chenjun Xiao Dale Schuurmans Bo Dai 45 4 0 20 Nov 2023
Selective Visual Representations Improve Convergence and Generalization for Embodied AI Ainaz Eftekhar Kuo-Hao Zeng Jiafei Duan Ali Farhadi Aniruddha Kembhavi Ranjay Krishna 40 13 0 07 Nov 2023
Learning to Navigate from Scratch using World Models and Curiosity: the Good, the Bad, and the Ugly Daria de Tinguy Sven Remmery Pietro Mazzaglia Tim Verbelen Bart Dhoedt 34 0 0 30 Aug 2023
DREAMWALKER: Mental Planning for Continuous Vision-Language Navigation Hanqing Wang Wei Liang Luc Van Gool Wenguan Wang LM&Ro 35 28 0 14 Aug 2023
Would I have gotten that reward? Long-term credit assignment by counterfactual contribution analysis Alexander Meulemans Simon Schug Seijin Kobayashi Nathaniel D. Daw Gregory Wayne 29 3 0 29 Jun 2023
Informed POMDP: Leveraging Additional Information in Model-Based RL Gaspard Lambrechts Adrien Bolland D. Ernst 31 7 0 20 Jun 2023
Representations and Exploration for Deep Reinforcement Learning using Singular Value Decomposition Yash Chandak S. Thakoor Z. Guo Yunhao Tang Rémi Munos Will Dabney Diana Borsa 24 2 0 01 May 2023
Fast exploration and learning of latent graphs with aliased observations Miguel Lazaro-Gredilla Ishani Deshpande Siva K. Swaminathan Meet Dave Dileep George 28 3 0 13 Mar 2023
The Wasserstein Believer: Learning Belief Updates for Partially Observable Environments through Reliable Latent Space Models Raphael Avalos Florent Delgrange Ann Nowé Guillermo A. Pérez D. Roijers 39 2 0 06 Mar 2023
Graph schemas as abstractions for transfer learning, inference, and planning J. S. Guntupalli Rajkumar Vasudeva Raju Shrinu Kushagra Carter Wendelken Daniel P. Sawyer Ishani Deshpande Guangyao Zhou Miguel Lazaro-Gredilla Dileep George 42 9 0 14 Feb 2023
Training Robots to Evaluate Robots: Example-Based Interactive Reward Functions for Policy Learning Kun-Yen Huang E. Hu Dinesh Jayaraman OffRL 38 5 0 17 Dec 2022
CACTI: A Framework for Scalable Multi-Task Multi-Scene Visual Imitation Learning Zhao Mandi Homanga Bharadhwaj Vincent Moens Shuran Song Aravind Rajeswaran Vikash Kumar LM&Ro 28 70 0 12 Dec 2022
Decentralized cooperative perception for autonomous vehicles: Learning to value the unknown Maxime Chaveroche Franck Davoine V. Berge-Cherfaoui 17 1 0 12 Dec 2022
PRISM: Probabilistic Real-Time Inference in Spatial World Models Atanas Mirchev Baris Kayalibay Ahmed Agha Patrick van der Smagt Daniel Cremers Justin Bayer VGen 31 0 0 06 Dec 2022
A General Purpose Supervisory Signal for Embodied Agents Kunal Pratap Singh Jordi Salvador Luca Weihs Aniruddha Kembhavi SSL 31 3 0 01 Dec 2022
Curiosity in Hindsight: Intrinsic Exploration in Stochastic Environments Daniel Jarrett Corentin Tallec Florent Altché Thomas Mesnard Rémi Munos Michal Valko 48 5 0 18 Nov 2022
The Benefits of Model-Based Generalization in Reinforcement Learning K. Young Aditya A. Ramesh Louis Kirsch Jürgen Schmidhuber OffRL 28 12 0 04 Nov 2022
Towards Versatile Embodied Navigation H. Wang Wei Liang Luc Van Gool Wenguan Wang LM&Ro 58 20 0 30 Oct 2022
Evaluating Long-Term Memory in 3D Mazes J. Pašukonis Timothy Lillicrap Danijar Hafner 3DV 21 21 0 24 Oct 2022
Latent State Marginalization as a Low-cost Approach for Improving Exploration Dinghuai Zhang Aaron Courville Yoshua Bengio Qinqing Zheng Amy Zhang Ricky T. Q. Chen OOD 38 9 0 03 Oct 2022
Unsupervised Representation Learning in Deep Reinforcement Learning: A Review N. Botteghi M. Poel C. Brune SSL OffRL 36 11 0 27 Aug 2022
BYOL-Explore: Exploration by Bootstrapped Prediction Z. Guo S. Thakoor Miruna Pislar Bernardo Avila-Pires Florent Altché ... Yunhao Tang Michal Valko Rémi Munos M. G. Azar Bilal Piot 22 68 0 16 Jun 2022
Deep Hierarchical Planning from Pixels Danijar Hafner Kuang-Huei Lee Ian S. Fischer Pieter Abbeel 42 93 0 08 Jun 2022
Flow-based Recurrent Belief State Learning for POMDPs Xiaoyu Chen Yao Mu Ping Luo Sheng Li Jianyu Chen 45 18 0 23 May 2022
Deterministic training of generative autoencoders using invertible layers Gianluigi Silvestri Daan Roos L. Ambrogioni TPM 21 2 0 19 May 2022
INFOrmation Prioritization through EmPOWERment in Visual Model-Based RL Homanga Bharadhwaj Mohammad Babaeizadeh D. Erhan Sergey Levine 38 31 0 18 Apr 2022
Optimizing Sequential Experimental Design with Deep Reinforcement Learning Tom Blau Edwin V. Bonilla Iadine Chadès Amir Dezfouli BDL OffRL 27 37 0 02 Feb 2022
Tracking and Planning with Spatial World Models Baris Kayalibay Atanas Mirchev Patrick van der Smagt Justin Bayer 46 2 0 25 Jan 2022
Safe Deep RL in 3D Environments using Human Feedback Matthew Rahtz Vikrant Varma Ramana Kumar Zachary Kenton Shane Legg Jan Leike 32 4 0 20 Jan 2022
Towards Disturbance-Free Visual Mobile Manipulation Tianwei Ni Kiana Ehsani Luca Weihs Jordi Salvador 28 9 0 17 Dec 2021
Model-Value Inconsistency as a Signal for Epistemic Uncertainty Angelos Filos Eszter Vértes Zita Marinho Gregory Farquhar Diana Borsa A. Friesen Feryal M. P. Behbahani Tom Schaul André Barreto Simon Osindero 44 7 0 08 Dec 2021
Tell me why! Explanations support learning relational and causal structure Andrew Kyle Lampinen Nicholas A. Roy Ishita Dasgupta Stephanie C. Y. Chan Allison C. Tam ... Chen Yan Adam Santoro Neil C. Rabinowitz Jane X. Wang Felix Hill 35 45 0 07 Dec 2021
Differentiable Spatial Planning using Transformers Devendra Singh Chaplot Deepak Pathak Jitendra Malik 27 37 0 02 Dec 2021
Attention Approximates Sparse Distributed Memory Trenton Bricken Cengiz Pehlevan 35 34 0 10 Nov 2021
Value Function Spaces: Skill-Centric State Abstractions for Long-Horizon Reasoning Dhruv Shah Peng Xu Yao Lu Ted Xiao Alexander Toshev Sergey Levine Brian Ichter OffRL 37 41 0 04 Nov 2021
Procedural Generalization by Planning with Self-Supervised World Models Ankesh Anand Jacob Walker Yazhe Li Eszter Vértes Julian Schrittwieser Sherjil Ozair T. Weber Jessica B. Hamrick 31 31 0 02 Nov 2021
DreamerPro: Reconstruction-Free Model-Based Reinforcement Learning with Prototypical Representations Fei Deng Ingook Jang Sungjin Ahn VLM 29 62 0 27 Oct 2021
Self-Consistent Models and Values Roy Miles Kate Baumli Zita Marinho Angelos Filos Matteo Hessel Hado van Hasselt David Silver 38 8 0 25 Oct 2021
Recurrent Model-Free RL Can Be a Strong Baseline for Many POMDPs Tianwei Ni Benjamin Eysenbach Ruslan Salakhutdinov 26 103 0 11 Oct 2021
Temporally Abstract Partial Models Khimya Khetarpal Zafarali Ahmed Gheorghe Comanici Doina Precup 26 14 0 06 Aug 2021
Structured World Belief for Reinforcement Learning in POMDP Gautam Singh Skand Peri Junghyun Kim Hyunseok Kim Sungjin Ahn OCL 27 27 0 19 Jul 2021
Estimating Disentangled Belief about Hidden State and Hidden Task for Meta-RL K. Akuzawa Yusuke Iwasawa Y. Matsuo 11 4 0 14 May 2021
A Self-Supervised Auxiliary Loss for Deep RL in Partially Observable Settings Eltayeb Ahmed L. Zintgraf Christian Schroeder de Witt Nicolas Usunier SSL 24 0 0 17 Apr 2021