Unifying Count-Based Exploration and Intrinsic Motivation

6 June 2016

Papers citing "Unifying Count-Based Exploration and Intrinsic Motivation"

50 / 333 papers shown

Title
Reward-Mixing MDPs with a Few Latent Contexts are Learnable Jeongyeol Kwon Yonathan Efroni C. Caramanis Shie Mannor 31 5 0 05 Oct 2022
Query The Agent: Improving sample efficiency through epistemic uncertainty estimation Julian Alverio Boris Katz Andrei Barbu 35 0 0 05 Oct 2022
Bayesian Q-learning With Imperfect Expert Demonstrations Fengdi Che Xiru Zhu Doina Precup D. Meger Gregory Dudek 19 2 0 01 Oct 2022
Boosting Exploration in Actor-Critic Algorithms by Incentivizing Plausible Novel States C. Banerjee Zhiyong Chen N. Noman 26 3 0 01 Oct 2022
Ensemble Reinforcement Learning in Continuous Spaces -- A Hierarchical Multi-Step Approach for Policy Training Gang Chen Victoria Huang OffRL 40 0 0 29 Sep 2022
Delayed Geometric Discounts: An Alternative Criterion for Reinforcement Learning Firas Jarboui Ahmed Akakzia 19 0 0 26 Sep 2022
An information-theoretic perspective on intrinsic motivation in reinforcement learning: a survey A. Aubret L. Matignon S. Hassas 37 35 0 19 Sep 2022
Rewarding Episodic Visitation Discrepancy for Exploration in Reinforcement Learning Mingqi Yuan Bo Li Xin Jin Wenjun Zeng 36 12 0 19 Sep 2022
Designing Biological Sequences via Meta-Reinforcement Learning and Bayesian Optimization Leo Feng Padideh Nouri Aneri Muni Yoshua Bengio Pierre-Luc Bacon 116 4 0 13 Sep 2022
On the Convergence of Monte Carlo UCB for Random-Length Episodic MDPs Zixuan Dong Che Wang Keith Ross 33 3 0 07 Sep 2022
Go-Explore Complex 3D Game Environments for Automated Reachability Testing Cong Lu Raluca Georgescu J. Verwey 27 7 0 01 Sep 2022
Cell-Free Latent Go-Explore Quentin Gallouedec Emmanuel Dellandrea 14 1 0 31 Aug 2022
Reactive Exploration to Cope with Non-Stationarity in Lifelong Reinforcement Learning C. Steinparz Thomas Schmied Fabian Paischer Marius-Constantin Dinu Vihang Patil Angela Bitto-Nemling Hamid Eghbalzadeh Sepp Hochreiter CLL 29 11 0 12 Jul 2022
Towards Understanding How Machines Can Learn Causal Overhypotheses Eliza Kosoy David M. Chan Adrian Liu Jasmine Collins Bryanna Kaufmann Sandy Han Huang Jessica B. Hamrick John F. Canny Nan Rosemary Ke Alison Gopnik CML AI4CE 28 18 0 16 Jun 2022
BYOL-Explore: Exploration by Bootstrapped Prediction Z. Guo S. Thakoor Miruna Pislar Bernardo Avila-Pires Florent Altché ... Yunhao Tang Michal Valko Rémi Munos M. G. Azar Bilal Piot 22 68 0 16 Jun 2022
Reward Uncertainty for Exploration in Preference-based Reinforcement Learning Xinran Liang Katherine Shu Kimin Lee Pieter Abbeel 21 58 0 24 May 2022
Nuclear Norm Maximization Based Curiosity-Driven Learning Chao Chen Zijian Gao Kele Xu Sen Yang Yiying Li Bo Ding Dawei Feng Huaimin Wang 166 5 0 21 May 2022
Image Augmentation Based Momentum Memory Intrinsic Reward for Sparse Reward Visual Scenes Zheng Fang Biao Zhao Guizhong Liu 16 2 0 19 May 2022
Cliff Diving: Exploring Reward Surfaces in Reinforcement Learning Environments Ryan Sullivan J. K. Terry Benjamin Black John P. Dickerson 27 8 0 14 May 2022
Asking for Knowledge: Training RL Agents to Query External Knowledge Using Language Iou-Jen Liu Xingdi Yuan Marc-Alexandre Côté Pierre-Yves Oudeyer A. Schwing RALM 21 12 0 12 May 2022
ASE: Large-Scale Reusable Adversarial Skill Embeddings for Physically Simulated Characters Xue Bin Peng Yunrong Guo L. Halper Sergey Levine Sanja Fidler 28 15 0 04 May 2022
CCLF: A Contrastive-Curiosity-Driven Learning Framework for Sample-Efficient Reinforcement Learning Chenyu Sun Hangwei Qian Chunyan Miao OffRL 32 12 0 02 May 2022
Exploration in Deep Reinforcement Learning: A Survey Pawel Ladosz Lilian Weng Minwoo Kim H. Oh OffRL 26 324 0 02 May 2022
Discovering Intrinsic Reward with Contrastive Random Walk Zixuan Pan Zihao Wei Yidong Huang Aditya Gupta 32 0 0 23 Apr 2022
Divide & Conquer Imitation Learning Alexandre Chenu Nicolas Perrin-Gilbert Olivier Sigaud 16 5 0 15 Apr 2022
Semantic Exploration from Language Abstractions and Pretrained Representations Allison C. Tam Neil C. Rabinowitz Andrew Kyle Lampinen Nicholas A. Roy Stephanie C. Y. Chan D. Strouse Jane X. Wang Andrea Banino Felix Hill LM&Ro 39 67 0 08 Apr 2022
Habitat-Web: Learning Embodied Object-Search Strategies from Human Demonstrations at Scale Ram Ramrakhya Eric Undersander Dhruv Batra Abhishek Das LM&Ro 39 109 0 07 Apr 2022
Off-Policy Evaluation with Online Adaptation for Robot Exploration in Challenging Environments Yafei Hu Junyi Geng Chen Wang John Keller Sebastian Scherer OffRL 28 15 0 07 Apr 2022
Curiosity Driven Self-supervised Tactile Exploration of Unknown Objects Yujie Lu Jianren Wang Vikash Kumar 31 4 0 31 Mar 2022
Reinforcement Learning with Action-Free Pre-Training from Videos Younggyo Seo Kimin Lee Stephen James Pieter Abbeel SSL OnRL 18 118 0 25 Mar 2022
Lazy-MDPs: Towards Interpretable Reinforcement Learning by Learning When to Act Alexis Jacq Johan Ferret Olivier Pietquin M. Geist 32 9 0 16 Mar 2022
Zipfian environments for Reinforcement Learning Stephanie C. Y. Chan Andrew Kyle Lampinen Pierre Harvey Richemond Felix Hill OffRL 15 15 0 15 Mar 2022
Rényi State Entropy for Exploration Acceleration in Reinforcement Learning Mingqi Yuan Man-On Pun Dong Wang 24 23 0 08 Mar 2022
On Credit Assignment in Hierarchical Reinforcement Learning Joery A. de Vries Thomas M. Moerland Aske Plaat 13 0 0 07 Mar 2022
Learning Robust Real-Time Cultural Transmission without Human Data Cultural General Intelligence Team Avishkar Bhoopchand Bethanie Brownfield Adrian Collister Agustin Dal Lago ... Alex Platonov Evan Senter Sukhdeep Singh Alexander Zacherl Lei M. Zhang VLM 46 11 0 01 Mar 2022
Collaborative Training of Heterogeneous Reinforcement Learning Agents in Environments with Sparse Rewards: What and When to Share? Alain Andres Esther Villar-Rodriguez Javier Del Ser 22 9 0 24 Feb 2022
FedCAT: Towards Accurate Federated Learning via Device Concatenation Ming Hu Tian Liu Zhiwei Ling Zhihao Yue Mingsong Chen FedML 24 1 0 23 Feb 2022
Reinforcement Learning in Practice: Opportunities and Challenges Yuxi Li OffRL 38 9 0 23 Feb 2022
ViKiNG: Vision-Based Kilometer-Scale Navigation with Geographic Hints Dhruv Shah Sergey Levine 132 66 0 23 Feb 2022
Learning Causal Overhypotheses through Exploration in Children and Computational Models Eliza Kosoy Adrian Liu Jasmine Collins David M. Chan Jessica B. Hamrick Nan Rosemary Ke Sandy H Huang Bryanna Kaufmann John F. Canny Alison Gopnik CML 22 9 0 21 Feb 2022
Open-Ended Reinforcement Learning with Neural Reward Functions Robert Meier Asier Mujika 37 7 0 16 Feb 2022
Active Audio-Visual Separation of Dynamic Sound Sources Sagnik Majumder Kristen Grauman 27 21 0 02 Feb 2022
Efficient Reinforcement Learning in Block MDPs: A Model-free Representation Learning Approach Xuezhou Zhang Yuda Song Masatoshi Uehara Mengdi Wang Alekh Agarwal Wen Sun OffRL 29 57 0 31 Jan 2022
Generative Adversarial Exploration for Reinforcement Learning Weijun Hong Menghui Zhu Minghuan Liu Weinan Zhang Ming Zhou Yong Yu Peng Sun OnRL 39 7 0 27 Jan 2022
Learning to Act with Affordance-Aware Multimodal Neural SLAM Zhiwei Jia Kaixiang Lin Yizhou Zhao Qiaozi Gao Govind Thattai Gaurav Sukhatme LM&Ro 31 15 0 24 Jan 2022
Physical Derivatives: Computing policy gradients by physical forward-propagation Arash Mehrjou Ashkan Soleymani Stefan Bauer Bernhard Schölkopf 38 0 0 15 Jan 2022
Learning from Guided Play: A Scheduled Hierarchical Approach for Improving Exploration in Adversarial Imitation Learning Trevor Ablett Bryan Chan Jonathan Kelly 31 4 0 16 Dec 2021
Programmatic Reward Design by Example Weichao Zhou Wenchao Li 34 15 0 14 Dec 2021
Model-Value Inconsistency as a Signal for Epistemic Uncertainty Angelos Filos Eszter Vértes Zita Marinho Gregory Farquhar Diana Borsa A. Friesen Feryal M. P. Behbahani Tom Schaul André Barreto Simon Osindero 44 7 0 08 Dec 2021
Interesting Object, Curious Agent: Learning Task-Agnostic Exploration Simone Parisi Victoria Dean Deepak Pathak Abhinav Gupta LM&Ro 40 50 0 25 Nov 2021