Provably Efficient Maximum Entropy Exploration

6 December 2018

Papers citing "Provably Efficient Maximum Entropy Exploration"

50 / 76 papers shown

Title
Online Episodic Convex Reinforcement Learning B. Moreno Khaled Eldowa Pierre Gaillard Margaux Brégère Nadia Oudjane OffRL 29 0 0 12 May 2025
Enhancing Diversity in Parallel Agents: A Maximum State Entropy Exploration Story Vincenzo De Paola Riccardo Zamboni Mirco Mutti Marcello Restelli 19 0 0 02 May 2025
Behavioral Entropy-Guided Dataset Generation for Offline Reinforcement Learning Wesley A. Suttle A. Suresh Carlos Nieto-Granda OffRL 97 0 0 06 Feb 2025
DIAL: Distribution-Informed Adaptive Learning of Multi-Task Constraints for Safety-Critical Systems Se-Wook Yoo Seung-Woo Seo 55 0 0 30 Jan 2025
NBDI: A Simple and Effective Termination Condition for Skill Extraction from Task-Agnostic Demonstrations Myunsoo Kim Hayeong Lee Seong-Woong Shim JunHo Seo Byung-Jun Lee LLMAG 37 0 0 22 Jan 2025
Autoregressive Action Sequence Learning for Robotic Manipulation Xinyu Zhang Yuhan Liu Haonan Chang Liam Schramm Abdeslam Boularias 41 10 0 04 Oct 2024
Zeroth-Order Policy Gradient for Reinforcement Learning from Human Feedback without Reward Inference Qining Zhang Lei Ying OffRL 37 2 0 25 Sep 2024
Random Latent Exploration for Deep Reinforcement Learning Srinath Mahankali Zhang-Wei Hong Ayush Sekhari Alexander Rakhlin Pulkit Agrawal 33 3 0 18 Jul 2024
Global Reinforcement Learning: Beyond Linear and Convex Rewards via Submodular Semi-gradient Methods Ric De Santi Manish Prajapat Andreas Krause 36 3 0 13 Jul 2024
Provably Efficient Long-Horizon Exploration in Monte Carlo Tree Search through State Occupancy Regularization Liam Schramm Abdeslam Boularias 28 1 0 07 Jul 2024
Reinforcement Learning from Human Feedback without Reward Inference: Model-Free Algorithm and Instance-Dependent Analysis Qining Zhang Honghao Wei Lei Ying OffRL 67 1 0 11 Jun 2024
MetaCURL: Non-stationary Concave Utility Reinforcement Learning B. Moreno Margaux Brégère Pierre Gaillard Nadia Oudjane OffRL 39 0 0 30 May 2024
Koopman-Assisted Reinforcement Learning Preston Rozwood Edward Mehrez Ludger Paehler Wen Sun Steven L. Brunton 40 6 0 04 Mar 2024
ACE : Off-Policy Actor-Critic with Causality-Aware Entropy Regularization Tianying Ji Yongyuan Liang Yan Zeng Yu-Juan Luo Guowei Xu Jiawei Guo Ruijie Zheng Furong Huang Gang Hua Huazhe Xu CML 48 11 0 22 Feb 2024
Iteratively Learn Diverse Strategies with State Distance Information Wei Fu Weihua Du Jingwei Li Sunli Chen Jingzhao Zhang Yi Wu 51 3 0 23 Oct 2023
METRA: Scalable Unsupervised RL with Metric-Aware Abstraction Seohong Park Oleh Rybkin Sergey Levine OffRL 33 34 0 13 Oct 2023
Subwords as Skills: Tokenization for Sparse-Reward Reinforcement Learning David Yunis Justin Jung Falcon Z. Dai Matthew R. Walter OffRL 47 0 0 08 Sep 2023
Reinforcement Learning by Guided Safe Exploration Qisong Yang T. D. Simão N. Jansen Simon Tindemans M. Spaan OffRL OnRL 34 5 0 26 Jul 2023
Submodular Reinforcement Learning Manish Prajapat Mojmír Mutný M. Zeilinger Andreas Krause OffRL 30 12 0 25 Jul 2023
Beyond Black-Box Advice: Learning-Augmented Algorithms for MDPs with Q-Value Predictions Tongxin Li Yiheng Lin Shaolei Ren Adam Wierman AAML OffRL 34 6 0 20 Jul 2023
Policy Finetuning in Reinforcement Learning via Design of Experiments using Offline Data Ruiqi Zhang Andrea Zanette OffRL OnRL 40 7 0 10 Jul 2023
Active Sensing with Predictive Coding and Uncertainty Minimization A. Sharafeldin N. Imam Hannah Choi 20 3 0 02 Jul 2023
Optimal Exploration for Model-Based RL in Nonlinear Systems Andrew Wagenmaker Guanya Shi Kevin G. Jamieson 36 14 0 15 Jun 2023
A Cover Time Study of a non-Markovian Algorithm Guanhua Fang G. Samorodnitsky Zhiqiang Xu 18 0 0 08 Jun 2023
Policy Gradient Algorithms Implicitly Optimize by Continuation Adrien Bolland Gilles Louppe D. Ernst 39 3 0 11 May 2023
Improved Sample Complexity for Reward-free Reinforcement Learning under Low-rank MDPs Yuan Cheng Ruiquan Huang J. Yang Yitao Liang OffRL 41 8 0 20 Mar 2023
Fast Rates for Maximum Entropy Exploration D. Tiapkin Denis Belomestny Daniele Calandriello Eric Moulines Rémi Munos A. Naumov Pierre Perrault Yunhao Tang Michal Valko Pierre Menard 44 17 0 14 Mar 2023
Scalable Multi-Agent Reinforcement Learning with General Utilities Donghao Ying Yuhao Ding Alec Koppel Javad Lavaei 38 1 0 15 Feb 2023
Distributional GFlowNets with Quantile Flows Dinghuai Zhang L. Pan Ricky T. Q. Chen Aaron Courville Yoshua Bengio 29 25 0 11 Feb 2023
Investigating the role of model-based learning in exploration and transfer Jacob Walker Eszter Vértes Yazhe Li Gabriel Dulac-Arnold Ankesh Anand T. Weber Jessica B. Hamrick OffRL 36 7 0 08 Feb 2023
Layered State Discovery for Incremental Autonomous Exploration Liyu Chen Andrea Tirinzoni A. Lazaric Matteo Pirotta 34 0 0 07 Feb 2023
A general Markov decision process formalism for action-state entropy-regularized reward maximization D. Grytskyy Jorge Ramírez-Ruiz R. Moreno-Bote 22 3 0 02 Feb 2023
The Conditional Cauchy-Schwarz Divergence with Applications to Time-Series Data and Sequential Decision Making Shujian Yu Hongming Li Sigurd Løkse Robert Jenssen José C. Príncipe BDL 28 6 0 21 Jan 2023
CIM: Constrained Intrinsic Motivation for Sparse-Reward Continuous Control Xiang Zheng Xingjun Ma Cong Wang 28 1 0 28 Nov 2022
Choreographer: Learning and Adapting Skills in Imagination Pietro Mazzaglia Tim Verbelen Bart Dhoedt Alexandre Lacoste Sai Rajeswar 29 22 0 23 Nov 2022
Curiosity in Hindsight: Intrinsic Exploration in Stochastic Environments Daniel Jarrett Corentin Tallec Florent Altché Thomas Mesnard Rémi Munos Michal Valko 48 5 0 18 Nov 2022
Learning General World Models in a Handful of Reward-Free Deployments Yingchen Xu Jack Parker-Holder Aldo Pacchiano Philip J. Ball Oleh Rybkin Stephen J. Roberts Tim Rocktaschel Edward Grefenstette OffRL 57 9 0 23 Oct 2022
An information-theoretic perspective on intrinsic motivation in reinforcement learning: a survey A. Aubret L. Matignon S. Hassas 37 35 0 19 Sep 2022
Active Exploration via Experiment Design in Markov Chains Mojmír Mutný Tadeusz Janik Andreas Krause 41 14 0 29 Jun 2022
BYOL-Explore: Exploration by Bootstrapped Prediction Z. Guo S. Thakoor Miruna Pislar Bernardo Avila-Pires Florent Altché ... Yunhao Tang Michal Valko Rémi Munos M. G. Azar Bilal Piot 22 68 0 16 Jun 2022
On Reinforcement Learning and Distribution Matching for Fine-Tuning Language Models with no Catastrophic Forgetting Tomasz Korbak Hady ElSahar Germán Kruszewski Marc Dymetman CLL 25 51 0 01 Jun 2022
Reward Uncertainty for Exploration in Preference-based Reinforcement Learning Xinran Liang Katherine Shu Kimin Lee Pieter Abbeel 21 58 0 24 May 2022
ASE: Large-Scale Reusable Adversarial Skill Embeddings for Physically Simulated Characters Xue Bin Peng Yunrong Guo L. Halper Sergey Levine Sanja Fidler 28 15 0 04 May 2022
Reinforcement Learning with Action-Free Pre-Training from Videos Younggyo Seo Kimin Lee Stephen James Pieter Abbeel SSL OnRL 18 118 0 25 Mar 2022
Rényi State Entropy for Exploration Acceleration in Reinforcement Learning Mingqi Yuan Man-On Pun Dong Wang 19 23 0 08 Mar 2022
A Differential Entropy Estimator for Training Neural Networks Georg Pichler Pierre Colombo Malik Boudiaf Günther Koliander Pablo Piantanida 25 21 0 14 Feb 2022
Challenging Common Assumptions in Convex Reinforcement Learning Mirco Mutti Ric De Santi Piersilvio De Bartolomeis Marcello Restelli OffRL 32 21 0 03 Feb 2022
The Impact of Data Distribution on Q-learning with Function Approximation Pedro P. Santos Diogo S. Carvalho A. Sardinha Francisco S. Melo OffRL 19 2 0 23 Nov 2021
B-Pref: Benchmarking Preference-Based Reinforcement Learning Kimin Lee Laura M. Smith Anca Dragan Pieter Abbeel OffRL 40 93 0 04 Nov 2021
Direct then Diffuse: Incremental Unsupervised Skill Discovery for State Covering and Goal Reaching Pierre-Alexandre Kamienny Jean Tarbouriech Sylvain Lamprier A. Lazaric Ludovic Denoyer SSL 40 18 0 27 Oct 2021