Gap-Dependent Unsupervised Exploration for Reinforcement Learning

11 August 2021

Papers citing "Gap-Dependent Unsupervised Exploration for Reinforcement Learning"

9 / 9 papers shown

Title
Provable Offline Preference-Based Reinforcement Learning Wenhao Zhan Masatoshi Uehara Nathan Kallus Jason D. Lee Wen Sun OffRL 37 24 0 24 May 2023
Provably Feedback-Efficient Reinforcement Learning via Active Reward Learning Dingwen Kong Lin F. Yang 31 9 0 18 Apr 2023
Offline Minimax Soft-Q-learning Under Realizability and Partial Coverage Masatoshi Uehara Nathan Kallus Jason D. Lee Wen Sun OffRL 39 5 0 05 Feb 2023
Computationally Efficient PAC RL in POMDPs with Latent Determinism and Conditional Embeddings Masatoshi Uehara Ayush Sekhari Jason D. Lee Nathan Kallus Wen Sun 58 6 0 24 Jun 2022
On the Statistical Efficiency of Reward-Free Exploration in Non-Linear RL Jinglin Chen Aditya Modi A. Krishnamurthy Nan Jiang Alekh Agarwal 38 25 0 21 Jun 2022
Offline Reinforcement Learning Under Value and Density-Ratio Realizability: The Power of Gaps Jinglin Chen Nan Jiang OffRL 21 33 0 25 Mar 2022
Reward-Free RL is No Harder Than Reward-Aware RL in Linear Markov Decision Processes Andrew Wagenmaker Yifang Chen Max Simchowitz S. Du Kevin G. Jamieson 19 48 0 26 Jan 2022
Adaptive Multi-Goal Exploration Jean Tarbouriech O. D. Domingues Pierre Ménard Matteo Pirotta Michal Valko A. Lazaric 18 2 0 23 Nov 2021
Reward-Free Exploration for Reinforcement Learning Chi Jin A. Krishnamurthy Max Simchowitz Tiancheng Yu OffRL 112 194 0 07 Feb 2020