An Exponential Lower Bound for Linearly-Realizable MDPs with Constant Suboptimality Gap

23 March 2021

Papers citing "An Exponential Lower Bound for Linearly-Realizable MDPs with Constant Suboptimality Gap"

34 / 34 papers shown

Title
Nearly Minimax Optimal Reinforcement Learning for Linear Mixture Markov Decision Processes Dongruo Zhou Quanquan Gu Csaba Szepesvári 66 206 0 15 Dec 2020
Exponential Lower Bounds for Batch Reinforcement Learning: Batch RL can be Exponentially Harder than Online RL Andrea Zanette OffRL 122 71 0 14 Dec 2020
A Variant of the Wang-Foster-Kakade Lower Bound for the Discounted Setting Philip Amortila Nan Jiang Tengyang Xie OffRL 90 23 0 02 Nov 2020
What are the Statistical Limits of Offline RL with Linear Function Approximation? Ruosong Wang Dean Phillips Foster Sham Kakade OffRL 127 163 0 22 Oct 2020
Exponential Lower Bounds for Planning in MDPs With Linearly-Realizable Optimal Action-Value Functions Gellert Weisz Philip Amortila Csaba Szepesvári OffRL 122 80 0 03 Oct 2020
Efficient Planning in Large MDPs with Weak Linear Function Approximation R. Shariff Csaba Szepesvári 49 22 0 13 Jul 2020
Linear Bandits with Limited Adaptivity and Learning Distributional Optimal Design Yufei Ruan Jiaqi Yang Yuanshuo Zhou OffRL 147 52 0 04 Jul 2020
Robust Linear Regression: Optimal Rates in Polynomial Time Ainesh Bakshi Adarsh Prasad 54 59 0 29 Jun 2020
Provably Efficient Reinforcement Learning for Discounted MDPs with Feature Mapping Dongruo Zhou Jiafan He Quanquan Gu 55 135 0 23 Jun 2020
On Reward-Free Reinforcement Learning with Linear Function Approximation Ruosong Wang S. Du Lin F. Yang Ruslan Salakhutdinov OffRL 55 106 0 19 Jun 2020
$Q$ -learning with Logarithmic Regret Kunhe Yang Lin F. Yang S. Du 64 59 0 16 Jun 2020
Model-Based Reinforcement Learning with Value-Targeted Regression Alex Ayoub Zeyu Jia Csaba Szepesvári Mengdi Wang Lin F. Yang OffRL 83 303 0 01 Jun 2020
Almost Optimal Model-Free Reinforcement Learning via Reference-Advantage Decomposition Zihan Zhang Yuanshuo Zhou Xiangyang Ji OffRL 60 156 0 21 Apr 2020
DisCor: Corrective Feedback in Reinforcement Learning via Distribution Correction Aviral Kumar Abhishek Gupta Sergey Levine OffRL 39 101 0 16 Mar 2020
Learning Near Optimal Policies with Low Inherent Bellman Error Andrea Zanette A. Lazaric Mykel Kochenderfer Emma Brunskill OffRL 68 222 0 29 Feb 2020
Beyond UCB: Optimal and Efficient Contextual Bandits with Regression Oracles Dylan J. Foster Alexander Rakhlin 275 207 0 12 Feb 2020
Provably Efficient Exploration in Policy Optimization Qi Cai Zhuoran Yang Chi Jin Zhaoran Wang 49 280 0 12 Dec 2019
Optimism in Reinforcement Learning with Generalized Linear Function Approximation Yining Wang Ruosong Wang S. Du A. Krishnamurthy 168 136 0 09 Dec 2019
Comments on the Du-Kakade-Wang-Yang Lower Bounds Benjamin Van Roy Shi Dong 111 38 0 18 Nov 2019
Learning with Good Feature Representations in Bandits and in RL with a Generative Model Tor Lattimore Csaba Szepesvári Gellert Weisz OffRL 131 170 0 18 Nov 2019
Kinematic State Abstraction and Provably Efficient Rich-Observation Reinforcement Learning Dipendra Kumar Misra Mikael Henaff A. Krishnamurthy John Langford 69 151 0 13 Nov 2019
Sample Complexity of Reinforcement Learning using Linearly Combined Model Ensembles Aditya Modi Nan Jiang Ambuj Tewari Satinder Singh 61 131 0 23 Oct 2019
Is a Good Representation Sufficient for Sample Efficient Reinforcement Learning? S. Du Sham Kakade Ruosong Wang Lin F. Yang 153 192 0 07 Oct 2019
Provably Efficient Reinforcement Learning with Linear Function Approximation Chi Jin Zhuoran Yang Zhaoran Wang Michael I. Jordan 86 555 0 11 Jul 2019
Model-Based Reinforcement Learning with a Generative Model is Minimax Optimal Alekh Agarwal Sham Kakade Lin F. Yang OffRL 81 170 0 10 Jun 2019
Reinforcement Learning in Feature Space: Matrix Bandit, Kernels, and Regret Bound Lin F. Yang Mengdi Wang OffRL GP 55 285 0 24 May 2019
Non-Asymptotic Gap-Dependent Regret Bounds for Tabular MDPs Max Simchowitz Kevin Jamieson 63 145 0 09 May 2019
Information-Theoretic Considerations in Batch Reinforcement Learning Jinglin Chen Nan Jiang OOD OffRL 127 375 0 01 May 2019
Provably efficient RL with Rich Observations via Latent State Decoding S. Du A. Krishnamurthy Nan Jiang Alekh Agarwal Miroslav Dudík John Langford OffRL 60 230 0 25 Jan 2019
Efficient Algorithms for Outlier-Robust Regression Adam R. Klivans Pravesh Kothari Raghu Meka AAML 46 154 0 08 Mar 2018
Outlier-robust moment-estimation via sum-of-squares Pravesh Kothari David Steurer 48 65 0 30 Nov 2017
Better Agnostic Clustering Via Relaxed Tensor Norms Pravesh Kothari Jacob Steinhardt 72 60 0 20 Nov 2017
An Introduction to Matrix Concentration Inequalities J. Tropp 129 1,148 0 07 Jan 2015
Efficient Reinforcement Learning in Deterministic Systems with Value Function Generalization Zheng Wen Benjamin Van Roy 55 42 0 18 Jul 2013