Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2103.12690
Cited By
An Exponential Lower Bound for Linearly-Realizable MDPs with Constant Suboptimality Gap
23 March 2021
Yuanhao Wang
Ruosong Wang
Sham Kakade
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"An Exponential Lower Bound for Linearly-Realizable MDPs with Constant Suboptimality Gap"
34 / 34 papers shown
Title
Nearly Minimax Optimal Reinforcement Learning for Linear Mixture Markov Decision Processes
Dongruo Zhou
Quanquan Gu
Csaba Szepesvári
66
206
0
15 Dec 2020
Exponential Lower Bounds for Batch Reinforcement Learning: Batch RL can be Exponentially Harder than Online RL
Andrea Zanette
OffRL
122
71
0
14 Dec 2020
A Variant of the Wang-Foster-Kakade Lower Bound for the Discounted Setting
Philip Amortila
Nan Jiang
Tengyang Xie
OffRL
90
23
0
02 Nov 2020
What are the Statistical Limits of Offline RL with Linear Function Approximation?
Ruosong Wang
Dean Phillips Foster
Sham Kakade
OffRL
127
163
0
22 Oct 2020
Exponential Lower Bounds for Planning in MDPs With Linearly-Realizable Optimal Action-Value Functions
Gellert Weisz
Philip Amortila
Csaba Szepesvári
OffRL
122
80
0
03 Oct 2020
Efficient Planning in Large MDPs with Weak Linear Function Approximation
R. Shariff
Csaba Szepesvári
49
22
0
13 Jul 2020
Linear Bandits with Limited Adaptivity and Learning Distributional Optimal Design
Yufei Ruan
Jiaqi Yang
Yuanshuo Zhou
OffRL
147
52
0
04 Jul 2020
Robust Linear Regression: Optimal Rates in Polynomial Time
Ainesh Bakshi
Adarsh Prasad
54
59
0
29 Jun 2020
Provably Efficient Reinforcement Learning for Discounted MDPs with Feature Mapping
Dongruo Zhou
Jiafan He
Quanquan Gu
55
135
0
23 Jun 2020
On Reward-Free Reinforcement Learning with Linear Function Approximation
Ruosong Wang
S. Du
Lin F. Yang
Ruslan Salakhutdinov
OffRL
55
106
0
19 Jun 2020
Q
Q
Q
-learning with Logarithmic Regret
Kunhe Yang
Lin F. Yang
S. Du
64
59
0
16 Jun 2020
Model-Based Reinforcement Learning with Value-Targeted Regression
Alex Ayoub
Zeyu Jia
Csaba Szepesvári
Mengdi Wang
Lin F. Yang
OffRL
83
303
0
01 Jun 2020
Almost Optimal Model-Free Reinforcement Learning via Reference-Advantage Decomposition
Zihan Zhang
Yuanshuo Zhou
Xiangyang Ji
OffRL
60
156
0
21 Apr 2020
DisCor: Corrective Feedback in Reinforcement Learning via Distribution Correction
Aviral Kumar
Abhishek Gupta
Sergey Levine
OffRL
39
101
0
16 Mar 2020
Learning Near Optimal Policies with Low Inherent Bellman Error
Andrea Zanette
A. Lazaric
Mykel Kochenderfer
Emma Brunskill
OffRL
68
222
0
29 Feb 2020
Beyond UCB: Optimal and Efficient Contextual Bandits with Regression Oracles
Dylan J. Foster
Alexander Rakhlin
275
207
0
12 Feb 2020
Provably Efficient Exploration in Policy Optimization
Qi Cai
Zhuoran Yang
Chi Jin
Zhaoran Wang
49
280
0
12 Dec 2019
Optimism in Reinforcement Learning with Generalized Linear Function Approximation
Yining Wang
Ruosong Wang
S. Du
A. Krishnamurthy
168
136
0
09 Dec 2019
Comments on the Du-Kakade-Wang-Yang Lower Bounds
Benjamin Van Roy
Shi Dong
111
38
0
18 Nov 2019
Learning with Good Feature Representations in Bandits and in RL with a Generative Model
Tor Lattimore
Csaba Szepesvári
Gellert Weisz
OffRL
131
170
0
18 Nov 2019
Kinematic State Abstraction and Provably Efficient Rich-Observation Reinforcement Learning
Dipendra Kumar Misra
Mikael Henaff
A. Krishnamurthy
John Langford
69
151
0
13 Nov 2019
Sample Complexity of Reinforcement Learning using Linearly Combined Model Ensembles
Aditya Modi
Nan Jiang
Ambuj Tewari
Satinder Singh
61
131
0
23 Oct 2019
Is a Good Representation Sufficient for Sample Efficient Reinforcement Learning?
S. Du
Sham Kakade
Ruosong Wang
Lin F. Yang
153
192
0
07 Oct 2019
Provably Efficient Reinforcement Learning with Linear Function Approximation
Chi Jin
Zhuoran Yang
Zhaoran Wang
Michael I. Jordan
86
555
0
11 Jul 2019
Model-Based Reinforcement Learning with a Generative Model is Minimax Optimal
Alekh Agarwal
Sham Kakade
Lin F. Yang
OffRL
81
170
0
10 Jun 2019
Reinforcement Learning in Feature Space: Matrix Bandit, Kernels, and Regret Bound
Lin F. Yang
Mengdi Wang
OffRL
GP
55
285
0
24 May 2019
Non-Asymptotic Gap-Dependent Regret Bounds for Tabular MDPs
Max Simchowitz
Kevin Jamieson
63
145
0
09 May 2019
Information-Theoretic Considerations in Batch Reinforcement Learning
Jinglin Chen
Nan Jiang
OOD
OffRL
127
375
0
01 May 2019
Provably efficient RL with Rich Observations via Latent State Decoding
S. Du
A. Krishnamurthy
Nan Jiang
Alekh Agarwal
Miroslav Dudík
John Langford
OffRL
60
230
0
25 Jan 2019
Efficient Algorithms for Outlier-Robust Regression
Adam R. Klivans
Pravesh Kothari
Raghu Meka
AAML
46
154
0
08 Mar 2018
Outlier-robust moment-estimation via sum-of-squares
Pravesh Kothari
David Steurer
48
65
0
30 Nov 2017
Better Agnostic Clustering Via Relaxed Tensor Norms
Pravesh Kothari
Jacob Steinhardt
72
60
0
20 Nov 2017
An Introduction to Matrix Concentration Inequalities
J. Tropp
129
1,148
0
07 Jan 2015
Efficient Reinforcement Learning in Deterministic Systems with Value Function Generalization
Zheng Wen
Benjamin Van Roy
55
42
0
18 Jul 2013
1