ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2103.12690
  4. Cited By
An Exponential Lower Bound for Linearly-Realizable MDPs with Constant
  Suboptimality Gap

An Exponential Lower Bound for Linearly-Realizable MDPs with Constant Suboptimality Gap

23 March 2021
Yuanhao Wang
Ruosong Wang
Sham Kakade
    OffRL
ArXivPDFHTML

Papers citing "An Exponential Lower Bound for Linearly-Realizable MDPs with Constant Suboptimality Gap"

34 / 34 papers shown
Title
Nearly Minimax Optimal Reinforcement Learning for Linear Mixture Markov
  Decision Processes
Nearly Minimax Optimal Reinforcement Learning for Linear Mixture Markov Decision Processes
Dongruo Zhou
Quanquan Gu
Csaba Szepesvári
66
206
0
15 Dec 2020
Exponential Lower Bounds for Batch Reinforcement Learning: Batch RL can
  be Exponentially Harder than Online RL
Exponential Lower Bounds for Batch Reinforcement Learning: Batch RL can be Exponentially Harder than Online RL
Andrea Zanette
OffRL
122
71
0
14 Dec 2020
A Variant of the Wang-Foster-Kakade Lower Bound for the Discounted
  Setting
A Variant of the Wang-Foster-Kakade Lower Bound for the Discounted Setting
Philip Amortila
Nan Jiang
Tengyang Xie
OffRL
90
23
0
02 Nov 2020
What are the Statistical Limits of Offline RL with Linear Function
  Approximation?
What are the Statistical Limits of Offline RL with Linear Function Approximation?
Ruosong Wang
Dean Phillips Foster
Sham Kakade
OffRL
127
163
0
22 Oct 2020
Exponential Lower Bounds for Planning in MDPs With Linearly-Realizable
  Optimal Action-Value Functions
Exponential Lower Bounds for Planning in MDPs With Linearly-Realizable Optimal Action-Value Functions
Gellert Weisz
Philip Amortila
Csaba Szepesvári
OffRL
122
80
0
03 Oct 2020
Efficient Planning in Large MDPs with Weak Linear Function Approximation
Efficient Planning in Large MDPs with Weak Linear Function Approximation
R. Shariff
Csaba Szepesvári
49
22
0
13 Jul 2020
Linear Bandits with Limited Adaptivity and Learning Distributional
  Optimal Design
Linear Bandits with Limited Adaptivity and Learning Distributional Optimal Design
Yufei Ruan
Jiaqi Yang
Yuanshuo Zhou
OffRL
147
52
0
04 Jul 2020
Robust Linear Regression: Optimal Rates in Polynomial Time
Robust Linear Regression: Optimal Rates in Polynomial Time
Ainesh Bakshi
Adarsh Prasad
54
59
0
29 Jun 2020
Provably Efficient Reinforcement Learning for Discounted MDPs with
  Feature Mapping
Provably Efficient Reinforcement Learning for Discounted MDPs with Feature Mapping
Dongruo Zhou
Jiafan He
Quanquan Gu
55
135
0
23 Jun 2020
On Reward-Free Reinforcement Learning with Linear Function Approximation
On Reward-Free Reinforcement Learning with Linear Function Approximation
Ruosong Wang
S. Du
Lin F. Yang
Ruslan Salakhutdinov
OffRL
55
106
0
19 Jun 2020
$Q$-learning with Logarithmic Regret
QQQ-learning with Logarithmic Regret
Kunhe Yang
Lin F. Yang
S. Du
64
59
0
16 Jun 2020
Model-Based Reinforcement Learning with Value-Targeted Regression
Model-Based Reinforcement Learning with Value-Targeted Regression
Alex Ayoub
Zeyu Jia
Csaba Szepesvári
Mengdi Wang
Lin F. Yang
OffRL
83
303
0
01 Jun 2020
Almost Optimal Model-Free Reinforcement Learning via Reference-Advantage
  Decomposition
Almost Optimal Model-Free Reinforcement Learning via Reference-Advantage Decomposition
Zihan Zhang
Yuanshuo Zhou
Xiangyang Ji
OffRL
60
156
0
21 Apr 2020
DisCor: Corrective Feedback in Reinforcement Learning via Distribution
  Correction
DisCor: Corrective Feedback in Reinforcement Learning via Distribution Correction
Aviral Kumar
Abhishek Gupta
Sergey Levine
OffRL
39
101
0
16 Mar 2020
Learning Near Optimal Policies with Low Inherent Bellman Error
Learning Near Optimal Policies with Low Inherent Bellman Error
Andrea Zanette
A. Lazaric
Mykel Kochenderfer
Emma Brunskill
OffRL
68
222
0
29 Feb 2020
Beyond UCB: Optimal and Efficient Contextual Bandits with Regression
  Oracles
Beyond UCB: Optimal and Efficient Contextual Bandits with Regression Oracles
Dylan J. Foster
Alexander Rakhlin
275
207
0
12 Feb 2020
Provably Efficient Exploration in Policy Optimization
Provably Efficient Exploration in Policy Optimization
Qi Cai
Zhuoran Yang
Chi Jin
Zhaoran Wang
49
280
0
12 Dec 2019
Optimism in Reinforcement Learning with Generalized Linear Function
  Approximation
Optimism in Reinforcement Learning with Generalized Linear Function Approximation
Yining Wang
Ruosong Wang
S. Du
A. Krishnamurthy
168
136
0
09 Dec 2019
Comments on the Du-Kakade-Wang-Yang Lower Bounds
Comments on the Du-Kakade-Wang-Yang Lower Bounds
Benjamin Van Roy
Shi Dong
111
38
0
18 Nov 2019
Learning with Good Feature Representations in Bandits and in RL with a
  Generative Model
Learning with Good Feature Representations in Bandits and in RL with a Generative Model
Tor Lattimore
Csaba Szepesvári
Gellert Weisz
OffRL
131
170
0
18 Nov 2019
Kinematic State Abstraction and Provably Efficient Rich-Observation
  Reinforcement Learning
Kinematic State Abstraction and Provably Efficient Rich-Observation Reinforcement Learning
Dipendra Kumar Misra
Mikael Henaff
A. Krishnamurthy
John Langford
69
151
0
13 Nov 2019
Sample Complexity of Reinforcement Learning using Linearly Combined
  Model Ensembles
Sample Complexity of Reinforcement Learning using Linearly Combined Model Ensembles
Aditya Modi
Nan Jiang
Ambuj Tewari
Satinder Singh
61
131
0
23 Oct 2019
Is a Good Representation Sufficient for Sample Efficient Reinforcement
  Learning?
Is a Good Representation Sufficient for Sample Efficient Reinforcement Learning?
S. Du
Sham Kakade
Ruosong Wang
Lin F. Yang
153
192
0
07 Oct 2019
Provably Efficient Reinforcement Learning with Linear Function
  Approximation
Provably Efficient Reinforcement Learning with Linear Function Approximation
Chi Jin
Zhuoran Yang
Zhaoran Wang
Michael I. Jordan
86
555
0
11 Jul 2019
Model-Based Reinforcement Learning with a Generative Model is Minimax
  Optimal
Model-Based Reinforcement Learning with a Generative Model is Minimax Optimal
Alekh Agarwal
Sham Kakade
Lin F. Yang
OffRL
81
170
0
10 Jun 2019
Reinforcement Learning in Feature Space: Matrix Bandit, Kernels, and
  Regret Bound
Reinforcement Learning in Feature Space: Matrix Bandit, Kernels, and Regret Bound
Lin F. Yang
Mengdi Wang
OffRL
GP
55
285
0
24 May 2019
Non-Asymptotic Gap-Dependent Regret Bounds for Tabular MDPs
Non-Asymptotic Gap-Dependent Regret Bounds for Tabular MDPs
Max Simchowitz
Kevin Jamieson
63
145
0
09 May 2019
Information-Theoretic Considerations in Batch Reinforcement Learning
Information-Theoretic Considerations in Batch Reinforcement Learning
Jinglin Chen
Nan Jiang
OOD
OffRL
127
375
0
01 May 2019
Provably efficient RL with Rich Observations via Latent State Decoding
Provably efficient RL with Rich Observations via Latent State Decoding
S. Du
A. Krishnamurthy
Nan Jiang
Alekh Agarwal
Miroslav Dudík
John Langford
OffRL
60
230
0
25 Jan 2019
Efficient Algorithms for Outlier-Robust Regression
Efficient Algorithms for Outlier-Robust Regression
Adam R. Klivans
Pravesh Kothari
Raghu Meka
AAML
46
154
0
08 Mar 2018
Outlier-robust moment-estimation via sum-of-squares
Outlier-robust moment-estimation via sum-of-squares
Pravesh Kothari
David Steurer
48
65
0
30 Nov 2017
Better Agnostic Clustering Via Relaxed Tensor Norms
Better Agnostic Clustering Via Relaxed Tensor Norms
Pravesh Kothari
Jacob Steinhardt
72
60
0
20 Nov 2017
An Introduction to Matrix Concentration Inequalities
An Introduction to Matrix Concentration Inequalities
J. Tropp
129
1,148
0
07 Jan 2015
Efficient Reinforcement Learning in Deterministic Systems with Value
  Function Generalization
Efficient Reinforcement Learning in Deterministic Systems with Value Function Generalization
Zheng Wen
Benjamin Van Roy
55
42
0
18 Jul 2013
1