Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1907.05388
Cited By
v1
v2 (latest)
Provably Efficient Reinforcement Learning with Linear Function Approximation
11 July 2019
Chi Jin
Zhuoran Yang
Zhaoran Wang
Michael I. Jordan
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Provably Efficient Reinforcement Learning with Linear Function Approximation"
50 / 417 papers shown
Title
Understanding Domain Randomization for Sim-to-real Transfer
Xiaoyu Chen
Jiachen Hu
Chi Jin
Lihong Li
Liwei Wang
202
118
0
07 Oct 2021
Explaining Off-Policy Actor-Critic From A Bias-Variance Perspective
Ting-Han Fan
Peter J. Ramadge
CML
FAtt
OffRL
68
2
0
06 Oct 2021
Feel-Good Thompson Sampling for Contextual Bandits and Reinforcement Learning
Tong Zhang
87
65
0
02 Oct 2021
Dimension-Free Rates for Natural Policy Gradient in Multi-Agent Reinforcement Learning
Carlo Alfano
Patrick Rebeschini
91
5
0
23 Sep 2021
Exploration in Deep Reinforcement Learning: From Single-Agent to Multiagent Domain
Jianye Hao
Tianpei Yang
Hongyao Tang
Chenjia Bai
Jinyi Liu
Zhaopeng Meng
Peng Liu
Zhen Wang
OffRL
86
103
0
14 Sep 2021
On the Approximation of Cooperative Heterogeneous Multi-Agent Reinforcement Learning (MARL) using Mean Field Control (MFC)
Washim Uddin Mondal
Mridul Agarwal
Vaneet Aggarwal
S. Ukkusuri
134
44
0
09 Sep 2021
Adaptive Control of Differentially Private Linear Quadratic Systems
Sayak Ray Chowdhury
Xingyu Zhou
Ness B. Shroff
75
10
0
26 Aug 2021
A Boosting Approach to Reinforcement Learning
Nataly Brukhim
Elad Hazan
Karan Singh
84
14
0
22 Aug 2021
Provable Benefits of Actor-Critic Methods for Offline Reinforcement Learning
Andrea Zanette
Martin J. Wainwright
Emma Brunskill
OffRL
104
119
0
19 Aug 2021
Towards General Function Approximation in Zero-Sum Markov Games
Baihe Huang
Jason D. Lee
Zhaoran Wang
Zhuoran Yang
88
47
0
30 Jul 2021
Policy Optimization in Adversarial MDPs: Improved Exploration via Dilated Bonuses
Haipeng Luo
Chen-Yu Wei
Chung-Wei Lee
124
45
0
18 Jul 2021
Pessimistic Model-based Offline Reinforcement Learning under Partial Coverage
Masatoshi Uehara
Wen Sun
OffRL
194
150
0
13 Jul 2021
Model Selection for Generic Reinforcement Learning
Avishek Ghosh
Sayak Ray Chowdhury
Kannan Ramchandran
54
1
0
13 Jul 2021
Efficient Model-Based Multi-Agent Mean-Field Reinforcement Learning
Barna Pásztor
Ilija Bogunovic
Andreas Krause
88
41
0
08 Jul 2021
Variance-Aware Off-Policy Evaluation with Linear Function Approximation
Yifei Min
Tianhao Wang
Dongruo Zhou
Quanquan Gu
OffRL
89
38
0
22 Jun 2021
Provably Efficient Representation Selection in Low-rank Markov Decision Processes: From Online to Offline RL
Weitong Zhang
Jiafan He
Dongruo Zhou
Amy Zhang
Quanquan Gu
OffRL
72
11
0
22 Jun 2021
Uniform-PAC Bounds for Reinforcement Learning with Linear Function Approximation
Jiafan He
Dongruo Zhou
Quanquan Gu
55
13
0
22 Jun 2021
Agnostic Reinforcement Learning with Low-Rank MDPs and Rich Observations
Christoph Dann
Yishay Mansour
M. Mohri
Ayush Sekhari
Karthik Sridharan
OffRL
57
11
0
22 Jun 2021
On the Power of Multitask Representation Learning in Linear MDP
Rui Lu
Gao Huang
S. Du
85
29
0
15 Jun 2021
Randomized Exploration for Reinforcement Learning with General Value Function Approximation
Haque Ishfaq
Qiwen Cui
V. Nguyen
Alex Ayoub
Zhuoran Yang
Zhaoran Wang
Doina Precup
Lin F. Yang
96
48
0
15 Jun 2021
Sample Efficient Reinforcement Learning In Continuous State Spaces: A Perspective Beyond Linearity
Dhruv Malik
Aldo Pacchiano
Vishwak Srinivasan
Yuanzhi Li
57
6
0
15 Jun 2021
Online Sub-Sampling for Reinforcement Learning with General Function Approximation
Dingwen Kong
Ruslan Salakhutdinov
Ruosong Wang
Lin F. Yang
OffRL
75
1
0
14 Jun 2021
Bellman-consistent Pessimism for Offline Reinforcement Learning
Tengyang Xie
Ching-An Cheng
Nan Jiang
Paul Mineiro
Alekh Agarwal
OffRL
LRM
240
280
0
13 Jun 2021
Corruption-Robust Offline Reinforcement Learning
Xuezhou Zhang
Yiding Chen
Jerry Zhu
Wen Sun
OffRL
94
43
0
11 Jun 2021
Policy Finetuning: Bridging Sample-Efficient Offline and Online Reinforcement Learning
Tengyang Xie
Nan Jiang
Huan Wang
Caiming Xiong
Yu Bai
OffRL
OnRL
109
165
0
09 Jun 2021
A Provably-Efficient Model-Free Algorithm for Constrained Markov Decision Processes
Honghao Wei
Xin Liu
Lei Ying
112
23
0
03 Jun 2021
Sample-Efficient Reinforcement Learning for Linearly-Parameterized MDPs with a Generative Model
Bingyan Wang
Yuling Yan
Jianqing Fan
110
20
0
28 May 2021
Sample-Efficient Reinforcement Learning Is Feasible for Linearly Realizable MDPs with Limited Revisiting
Gen Li
Yuxin Chen
Yuejie Chi
Yuantao Gu
Yuting Wei
OffRL
92
30
0
17 May 2021
Online Algorithms and Policies Using Adaptive and Machine Learning Approaches
Anuradha M. Annaswamy
A. Guha
Yingnan Cui
Sunbochen Tang
Peter A. Fisher
Joseph E. Gaudio
81
19
0
13 May 2021
Optimal Uniform OPE and Model-based Offline Reinforcement Learning in Time-Homogeneous, Reward-Free and Task-Agnostic Settings
Ming Yin
Yu Wang
OffRL
106
19
0
13 May 2021
Towards Theoretical Understandings of Robust Markov Decision Processes: Sample Complexity and Asymptotics
Wenhao Yang
Liangyu Zhang
Zhihua Zhang
81
35
0
09 May 2021
Learning Good State and Action Representations via Tensor Decomposition
Chengzhuo Ni
Yaqi Duan
M. Dahleh
Anru R. Zhang
Mengdi Wang
94
7
0
03 May 2021
An
L
2
L^2
L
2
Analysis of Reinforcement Learning in High Dimensions with Kernel and Neural Network Approximation
Jihao Long
Jiequn Han null
Weinan E
OffRL
75
15
0
15 Apr 2021
Rule-Based Reinforcement Learning for Efficient Robot Navigation with Space Reduction
Yuanyang Zhu
Zhi Wang
Chunlin Chen
D. Dong
42
37
0
15 Apr 2021
Leveraging Good Representations in Linear Contextual Bandits
Matteo Papini
Andrea Tirinzoni
Marcello Restelli
A. Lazaric
Matteo Pirotta
73
27
0
08 Apr 2021
Policy Information Capacity: Information-Theoretic Measure for Task Complexity in Deep Reinforcement Learning
Hiroki Furuta
T. Matsushima
Tadashi Kozuno
Y. Matsuo
Sergey Levine
Ofir Nachum
S. Gu
OffRL
58
14
0
23 Mar 2021
An Exponential Lower Bound for Linearly-Realizable MDPs with Constant Suboptimality Gap
Yuanhao Wang
Ruosong Wang
Sham Kakade
OffRL
193
43
0
23 Mar 2021
Provably Correct Optimization and Exploration with Non-linear Policies
Fei Feng
W. Yin
Alekh Agarwal
Lin F. Yang
156
13
0
22 Mar 2021
Infinite-Horizon Offline Reinforcement Learning with Linear Function Approximation: Curse of Dimensionality and Algorithm
Lin Chen
B. Scherrer
Peter L. Bartlett
OffRL
213
16
0
17 Mar 2021
Sample Complexity of Offline Reinforcement Learning with Deep ReLU Networks
Thanh Nguyen-Tang
Sunil R. Gupta
Hung The Tran
Svetha Venkatesh
OffRL
137
7
0
11 Mar 2021
Online Learning for Unknown Partially Observable MDPs
Mehdi Jafarnia-Jahromi
Rahul Jain
A. Nayyar
108
20
0
25 Feb 2021
Near-Optimal Randomized Exploration for Tabular Markov Decision Processes
Zhihan Xiong
Ruoqi Shen
Qiwen Cui
Maryam Fazel
S. Du
92
10
0
19 Feb 2021
Near-optimal Policy Optimization Algorithms for Learning Adversarial Linear Mixture MDPs
Jiafan He
Dongruo Zhou
Quanquan Gu
139
24
0
17 Feb 2021
Almost Optimal Algorithms for Two-player Zero-Sum Linear Mixture Markov Games
Zixiang Chen
Dongruo Zhou
Quanquan Gu
75
25
0
15 Feb 2021
Nearly Minimax Optimal Regret for Learning Infinite-horizon Average-reward MDPs with Linear Function Approximation
Yue Wu
Dongruo Zhou
Quanquan Gu
62
21
0
15 Feb 2021
PerSim: Data-Efficient Offline Reinforcement Learning with Heterogeneous Agents via Personalized Simulators
Anish Agarwal
Abdullah Alomar
Varkey Alumootil
Devavrat Shah
Dennis Shen
Zhi Xu
Cindy Yang
OffRL
76
18
0
13 Feb 2021
Non-stationary Reinforcement Learning without Prior Knowledge: An Optimal Black-box Approach
Chen-Yu Wei
Haipeng Luo
OffRL
189
108
0
10 Feb 2021
Simple Agent, Complex Environment: Efficient Reinforcement Learning with Agent States
Shi Dong
Benjamin Van Roy
Zhengyuan Zhou
110
32
0
10 Feb 2021
Provable Model-based Nonlinear Bandit and Reinforcement Learning: Shelve Optimism, Embrace Virtual Curvature
Kefan Dong
Jiaqi Yang
Tengyu Ma
97
33
0
08 Feb 2021
Near-optimal Representation Learning for Linear Bandits and Linear RL
Jiachen Hu
Xiaoyu Chen
Chi Jin
Lihong Li
Liwei Wang
OffRL
165
53
0
08 Feb 2021
Previous
1
2
3
4
5
6
7
8
9
Next