Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1807.03765
Cited By
Is Q-learning Provably Efficient?
10 July 2018
Chi Jin
Zeyuan Allen-Zhu
Sébastien Bubeck
Michael I. Jordan
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Is Q-learning Provably Efficient?"
50 / 225 papers shown
Title
Sharp Variance-Dependent Bounds in Reinforcement Learning: Best of Both Worlds in Stochastic and Deterministic Environments
Runlong Zhou
Zihan Zhang
S. Du
49
10
0
31 Jan 2023
Improved Regret for Efficient Online Reinforcement Learning with Linear Function Approximation
Uri Sherman
Tomer Koren
Yishay Mansour
37
12
0
30 Jan 2023
STEERING: Stein Information Directed Exploration for Model-Based Reinforcement Learning
Souradip Chakraborty
Amrit Singh Bedi
Alec Koppel
Mengdi Wang
Furong Huang
Dinesh Manocha
24
8
0
28 Jan 2023
Asynchronous Hybrid Reinforcement Learning for Latency and Reliability Optimization in the Metaverse over Wireless Communications
Wen-li Yu
Terence Jie Chua
Jun Zhao
OffRL
19
20
0
30 Dec 2022
Understanding the Complexity Gains of Single-Task RL with a Curriculum
Qiyang Li
Yuexiang Zhai
Yi Ma
Sergey Levine
42
14
0
24 Dec 2022
Latent Variable Representation for Reinforcement Learning
Tongzheng Ren
Chenjun Xiao
Tianjun Zhang
Na Li
Zhaoran Wang
Sujay Sanghavi
Dale Schuurmans
Bo Dai
OffRL
35
10
0
17 Dec 2022
Nearly Minimax Optimal Reinforcement Learning for Linear Markov Decision Processes
Jiafan He
Heyang Zhao
Dongruo Zhou
Quanquan Gu
OffRL
62
55
0
12 Dec 2022
Frugal Reinforcement-based Active Learning
Sebastien Deschamps
H. Sahbi
25
0
0
09 Dec 2022
Near Sample-Optimal Reduction-based Policy Learning for Average Reward MDP
Jinghan Wang
Meng-Xian Wang
Lin F. Yang
42
16
0
01 Dec 2022
Global Convergence of Localized Policy Iteration in Networked Multi-Agent Reinforcement Learning
Yizhou Zhang
Guannan Qu
Pan Xu
Yiheng Lin
Zaiwei Chen
Adam Wierman
44
26
0
30 Nov 2022
Non-stationary Risk-sensitive Reinforcement Learning: Near-optimal Dynamic Regret, Adaptive Detection, and Separation Design
Yuhao Ding
Ming Jin
Javad Lavaei
34
6
0
19 Nov 2022
On Rate-Distortion Theory in Capacity-Limited Cognition & Reinforcement Learning
Dilip Arumugam
Mark K. Ho
Noah D. Goodman
Benjamin Van Roy
36
4
0
30 Oct 2022
Hardness in Markov Decision Processes: Theory and Practice
Michelangelo Conserva
Paulo E. Rauber
39
3
0
24 Oct 2022
On the Power of Pre-training for Generalization in RL: Provable Benefits and Hardness
Haotian Ye
Xiaoyu Chen
Liwei Wang
S. Du
OffRL
37
6
0
19 Oct 2022
A Unified Algorithm for Stochastic Path Problems
Christoph Dann
Chen-Yu Wei
Julian Zimmert
40
0
0
17 Oct 2022
Near-Optimal Regret Bounds for Multi-batch Reinforcement Learning
Zihan Zhang
Yuhang Jiang
Yuanshuo Zhou
Xiangyang Ji
OffRL
26
9
0
15 Oct 2022
Reward Imputation with Sketching for Contextual Batched Bandits
Xiao Zhang
Ninglu Shao
Zihua Si
Jun Xu
Wen Wang
Hanjing Su
Jirong Wen
OffRL
28
1
0
13 Oct 2022
Square-root regret bounds for continuous-time episodic Markov decision processes
Xuefeng Gao
X. Zhou
51
6
0
03 Oct 2022
Bayesian Q-learning With Imperfect Expert Demonstrations
Fengdi Che
Xiru Zhu
Doina Precup
David Meger
Gregory Dudek
19
2
0
01 Oct 2022
O
(
T
−
1
)
O(T^{-1})
O
(
T
−
1
)
Convergence of Optimistic-Follow-the-Regularized-Leader in Two-Player Zero-Sum Markov Games
Yuepeng Yang
Cong Ma
42
14
0
26 Sep 2022
Multi-armed Bandit Learning on a Graph
Tianpeng Zhang
Kasper Johansson
Na Li
35
6
0
20 Sep 2022
Socially Fair Reinforcement Learning
Debmalya Mandal
Jiarui Gan
OffRL
30
13
0
26 Aug 2022
Minimax-Optimal Multi-Agent RL in Markov Games With a Generative Model
Gen Li
Yuejie Chi
Yuting Wei
Yuxin Chen
37
18
0
22 Aug 2022
Spectral Decomposition Representation for Reinforcement Learning
Tongzheng Ren
Tianjun Zhang
Lisa Lee
Joseph E. Gonzalez
Dale Schuurmans
Bo Dai
OffRL
42
27
0
19 Aug 2022
Efficiently Computing Nash Equilibria in Adversarial Team Markov Games
Fivos Kalogiannis
Ioannis Anagnostides
Ioannis Panageas
Emmanouil-Vasileios Vlatakis-Gkaragkounis
Vaggos Chatziafratis
S. Stavroulakis
49
13
0
03 Aug 2022
Making Linear MDPs Practical via Contrastive Representation Learning
Tianjun Zhang
Tongzheng Ren
Mengjiao Yang
Joseph E. Gonzalez
Dale Schuurmans
Bo Dai
27
44
0
14 Jul 2022
Learning to Order for Inventory Systems with Lost Sales and Uncertain Supplies
Boxiao Chen
Jiashuo Jiang
Jiawei Zhang
Zhengyuan Zhou
26
10
0
10 Jul 2022
Provably Efficient Model-Free Constrained RL with Linear Function Approximation
A. Ghosh
Xingyu Zhou
Ness B. Shroff
75
23
0
23 Jun 2022
Sample-Efficient Reinforcement Learning in the Presence of Exogenous Information
Yonathan Efroni
Dylan J. Foster
Dipendra Kumar Misra
A. Krishnamurthy
John Langford
OffRL
31
25
0
09 Jun 2022
Provably Efficient Risk-Sensitive Reinforcement Learning: Iterated CVaR and Worst Path
Yihan Du
Siwei Wang
Longbo Huang
OOD
37
13
0
06 Jun 2022
Policy Optimization for Markov Games: Unified Framework and Faster Convergence
Runyu Zhang
Qinghua Liu
Haiquan Wang
Caiming Xiong
Na Li
Yu Bai
36
26
0
06 Jun 2022
Learning in Congestion Games with Bandit Feedback
Qiwen Cui
Zhihan Xiong
Maryam Fazel
S. Du
31
12
0
04 Jun 2022
Sample-Efficient Reinforcement Learning of Partially Observable Markov Games
Qinghua Liu
Csaba Szepesvári
Chi Jin
45
20
0
02 Jun 2022
Stabilizing Q-learning with Linear Architectures for Provably Efficient Learning
Andrea Zanette
Martin J. Wainwright
OOD
45
5
0
01 Jun 2022
Byzantine-Robust Online and Offline Distributed Reinforcement Learning
Yiding Chen
Xuezhou Zhang
Kai Zhang
Mengdi Wang
Xiaojin Zhu
OffRL
34
16
0
01 Jun 2022
Efficient Reward Poisoning Attacks on Online Deep Reinforcement Learning
Yinglun Xu
Qi Zeng
Gagandeep Singh
AAML
42
6
0
30 May 2022
From Dirichlet to Rubin: Optimistic Exploration in RL without Bonuses
D. Tiapkin
Denis Belomestny
Eric Moulines
A. Naumov
S. Samsonov
Yunhao Tang
Michal Valko
Pierre Menard
34
17
0
16 May 2022
Provably Efficient Kernelized Q-Learning
Shuang Liu
H. Su
MLT
34
4
0
21 Apr 2022
Reinforcement Learning from Partial Observation: Linear Function Approximation with Provable Sample Efficiency
Qi Cai
Zhuoran Yang
Zhaoran Wang
38
14
0
20 Apr 2022
When Is Partially Observable Reinforcement Learning Not Scary?
Qinghua Liu
Alan Chung
Csaba Szepesvári
Chi Jin
22
94
0
19 Apr 2022
Jump-Start Reinforcement Learning
Ikechukwu Uchendu
Ted Xiao
Yao Lu
Banghua Zhu
Mengyuan Yan
...
Chuyuan Fu
Cong Ma
Jiantao Jiao
Sergey Levine
Karol Hausman
OffRL
OnRL
44
109
0
05 Apr 2022
Horizon-Free Reinforcement Learning in Polynomial Time: the Power of Stationary Policies
Zihan Zhang
Xiangyang Ji
S. Du
35
21
0
24 Mar 2022
Reinforcement-based frugal learning for satellite image change detection
Sebastien Deschamps
H. Sahbi
37
1
0
22 Mar 2022
The Efficacy of Pessimism in Asynchronous Q-Learning
Yuling Yan
Gen Li
Yuxin Chen
Jianqing Fan
OffRL
78
40
0
14 Mar 2022
Learn to Match with No Regret: Reinforcement Learning in Markov Matching Markets
Yifei Min
Tianhao Wang
Ruitu Xu
Zhaoran Wang
Michael I. Jordan
Zhuoran Yang
38
21
0
07 Mar 2022
Target Network and Truncation Overcome The Deadly Triad in
Q
Q
Q
-Learning
Zaiwei Chen
John-Paul Clarke
S. T. Maguluri
28
19
0
05 Mar 2022
Branching Reinforcement Learning
Yihan Du
Wei Chen
32
0
0
16 Feb 2022
Sample-Efficient Reinforcement Learning with loglog(T) Switching Cost
Dan Qiao
Ming Yin
Ming Min
Yu Wang
43
28
0
13 Feb 2022
Transferred Q-learning
Elynn Y. Chen
Michael I. Jordan
Sai Li
OffRL
OnRL
36
4
0
09 Feb 2022
Learning Infinite-Horizon Average-Reward Markov Decision Processes with Constraints
Liyu Chen
R. Jain
Haipeng Luo
72
25
0
31 Jan 2022
Previous
1
2
3
4
5
Next