Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1901.00137
Cited By
A Theoretical Analysis of Deep Q-Learning
1 January 2019
Jianqing Fan
Zhuoran Yang
Yuchen Xie
Zhaoran Wang
Re-assign community
ArXiv
PDF
HTML
Papers citing
"A Theoretical Analysis of Deep Q-Learning"
49 / 99 papers shown
Title
Understanding Value Decomposition Algorithms in Deep Cooperative Multi-Agent Reinforcement Learning
Zehao Dou
J. Kuba
Yaodong Yang
FAtt
27
5
0
10 Feb 2022
Deep Q-learning: a robust control approach
B. Varga
Balázs Kulcsár
M. Chehreghani
OOD
35
9
0
21 Jan 2022
Can Reinforcement Learning Find Stackelberg-Nash Equilibria in General-Sum Markov Games with Myopic Followers?
Han Zhong
Zhuoran Yang
Zhaoran Wang
Michael I. Jordan
34
30
0
27 Dec 2021
Recent Advances in Reinforcement Learning in Finance
B. Hambly
Renyuan Xu
Huining Yang
OffRL
40
168
0
08 Dec 2021
Pessimistic Model Selection for Offline Deep Reinforcement Learning
Chao-Han Huck Yang
Zhengling Qi
Yifan Cui
Pin-Yu Chen
OffRL
50
4
0
29 Nov 2021
The Impact of Data Distribution on Q-learning with Function Approximation
Pedro P. Santos
Diogo S. Carvalho
Alberto Sardinha
Francisco S. Melo
OffRL
24
2
0
23 Nov 2021
Perturbational Complexity by Distribution Mismatch: A Systematic Analysis of Reinforcement Learning in Reproducing Kernel Hilbert Space
Jihao Long
Jiequn Han
34
6
0
05 Nov 2021
False Correlation Reduction for Offline Reinforcement Learning
Arvindkumar Krishnakumar
Zuyue Fu
Lingxiao Wang
Zhuoran Yang
Chenjia Bai
Tianyi Zhou
Judy Hoffman
Jing Jiang
OffRL
44
9
0
24 Oct 2021
Estimating Optimal Infinite Horizon Dynamic Treatment Regimes via pT-Learning
Wenzhuo Zhou
Ruoqing Zhu
Annie Qu
45
22
0
20 Oct 2021
Breaking the Sample Complexity Barrier to Regret-Optimal Model-Free Reinforcement Learning
Gen Li
Laixi Shi
Yuxin Chen
Yuejie Chi
OffRL
49
51
0
09 Oct 2021
Optimal policy evaluation using kernel-based temporal difference methods
Yaqi Duan
Mengdi Wang
Martin J. Wainwright
OffRL
35
27
0
24 Sep 2021
Data Analytics for Smart cities: Challenges and Promises
F. Mohammadi
Farzan Shenavarmasouleh
M. Amini
H. Arabnia
19
8
0
12 Sep 2021
On the Complexity of Computing Markov Perfect Equilibrium in General-Sum Stochastic Games
Xiaotie Deng
Ningyuan Li
D. Mguni
Jun Wang
Yaodong Yang
37
46
0
04 Sep 2021
Provable Benefits of Actor-Critic Methods for Offline Reinforcement Learning
Andrea Zanette
Martin J. Wainwright
Emma Brunskill
OffRL
34
115
0
19 Aug 2021
Towards General Function Approximation in Zero-Sum Markov Games
Baihe Huang
Jason D. Lee
Zhaoran Wang
Zhuoran Yang
38
47
0
30 Jul 2021
UAV Swarm Path Planning with Reinforcement Learning for Field prospecting
Alejandro Puente-Castro
Daniel Rivero
A. Pazos
Enrique Fernández-Blanco
11
36
0
04 Jun 2021
Deeply-Debiased Off-Policy Interval Estimation
C. Shi
Runzhe Wan
Victor Chernozhukov
R. Song
OffRL
30
36
0
10 May 2021
Learning to reflect: A unifying approach for data-driven stochastic control strategies
Soren Christensen
Claudia Strauch
Lukas Trottner
21
10
0
23 Apr 2021
Risk Bounds and Rademacher Complexity in Batch Reinforcement Learning
Yaqi Duan
Chi Jin
Zhiyuan Li
OffRL
36
48
0
25 Mar 2021
Sample Complexity of Offline Reinforcement Learning with Deep ReLU Networks
Thanh Nguyen-Tang
Sunil R. Gupta
Hung The Tran
Svetha Venkatesh
OffRL
88
7
0
11 Mar 2021
Is Q-Learning Minimax Optimal? A Tight Sample Complexity Analysis
Gen Li
Changxiao Cai
Ee
Yuting Wei
Yuejie Chi
OffRL
55
75
0
12 Feb 2021
A Lyapunov Theory for Finite-Sample Guarantees of Asynchronous Q-Learning and TD-Learning Variants
Zaiwei Chen
S. T. Maguluri
Sanjay Shakkottai
Karthikeyan Shanmugam
OffRL
105
54
0
02 Feb 2021
Fast Rates for the Regret of Offline Reinforcement Learning
Yichun Hu
Nathan Kallus
Masatoshi Uehara
OffRL
32
30
0
31 Jan 2021
Coding for Distributed Multi-Agent Reinforcement Learning
Baoqian Wang
Junfei Xie
Nikolay Atanasov
38
4
0
07 Jan 2021
A novel policy for pre-trained Deep Reinforcement Learning for Speech Emotion Recognition
Thejan Rajapakshe
R. Rana
Sara Khalifa
Björn W. Schuller
Jiajun Liu
OffRL
44
11
0
04 Jan 2021
Is Pessimism Provably Efficient for Offline RL?
Ying Jin
Zhuoran Yang
Zhaoran Wang
OffRL
32
350
0
30 Dec 2020
Convergence Proof for Actor-Critic Methods Applied to PPO and RUDDER
Markus Holzleitner
Lukas Gruber
Jose A. Arjona-Medina
Johannes Brandstetter
Sepp Hochreiter
33
38
0
02 Dec 2020
Logistic Q-Learning
Joan Bas-Serrano
Sebastian Curi
Andreas Krause
Gergely Neu
14
40
0
21 Oct 2020
Deep Reinforcement Learning for Adaptive Network Slicing in 5G for Intelligent Vehicular Systems and Smart Cities
A. Nassar
Y. Yilmaz
AI4CE
19
55
0
19 Oct 2020
Single-Timescale Actor-Critic Provably Finds Globally Optimal Policy
Zuyue Fu
Zhuoran Yang
Zhaoran Wang
26
42
0
02 Aug 2020
The Kolmogorov-Arnold representation theorem revisited
Johannes Schmidt-Hieber
30
129
0
31 Jul 2020
Model-Based Multi-Agent RL in Zero-Sum Markov Games with Near-Optimal Sample Complexity
Kai Zhang
Sham Kakade
Tamer Bacsar
Lin F. Yang
73
120
0
15 Jul 2020
Curriculum learning for multilevel budgeted combinatorial problems
Adel Nabli
Margarida Carvalho
AI4CE
11
4
0
07 Jul 2020
Breaking the Curse of Many Agents: Provable Mean Embedding Q-Iteration for Mean-Field Reinforcement Learning
Lingxiao Wang
Zhuoran Yang
Zhaoran Wang
34
26
0
21 Jun 2020
Breaking the Sample Size Barrier in Model-Based Reinforcement Learning with a Generative Model
Gen Li
Yuting Wei
Yuejie Chi
Yuxin Chen
60
125
0
26 May 2020
Deep Reinforcement Learning for QoS-Constrained Resource Allocation in Multiservice Networks
J. Saraiva
I. M. Braga
V. F. Monteiro
F. Lima
T. Maciel
W. Freitas
F. Cavalcanti
14
9
0
03 Mar 2020
On Reinforcement Learning for Turn-based Zero-sum Markov Games
Devavrat Shah
Varun Somani
Qiaomin Xie
Zhi Xu
21
11
0
25 Feb 2020
Learning Zero-Sum Simultaneous-Move Markov Games Using Function Approximation and Correlated Equilibrium
Qiaomin Xie
Yudong Chen
Zhaoran Wang
Zhuoran Yang
41
125
0
17 Feb 2020
A Unified Switching System Perspective and O.D.E. Analysis of Q-Learning Algorithms
Dong-hwan Lee
Niao He
31
28
0
04 Dec 2019
Multi-Agent Reinforcement Learning: A Selective Overview of Theories and Algorithms
Kai Zhang
Zhuoran Yang
Tamer Basar
70
1,187
0
24 Nov 2019
Neural Proximal/Trust Region Policy Optimization Attains Globally Optimal Policy
Boyi Liu
Qi Cai
Zhuoran Yang
Zhaoran Wang
30
108
0
25 Jun 2019
Feature-Based Q-Learning for Two-Player Stochastic Games
Zeyu Jia
Lin F. Yang
Mengdi Wang
27
45
0
02 Jun 2019
Policy Optimization Provably Converges to Nash Equilibria in Zero-Sum Linear Quadratic Games
Kai Zhang
Zhuoran Yang
Tamer Basar
37
125
0
31 May 2019
Non-Asymptotic Analysis of Monte Carlo Tree Search
Devavrat Shah
Qiaomin Xie
Zhi Xu
19
9
0
14 Feb 2019
Finite-Sample Analysis for SARSA with Linear Function Approximation
Shaofeng Zou
Tengyu Xu
Yingbin Liang
34
146
0
06 Feb 2019
Finite-Sample Analysis For Decentralized Batch Multi-Agent Reinforcement Learning With Networked Agents
Kai Zhang
Zhuoran Yang
Han Liu
Tong Zhang
Tamer Basar
OffRL
29
26
0
06 Dec 2018
VIREL: A Variational Inference Framework for Reinforcement Learning
M. Fellows
Anuj Mahajan
Tim G. J. Rudner
Shimon Whiteson
DRL
38
54
0
03 Nov 2018
Norm-Based Capacity Control in Neural Networks
Behnam Neyshabur
Ryota Tomioka
Nathan Srebro
127
579
0
27 Feb 2015
Q-learning with censored data
Y. Goldberg
Michael R. Kosorok
OffRL
68
137
0
30 May 2012
Previous
1
2