ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1901.00137
  4. Cited By
A Theoretical Analysis of Deep Q-Learning

A Theoretical Analysis of Deep Q-Learning

1 January 2019
Jianqing Fan
Zhuoran Yang
Yuchen Xie
Zhaoran Wang
ArXivPDFHTML

Papers citing "A Theoretical Analysis of Deep Q-Learning"

49 / 99 papers shown
Title
Understanding Value Decomposition Algorithms in Deep Cooperative
  Multi-Agent Reinforcement Learning
Understanding Value Decomposition Algorithms in Deep Cooperative Multi-Agent Reinforcement Learning
Zehao Dou
J. Kuba
Yaodong Yang
FAtt
27
5
0
10 Feb 2022
Deep Q-learning: a robust control approach
Deep Q-learning: a robust control approach
B. Varga
Balázs Kulcsár
M. Chehreghani
OOD
35
9
0
21 Jan 2022
Can Reinforcement Learning Find Stackelberg-Nash Equilibria in
  General-Sum Markov Games with Myopic Followers?
Can Reinforcement Learning Find Stackelberg-Nash Equilibria in General-Sum Markov Games with Myopic Followers?
Han Zhong
Zhuoran Yang
Zhaoran Wang
Michael I. Jordan
34
30
0
27 Dec 2021
Recent Advances in Reinforcement Learning in Finance
Recent Advances in Reinforcement Learning in Finance
B. Hambly
Renyuan Xu
Huining Yang
OffRL
40
168
0
08 Dec 2021
Pessimistic Model Selection for Offline Deep Reinforcement Learning
Pessimistic Model Selection for Offline Deep Reinforcement Learning
Chao-Han Huck Yang
Zhengling Qi
Yifan Cui
Pin-Yu Chen
OffRL
50
4
0
29 Nov 2021
The Impact of Data Distribution on Q-learning with Function
  Approximation
The Impact of Data Distribution on Q-learning with Function Approximation
Pedro P. Santos
Diogo S. Carvalho
Alberto Sardinha
Francisco S. Melo
OffRL
24
2
0
23 Nov 2021
Perturbational Complexity by Distribution Mismatch: A Systematic
  Analysis of Reinforcement Learning in Reproducing Kernel Hilbert Space
Perturbational Complexity by Distribution Mismatch: A Systematic Analysis of Reinforcement Learning in Reproducing Kernel Hilbert Space
Jihao Long
Jiequn Han
34
6
0
05 Nov 2021
False Correlation Reduction for Offline Reinforcement Learning
False Correlation Reduction for Offline Reinforcement Learning
Arvindkumar Krishnakumar
Zuyue Fu
Lingxiao Wang
Zhuoran Yang
Chenjia Bai
Tianyi Zhou
Judy Hoffman
Jing Jiang
OffRL
44
9
0
24 Oct 2021
Estimating Optimal Infinite Horizon Dynamic Treatment Regimes via
  pT-Learning
Estimating Optimal Infinite Horizon Dynamic Treatment Regimes via pT-Learning
Wenzhuo Zhou
Ruoqing Zhu
Annie Qu
45
22
0
20 Oct 2021
Breaking the Sample Complexity Barrier to Regret-Optimal Model-Free
  Reinforcement Learning
Breaking the Sample Complexity Barrier to Regret-Optimal Model-Free Reinforcement Learning
Gen Li
Laixi Shi
Yuxin Chen
Yuejie Chi
OffRL
49
51
0
09 Oct 2021
Optimal policy evaluation using kernel-based temporal difference methods
Optimal policy evaluation using kernel-based temporal difference methods
Yaqi Duan
Mengdi Wang
Martin J. Wainwright
OffRL
35
27
0
24 Sep 2021
Data Analytics for Smart cities: Challenges and Promises
Data Analytics for Smart cities: Challenges and Promises
F. Mohammadi
Farzan Shenavarmasouleh
M. Amini
H. Arabnia
19
8
0
12 Sep 2021
On the Complexity of Computing Markov Perfect Equilibrium in General-Sum
  Stochastic Games
On the Complexity of Computing Markov Perfect Equilibrium in General-Sum Stochastic Games
Xiaotie Deng
Ningyuan Li
D. Mguni
Jun Wang
Yaodong Yang
37
46
0
04 Sep 2021
Provable Benefits of Actor-Critic Methods for Offline Reinforcement
  Learning
Provable Benefits of Actor-Critic Methods for Offline Reinforcement Learning
Andrea Zanette
Martin J. Wainwright
Emma Brunskill
OffRL
34
115
0
19 Aug 2021
Towards General Function Approximation in Zero-Sum Markov Games
Towards General Function Approximation in Zero-Sum Markov Games
Baihe Huang
Jason D. Lee
Zhaoran Wang
Zhuoran Yang
38
47
0
30 Jul 2021
UAV Swarm Path Planning with Reinforcement Learning for Field
  prospecting
UAV Swarm Path Planning with Reinforcement Learning for Field prospecting
Alejandro Puente-Castro
Daniel Rivero
A. Pazos
Enrique Fernández-Blanco
11
36
0
04 Jun 2021
Deeply-Debiased Off-Policy Interval Estimation
Deeply-Debiased Off-Policy Interval Estimation
C. Shi
Runzhe Wan
Victor Chernozhukov
R. Song
OffRL
30
36
0
10 May 2021
Learning to reflect: A unifying approach for data-driven stochastic
  control strategies
Learning to reflect: A unifying approach for data-driven stochastic control strategies
Soren Christensen
Claudia Strauch
Lukas Trottner
21
10
0
23 Apr 2021
Risk Bounds and Rademacher Complexity in Batch Reinforcement Learning
Risk Bounds and Rademacher Complexity in Batch Reinforcement Learning
Yaqi Duan
Chi Jin
Zhiyuan Li
OffRL
36
48
0
25 Mar 2021
Sample Complexity of Offline Reinforcement Learning with Deep ReLU
  Networks
Sample Complexity of Offline Reinforcement Learning with Deep ReLU Networks
Thanh Nguyen-Tang
Sunil R. Gupta
Hung The Tran
Svetha Venkatesh
OffRL
88
7
0
11 Mar 2021
Is Q-Learning Minimax Optimal? A Tight Sample Complexity Analysis
Is Q-Learning Minimax Optimal? A Tight Sample Complexity Analysis
Gen Li
Changxiao Cai
Ee
Yuting Wei
Yuejie Chi
OffRL
55
75
0
12 Feb 2021
A Lyapunov Theory for Finite-Sample Guarantees of Asynchronous
  Q-Learning and TD-Learning Variants
A Lyapunov Theory for Finite-Sample Guarantees of Asynchronous Q-Learning and TD-Learning Variants
Zaiwei Chen
S. T. Maguluri
Sanjay Shakkottai
Karthikeyan Shanmugam
OffRL
105
54
0
02 Feb 2021
Fast Rates for the Regret of Offline Reinforcement Learning
Fast Rates for the Regret of Offline Reinforcement Learning
Yichun Hu
Nathan Kallus
Masatoshi Uehara
OffRL
32
30
0
31 Jan 2021
Coding for Distributed Multi-Agent Reinforcement Learning
Coding for Distributed Multi-Agent Reinforcement Learning
Baoqian Wang
Junfei Xie
Nikolay Atanasov
38
4
0
07 Jan 2021
A novel policy for pre-trained Deep Reinforcement Learning for Speech
  Emotion Recognition
A novel policy for pre-trained Deep Reinforcement Learning for Speech Emotion Recognition
Thejan Rajapakshe
R. Rana
Sara Khalifa
Björn W. Schuller
Jiajun Liu
OffRL
44
11
0
04 Jan 2021
Is Pessimism Provably Efficient for Offline RL?
Is Pessimism Provably Efficient for Offline RL?
Ying Jin
Zhuoran Yang
Zhaoran Wang
OffRL
32
350
0
30 Dec 2020
Convergence Proof for Actor-Critic Methods Applied to PPO and RUDDER
Convergence Proof for Actor-Critic Methods Applied to PPO and RUDDER
Markus Holzleitner
Lukas Gruber
Jose A. Arjona-Medina
Johannes Brandstetter
Sepp Hochreiter
33
38
0
02 Dec 2020
Logistic Q-Learning
Logistic Q-Learning
Joan Bas-Serrano
Sebastian Curi
Andreas Krause
Gergely Neu
14
40
0
21 Oct 2020
Deep Reinforcement Learning for Adaptive Network Slicing in 5G for
  Intelligent Vehicular Systems and Smart Cities
Deep Reinforcement Learning for Adaptive Network Slicing in 5G for Intelligent Vehicular Systems and Smart Cities
A. Nassar
Y. Yilmaz
AI4CE
19
55
0
19 Oct 2020
Single-Timescale Actor-Critic Provably Finds Globally Optimal Policy
Single-Timescale Actor-Critic Provably Finds Globally Optimal Policy
Zuyue Fu
Zhuoran Yang
Zhaoran Wang
26
42
0
02 Aug 2020
The Kolmogorov-Arnold representation theorem revisited
The Kolmogorov-Arnold representation theorem revisited
Johannes Schmidt-Hieber
30
129
0
31 Jul 2020
Model-Based Multi-Agent RL in Zero-Sum Markov Games with Near-Optimal
  Sample Complexity
Model-Based Multi-Agent RL in Zero-Sum Markov Games with Near-Optimal Sample Complexity
Kai Zhang
Sham Kakade
Tamer Bacsar
Lin F. Yang
73
120
0
15 Jul 2020
Curriculum learning for multilevel budgeted combinatorial problems
Curriculum learning for multilevel budgeted combinatorial problems
Adel Nabli
Margarida Carvalho
AI4CE
11
4
0
07 Jul 2020
Breaking the Curse of Many Agents: Provable Mean Embedding Q-Iteration
  for Mean-Field Reinforcement Learning
Breaking the Curse of Many Agents: Provable Mean Embedding Q-Iteration for Mean-Field Reinforcement Learning
Lingxiao Wang
Zhuoran Yang
Zhaoran Wang
34
26
0
21 Jun 2020
Breaking the Sample Size Barrier in Model-Based Reinforcement Learning
  with a Generative Model
Breaking the Sample Size Barrier in Model-Based Reinforcement Learning with a Generative Model
Gen Li
Yuting Wei
Yuejie Chi
Yuxin Chen
60
125
0
26 May 2020
Deep Reinforcement Learning for QoS-Constrained Resource Allocation in
  Multiservice Networks
Deep Reinforcement Learning for QoS-Constrained Resource Allocation in Multiservice Networks
J. Saraiva
I. M. Braga
V. F. Monteiro
F. Lima
T. Maciel
W. Freitas
F. Cavalcanti
14
9
0
03 Mar 2020
On Reinforcement Learning for Turn-based Zero-sum Markov Games
On Reinforcement Learning for Turn-based Zero-sum Markov Games
Devavrat Shah
Varun Somani
Qiaomin Xie
Zhi Xu
21
11
0
25 Feb 2020
Learning Zero-Sum Simultaneous-Move Markov Games Using Function
  Approximation and Correlated Equilibrium
Learning Zero-Sum Simultaneous-Move Markov Games Using Function Approximation and Correlated Equilibrium
Qiaomin Xie
Yudong Chen
Zhaoran Wang
Zhuoran Yang
41
125
0
17 Feb 2020
A Unified Switching System Perspective and O.D.E. Analysis of Q-Learning
  Algorithms
A Unified Switching System Perspective and O.D.E. Analysis of Q-Learning Algorithms
Dong-hwan Lee
Niao He
31
28
0
04 Dec 2019
Multi-Agent Reinforcement Learning: A Selective Overview of Theories and
  Algorithms
Multi-Agent Reinforcement Learning: A Selective Overview of Theories and Algorithms
Kai Zhang
Zhuoran Yang
Tamer Basar
70
1,187
0
24 Nov 2019
Neural Proximal/Trust Region Policy Optimization Attains Globally
  Optimal Policy
Neural Proximal/Trust Region Policy Optimization Attains Globally Optimal Policy
Boyi Liu
Qi Cai
Zhuoran Yang
Zhaoran Wang
30
108
0
25 Jun 2019
Feature-Based Q-Learning for Two-Player Stochastic Games
Feature-Based Q-Learning for Two-Player Stochastic Games
Zeyu Jia
Lin F. Yang
Mengdi Wang
27
45
0
02 Jun 2019
Policy Optimization Provably Converges to Nash Equilibria in Zero-Sum
  Linear Quadratic Games
Policy Optimization Provably Converges to Nash Equilibria in Zero-Sum Linear Quadratic Games
Kai Zhang
Zhuoran Yang
Tamer Basar
37
125
0
31 May 2019
Non-Asymptotic Analysis of Monte Carlo Tree Search
Non-Asymptotic Analysis of Monte Carlo Tree Search
Devavrat Shah
Qiaomin Xie
Zhi Xu
19
9
0
14 Feb 2019
Finite-Sample Analysis for SARSA with Linear Function Approximation
Finite-Sample Analysis for SARSA with Linear Function Approximation
Shaofeng Zou
Tengyu Xu
Yingbin Liang
34
146
0
06 Feb 2019
Finite-Sample Analysis For Decentralized Batch Multi-Agent Reinforcement
  Learning With Networked Agents
Finite-Sample Analysis For Decentralized Batch Multi-Agent Reinforcement Learning With Networked Agents
Kai Zhang
Zhuoran Yang
Han Liu
Tong Zhang
Tamer Basar
OffRL
29
26
0
06 Dec 2018
VIREL: A Variational Inference Framework for Reinforcement Learning
VIREL: A Variational Inference Framework for Reinforcement Learning
M. Fellows
Anuj Mahajan
Tim G. J. Rudner
Shimon Whiteson
DRL
38
54
0
03 Nov 2018
Norm-Based Capacity Control in Neural Networks
Norm-Based Capacity Control in Neural Networks
Behnam Neyshabur
Ryota Tomioka
Nathan Srebro
127
579
0
27 Feb 2015
Q-learning with censored data
Q-learning with censored data
Y. Goldberg
Michael R. Kosorok
OffRL
68
137
0
30 May 2012
Previous
12