ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1901.00137
  4. Cited By
A Theoretical Analysis of Deep Q-Learning

A Theoretical Analysis of Deep Q-Learning

1 January 2019
Jianqing Fan
Zhuoran Yang
Yuchen Xie
Zhaoran Wang
ArXivPDFHTML

Papers citing "A Theoretical Analysis of Deep Q-Learning"

50 / 98 papers shown
Title
Embodied Intelligence: The Key to Unblocking Generalized Artificial Intelligence
Embodied Intelligence: The Key to Unblocking Generalized Artificial Intelligence
Jinhao Jiang
Changlin Chen
Shile Feng
Wanru Geng
Zesheng Zhou
Ni Wang
Shuai Li
Feng-Qi Cui
Erbao Dong
AI4CE
36
0
0
11 May 2025
Universal Approximation Theorem of Deep Q-Networks
Universal Approximation Theorem of Deep Q-Networks
Qian Qi
45
1
0
04 May 2025
Approximation to Deep Q-Network by Stochastic Delay Differential Equations
Approximation to Deep Q-Network by Stochastic Delay Differential Equations
Jianya Lu
Yingjun Mo
38
0
0
01 May 2025
Low-altitude Friendly-Jamming for Satellite-Maritime Communications via Generative AI-enabled Deep Reinforcement Learning
Jiawei Huang
Aimin Wang
Geng Sun
Jiahui Li
Jiacheng Wang
Dusit Niyato
Victor C. M. Leung
67
0
0
28 Jan 2025
Game Theory and Multi-Agent Reinforcement Learning : From Nash Equilibria to Evolutionary Dynamics
Game Theory and Multi-Agent Reinforcement Learning : From Nash Equilibria to Evolutionary Dynamics
Neil De La Fuente
Miquel Noguer i Alonso
Guim Casadellà
45
0
0
31 Dec 2024
Parameter-Efficient Fine-Tuning in Large Models: A Survey of Methodologies
Parameter-Efficient Fine-Tuning in Large Models: A Survey of Methodologies
Liwen Wang
Sheng Chen
Linnan Jiang
Shu Pan
Runze Cai
Sen Yang
Fei Yang
52
4
0
24 Oct 2024
Process Reward Model with Q-Value Rankings
Process Reward Model with Q-Value Rankings
W. Li
Yixuan Li
LRM
68
15
0
15 Oct 2024
Deflated Dynamics Value Iteration
Deflated Dynamics Value Iteration
Jongmin Lee
Amin Rakhsha
Ernest K. Ryu
Amir-massoud Farahmand
46
2
0
15 Jul 2024
Causal inference through multi-stage learning and doubly robust deep
  neural networks
Causal inference through multi-stage learning and doubly robust deep neural networks
Yuqian Zhang
Jelena Bradic
OOD
CML
32
0
0
11 Jul 2024
Simplifying Deep Temporal Difference Learning
Simplifying Deep Temporal Difference Learning
Matteo Gallici
Mattie Fellows
Benjamin Ellis
B. Pou
Ivan Masmitja
Jakob Foerster
Mario Martin
OffRL
62
18
0
05 Jul 2024
An Improved Finite-time Analysis of Temporal Difference Learning with
  Deep Neural Networks
An Improved Finite-time Analysis of Temporal Difference Learning with Deep Neural Networks
Zhifa Ke
Zaiwen Wen
Junyu Zhang
37
0
0
07 May 2024
Federated Offline Reinforcement Learning: Collaborative Single-Policy
  Coverage Suffices
Federated Offline Reinforcement Learning: Collaborative Single-Policy Coverage Suffices
Jiin Woo
Laixi Shi
Gauri Joshi
Yuejie Chi
OffRL
39
3
0
08 Feb 2024
RadDQN: a Deep Q Learning-based Architecture for Finding Time-efficient
  Minimum Radiation Exposure Pathway
RadDQN: a Deep Q Learning-based Architecture for Finding Time-efficient Minimum Radiation Exposure Pathway
B. Sadhu
Trijit Sadhu
S. Anand
AI4CE
27
0
0
01 Feb 2024
Regularized Q-Learning with Linear Function Approximation
Regularized Q-Learning with Linear Function Approximation
Jiachen Xi
Alfredo Garcia
P. Momcilovic
52
2
0
26 Jan 2024
BET: Explaining Deep Reinforcement Learning through The Error-Prone
  Decisions
BET: Explaining Deep Reinforcement Learning through The Error-Prone Decisions
Xiao Liu
Jie Zhao
Wubing Chen
Mao Tan
Yongxin Su
OffRL
FAtt
35
0
0
14 Jan 2024
Neural Network Approximation for Pessimistic Offline Reinforcement
  Learning
Neural Network Approximation for Pessimistic Offline Reinforcement Learning
Di Wu
Yuling Jiao
Li Shen
Haizhao Yang
Xiliang Lu
OffRL
43
1
0
19 Dec 2023
An Invitation to Deep Reinforcement Learning
An Invitation to Deep Reinforcement Learning
Bernhard Jaeger
Andreas Geiger
OffRL
OOD
85
5
0
13 Dec 2023
Finite-Time Analysis of Whittle Index based Q-Learning for Restless
  Multi-Armed Bandits with Neural Network Function Approximation
Finite-Time Analysis of Whittle Index based Q-Learning for Restless Multi-Armed Bandits with Neural Network Function Approximation
Guojun Xiong
Jian Li
40
13
0
03 Oct 2023
TD Convergence: An Optimization Perspective
TD Convergence: An Optimization Perspective
Kavosh Asadi
Shoham Sabach
Yao Liu
Omer Gottesman
Rasool Fakoor
MU
25
8
0
30 Jun 2023
Fitted Value Iteration Methods for Bicausal Optimal Transport
Fitted Value Iteration Methods for Bicausal Optimal Transport
Erhan Bayraktar
Bingyan Han
OT
37
6
0
22 Jun 2023
Finite-Time Analysis of Minimax Q-Learning for Two-Player Zero-Sum
  Markov Games: Switching System Approach
Finite-Time Analysis of Minimax Q-Learning for Two-Player Zero-Sum Markov Games: Switching System Approach
Dong-hwan Lee
38
2
0
09 Jun 2023
Agent Performing Autonomous Stock Trading under Good and Bad Situations
Agent Performing Autonomous Stock Trading under Good and Bad Situations
Yunfei Luo
Zhangqi Duan
AIFin
36
0
0
06 Jun 2023
High-probability sample complexities for policy evaluation with linear
  function approximation
High-probability sample complexities for policy evaluation with linear function approximation
Gen Li
Weichen Wu
Yuejie Chi
Cong Ma
Alessandro Rinaldo
Yuting Wei
OffRL
40
7
0
30 May 2023
An Offline Time-aware Apprenticeship Learning Framework for Evolving
  Reward Functions
An Offline Time-aware Apprenticeship Learning Framework for Evolving Reward Functions
Xi Yang
Ge Gao
Min Chi
OffRL
32
2
0
15 May 2023
Local Optimization Achieves Global Optimality in Multi-Agent
  Reinforcement Learning
Local Optimization Achieves Global Optimality in Multi-Agent Reinforcement Learning
Yulai Zhao
Zhuoran Yang
Zhaoran Wang
Jason D. Lee
48
3
0
08 May 2023
Gauss-Newton Temporal Difference Learning with Nonlinear Function
  Approximation
Gauss-Newton Temporal Difference Learning with Nonlinear Function Approximation
Zhifa Ke
Junyu Zhang
Zaiwen Wen
24
0
0
25 Feb 2023
Why Target Networks Stabilise Temporal Difference Methods
Why Target Networks Stabilise Temporal Difference Methods
Matt Fellows
Matthew Smith
Shimon Whiteson
OOD
AAML
21
7
0
24 Feb 2023
Kernel-Based Distributed Q-Learning: A Scalable Reinforcement Learning Approach for Dynamic Treatment Regimes
Kernel-Based Distributed Q-Learning: A Scalable Reinforcement Learning Approach for Dynamic Treatment Regimes
Di Wang
Yao Wang
Shaojie Tang
OffRL
26
1
0
21 Feb 2023
Reinforcement Learning with Function Approximation: From Linear to
  Nonlinear
Reinforcement Learning with Function Approximation: From Linear to Nonlinear
Jihao Long
Jiequn Han
39
5
0
20 Feb 2023
Distillation Policy Optimization
Distillation Policy Optimization
Jianfei Ma
OffRL
26
1
0
01 Feb 2023
A Review of Off-Policy Evaluation in Reinforcement Learning
A Review of Off-Policy Evaluation in Reinforcement Learning
Masatoshi Uehara
C. Shi
Nathan Kallus
OffRL
49
69
0
13 Dec 2022
CT-DQN: Control-Tutored Deep Reinforcement Learning
CT-DQN: Control-Tutored Deep Reinforcement Learning
F. D. Lellis
M. Coraggio
G. Russo
Mirco Musolesi
M. D. Bernardo
24
4
0
02 Dec 2022
Operator Splitting Value Iteration
Operator Splitting Value Iteration
Amin Rakhsha
Andrew Wang
Mohammad Ghavamzadeh
Amir-massoud Farahmand
OffRL
33
7
0
25 Nov 2022
When is Realizability Sufficient for Off-Policy Reinforcement Learning?
When is Realizability Sufficient for Off-Policy Reinforcement Learning?
Andrea Zanette
OffRL
29
14
0
10 Nov 2022
Can maker-taker fees prevent algorithmic cooperation in market making?
Can maker-taker fees prevent algorithmic cooperation in market making?
Bingyan Han
53
1
0
01 Nov 2022
Strategic Decision-Making in the Presence of Information Asymmetry:
  Provably Efficient RL with Algorithmic Instruments
Strategic Decision-Making in the Presence of Information Asymmetry: Provably Efficient RL with Algorithmic Instruments
Mengxin Yu
Zhuoran Yang
Jianqing Fan
OffRL
36
8
0
23 Aug 2022
Robust Knowledge Adaptation for Dynamic Graph Neural Networks
Robust Knowledge Adaptation for Dynamic Graph Neural Networks
Han Li
Changsheng Li
Kaituo Feng
Ye Yuan
Guoren Wang
H. Zha
39
13
0
22 Jul 2022
q-Learning in Continuous Time
q-Learning in Continuous Time
Yanwei Jia
X. Zhou
OffRL
58
70
0
02 Jul 2022
Analysis of Stochastic Processes through Replay Buffers
Analysis of Stochastic Processes through Replay Buffers
Shirli Di-Castro Shashua
Shie Mannor
Dotan Di-Castro
36
6
0
26 Jun 2022
Stabilizing Q-learning with Linear Architectures for Provably Efficient
  Learning
Stabilizing Q-learning with Linear Architectures for Provably Efficient Learning
Andrea Zanette
Martin J. Wainwright
OOD
45
5
0
01 Jun 2022
CoNSoLe: Convex Neural Symbolic Learning
CoNSoLe: Convex Neural Symbolic Learning
Haoran Li
Yang Weng
Yangqiu Song
34
9
0
01 Jun 2022
Pervasive Machine Learning for Smart Radio Environments Enabled by
  Reconfigurable Intelligent Surfaces
Pervasive Machine Learning for Smart Radio Environments Enabled by Reconfigurable Intelligent Surfaces
G. C. Alexandropoulos
Kyriakos Stylianopoulos
Chongwen Huang
Chau Yuen
M. Bennis
Mérouane Debbah
36
88
0
08 May 2022
Chemoreception and chemotaxis of a three-sphere swimmer
Chemoreception and chemotaxis of a three-sphere swimmer
S. Paz
R. Ausas
J. P. Carbajal
G. Buscaglia
18
4
0
05 May 2022
Infinite-Horizon Reach-Avoid Zero-Sum Games via Deep Reinforcement
  Learning
Infinite-Horizon Reach-Avoid Zero-Sum Games via Deep Reinforcement Learning
Jingqi Li
Donggun Lee
Somayeh Sojoudi
Claire Tomlin
27
11
0
18 Mar 2022
Target Network and Truncation Overcome The Deadly Triad in $Q$-Learning
Target Network and Truncation Overcome The Deadly Triad in QQQ-Learning
Zaiwei Chen
John-Paul Clarke
S. T. Maguluri
28
19
0
05 Mar 2022
Testing Stationarity and Change Point Detection in Reinforcement Learning
Testing Stationarity and Change Point Detection in Reinforcement Learning
Mengbing Li
C. Shi
Zhanghua Wu
Piotr Fryzlewicz
OffRL
47
9
0
03 Mar 2022
Off-Policy Confidence Interval Estimation with Confounded Markov
  Decision Process
Off-Policy Confidence Interval Estimation with Confounded Markov Decision Process
C. Shi
Jin Zhu
Ye Shen
Shuang Luo
Hong Zhu
R. Song
OffRL
38
30
0
22 Feb 2022
A Multi-Agent Reinforcement Learning Framework for Off-Policy Evaluation
  in Two-sided Markets
A Multi-Agent Reinforcement Learning Framework for Off-Policy Evaluation in Two-sided Markets
C. Shi
Runzhe Wan
Ge Song
Shuang Luo
R. Song
Hongtu Zhu
OffRL
43
6
0
21 Feb 2022
Off-Policy Fitted Q-Evaluation with Differentiable Function
  Approximators: Z-Estimation and Inference Theory
Off-Policy Fitted Q-Evaluation with Differentiable Function Approximators: Z-Estimation and Inference Theory
Ruiqi Zhang
Xuezhou Zhang
Chengzhuo Ni
Mengdi Wang
OffRL
40
16
0
10 Feb 2022
Understanding Value Decomposition Algorithms in Deep Cooperative
  Multi-Agent Reinforcement Learning
Understanding Value Decomposition Algorithms in Deep Cooperative Multi-Agent Reinforcement Learning
Zehao Dou
J. Kuba
Yaodong Yang
FAtt
27
5
0
10 Feb 2022
12
Next