Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1705.09322
Cited By
Convergent Tree Backup and Retrace with Function Approximation
25 May 2017
Ahmed Touati
Pierre-Luc Bacon
Doina Precup
Pascal Vincent
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Convergent Tree Backup and Retrace with Function Approximation"
9 / 9 papers shown
Title
An Improved Finite-time Analysis of Temporal Difference Learning with Deep Neural Networks
Zhifa Ke
Zaiwen Wen
Junyu Zhang
37
0
0
07 May 2024
Trajectory-Aware Eligibility Traces for Off-Policy Reinforcement Learning
Brett Daley
Martha White
Chris Amato
Marlos C. Machado
OffRL
22
3
0
26 Jan 2023
Convergent and Efficient Deep Q Network Algorithm
Zhikang T. Wang
Masahito Ueda
27
12
0
29 Jun 2021
An Empirical Comparison of Off-policy Prediction Learning Algorithms on the Collision Task
Sina Ghiassian
R. Sutton
AAML
OffRL
19
5
0
02 Jun 2021
Single-Timescale Stochastic Nonconvex-Concave Optimization for Smooth Nonlinear TD Learning
Shuang Qiu
Zhuoran Yang
Xiaohan Wei
Jieping Ye
Zhaoran Wang
33
38
0
23 Aug 2020
Gradient Q
(
σ
,
λ
)
(σ, λ)
(
σ
,
λ
)
: A Unified Algorithm with Function Approximation for Reinforcement Learning
Long Yang
Yu Zhang
Qian Zheng
Pengfei Li
Gang Pan
15
1
0
06 Sep 2019
Modified Actor-Critics
Erinc Merdivan
S. Hanke
M. Geist
21
2
0
02 Jul 2019
Maximum a Posteriori Policy Optimisation
A. Abdolmaleki
Jost Tobias Springenberg
Yuval Tassa
Rémi Munos
N. Heess
Martin Riedmiller
48
470
0
14 Jun 2018
A Finite Time Analysis of Temporal Difference Learning With Linear Function Approximation
Jalaj Bhandari
Daniel Russo
Raghav Singal
20
334
0
06 Jun 2018
1