Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2103.00107
Cited By
Revisiting Peng's Q(
λ
λ
λ
) for Modern Reinforcement Learning
27 February 2021
Tadashi Kozuno
Yunhao Tang
Mark Rowland
Rémi Munos
Steven Kapturowski
Will Dabney
Michal Valko
David Abel
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Revisiting Peng's Q($λ$) for Modern Reinforcement Learning"
12 / 12 papers shown
Title
Two-Step Q-Learning
Antony Vijesh
Shreyas Sumithra Rudresha
OffRL
93
1
0
02 Jul 2024
Demystifying the Recency Heuristic in Temporal-Difference Learning
Brett Daley
Marlos C. Machado
Martha White
63
1
0
18 Jun 2024
Off-policy Distributional Q(
λ
λ
λ
): Distributional RL without Importance Sampling
Yunhao Tang
Mark Rowland
Rémi Munos
Bernardo Avila-Pires
Will Dabney
OffRL
54
1
0
08 Feb 2024
Trajectory-Aware Eligibility Traces for Off-Policy Reinforcement Learning
Brett Daley
Martha White
Chris Amato
Marlos C. Machado
OffRL
138
3
0
26 Jan 2023
Opportunities and Challenges from Using Animal Videos in Reinforcement Learning for Navigation
Vittorio Giammarino
James Queeney
Lucas C. Carstensen
Michael Hasselmo
I. Paschalidis
OffRL
78
5
0
25 Sep 2022
The Nature of Temporal Difference Errors in Multi-step Distributional Reinforcement Learning
Yunhao Tang
Mark Rowland
Rémi Munos
Bernardo Avila-Pires
Will Dabney
Marc G. Bellemare
OffRL
67
12
0
15 Jul 2022
Safe-FinRL: A Low Bias and Variance Deep Reinforcement Learning Implementation for High-Freq Stock Trading
Zitao Song
Xuyang Jin
Chenliang Li
OffRL
AIFin
46
1
0
13 Jun 2022
The Phenomenon of Policy Churn
Tom Schaul
André Barreto
John Quan
Georg Ostrovski
89
28
0
01 Jun 2022
On Credit Assignment in Hierarchical Reinforcement Learning
Joery A. de Vries
Thomas M. Moerland
Aske Plaat
23
0
0
07 Mar 2022
Improving the Efficiency of Off-Policy Reinforcement Learning by Accounting for Past Decisions
Brett Daley
Chris Amato
OffRL
99
1
0
23 Dec 2021
Virtual Replay Cache
Brett Daley
Chris Amato
24
0
0
06 Dec 2021
Rethinking the Implementation Tricks and Monotonicity Constraint in Cooperative Multi-Agent Reinforcement Learning
Jian Hu
Siyang Jiang
Seth Austin Harding
Haibin Wu
Shihua Liao
201
90
0
06 Feb 2021
1