Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1512.04087
Cited By
v1
v2 (latest)
True Online Temporal-Difference Learning
13 December 2015
H. V. Seijen
A. R. Mahmood
P. Pilarski
Marlos C. Machado
R. Sutton
OffRL
CLL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"True Online Temporal-Difference Learning"
17 / 17 papers shown
Title
Two-Step Q-Learning
Antony Vijesh
Shreyas Sumithra Rudresha
OffRL
88
1
0
02 Jul 2024
DeepADMR: A Deep Learning based Anomaly Detection for MANET Routing
Alex Yahja
Saeed Kaviani
Bo Ryu
Jae H. Kim
Kevin Larson
43
2
0
24 Jan 2023
Finite-Sample Analysis of Off-Policy TD-Learning via Generalized Bellman Operators
Zaiwei Chen
S. T. Maguluri
Sanjay Shakkottai
Karthikeyan Shanmugam
OffRL
83
13
0
24 Jun 2021
Sim-Env: Decoupling OpenAI Gym Environments from Simulation Models
Andreas Schuderer
Stefano Bromuri
M. V. Eekelen
AI4CE
34
2
0
19 Feb 2021
An Analysis of Frame-skipping in Reinforcement Learning
Shivaram Kalyanakrishnan
Siddharth Aravindan
Vishwajeet Bagdawat
Varun Bhatt
Harshith Goka
Archit Gupta
Kalpesh Krishna
Vihari Piratla
60
20
0
07 Feb 2021
Adaptive and Multiple Time-scale Eligibility Traces for Online Deep Reinforcement Learning
Taisuke Kobayashi
OffRL
44
8
0
23 Aug 2020
The LoCA Regret: A Consistent Metric to Evaluate Model-Based Behavior in Reinforcement Learning
H. V. Seijen
Hadi Nekoei
Evan Racah
A. Chandar
OffRL
57
14
0
07 Jul 2020
Learning and Planning in Average-Reward Markov Decision Processes
Yi Wan
A. Naik
R. Sutton
OffRL
76
61
0
29 Jun 2020
META-Learning Eligibility Traces for More Sample Efficient Temporal Difference Learning
Mingde Zhao
20
0
0
16 Jun 2020
A reinforcement learning approach to rare trajectory sampling
Dominic C. Rose
Jamie F. Mair
J. P. Garrahan
76
52
0
26 May 2020
Market Making via Reinforcement Learning
Thomas Spooner
John Fearnley
Rahul Savani
Andreas Koukorinis
98
115
0
11 Apr 2018
General Video Game AI: a Multi-Track Framework for Evaluating Agents, Games and Content Generation Algorithms
Diego Perez-Liebana
Jialin Liu
Ahmed Khalifa
Raluca D. Gaina
Julian Togelius
Simon Lucas
135
175
0
28 Feb 2018
A Unified Approach for Multi-step Temporal-Difference Learning with Eligibility Traces in Reinforcement Learning
Long Yang
Minhao Shi
Qian Zheng
Wenjia Meng
Gang Pan
78
24
0
09 Feb 2018
Multi-step Off-policy Learning Without Importance Sampling Ratios
A. R. Mahmood
Huizhen Yu
R. Sutton
OffRL
143
54
0
09 Feb 2017
Accelerated Gradient Temporal Difference Learning
Yangchen Pan
Adam White
Martha White
40
27
0
28 Nov 2016
Effective Multi-step Temporal-Difference Learning for Non-Linear Function Approximation
H. V. Seijen
44
15
0
18 Aug 2016
Asynchronous Methods for Deep Reinforcement Learning
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
212
8,889
0
04 Feb 2016
1