ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1512.04087
  4. Cited By
True Online Temporal-Difference Learning
v1v2 (latest)

True Online Temporal-Difference Learning

13 December 2015
H. V. Seijen
A. R. Mahmood
P. Pilarski
Marlos C. Machado
R. Sutton
    OffRLCLL
ArXiv (abs)PDFHTML

Papers citing "True Online Temporal-Difference Learning"

17 / 17 papers shown
Title
Two-Step Q-Learning
Two-Step Q-Learning
Antony Vijesh
Shreyas Sumithra Rudresha
OffRL
88
1
0
02 Jul 2024
DeepADMR: A Deep Learning based Anomaly Detection for MANET Routing
DeepADMR: A Deep Learning based Anomaly Detection for MANET Routing
Alex Yahja
Saeed Kaviani
Bo Ryu
Jae H. Kim
Kevin Larson
43
2
0
24 Jan 2023
Finite-Sample Analysis of Off-Policy TD-Learning via Generalized Bellman
  Operators
Finite-Sample Analysis of Off-Policy TD-Learning via Generalized Bellman Operators
Zaiwei Chen
S. T. Maguluri
Sanjay Shakkottai
Karthikeyan Shanmugam
OffRL
83
13
0
24 Jun 2021
Sim-Env: Decoupling OpenAI Gym Environments from Simulation Models
Sim-Env: Decoupling OpenAI Gym Environments from Simulation Models
Andreas Schuderer
Stefano Bromuri
M. V. Eekelen
AI4CE
34
2
0
19 Feb 2021
An Analysis of Frame-skipping in Reinforcement Learning
An Analysis of Frame-skipping in Reinforcement Learning
Shivaram Kalyanakrishnan
Siddharth Aravindan
Vishwajeet Bagdawat
Varun Bhatt
Harshith Goka
Archit Gupta
Kalpesh Krishna
Vihari Piratla
60
20
0
07 Feb 2021
Adaptive and Multiple Time-scale Eligibility Traces for Online Deep
  Reinforcement Learning
Adaptive and Multiple Time-scale Eligibility Traces for Online Deep Reinforcement Learning
Taisuke Kobayashi
OffRL
44
8
0
23 Aug 2020
The LoCA Regret: A Consistent Metric to Evaluate Model-Based Behavior in
  Reinforcement Learning
The LoCA Regret: A Consistent Metric to Evaluate Model-Based Behavior in Reinforcement Learning
H. V. Seijen
Hadi Nekoei
Evan Racah
A. Chandar
OffRL
57
14
0
07 Jul 2020
Learning and Planning in Average-Reward Markov Decision Processes
Learning and Planning in Average-Reward Markov Decision Processes
Yi Wan
A. Naik
R. Sutton
OffRL
76
61
0
29 Jun 2020
META-Learning Eligibility Traces for More Sample Efficient Temporal
  Difference Learning
META-Learning Eligibility Traces for More Sample Efficient Temporal Difference Learning
Mingde Zhao
20
0
0
16 Jun 2020
A reinforcement learning approach to rare trajectory sampling
A reinforcement learning approach to rare trajectory sampling
Dominic C. Rose
Jamie F. Mair
J. P. Garrahan
76
52
0
26 May 2020
Market Making via Reinforcement Learning
Market Making via Reinforcement Learning
Thomas Spooner
John Fearnley
Rahul Savani
Andreas Koukorinis
98
115
0
11 Apr 2018
General Video Game AI: a Multi-Track Framework for Evaluating Agents,
  Games and Content Generation Algorithms
General Video Game AI: a Multi-Track Framework for Evaluating Agents, Games and Content Generation Algorithms
Diego Perez-Liebana
Jialin Liu
Ahmed Khalifa
Raluca D. Gaina
Julian Togelius
Simon Lucas
135
175
0
28 Feb 2018
A Unified Approach for Multi-step Temporal-Difference Learning with
  Eligibility Traces in Reinforcement Learning
A Unified Approach for Multi-step Temporal-Difference Learning with Eligibility Traces in Reinforcement Learning
Long Yang
Minhao Shi
Qian Zheng
Wenjia Meng
Gang Pan
78
24
0
09 Feb 2018
Multi-step Off-policy Learning Without Importance Sampling Ratios
Multi-step Off-policy Learning Without Importance Sampling Ratios
A. R. Mahmood
Huizhen Yu
R. Sutton
OffRL
143
54
0
09 Feb 2017
Accelerated Gradient Temporal Difference Learning
Accelerated Gradient Temporal Difference Learning
Yangchen Pan
Adam White
Martha White
40
27
0
28 Nov 2016
Effective Multi-step Temporal-Difference Learning for Non-Linear
  Function Approximation
Effective Multi-step Temporal-Difference Learning for Non-Linear Function Approximation
H. V. Seijen
44
15
0
18 Aug 2016
Asynchronous Methods for Deep Reinforcement Learning
Asynchronous Methods for Deep Reinforcement Learning
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
212
8,889
0
04 Feb 2016
1