ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1901.09732
  4. Cited By
Making Deep Q-learning methods robust to time discretization

Making Deep Q-learning methods robust to time discretization

28 January 2019
Corentin Tallec
Léonard Blier
Yann Ollivier
    OOD
    OffRL
ArXivPDFHTML

Papers citing "Making Deep Q-learning methods robust to time discretization"

19 / 19 papers shown
Title
Reinforcement Learning for Jump-Diffusions, with Financial Applications
Reinforcement Learning for Jump-Diffusions, with Financial Applications
Xuefeng Gao
Lingfei Li
X. Zhou
48
1
0
08 Jan 2025
Reducing the Cost of Cycle-Time Tuning for Real-World Policy
  Optimization
Reducing the Cost of Cycle-Time Tuning for Real-World Policy Optimization
Homayoon Farrahi
Rupam Mahmood
34
5
0
09 May 2023
Managing Temporal Resolution in Continuous Value Estimation: A
  Fundamental Trade-off
Managing Temporal Resolution in Continuous Value Estimation: A Fundamental Trade-off
Zichen Zhang
Johannes Kirschner
Junxi Zhang
Francesco Zanini
Alex Ayoub
Masood Dehghan
Dale Schuurmans
OffRL
26
3
0
17 Dec 2022
Dynamic Decision Frequency with Continuous Options
Dynamic Decision Frequency with Continuous Options
Amir-Hossein Karimi
Jun Jin
Jun Luo
A. R. Mahmood
Martin Jägersand
Samuele Tosatto
20
9
0
06 Dec 2022
Square-root regret bounds for continuous-time episodic Markov decision
  processes
Square-root regret bounds for continuous-time episodic Markov decision processes
Xuefeng Gao
X. Zhou
48
6
0
03 Oct 2022
Offline Reinforcement Learning at Multiple Frequencies
Offline Reinforcement Learning at Multiple Frequencies
Kaylee Burns
Tianhe Yu
Chelsea Finn
Karol Hausman
OffRL
35
6
0
26 Jul 2022
q-Learning in Continuous Time
q-Learning in Continuous Time
Yanwei Jia
X. Zhou
OffRL
58
70
0
02 Jul 2022
Logarithmic regret bounds for continuous-time average-reward Markov
  decision processes
Logarithmic regret bounds for continuous-time average-reward Markov decision processes
Xuefeng Gao
X. Zhou
41
8
0
23 May 2022
Linear convergence of a policy gradient method for some finite horizon
  continuous time control problems
Linear convergence of a policy gradient method for some finite horizon continuous time control problems
C. Reisinger
Wolfgang Stockinger
Yufei Zhang
28
5
0
22 Mar 2022
Policy Gradient and Actor-Critic Learning in Continuous Time and Space:
  Theory and Algorithms
Policy Gradient and Actor-Critic Learning in Continuous Time and Space: Theory and Algorithms
Yanwei Jia
X. Zhou
OffRL
36
79
0
22 Nov 2021
Time Discretization-Invariant Safe Action Repetition for Policy Gradient
  Methods
Time Discretization-Invariant Safe Action Repetition for Policy Gradient Methods
Seohong Park
Jaekyeom Kim
Gunhee Kim
38
23
0
06 Nov 2021
Continuous-Time Fitted Value Iteration for Robust Policies
Continuous-Time Fitted Value Iteration for Robust Policies
M. Lutter
Boris Belousov
Shie Mannor
Dieter Fox
Animesh Garg
Jan Peters
15
9
0
05 Oct 2021
Continuous Homeostatic Reinforcement Learning for Self-Regulated
  Autonomous Agents
Continuous Homeostatic Reinforcement Learning for Self-Regulated Autonomous Agents
Hugo Laurençon
Charbel-Raphaël Ségerie
J. Lussange
Boris Gutkin
41
6
0
14 Sep 2021
State-Dependent Temperature Control for Langevin Diffusions
State-Dependent Temperature Control for Langevin Diffusions
Xuefeng Gao
Z. Xu
X. Zhou
36
27
0
15 Nov 2020
POMDPs in Continuous Time and Discrete Spaces
POMDPs in Continuous Time and Discrete Spaces
Bastian Alt
M. Schultheis
Heinz Koeppl
21
9
0
02 Oct 2020
Model-based Reinforcement Learning for Semi-Markov Decision Processes
  with Neural ODEs
Model-based Reinforcement Learning for Semi-Markov Decision Processes with Neural ODEs
Jianzhun Du
Joseph D. Futoma
Finale Doshi-Velez
32
50
0
29 Jun 2020
Thinking While Moving: Deep Reinforcement Learning with Concurrent
  Control
Thinking While Moving: Deep Reinforcement Learning with Concurrent Control
Ted Xiao
Eric Jang
Dmitry Kalashnikov
Sergey Levine
Julian Ibarz
Karol Hausman
Alexander Herzog
25
37
0
13 Apr 2020
Real-Time Reinforcement Learning
Real-Time Reinforcement Learning
Simon Ramstedt
C. Pal
AI4CE
19
62
0
11 Nov 2019
Autoregressive Policies for Continuous Control Deep Reinforcement
  Learning
Autoregressive Policies for Continuous Control Deep Reinforcement Learning
D. Korenkevych
A. R. Mahmood
Gautham Vasan
James Bergstra
32
28
0
27 Mar 2019
1