ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2111.03941
  4. Cited By
Time Discretization-Invariant Safe Action Repetition for Policy Gradient
  Methods

Time Discretization-Invariant Safe Action Repetition for Policy Gradient Methods

6 November 2021
Seohong Park
Jaekyeom Kim
Gunhee Kim
ArXivPDFHTML

Papers citing "Time Discretization-Invariant Safe Action Repetition for Policy Gradient Methods"

18 / 18 papers shown
Title
Mastering Atari with Discrete World Models
Mastering Atari with Discrete World Models
Danijar Hafner
Timothy Lillicrap
Mohammad Norouzi
Jimmy Ba
DRL
95
852
0
05 Oct 2020
Model-based Reinforcement Learning for Semi-Markov Decision Processes
  with Neural ODEs
Model-based Reinforcement Learning for Semi-Markov Decision Processes with Neural ODEs
Jianzhun Du
Joseph D. Futoma
Finale Doshi-Velez
61
51
0
29 Jun 2020
Improving Robustness via Risk Averse Distributional Reinforcement
  Learning
Improving Robustness via Risk Averse Distributional Reinforcement Learning
Rahul Singh
Qinsheng Zhang
Yongxin Chen
OOD
47
43
0
01 May 2020
Control Frequency Adaptation via Action Persistence in Batch
  Reinforcement Learning
Control Frequency Adaptation via Action Persistence in Batch Reinforcement Learning
Alberto Maria Metelli
Flavio Mazzolini
L. Bisi
Luca Sabbioni
Marcello Restelli
35
41
0
17 Feb 2020
PyTorch: An Imperative Style, High-Performance Deep Learning Library
PyTorch: An Imperative Style, High-Performance Deep Learning Library
Adam Paszke
Sam Gross
Francisco Massa
Adam Lerer
James Bradbury
...
Sasank Chilamkurthy
Benoit Steiner
Lu Fang
Junjie Bai
Soumith Chintala
ODL
439
42,393
0
03 Dec 2019
Dream to Control: Learning Behaviors by Latent Imagination
Dream to Control: Learning Behaviors by Latent Imagination
Danijar Hafner
Timothy Lillicrap
Jimmy Ba
Mohammad Norouzi
VLM
113
1,354
0
03 Dec 2019
Autoregressive Policies for Continuous Control Deep Reinforcement
  Learning
Autoregressive Policies for Continuous Control Deep Reinforcement Learning
D. Korenkevych
A. R. Mahmood
Gautham Vasan
James Bergstra
56
28
0
27 Mar 2019
Making Deep Q-learning methods robust to time discretization
Making Deep Q-learning methods robust to time discretization
Corentin Tallec
Léonard Blier
Yann Ollivier
OOD
OffRL
31
91
0
28 Jan 2019
QT-Opt: Scalable Deep Reinforcement Learning for Vision-Based Robotic
  Manipulation
QT-Opt: Scalable Deep Reinforcement Learning for Vision-Based Robotic Manipulation
Dmitry Kalashnikov
A. Irpan
P. Pastor
Julian Ibarz
Alexander Herzog
...
Deirdre Quillen
E. Holly
Mrinal Kalakrishnan
Vincent Vanhoucke
Sergey Levine
113
1,460
0
27 Jun 2018
Time Limits in Reinforcement Learning
Time Limits in Reinforcement Learning
Fabio Pardo
Arash Tavakoli
Vitaly Levdik
Petar Kormushev
CLL
73
160
0
01 Dec 2017
Proximal Policy Optimization Algorithms
Proximal Policy Optimization Algorithms
John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
OffRL
463
19,006
0
20 Jul 2017
Deep Reinforcement Learning for Robotic Manipulation with Asynchronous
  Off-Policy Updates
Deep Reinforcement Learning for Robotic Manipulation with Asynchronous Off-Policy Updates
S. Gu
E. Holly
Timothy Lillicrap
Sergey Levine
OffRL
SSL
114
1,480
0
03 Oct 2016
Asynchronous Methods for Deep Reinforcement Learning
Asynchronous Methods for Deep Reinforcement Learning
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
191
8,850
0
04 Feb 2016
Continuous control with deep reinforcement learning
Continuous control with deep reinforcement learning
Timothy Lillicrap
Jonathan J. Hunt
Alexander Pritzel
N. Heess
Tom Erez
Yuval Tassa
David Silver
Daan Wierstra
318
13,234
0
09 Sep 2015
Deep Recurrent Q-Learning for Partially Observable MDPs
Deep Recurrent Q-Learning for Partially Observable MDPs
Matthew J. Hausknecht
Peter Stone
104
1,679
0
23 Jul 2015
High-Dimensional Continuous Control Using Generalized Advantage
  Estimation
High-Dimensional Continuous Control Using Generalized Advantage Estimation
John Schulman
Philipp Moritz
Sergey Levine
Michael I. Jordan
Pieter Abbeel
OffRL
84
3,406
0
08 Jun 2015
Trust Region Policy Optimization
Trust Region Policy Optimization
John Schulman
Sergey Levine
Philipp Moritz
Michael I. Jordan
Pieter Abbeel
277
6,764
0
19 Feb 2015
Playing Atari with Deep Reinforcement Learning
Playing Atari with Deep Reinforcement Learning
Volodymyr Mnih
Koray Kavukcuoglu
David Silver
Alex Graves
Ioannis Antonoglou
Daan Wierstra
Martin Riedmiller
119
12,223
0
19 Dec 2013
1