Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2111.03941
Cited By
Time Discretization-Invariant Safe Action Repetition for Policy Gradient Methods
6 November 2021
Seohong Park
Jaekyeom Kim
Gunhee Kim
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Time Discretization-Invariant Safe Action Repetition for Policy Gradient Methods"
18 / 18 papers shown
Title
Mastering Atari with Discrete World Models
Danijar Hafner
Timothy Lillicrap
Mohammad Norouzi
Jimmy Ba
DRL
95
852
0
05 Oct 2020
Model-based Reinforcement Learning for Semi-Markov Decision Processes with Neural ODEs
Jianzhun Du
Joseph D. Futoma
Finale Doshi-Velez
61
51
0
29 Jun 2020
Improving Robustness via Risk Averse Distributional Reinforcement Learning
Rahul Singh
Qinsheng Zhang
Yongxin Chen
OOD
47
43
0
01 May 2020
Control Frequency Adaptation via Action Persistence in Batch Reinforcement Learning
Alberto Maria Metelli
Flavio Mazzolini
L. Bisi
Luca Sabbioni
Marcello Restelli
35
41
0
17 Feb 2020
PyTorch: An Imperative Style, High-Performance Deep Learning Library
Adam Paszke
Sam Gross
Francisco Massa
Adam Lerer
James Bradbury
...
Sasank Chilamkurthy
Benoit Steiner
Lu Fang
Junjie Bai
Soumith Chintala
ODL
439
42,393
0
03 Dec 2019
Dream to Control: Learning Behaviors by Latent Imagination
Danijar Hafner
Timothy Lillicrap
Jimmy Ba
Mohammad Norouzi
VLM
113
1,354
0
03 Dec 2019
Autoregressive Policies for Continuous Control Deep Reinforcement Learning
D. Korenkevych
A. R. Mahmood
Gautham Vasan
James Bergstra
56
28
0
27 Mar 2019
Making Deep Q-learning methods robust to time discretization
Corentin Tallec
Léonard Blier
Yann Ollivier
OOD
OffRL
31
91
0
28 Jan 2019
QT-Opt: Scalable Deep Reinforcement Learning for Vision-Based Robotic Manipulation
Dmitry Kalashnikov
A. Irpan
P. Pastor
Julian Ibarz
Alexander Herzog
...
Deirdre Quillen
E. Holly
Mrinal Kalakrishnan
Vincent Vanhoucke
Sergey Levine
113
1,460
0
27 Jun 2018
Time Limits in Reinforcement Learning
Fabio Pardo
Arash Tavakoli
Vitaly Levdik
Petar Kormushev
CLL
73
160
0
01 Dec 2017
Proximal Policy Optimization Algorithms
John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
OffRL
463
19,006
0
20 Jul 2017
Deep Reinforcement Learning for Robotic Manipulation with Asynchronous Off-Policy Updates
S. Gu
E. Holly
Timothy Lillicrap
Sergey Levine
OffRL
SSL
114
1,480
0
03 Oct 2016
Asynchronous Methods for Deep Reinforcement Learning
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
191
8,850
0
04 Feb 2016
Continuous control with deep reinforcement learning
Timothy Lillicrap
Jonathan J. Hunt
Alexander Pritzel
N. Heess
Tom Erez
Yuval Tassa
David Silver
Daan Wierstra
318
13,234
0
09 Sep 2015
Deep Recurrent Q-Learning for Partially Observable MDPs
Matthew J. Hausknecht
Peter Stone
104
1,679
0
23 Jul 2015
High-Dimensional Continuous Control Using Generalized Advantage Estimation
John Schulman
Philipp Moritz
Sergey Levine
Michael I. Jordan
Pieter Abbeel
OffRL
84
3,406
0
08 Jun 2015
Trust Region Policy Optimization
John Schulman
Sergey Levine
Philipp Moritz
Michael I. Jordan
Pieter Abbeel
277
6,764
0
19 Feb 2015
Playing Atari with Deep Reinforcement Learning
Volodymyr Mnih
Koray Kavukcuoglu
David Silver
Alex Graves
Ioannis Antonoglou
Daan Wierstra
Martin Riedmiller
119
12,223
0
19 Dec 2013
1