Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1811.02553
Cited By
A Closer Look at Deep Policy Gradients
6 November 2018
Andrew Ilyas
Logan Engstrom
Shibani Santurkar
Dimitris Tsipras
Firdaus Janoos
Larry Rudolph
Aleksander Madry
Re-assign community
ArXiv
PDF
HTML
Papers citing
"A Closer Look at Deep Policy Gradients"
15 / 15 papers shown
Title
Graph Learning Based Decision Support for Multi-Aircraft Take-Off and Landing at Urban Air Mobility Vertiports
Prajit K. Kumar
Jhoel Witter
Steve Paul
Karthik Dantu
Souma Chowdhury
19
3
0
12 Feb 2023
Entropy Augmented Reinforcement Learning
Jianfei Ma
30
0
0
19 Aug 2022
Uncovering Instabilities in Variational-Quantum Deep Q-Networks
Maja Franz
Lucas Wolf
Maniraman Periyasamy
Christian Ufrecht
Daniel D. Scherer
Axel Plinge
Christopher Mutschler
Wolfgang Mauerer
30
29
0
10 Feb 2022
Proximal Policy Optimization via Enhanced Exploration Efficiency
Junwei Zhang
Zhenghao Zhang
Shuai Han
Shuai Lu
29
41
0
11 Nov 2020
How to Make Deep RL Work in Practice
Nirnai Rao
Elie Aljalbout
Axel Sauer
Sami Haddadin
OffRL
18
11
0
25 Oct 2020
Ready Policy One: World Building Through Active Learning
Philip J. Ball
Jack Parker-Holder
Aldo Pacchiano
K. Choromanski
Stephen J. Roberts
OffRL
32
49
0
07 Feb 2020
Lyceum: An efficient and scalable ecosystem for robot learning
Colin Summers
Kendall Lowrey
Aravind Rajeswaran
S. Srinivasa
E. Todorov
24
18
0
21 Jan 2020
Meta-Q-Learning
Rasool Fakoor
Pratik Chaudhari
Stefano Soatto
Alex Smola
OffRL
25
145
0
30 Sep 2019
Neural Proximal/Trust Region Policy Optimization Attains Globally Optimal Policy
Boyi Liu
Qi Cai
Zhuoran Yang
Zhaoran Wang
24
108
0
25 Jun 2019
An Empirical Study on Hyperparameters and their Interdependence for RL Generalization
Xingyou Song
Yilun Du
Jacob Jackson
AI4CE
24
8
0
02 Jun 2019
Policy Search by Target Distribution Learning for Continuous Control
Chuheng Zhang
Yuanqi Li
Jian Li
19
6
0
27 May 2019
Trajectory-Based Off-Policy Deep Reinforcement Learning
Andreas Doerr
Michael Volpp
Marc Toussaint
Sebastian Trimpe
Christian Daniel
OffRL
26
2
0
14 May 2019
P3O: Policy-on Policy-off Policy Optimization
Rasool Fakoor
Pratik Chaudhari
Alex Smola
OffRL
13
51
0
05 May 2019
A Survey and Critique of Multiagent Deep Reinforcement Learning
Pablo Hernandez-Leal
Bilal Kartal
Matthew E. Taylor
OffRL
32
550
0
12 Oct 2018
On Large-Batch Training for Deep Learning: Generalization Gap and Sharp Minima
N. Keskar
Dheevatsa Mudigere
J. Nocedal
M. Smelyanskiy
P. T. P. Tang
ODL
308
2,890
0
15 Sep 2016
1