ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1811.02553
  4. Cited By
A Closer Look at Deep Policy Gradients

A Closer Look at Deep Policy Gradients

6 November 2018
Andrew Ilyas
Logan Engstrom
Shibani Santurkar
Dimitris Tsipras
Firdaus Janoos
Larry Rudolph
Aleksander Madry
ArXivPDFHTML

Papers citing "A Closer Look at Deep Policy Gradients"

14 / 14 papers shown
Title
Graph Learning Based Decision Support for Multi-Aircraft Take-Off and
  Landing at Urban Air Mobility Vertiports
Graph Learning Based Decision Support for Multi-Aircraft Take-Off and Landing at Urban Air Mobility Vertiports
Prajit K. Kumar
Jhoel Witter
Steve Paul
Karthik Dantu
Souma Chowdhury
19
3
0
12 Feb 2023
Entropy Augmented Reinforcement Learning
Entropy Augmented Reinforcement Learning
Jianfei Ma
28
0
0
19 Aug 2022
Uncovering Instabilities in Variational-Quantum Deep Q-Networks
Uncovering Instabilities in Variational-Quantum Deep Q-Networks
Maja Franz
Lucas Wolf
Maniraman Periyasamy
Christian Ufrecht
Daniel D. Scherer
Axel Plinge
Christopher Mutschler
Wolfgang Mauerer
26
29
0
10 Feb 2022
Proximal Policy Optimization via Enhanced Exploration Efficiency
Proximal Policy Optimization via Enhanced Exploration Efficiency
Junwei Zhang
Zhenghao Zhang
Shuai Han
Shuai Lu
29
41
0
11 Nov 2020
How to Make Deep RL Work in Practice
How to Make Deep RL Work in Practice
Nirnai Rao
Elie Aljalbout
Axel Sauer
Sami Haddadin
OffRL
8
11
0
25 Oct 2020
Ready Policy One: World Building Through Active Learning
Ready Policy One: World Building Through Active Learning
Philip J. Ball
Jack Parker-Holder
Aldo Pacchiano
K. Choromanski
Stephen J. Roberts
OffRL
32
49
0
07 Feb 2020
Lyceum: An efficient and scalable ecosystem for robot learning
Lyceum: An efficient and scalable ecosystem for robot learning
Colin Summers
Kendall Lowrey
Aravind Rajeswaran
S. Srinivasa
E. Todorov
24
18
0
21 Jan 2020
Meta-Q-Learning
Meta-Q-Learning
Rasool Fakoor
Pratik Chaudhari
Stefano Soatto
Alex Smola
OffRL
22
145
0
30 Sep 2019
Neural Proximal/Trust Region Policy Optimization Attains Globally
  Optimal Policy
Neural Proximal/Trust Region Policy Optimization Attains Globally Optimal Policy
Boyi Liu
Qi Cai
Zhuoran Yang
Zhaoran Wang
22
108
0
25 Jun 2019
An Empirical Study on Hyperparameters and their Interdependence for RL
  Generalization
An Empirical Study on Hyperparameters and their Interdependence for RL Generalization
Xingyou Song
Yilun Du
Jacob Jackson
AI4CE
21
8
0
02 Jun 2019
Policy Search by Target Distribution Learning for Continuous Control
Policy Search by Target Distribution Learning for Continuous Control
Chuheng Zhang
Yuanqi Li
Jian Li
19
6
0
27 May 2019
Trajectory-Based Off-Policy Deep Reinforcement Learning
Trajectory-Based Off-Policy Deep Reinforcement Learning
Andreas Doerr
Michael Volpp
Marc Toussaint
Sebastian Trimpe
Christian Daniel
OffRL
24
2
0
14 May 2019
A Survey and Critique of Multiagent Deep Reinforcement Learning
A Survey and Critique of Multiagent Deep Reinforcement Learning
Pablo Hernandez-Leal
Bilal Kartal
Matthew E. Taylor
OffRL
32
550
0
12 Oct 2018
On Large-Batch Training for Deep Learning: Generalization Gap and Sharp
  Minima
On Large-Batch Training for Deep Learning: Generalization Gap and Sharp Minima
N. Keskar
Dheevatsa Mudigere
J. Nocedal
M. Smelyanskiy
P. T. P. Tang
ODL
308
2,890
0
15 Sep 2016
1