Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2003.00430
Cited By
A Hybrid Stochastic Policy Gradient Algorithm for Reinforcement Learning
1 March 2020
Nhan H. Pham
Lam M. Nguyen
Dzung Phan
Phuong Ha Nguyen
Marten van Dijk
Quoc Tran-Dinh
Re-assign community
ArXiv
PDF
HTML
Papers citing
"A Hybrid Stochastic Policy Gradient Algorithm for Reinforcement Learning"
12 / 12 papers shown
Title
Hybrid Stochastic Gradient Descent Algorithms for Stochastic Nonconvex Optimization
Quoc Tran-Dinh
Nhan H. Pham
Dzung Phan
Lam M. Nguyen
56
56
0
15 May 2019
SPIDER: Near-Optimal Non-Convex Optimization via Stochastic Path Integrated Differential Estimator
Cong Fang
C. J. Li
Zhouchen Lin
Tong Zhang
85
577
0
04 Jul 2018
A Simple Proximal Stochastic Gradient Method for Nonsmooth Nonconvex Optimization
Zhize Li
Jian Li
60
116
0
13 Feb 2018
Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation
Yuhuai Wu
Elman Mansimov
Shun Liao
Roger C. Grosse
Jimmy Ba
OffRL
47
626
0
17 Aug 2017
Understanding deep learning requires rethinking generalization
Chiyuan Zhang
Samy Bengio
Moritz Hardt
Benjamin Recht
Oriol Vinyals
HAI
306
4,623
0
10 Nov 2016
Asynchronous Methods for Deep Reinforcement Learning
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
187
8,833
0
04 Feb 2016
Dueling Network Architectures for Deep Reinforcement Learning
Ziyun Wang
Tom Schaul
Matteo Hessel
H. V. Hasselt
Marc Lanctot
Nando de Freitas
OffRL
88
3,749
0
20 Nov 2015
Deep Reinforcement Learning with Double Q-learning
H. V. Hasselt
A. Guez
David Silver
OffRL
154
7,623
0
22 Sep 2015
Continuous control with deep reinforcement learning
Timothy Lillicrap
Jonathan J. Hunt
Alexander Pritzel
N. Heess
Tom Erez
Yuval Tassa
David Silver
Daan Wierstra
294
13,214
0
09 Sep 2015
SAGA: A Fast Incremental Gradient Method With Support for Non-Strongly Convex Composite Objectives
Aaron Defazio
Francis R. Bach
Simon Lacoste-Julien
ODL
128
1,823
0
01 Jul 2014
Playing Atari with Deep Reinforcement Learning
Volodymyr Mnih
Koray Kavukcuoglu
David Silver
Alex Graves
Ioannis Antonoglou
Daan Wierstra
Martin Riedmiller
114
12,201
0
19 Dec 2013
Infinite-Horizon Policy-Gradient Estimation
Jonathan Baxter
Peter L. Bartlett
88
811
0
03 Jun 2011
1