Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2001.02907
Cited By
Population-Guided Parallel Policy Search for Reinforcement Learning
9 January 2020
Whiyoung Jung
Giseung Park
Y. Sung
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Population-Guided Parallel Policy Search for Reinforcement Learning"
31 / 31 papers shown
Title
Improving Human-AI Coordination through Adversarial Training and Generative Models
Paresh Chaudhary
Yancheng Liang
Daphne Chen
S. Du
Natasha Jaques
117
1
0
21 Apr 2025
Exploration by Random Network Distillation
Yuri Burda
Harrison Edwards
Amos Storkey
Oleg Klimov
159
1,332
0
30 Oct 2018
GPU-Accelerated Robotic Simulation for Distributed Reinforcement Learning
Jacky Liang
Viktor Makoviychuk
Ankur Handa
N. Chentanez
Miles Macklin
Dieter Fox
AI4CE
67
182
0
12 Oct 2018
CEM-RL: Combining evolutionary and gradient-based methods for policy search
Aloïs Pourchot
Olivier Sigaud
74
161
0
02 Oct 2018
Self-Imitation Learning
Junhyuk Oh
Yijie Guo
Satinder Singh
Honglak Lee
SSL
59
251
0
14 Jun 2018
Learning Self-Imitating Diverse Policies
Tanmay Gangwani
Qiang Liu
Jian Peng
61
67
0
25 May 2018
Scalable Coordinated Exploration in Concurrent Reinforcement Learning
Maria Dimakopoulou
Ian Osband
Benjamin Van Roy
OffRL
49
25
0
23 May 2018
Evolution-Guided Policy Gradient in Reinforcement Learning
Shauharda Khadka
Kagan Tumer
112
228
0
21 May 2018
Distributed Distributional Deterministic Policy Gradients
Gabriel Barth-Maron
Matthew W. Hoffman
David Budden
Will Dabney
Dan Horgan
TB Dhruva
Alistair Muldal
N. Heess
Timothy Lillicrap
OffRL
86
480
0
23 Apr 2018
On Learning Intrinsic Rewards for Policy Gradient Methods
Zeyu Zheng
Junhyuk Oh
Satinder Singh
61
205
0
17 Apr 2018
Structured Evolution with Compact Architectures for Scalable Policy Optimization
K. Choromanski
Mark Rowland
Vikas Sindhwani
Richard Turner
Adrian Weller
74
149
0
06 Apr 2018
Distributed Prioritized Experience Replay
Dan Horgan
John Quan
David Budden
Gabriel Barth-Maron
Matteo Hessel
H. V. Hasselt
David Silver
147
741
0
02 Mar 2018
Addressing Function Approximation Error in Actor-Critic Methods
Scott Fujimoto
H. V. Hoof
David Meger
OffRL
175
5,187
0
26 Feb 2018
IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures
L. Espeholt
Hubert Soyer
Rémi Munos
Karen Simonyan
Volodymyr Mnih
...
Vlad Firoiu
Tim Harley
Iain Dunning
Shane Legg
Koray Kavukcuoglu
220
1,600
0
05 Feb 2018
Coordinated Exploration in Concurrent Reinforcement Learning
Maria Dimakopoulou
Benjamin Van Roy
127
42
0
05 Feb 2018
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
Tuomas Haarnoja
Aurick Zhou
Pieter Abbeel
Sergey Levine
311
8,352
0
04 Jan 2018
Improving Exploration in Evolution Strategies for Deep Reinforcement Learning via a Population of Novelty-Seeking Agents
Edoardo Conti
Vashisht Madhavan
F. Such
Joel Lehman
Kenneth O. Stanley
Jeff Clune
63
347
0
18 Dec 2017
Divide-and-Conquer Reinforcement Learning
Dibya Ghosh
Avi Singh
Aravind Rajeswaran
Vikash Kumar
Sergey Levine
OffRL
76
127
0
27 Nov 2017
Population Based Training of Neural Networks
Max Jaderberg
Valentin Dalibard
Simon Osindero
Wojciech M. Czarnecki
Jeff Donahue
...
Tim Green
Iain Dunning
Karen Simonyan
Chrisantha Fernando
Koray Kavukcuoglu
79
743
0
27 Nov 2017
Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation
Yuhuai Wu
Elman Mansimov
Shun Liao
Roger C. Grosse
Jimmy Ba
OffRL
57
629
0
17 Aug 2017
Proximal Policy Optimization Algorithms
John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
OffRL
499
19,065
0
20 Jul 2017
Distral: Robust Multitask Reinforcement Learning
Yee Whye Teh
V. Bapst
Wojciech M. Czarnecki
John Quan
J. Kirkpatrick
R. Hadsell
N. Heess
Razvan Pascanu
161
553
0
13 Jul 2017
Efficient Parallel Methods for Deep Reinforcement Learning
Alfredo V. Clemente
Humberto Nicolás Castejón Martínez
A. Chandra
46
115
0
13 May 2017
Evolution Strategies as a Scalable Alternative to Reinforcement Learning
Tim Salimans
Jonathan Ho
Xi Chen
Szymon Sidor
Ilya Sutskever
92
1,541
0
10 Mar 2017
Reinforcement Learning with Deep Energy-Based Policies
Tuomas Haarnoja
Haoran Tang
Pieter Abbeel
Sergey Levine
108
1,340
0
27 Feb 2017
Reinforcement Learning through Asynchronous Advantage Actor-Critic on a GPU
Mohammad Babaeizadeh
I. Frosio
Stephen Tyree
Jason Clemons
Jan Kautz
OffRL
57
259
0
18 Nov 2016
Asynchronous Methods for Deep Reinforcement Learning
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
199
8,859
0
04 Feb 2016
Continuous control with deep reinforcement learning
Timothy Lillicrap
Jonathan J. Hunt
Alexander Pritzel
N. Heess
Tom Erez
Yuval Tassa
David Silver
Daan Wierstra
320
13,248
0
09 Sep 2015
Massively Parallel Methods for Deep Reinforcement Learning
Arun Nair
Praveen Srinivasan
Sam Blackwell
Cagdas Alcicek
Rory Fearon
...
Stig Petersen
Shane Legg
Volodymyr Mnih
Koray Kavukcuoglu
David Silver
OffRL
AI4CE
GNN
96
503
0
15 Jul 2015
End-to-End Training of Deep Visuomotor Policies
Sergey Levine
Chelsea Finn
Trevor Darrell
Pieter Abbeel
BDL
315
3,437
0
02 Apr 2015
Trust Region Policy Optimization
John Schulman
Sergey Levine
Philipp Moritz
Michael I. Jordan
Pieter Abbeel
277
6,776
0
19 Feb 2015
1