Population-Guided Parallel Policy Search for Reinforcement Learning

9 January 2020

Papers citing "Population-Guided Parallel Policy Search for Reinforcement Learning"

31 / 31 papers shown

Title
Improving Human-AI Coordination through Adversarial Training and Generative Models Paresh Chaudhary Yancheng Liang Daphne Chen S. Du Natasha Jaques 117 1 0 21 Apr 2025
Exploration by Random Network Distillation Yuri Burda Harrison Edwards Amos Storkey Oleg Klimov 159 1,332 0 30 Oct 2018
GPU-Accelerated Robotic Simulation for Distributed Reinforcement Learning Jacky Liang Viktor Makoviychuk Ankur Handa N. Chentanez Miles Macklin Dieter Fox AI4CE 67 182 0 12 Oct 2018
CEM-RL: Combining evolutionary and gradient-based methods for policy search Aloïs Pourchot Olivier Sigaud 74 161 0 02 Oct 2018
Self-Imitation Learning Junhyuk Oh Yijie Guo Satinder Singh Honglak Lee SSL 59 251 0 14 Jun 2018
Learning Self-Imitating Diverse Policies Tanmay Gangwani Qiang Liu Jian Peng 61 67 0 25 May 2018
Scalable Coordinated Exploration in Concurrent Reinforcement Learning Maria Dimakopoulou Ian Osband Benjamin Van Roy OffRL 49 25 0 23 May 2018
Evolution-Guided Policy Gradient in Reinforcement Learning Shauharda Khadka Kagan Tumer 112 228 0 21 May 2018
Distributed Distributional Deterministic Policy Gradients Gabriel Barth-Maron Matthew W. Hoffman David Budden Will Dabney Dan Horgan TB Dhruva Alistair Muldal N. Heess Timothy Lillicrap OffRL 86 480 0 23 Apr 2018
On Learning Intrinsic Rewards for Policy Gradient Methods Zeyu Zheng Junhyuk Oh Satinder Singh 61 205 0 17 Apr 2018
Structured Evolution with Compact Architectures for Scalable Policy Optimization K. Choromanski Mark Rowland Vikas Sindhwani Richard Turner Adrian Weller 74 149 0 06 Apr 2018
Distributed Prioritized Experience Replay Dan Horgan John Quan David Budden Gabriel Barth-Maron Matteo Hessel H. V. Hasselt David Silver 147 741 0 02 Mar 2018
Addressing Function Approximation Error in Actor-Critic Methods Scott Fujimoto H. V. Hoof David Meger OffRL 175 5,187 0 26 Feb 2018
IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures L. Espeholt Hubert Soyer Rémi Munos Karen Simonyan Volodymyr Mnih ... Vlad Firoiu Tim Harley Iain Dunning Shane Legg Koray Kavukcuoglu 220 1,600 0 05 Feb 2018
Coordinated Exploration in Concurrent Reinforcement Learning Maria Dimakopoulou Benjamin Van Roy 127 42 0 05 Feb 2018
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor Tuomas Haarnoja Aurick Zhou Pieter Abbeel Sergey Levine 311 8,352 0 04 Jan 2018
Improving Exploration in Evolution Strategies for Deep Reinforcement Learning via a Population of Novelty-Seeking Agents Edoardo Conti Vashisht Madhavan F. Such Joel Lehman Kenneth O. Stanley Jeff Clune 63 347 0 18 Dec 2017
Divide-and-Conquer Reinforcement Learning Dibya Ghosh Avi Singh Aravind Rajeswaran Vikash Kumar Sergey Levine OffRL 76 127 0 27 Nov 2017
Population Based Training of Neural Networks Max Jaderberg Valentin Dalibard Simon Osindero Wojciech M. Czarnecki Jeff Donahue ... Tim Green Iain Dunning Karen Simonyan Chrisantha Fernando Koray Kavukcuoglu 79 743 0 27 Nov 2017
Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation Yuhuai Wu Elman Mansimov Shun Liao Roger C. Grosse Jimmy Ba OffRL 57 629 0 17 Aug 2017
Proximal Policy Optimization Algorithms John Schulman Filip Wolski Prafulla Dhariwal Alec Radford Oleg Klimov OffRL 499 19,065 0 20 Jul 2017
Distral: Robust Multitask Reinforcement Learning Yee Whye Teh V. Bapst Wojciech M. Czarnecki John Quan J. Kirkpatrick R. Hadsell N. Heess Razvan Pascanu 161 553 0 13 Jul 2017
Efficient Parallel Methods for Deep Reinforcement Learning Alfredo V. Clemente Humberto Nicolás Castejón Martínez A. Chandra 46 115 0 13 May 2017
Evolution Strategies as a Scalable Alternative to Reinforcement Learning Tim Salimans Jonathan Ho Xi Chen Szymon Sidor Ilya Sutskever 92 1,541 0 10 Mar 2017
Reinforcement Learning with Deep Energy-Based Policies Tuomas Haarnoja Haoran Tang Pieter Abbeel Sergey Levine 108 1,340 0 27 Feb 2017
Reinforcement Learning through Asynchronous Advantage Actor-Critic on a GPU Mohammad Babaeizadeh I. Frosio Stephen Tyree Jason Clemons Jan Kautz OffRL 57 259 0 18 Nov 2016
Asynchronous Methods for Deep Reinforcement Learning Volodymyr Mnih Adria Puigdomenech Badia M. Berk Mirza Alex Graves Timothy Lillicrap Tim Harley David Silver Koray Kavukcuoglu 199 8,859 0 04 Feb 2016
Continuous control with deep reinforcement learning Timothy Lillicrap Jonathan J. Hunt Alexander Pritzel N. Heess Tom Erez Yuval Tassa David Silver Daan Wierstra 320 13,248 0 09 Sep 2015
Massively Parallel Methods for Deep Reinforcement Learning Arun Nair Praveen Srinivasan Sam Blackwell Cagdas Alcicek Rory Fearon ... Stig Petersen Shane Legg Volodymyr Mnih Koray Kavukcuoglu David Silver OffRL AI4CE GNN 96 503 0 15 Jul 2015
End-to-End Training of Deep Visuomotor Policies Sergey Levine Chelsea Finn Trevor Darrell Pieter Abbeel BDL 315 3,437 0 02 Apr 2015
Trust Region Policy Optimization John Schulman Sergey Levine Philipp Moritz Michael I. Jordan Pieter Abbeel 277 6,776 0 19 Feb 2015