Policy Search in Continuous Action Domains: an Overview

13 March 2018

Papers citing "Policy Search in Continuous Action Domains: an Overview"

14 / 64 papers shown

Title
Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation Tejas D. Kulkarni Karthik Narasimhan A. Saeedi J. Tenenbaum 71 1,137 0 20 Apr 2016
Continuous Deep Q-Learning with Model-based Acceleration S. Gu Timothy Lillicrap Ilya Sutskever Sergey Levine 91 1,013 0 02 Mar 2016
Asynchronous Methods for Deep Reinforcement Learning Volodymyr Mnih Adria Puigdomenech Badia M. Berk Mirza Alex Graves Timothy Lillicrap Tim Harley David Silver Koray Kavukcuoglu 202 8,875 0 04 Feb 2016
Prioritized Experience Replay Tom Schaul John Quan Ioannis Antonoglou David Silver OffRL 223 3,797 0 18 Nov 2015
Deep Reinforcement Learning with Double Q-learning H. V. Hasselt A. Guez David Silver OffRL 170 7,658 0 22 Sep 2015
Continuous control with deep reinforcement learning Timothy Lillicrap Jonathan J. Hunt Alexander Pritzel N. Heess Tom Erez Yuval Tassa David Silver Daan Wierstra 323 13,272 0 09 Sep 2015
Trust Region Policy Optimization John Schulman Sergey Levine Philipp Moritz Michael I. Jordan Pieter Abbeel 277 6,793 0 19 Feb 2015
A Unified Perspective on Multi-Domain and Multi-Task Learning Yongxin Yang Timothy M. Hospedales 94 163 0 23 Dec 2014
Robots that can adapt like animals Antoine Cully Jeff Clune Danesh Tarapore Jean-Baptiste Mouret 101 1,038 0 13 Jul 2014
Active Learning of Inverse Models with Intrinsically Motivated Goal Exploration in Robots Adrien Baranes Pierre-Yves Oudeyer 150 442 0 21 Jan 2013
Efficient Natural Evolution Strategies Yi Sun Daan Wierstra Tom Schaul Jürgen Schmidhuber 83 122 0 26 Sep 2012
Path Integral Policy Improvement with Covariance Matrix Adaptation F. Stulp Olivier Sigaud 84 209 0 18 Jun 2012
Infinite-Horizon Policy-Gradient Estimation Jonathan Baxter Peter L. Bartlett 100 812 0 03 Jun 2011
A Tutorial on Bayesian Optimization of Expensive Cost Functions, with Application to Active User Modeling and Hierarchical Reinforcement Learning E. Brochu Vlad M. Cora Nando de Freitas GP 138 2,449 0 12 Dec 2010