Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1803.04706
Cited By
v1
v2
v3
v4
v5 (latest)
Policy Search in Continuous Action Domains: an Overview
13 March 2018
Olivier Sigaud
F. Stulp
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Policy Search in Continuous Action Domains: an Overview"
14 / 64 papers shown
Title
Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation
Tejas D. Kulkarni
Karthik Narasimhan
A. Saeedi
J. Tenenbaum
71
1,137
0
20 Apr 2016
Continuous Deep Q-Learning with Model-based Acceleration
S. Gu
Timothy Lillicrap
Ilya Sutskever
Sergey Levine
91
1,013
0
02 Mar 2016
Asynchronous Methods for Deep Reinforcement Learning
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
202
8,875
0
04 Feb 2016
Prioritized Experience Replay
Tom Schaul
John Quan
Ioannis Antonoglou
David Silver
OffRL
223
3,797
0
18 Nov 2015
Deep Reinforcement Learning with Double Q-learning
H. V. Hasselt
A. Guez
David Silver
OffRL
170
7,658
0
22 Sep 2015
Continuous control with deep reinforcement learning
Timothy Lillicrap
Jonathan J. Hunt
Alexander Pritzel
N. Heess
Tom Erez
Yuval Tassa
David Silver
Daan Wierstra
323
13,272
0
09 Sep 2015
Trust Region Policy Optimization
John Schulman
Sergey Levine
Philipp Moritz
Michael I. Jordan
Pieter Abbeel
277
6,793
0
19 Feb 2015
A Unified Perspective on Multi-Domain and Multi-Task Learning
Yongxin Yang
Timothy M. Hospedales
94
163
0
23 Dec 2014
Robots that can adapt like animals
Antoine Cully
Jeff Clune
Danesh Tarapore
Jean-Baptiste Mouret
101
1,038
0
13 Jul 2014
Active Learning of Inverse Models with Intrinsically Motivated Goal Exploration in Robots
Adrien Baranes
Pierre-Yves Oudeyer
150
442
0
21 Jan 2013
Efficient Natural Evolution Strategies
Yi Sun
Daan Wierstra
Tom Schaul
Jürgen Schmidhuber
83
122
0
26 Sep 2012
Path Integral Policy Improvement with Covariance Matrix Adaptation
F. Stulp
Olivier Sigaud
84
209
0
18 Jun 2012
Infinite-Horizon Policy-Gradient Estimation
Jonathan Baxter
Peter L. Bartlett
100
812
0
03 Jun 2011
A Tutorial on Bayesian Optimization of Expensive Cost Functions, with Application to Active User Modeling and Hierarchical Reinforcement Learning
E. Brochu
Vlad M. Cora
Nando de Freitas
GP
138
2,449
0
12 Dec 2010
Previous
1
2