Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1803.04706
Cited By
v1
v2
v3
v4
v5 (latest)
Policy Search in Continuous Action Domains: an Overview
13 March 2018
Olivier Sigaud
F. Stulp
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Policy Search in Continuous Action Domains: an Overview"
50 / 64 papers shown
Title
On Policy Gradients
Mattis Manfred Kämmerer
OffRL
35
14
0
12 Nov 2019
First-order and second-order variants of the gradient descent in a unified framework
Thomas Pierrot
Nicolas Perrin
Olivier Sigaud
ODL
39
7
0
18 Oct 2018
CEM-RL: Combining evolutionary and gradient-based methods for policy search
Aloïs Pourchot
Olivier Sigaud
74
161
0
02 Oct 2018
Importance mixing: Improving sample reuse in evolutionary policy search methods
Aloïs Pourchot
Nicolas Perrin
Olivier Sigaud
44
14
0
17 Aug 2018
Curiosity Driven Exploration of Learned Disentangled Goal Spaces
A. Laversanne-Finot
Alexandre Péré
Pierre-Yves Oudeyer
DRL
68
88
0
04 Jul 2018
Many-Goals Reinforcement Learning
Vivek Veeriah
Junhyuk Oh
Satinder Singh
KELM
65
53
0
22 Jun 2018
Data-Efficient Hierarchical Reinforcement Learning
Ofir Nachum
S. Gu
Honglak Lee
Sergey Levine
OffRL
99
811
0
21 May 2018
Hierarchical Reinforcement Learning with Hindsight
Andrew Levy
Robert Platt
Kate Saenko
72
84
0
21 May 2018
Evolution-Guided Policy Gradient in Reinforcement Learning
Shauharda Khadka
Kagan Tumer
114
229
0
21 May 2018
Simple random search provides a competitive approach to reinforcement learning
Horia Mania
Aurelia Guy
Benjamin Recht
62
316
0
19 Mar 2018
Unsupervised Learning of Goal Spaces for Intrinsically Motivated Goal Exploration
Alexandre Péré
Sébastien Forestier
Olivier Sigaud
Pierre-Yves Oudeyer
SSL
DRL
51
95
0
02 Mar 2018
Learning by Playing - Solving Sparse Reward Tasks from Scratch
Martin Riedmiller
Roland Hafner
Thomas Lampe
Michael Neunert
Jonas Degrave
T. Wiele
Volodymyr Mnih
N. Heess
Jost Tobias Springenberg
87
448
0
28 Feb 2018
Addressing Function Approximation Error in Actor-Critic Methods
Scott Fujimoto
H. V. Hoof
David Meger
OffRL
180
5,204
0
26 Feb 2018
Back to Basics: Benchmarking Canonical Evolution Strategies for Playing Atari
P. Chrabaszcz
I. Loshchilov
Frank Hutter
59
100
0
24 Feb 2018
GEP-PG: Decoupling Exploration and Exploitation in Deep Reinforcement Learning Algorithms
Cédric Colas
Olivier Sigaud
Pierre-Yves Oudeyer
61
159
0
14 Feb 2018
Efficient Exploration through Bayesian Deep Q-Networks
Kamyar Azizzadenesheli
Anima Anandkumar
OffRL
BDL
79
163
0
13 Feb 2018
State Representation Learning for Control: An Overview
Timothée Lesort
Natalia Díaz Rodríguez
Jean-François Goudou
David Filliat
OffRL
112
321
0
12 Feb 2018
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
Tuomas Haarnoja
Aurick Zhou
Pieter Abbeel
Sergey Levine
314
8,396
0
04 Jan 2018
ES Is More Than Just a Traditional Finite-Difference Approximator
Joel Lehman
Jay Chen
Jeff Clune
Kenneth O. Stanley
72
89
0
18 Dec 2017
Deep Neuroevolution: Genetic Algorithms Are a Competitive Alternative for Training Deep Neural Networks for Reinforcement Learning
F. Such
Vashisht Madhavan
Edoardo Conti
Joel Lehman
Kenneth O. Stanley
Jeff Clune
104
692
0
18 Dec 2017
On the Relationship Between the OpenAI Evolution Strategy and Stochastic Gradient Descent
Xingwen Zhang
Jeff Clune
Kenneth O. Stanley
59
58
0
18 Dec 2017
Improving Exploration in Evolution Strategies for Deep Reinforcement Learning via a Population of Novelty-Seeking Agents
Edoardo Conti
Vashisht Madhavan
F. Such
Joel Lehman
Kenneth O. Stanley
Jeff Clune
70
348
0
18 Dec 2017
Variational Deep Q Network
Yunhao Tang
A. Kucukelbir
BDL
74
10
0
30 Nov 2017
Population Based Training of Neural Networks
Max Jaderberg
Valentin Dalibard
Simon Osindero
Wojciech M. Czarnecki
Jeff Donahue
...
Tim Green
Iain Dunning
Karen Simonyan
Chrisantha Fernando
Koray Kavukcuoglu
88
743
0
27 Nov 2017
Policy Optimization by Genetic Distillation
Tanmay Gangwani
Jian-wei Peng
44
18
0
03 Nov 2017
Rainbow: Combining Improvements in Deep Reinforcement Learning
Matteo Hessel
Joseph Modayil
H. V. Hasselt
Tom Schaul
Georg Ostrovski
Will Dabney
Dan Horgan
Bilal Piot
M. G. Azar
David Silver
OffRL
107
2,268
0
06 Oct 2017
Deep Reinforcement Learning that Matters
Peter Henderson
Riashat Islam
Philip Bachman
Joelle Pineau
Doina Precup
David Meger
OffRL
118
1,961
0
19 Sep 2017
A Brief Survey of Deep Reinforcement Learning
Kai Arulkumaran
M. Deisenroth
Miles Brundage
Anil Anthony Bharath
OffRL
122
2,821
0
19 Aug 2017
Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation
Yuhuai Wu
Elman Mansimov
Shun Liao
Roger C. Grosse
Jimmy Ba
OffRL
57
629
0
17 Aug 2017
Reproducibility of Benchmarked Deep Reinforcement Learning Tasks for Continuous Control
Riashat Islam
Peter Henderson
Maziar Gomrokchi
Doina Precup
BDL
OffRL
77
253
0
10 Aug 2017
A Distributional Perspective on Reinforcement Learning
Marc G. Bellemare
Will Dabney
Rémi Munos
OffRL
98
1,506
0
21 Jul 2017
Proximal Policy Optimization Algorithms
John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
OffRL
526
19,237
0
20 Jul 2017
Noisy Networks for Exploration
Meire Fortunato
M. G. Azar
Bilal Piot
Jacob Menick
Ian Osband
...
Rémi Munos
Demis Hassabis
Olivier Pietquin
Charles Blundell
Shane Legg
79
897
0
30 Jun 2017
Parameter Space Noise for Exploration
Matthias Plappert
Rein Houthooft
Prafulla Dhariwal
Szymon Sidor
Richard Y. Chen
Xi Chen
Tamim Asfour
Pieter Abbeel
Marcin Andrychowicz
59
597
0
06 Jun 2017
Interpolated Policy Gradient: Merging On-Policy and Off-Policy Gradient Estimation for Deep Reinforcement Learning
S. Gu
Timothy Lillicrap
Zoubin Ghahramani
Richard Turner
Bernhard Schölkopf
Sergey Levine
OffRL
76
165
0
01 Jun 2017
Quality and Diversity Optimization: A Unifying Modular Framework
Antoine Cully
Y. Demiris
68
271
0
12 May 2017
Black-Box Data-efficient Policy Search for Robotics
Konstantinos Chatzilygeroudis
R. Rama
Rituraj Kaushik
Dorian Goepp
Vassilis Vassiliades
Jean-Baptiste Mouret
OffRL
65
115
0
21 Mar 2017
Evolution Strategies as a Scalable Alternative to Reinforcement Learning
Tim Salimans
Jonathan Ho
Xi Chen
Szymon Sidor
Ilya Sutskever
94
1,541
0
10 Mar 2017
FeUdal Networks for Hierarchical Reinforcement Learning
A. Vezhnevets
Simon Osindero
Tom Schaul
N. Heess
Max Jaderberg
David Silver
Koray Kavukcuoglu
FedML
88
907
0
03 Mar 2017
Loss is its own Reward: Self-Supervision for Reinforcement Learning
Evan Shelhamer
Parsa Mahmoudieh
Max Argus
Trevor Darrell
SSL
83
186
0
21 Dec 2016
Learning to reinforcement learn
Jane X. Wang
Z. Kurth-Nelson
Dhruva Tirumala
Hubert Soyer
Joel Z Leibo
Rémi Munos
Charles Blundell
D. Kumaran
M. Botvinick
OffRL
97
982
0
17 Nov 2016
Reinforcement Learning with Unsupervised Auxiliary Tasks
Max Jaderberg
Volodymyr Mnih
Wojciech M. Czarnecki
Tom Schaul
Joel Z Leibo
David Silver
Koray Kavukcuoglu
SSL
106
1,229
0
16 Nov 2016
Q-Prop: Sample-Efficient Policy Gradient with An Off-Policy Critic
S. Gu
Timothy Lillicrap
Zoubin Ghahramani
Richard Turner
Sergey Levine
OffRL
BDL
88
345
0
07 Nov 2016
Combining policy gradient and Q-learning
Brendan O'Donoghue
Rémi Munos
Koray Kavukcuoglu
Volodymyr Mnih
OffRL
OnRL
84
140
0
05 Nov 2016
Sample Efficient Actor-Critic with Experience Replay
Ziyun Wang
V. Bapst
N. Heess
Volodymyr Mnih
Rémi Munos
Koray Kavukcuoglu
Nando de Freitas
102
762
0
03 Nov 2016
The Option-Critic Architecture
Pierre-Luc Bacon
J. Harb
Doina Precup
OffRL
64
1,088
0
16 Sep 2016
Guided Policy Search as Approximate Mirror Descent
William H. Montgomery
Sergey Levine
72
126
0
15 Jul 2016
Actor-critic versus direct policy search: a comparison based on sample complexity
Arnaud de Froissard de Broissia
Olivier Sigaud
41
12
0
29 Jun 2016
Deep Learning without Poor Local Minima
Kenji Kawaguchi
ODL
221
925
0
23 May 2016
Benchmarking Deep Reinforcement Learning for Continuous Control
Yan Duan
Xi Chen
Rein Houthooft
John Schulman
Pieter Abbeel
OffRL
84
1,695
0
22 Apr 2016
1
2
Next