Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1509.02971
Cited By
Continuous control with deep reinforcement learning
9 September 2015
Timothy Lillicrap
Jonathan J. Hunt
Alexander Pritzel
N. Heess
Tom Erez
Yuval Tassa
David Silver
Daan Wierstra
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Continuous control with deep reinforcement learning"
50 / 3,416 papers shown
Title
A Benchmark Environment Motivated by Industrial Control Problems
D. Hein
Stefan Depeweg
Michel Tokic
Steffen Udluft
A. Hentschel
Thomas Runkler
V. Sterzing
OffRL
58
59
0
27 Sep 2017
Deep Reinforcement Learning for Dexterous Manipulation with Concept Networks
Aditya Gudimella
Ross Story
M. Shaker
Ruofan Kong
Matthew A. Brown
Victor Shnayder
Marcos Campos
37
24
0
20 Sep 2017
Transfer learning from synthetic to real images using variational autoencoders for robotic applications
Tadanobu Inoue
Subhajit Chaudhury
Giovanni De Magistris
Sakyasingha Dasgupta
26
19
0
20 Sep 2017
Automated Cloud Provisioning on AWS using Deep Reinforcement Learning
Zhiguang Wang
C. Gwon
Tim Oates
A. Iezzi
30
23
0
13 Sep 2017
Deep Reinforcement Learning with Surrogate Agent-Environment Interface
Songli Wang
Yutao Jing
25
1
0
12 Sep 2017
Mean Actor Critic
Cameron Allen
Kavosh Asadi
Melrose Roderick
Abdel-rahman Mohamed
George Konidaris
Michael Littman
36
44
0
01 Sep 2017
Deep Learning for Video Game Playing
Niels Justesen
Philip Bontrager
Julian Togelius
S. Risi
VLM
29
207
0
25 Aug 2017
A Brief Survey of Deep Reinforcement Learning
Kai Arulkumaran
M. Deisenroth
Miles Brundage
Anil Anthony Bharath
OffRL
65
2,787
0
19 Aug 2017
Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation
Yuhuai Wu
Elman Mansimov
Shun Liao
Roger C. Grosse
Jimmy Ba
OffRL
24
624
0
17 Aug 2017
Deep Reinforcement Learning for High Precision Assembly Tasks
Tadanobu Inoue
Giovanni De Magistris
Asim Munawar
T. Yokoya
Ryuki Tachibana
24
267
0
14 Aug 2017
Neural Network Dynamics for Model-Based Deep Reinforcement Learning with Model-Free Fine-Tuning
Anusha Nagabandi
G. Kahn
R. Fearing
Sergey Levine
46
966
0
08 Aug 2017
GPLAC: Generalizing Vision-Based Robotic Skills using Weakly Labeled Images
Avi Singh
Larry Yang
Sergey Levine
22
23
0
07 Aug 2017
Robust Physical-World Attacks on Deep Learning Models
Kevin Eykholt
Ivan Evtimov
Earlence Fernandes
Yue Liu
Amir Rahmati
Chaowei Xiao
Atul Prakash
Tadayoshi Kohno
D. Song
AAML
20
593
0
27 Jul 2017
Leveraging Demonstrations for Deep Reinforcement Learning on Robotics Problems with Sparse Rewards
Matej Vecerík
Todd Hester
Jonathan Scholz
Fumin Wang
Olivier Pietquin
Bilal Piot
N. Heess
Thomas Rothörl
Thomas Lampe
Martin Riedmiller
OffRL
38
659
0
27 Jul 2017
DARLA: Improving Zero-Shot Transfer in Reinforcement Learning
I. Higgins
Arka Pal
Andrei A. Rusu
Loic Matthey
Christopher P. Burgess
Alexander Pritzel
M. Botvinick
Charles Blundell
Alexander Lerchner
DRL
74
412
0
26 Jul 2017
RAIL: Risk-Averse Imitation Learning
Anirban Santara
A. Naik
Balaraman Ravindran
Dipankar Das
Dheevatsa Mudigere
Sasikanth Avancha
Bharat Kaul
30
18
0
20 Jul 2017
Imagination-Augmented Agents for Deep Reinforcement Learning
T. Weber
S. Racanière
David P. Reichert
Lars Buesing
A. Guez
...
Razvan Pascanu
Peter W. Battaglia
Demis Hassabis
David Silver
Daan Wierstra
LM&Ro
54
551
0
19 Jul 2017
Reverse Curriculum Generation for Reinforcement Learning
Carlos Florensa
David Held
Markus Wulfmeier
Michael Zhang
Pieter Abbeel
36
438
0
17 Jul 2017
Control of a Quadrotor with Reinforcement Learning
Jemin Hwangbo
Inkyu Sa
Roland Siegwart
Marco Hutter
32
477
0
17 Jul 2017
Robust Imitation of Diverse Behaviors
Ziyun Wang
J. Merel
Scott E. Reed
Greg Wayne
Nando de Freitas
N. Heess
34
195
0
10 Jul 2017
Emergence of Locomotion Behaviours in Rich Environments
N. Heess
TB Dhruva
S. Sriram
Jay Lemmon
J. Merel
...
Tom Erez
Ziyun Wang
S. M. Ali Eslami
Martin Riedmiller
David Silver
160
928
0
07 Jul 2017
Hindsight Experience Replay
Marcin Andrychowicz
Dwight Crow
Alex Ray
Jonas Schneider
Rachel Fong
Peter Welinder
Bob McGrew
Joshua Tobin
Pieter Abbeel
Wojciech Zaremba
OffRL
178
2,303
0
05 Jul 2017
Noisy Networks for Exploration
Meire Fortunato
M. G. Azar
Bilal Piot
Jacob Menick
Ian Osband
...
Rémi Munos
Demis Hassabis
Olivier Pietquin
Charles Blundell
Shane Legg
30
889
0
30 Jun 2017
A Deep Reinforcement Learning Framework for the Financial Portfolio Management Problem
Zhengyao Jiang
Dixing Xu
Jinjun Liang
OOD
26
344
0
30 Jun 2017
Path Integral Networks: End-to-End Differentiable Optimal Control
Masashi Okada
Luca Rigazio
T. Aoshima
PINN
37
56
0
29 Jun 2017
Learning to Learn: Meta-Critic Networks for Sample Efficient Learning
Flood Sung
Li Zhang
Tao Xiang
Timothy M. Hospedales
Yongxin Yang
OffRL
16
128
0
29 Jun 2017
Expected Policy Gradients
K. Ciosek
Shimon Whiteson
33
57
0
15 Jun 2017
ACCNet: Actor-Coordinator-Critic Net for "Learning-to-Communicate" with Deep Multi-agent Reinforcement Learning
Hangyu Mao
Zhibo Gong
Yan Ni
Zhen Xiao
30
44
0
10 Jun 2017
Unlocking the Potential of Simulators: Design with RL in Mind
Rika Antonova
S. Cruciani
21
2
0
08 Jun 2017
Generalized Value Iteration Networks: Life Beyond Lattices
Sufeng Niu
Siheng Chen
Hanyu Guo
Colin Targonski
M. C. Smith
J. Kovacevic
GNN
27
53
0
08 Jun 2017
Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments
Ryan J. Lowe
Yi Wu
Aviv Tamar
J. Harb
Pieter Abbeel
Igor Mordatch
87
4,425
0
07 Jun 2017
Parameter Space Noise for Exploration
Matthias Plappert
Rein Houthooft
Prafulla Dhariwal
Szymon Sidor
Richard Y. Chen
Xi Chen
Tamim Asfour
Pieter Abbeel
Marcin Andrychowicz
31
593
0
06 Jun 2017
Constrained Policy Optimization
Joshua Achiam
David Held
Aviv Tamar
Pieter Abbeel
61
1,309
0
30 May 2017
Learning End-to-end Multimodal Sensor Policies for Autonomous Navigation
Guan-Horng Liu
Avinash Siravuru
Sai P. Selvaraj
Manuela Veloso
George Kantor
25
69
0
30 May 2017
The Marginal Value of Adaptive Gradient Methods in Machine Learning
Ashia Wilson
Rebecca Roelofs
Mitchell Stern
Nathan Srebro
Benjamin Recht
ODL
25
1,016
0
23 May 2017
Visual Semantic Planning using Deep Successor Representations
Yuke Zhu
Daniel Gordon
Eric Kolve
Dieter Fox
Li Fei-Fei
Abhinav Gupta
Roozbeh Mottaghi
Ali Farhadi
24
141
0
23 May 2017
Automatic Goal Generation for Reinforcement Learning Agents
Carlos Florensa
David Held
Xinyang Geng
Pieter Abbeel
78
502
0
17 May 2017
Curiosity-driven Exploration by Self-supervised Prediction
Deepak Pathak
Pulkit Agrawal
Alexei A. Efros
Trevor Darrell
LRM
SSL
78
2,414
0
15 May 2017
Discrete Sequential Prediction of Continuous Actions for Deep RL
Luke Metz
Julian Ibarz
Navdeep Jaitly
James Davidson
BDL
OffRL
28
117
0
14 May 2017
Metacontrol for Adaptive Imagination-Based Optimization
Jessica B. Hamrick
A. J. Ballard
Razvan Pascanu
Oriol Vinyals
N. Heess
Peter W. Battaglia
27
69
0
07 May 2017
Toward Low-Flying Autonomous MAV Trail Navigation using Deep Neural Networks for Environmental Awareness
Nikolai Smolyanskiy
A. Kamenev
Jeffrey Smith
Stan Birchfield
44
222
0
07 May 2017
On Improving Deep Reinforcement Learning for POMDPs
Pengfei Zhu
Xin Li
Pascal Poupart
Guanghui Miao
29
123
0
26 Apr 2017
Inception Recurrent Convolutional Neural Network for Object Recognition
Md. Zahangir Alom
Mahmudul Hasan
C. Yakopcic
T. Taha
41
86
0
25 Apr 2017
The Reactor: A fast and sample-efficient Actor-Critic agent for Reinforcement Learning
A. Gruslys
Will Dabney
M. G. Azar
Bilal Piot
Marc G. Bellemare
Rémi Munos
23
58
0
15 Apr 2017
Data-efficient Deep Reinforcement Learning for Dexterous Manipulation
I. Popov
N. Heess
Timothy Lillicrap
Roland Hafner
Gabriel Barth-Maron
Matej Vecerík
Thomas Lampe
Yuval Tassa
Tom Erez
Martin Riedmiller
OffRL
31
263
0
10 Apr 2017
Stochastic Neural Networks for Hierarchical Reinforcement Learning
Carlos Florensa
Yan Duan
Pieter Abbeel
BDL
47
360
0
10 Apr 2017
Learning Visual Servoing with Deep Features and Fitted Q-Iteration
Alex X. Lee
Sergey Levine
Pieter Abbeel
SSL
30
73
0
31 Mar 2017
Multiagent Bidirectionally-Coordinated Nets: Emergence of Human-level Coordination in Learning to Play StarCraft Combat Games
Peng Peng
Ying Wen
Yaodong Yang
Quan Yuan
Zhenkun Tang
Haitao Long
Jun Wang
35
333
0
29 Mar 2017
Deep Deterministic Policy Gradient for Urban Traffic Light Control
Noe Casas
32
165
0
27 Mar 2017
Combining Neural Networks and Tree Search for Task and Motion Planning in Challenging Environments
Chris Paxton
Vasumathi Raman
Gregory Hager
Marin Kobilarov
35
123
0
22 Mar 2017
Previous
1
2
3
...
66
67
68
69
Next