Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1502.05477
Cited By
v1
v2
v3
v4
v5 (latest)
Trust Region Policy Optimization
19 February 2015
John Schulman
Sergey Levine
Philipp Moritz
Michael I. Jordan
Pieter Abbeel
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Trust Region Policy Optimization"
50 / 2,009 papers shown
Title
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
Tuomas Haarnoja
Aurick Zhou
Pieter Abbeel
Sergey Levine
417
8,489
0
04 Jan 2018
DeepMind Control Suite
Yuval Tassa
Yotam Doron
Alistair Muldal
Tom Erez
Yazhe Li
...
A. Abdolmaleki
J. Merel
Andrew Lefrancq
Timothy Lillicrap
Martin Riedmiller
ELM
LM&Ro
BDL
194
1,144
0
02 Jan 2018
f-Divergence constrained policy improvement
Boris Belousov
Jan Peters
61
21
0
29 Dec 2017
SBEED: Convergent Reinforcement Learning with Nonlinear Function Approximation
Bo Dai
Albert Eaton Shaw
Lihong Li
Lin Xiao
Niao He
Zhen Liu
Jianshu Chen
Le Song
105
25
0
29 Dec 2017
Boosting the Actor with Dual Critic
Bo Dai
Albert Eaton Shaw
Niao He
Lihong Li
Le Song
78
46
0
29 Dec 2017
RLlib: Abstractions for Distributed Reinforcement Learning
Eric Liang
Richard Liaw
Philipp Moritz
Robert Nishihara
Roy Fox
Ken Goldberg
Joseph E. Gonzalez
Michael I. Jordan
Ion Stoica
OffRL
AI4CE
109
175
0
26 Dec 2017
A short variational proof of equivalence between policy gradients and soft Q learning
Pierre Harvey Richemond
B. Maginnis
54
5
0
22 Dec 2017
Least-Squares Temporal Difference Learning for the Linear Quadratic Regulator
Stephen Tu
Benjamin Recht
OffRL
83
131
0
22 Dec 2017
On Wasserstein Reinforcement Learning and the Fokker-Planck equation
Pierre Harvey Richemond
B. Maginnis
78
24
0
19 Dec 2017
Safe Policy Improvement with Baseline Bootstrapping
Romain Laroche
P. Trichelair
Rémi Tachet des Combes
OffRL
98
201
0
19 Dec 2017
ES Is More Than Just a Traditional Finite-Difference Approximator
Joel Lehman
Jay Chen
Jeff Clune
Kenneth O. Stanley
110
89
0
18 Dec 2017
Deep Neuroevolution: Genetic Algorithms Are a Competitive Alternative for Training Deep Neural Networks for Reinforcement Learning
F. Such
Vashisht Madhavan
Edoardo Conti
Joel Lehman
Kenneth O. Stanley
Jeff Clune
166
697
0
18 Dec 2017
Safe Mutations for Deep and Recurrent Neural Networks through Output Gradients
Joel Lehman
Jay Chen
Jeff Clune
Kenneth O. Stanley
68
93
0
18 Dec 2017
Improving Exploration in Evolution Strategies for Deep Reinforcement Learning via a Population of Novelty-Seeking Agents
Edoardo Conti
Vashisht Madhavan
F. Such
Joel Lehman
Kenneth O. Stanley
Jeff Clune
120
349
0
18 Dec 2017
A Berkeley View of Systems Challenges for AI
Ion Stoica
Basel Alomair
Raluca A. Popa
D. Patterson
Michael W. Mahoney
...
Joseph E. Gonzalez
Ken Goldberg
A. Ghodsi
David Culler
Pieter Abbeel
87
201
0
15 Dec 2017
Safe Policy Search with Gaussian Process Models
Kyriakos Polymenakos
Alessandro Abate
Stephen J. Roberts
65
4
0
15 Dec 2017
Robust Deep Reinforcement Learning with Adversarial Attacks
Anay Pattanaik
Zhenyi Tang
Shuijing Liu
Gautham Bommannan
Girish Chowdhary
OOD
87
309
0
11 Dec 2017
Noisy Natural Gradient as Variational Inference
Guodong Zhang
Shengyang Sun
David Duvenaud
Roger C. Grosse
ODL
113
212
0
06 Dec 2017
Bayesian Policy Gradients via Alpha Divergence Dropout Inference
Peter Henderson
T. Doan
Riashat Islam
David Meger
BDL
85
13
0
06 Dec 2017
A Deeper Look at Experience Replay
Shangtong Zhang
R. Sutton
OffRL
VLM
124
276
0
04 Dec 2017
Variational Deep Q Network
Yunhao Tang
A. Kucukelbir
BDL
95
10
0
30 Nov 2017
Comparing Deep Reinforcement Learning and Evolutionary Methods in Continuous Control
Shangtong Zhang
Osmar R. Zaiane
66
11
0
30 Nov 2017
Learnings Options End-to-End for Continuous Action Tasks
Martin Klissarov
Pierre-Luc Bacon
J. Harb
Doina Precup
58
55
0
30 Nov 2017
Variational Inference for Gaussian Process Models with Linear Complexity
Ching-An Cheng
Byron Boots
BDL
83
76
0
28 Nov 2017
Divide-and-Conquer Reinforcement Learning
Dibya Ghosh
Avi Singh
Aravind Rajeswaran
Vikash Kumar
Sergey Levine
OffRL
102
127
0
27 Nov 2017
Cascade Attribute Learning Network
Zhuo Xu
Haonan Chang
Masayoshi Tomizuka
50
4
0
24 Nov 2017
How Generative Adversarial Networks and Their Variants Work: An Overview
Yongjun Hong
Uiwon Hwang
Jaeyoon Yoo
Sungroh Yoon
GAN
139
159
0
16 Nov 2017
Advances in Variational Inference
Cheng Zhang
Judith Butepage
Hedvig Kjellström
Stephan Mandt
BDL
241
698
0
15 Nov 2017
Learning Image-Conditioned Dynamics Models for Control of Under-actuated Legged Millirobots
Anusha Nagabandi
Guangzhao Yang
T. Asmar
Ravi Pandya
G. Kahn
Sergey Levine
R. Fearing
AI4CE
63
22
0
14 Nov 2017
Composing Meta-Policies for Autonomous Driving Using Hierarchical Deep Reinforcement Learning
Richard Liaw
S. Krishnan
Animesh Garg
D. Crankshaw
Joseph E. Gonzalez
Ken Goldberg
BDL
83
23
0
04 Nov 2017
Policy Optimization by Genetic Distillation
Tanmay Gangwani
Jian-wei Peng
72
18
0
03 Nov 2017
Regret Minimization for Partially Observable Deep Reinforcement Learning
Peter H. Jin
Kurt Keutzer
Sergey Levine
96
51
0
31 Oct 2017
Learning Robust Rewards with Adversarial Inverse Reinforcement Learning
Justin Fu
Katie Z Luo
Sergey Levine
142
757
0
30 Oct 2017
Meta Learning Shared Hierarchies
Kevin Frans
Jonathan Ho
Xi Chen
Pieter Abbeel
John Schulman
86
355
0
26 Oct 2017
Fast Model Identification via Physics Engines for Data-Efficient Policy Search
Shaojun Zhu
A. Kimmel
Kostas E. Bekris
Abdeslam Boularias
146
14
0
24 Oct 2017
Asymmetric Actor Critic for Image-Based Robot Learning
Lerrel Pinto
Marcin Andrychowicz
Peter Welinder
Wojciech Zaremba
Pieter Abbeel
OffRL
94
375
0
18 Oct 2017
Stochastic Variance Reduction for Policy Gradient Estimation
Tianbing Xu
Qiang Liu
Jian-wei Peng
67
19
0
17 Oct 2017
Flow: A Modular Learning Framework for Mixed Autonomy Traffic
Cathy Wu
Abdul Rahman Kreidieh
Kanaad Parvate
Eugene Vinitsky
Alexandre M. Bayen
111
162
0
16 Oct 2017
Burn-In Demonstrations for Multi-Modal Imitation Learning
Alex Kuefler
Mykel J. Kochenderfer
62
25
0
13 Oct 2017
Deep Imitation Learning for Complex Manipulation Tasks from Virtual Reality Teleoperation
Tianhao Zhang
Zoe McCarthy
Owen Jow
Dennis Lee
Xi Chen
Ken Goldberg
Pieter Abbeel
SSL
190
663
0
12 Oct 2017
Emergent Complexity via Multi-Agent Competition
Trapit Bansal
J. Pachocki
Szymon Sidor
Ilya Sutskever
Igor Mordatch
95
392
0
10 Oct 2017
Continuous Adaptation via Meta-Learning in Nonstationary and Competitive Environments
Maruan Al-Shedivat
Trapit Bansal
Yuri Burda
Ilya Sutskever
Igor Mordatch
Pieter Abbeel
CLL
90
354
0
10 Oct 2017
Socially Compliant Navigation through Raw Depth Inputs with Generative Adversarial Imitation Learning
L. Tai
Jingwei Zhang
Ming-Yuan Liu
Wolfram Burgard
GAN
69
180
0
06 Oct 2017
Rainbow: Combining Improvements in Deep Reinforcement Learning
Matteo Hessel
Joseph Modayil
H. V. Hasselt
Tom Schaul
Georg Ostrovski
Will Dabney
Dan Horgan
Bilal Piot
M. G. Azar
David Silver
OffRL
114
2,283
0
06 Oct 2017
Detecting Adversarial Attacks on Neural Network Policies with Visual Foresight
Yen-Chen Lin
Ming-Yuan Liu
Min Sun
Jia-Bin Huang
AAML
107
49
0
02 Oct 2017
Parameter Sharing Deep Deterministic Policy Gradient for Cooperative Multi-agent Reinforcement Learning
Xiangxiang Chu
Hangjun Ye
74
56
0
01 Oct 2017
Learning a Structured Neural Network Policy for a Hopping Task
Julian Viereck
Jules Kozolinsky
Alexander Herzog
Ludovic Righetti
89
12
0
29 Sep 2017
Overcoming Exploration in Reinforcement Learning with Demonstrations
Ashvin Nair
Bob McGrew
Marcin Andrychowicz
Wojciech Zaremba
Pieter Abbeel
OffRL
203
792
0
28 Sep 2017
Learning Complex Dexterous Manipulation with Deep Reinforcement Learning and Demonstrations
Aravind Rajeswaran
Vikash Kumar
Abhishek Gupta
Giulia Vezzani
John Schulman
E. Todorov
Sergey Levine
265
1,107
0
28 Sep 2017
Towards Optimally Decentralized Multi-Robot Collision Avoidance via Deep Reinforcement Learning
Pinxin Long
Tingxiang Fan
X. Liao
Wenxi Liu
Huatian Zhang
Jia Pan
OOD
121
460
0
28 Sep 2017
Previous
1
2
3
...
36
37
38
39
40
41
Next