Papers
Communities
Organizations
Events
Blog
Pricing
Search
Open menu
Home
Papers
1502.05477
Cited By
v1
v2
v3
v4
v5 (latest)
Trust Region Policy Optimization
19 February 2015
John Schulman
Sergey Levine
Philipp Moritz
Michael I. Jordan
Pieter Abbeel
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Trust Region Policy Optimization"
50 / 2,020 papers shown
Title
Intervention Aided Reinforcement Learning for Safe and Practical Policy Optimization in Navigation
Fan Wang
Bo Zhou
Ke Chen
Tingxiang Fan
Xi Zhang
Jiangyong Li
Hao Tian
Jia Pan
68
26
0
15 Nov 2018
Natural Environment Benchmarks for Reinforcement Learning
Amy Zhang
Yuxin Wu
Joelle Pineau
OffRL
OOD
82
69
0
14 Nov 2018
Importance Weighted Evolution Strategies
Victor Campos
Xavier Giró-i-Nieto
Jordi Torres
46
1
0
12 Nov 2018
Learning from Demonstration in the Wild
Bertrand Higy
K. Shiarlis
Xi Chen
Vitaly Kurin
Sudhanshu Kasewa
...
João Gomes
Supratik Paul
F. Oliehoek
João Messias
Shimon Whiteson
93
76
0
08 Nov 2018
Meta-Learning for Multi-objective Reinforcement Learning
Xi Chen
Ali Ghadirzadeh
Mårten Björkman
Pablo G. Cámara
OffRL
78
56
0
08 Nov 2018
Correlation Filter Selection for Visual Tracking Using Reinforcement Learning
Yanchun Xie
Jimin Xiao
Hassan Jameel Asghar
Jeyarajan Thiyagalingam
Dali Kaafar
45
21
0
08 Nov 2018
Deep Reinforcement Learning via L-BFGS Optimization
Chris Paxton
Roummel F. Marcia
OffRL
58
0
0
06 Nov 2018
A Closer Look at Deep Policy Gradients
Andrew Ilyas
Logan Engstrom
Shibani Santurkar
Dimitris Tsipras
Firdaus Janoos
Larry Rudolph
Aleksander Madry
95
51
0
06 Nov 2018
Managing engineering systems with large state and action spaces through deep reinforcement learning
Varun Chandrasekaran
K. Papakonstantinou
AI4CE
80
164
0
05 Nov 2018
Learning to Defend by Learning to Attack
Haoming Jiang
Zhehui Chen
Yuyang Shi
Bo Dai
T. Zhao
108
22
0
03 Nov 2018
Horizon: Facebook's Open Source Applied Reinforcement Learning Platform
J. Gauci
Edoardo Conti
Yitao Liang
Kittipat Virochsiri
Yuchen He
Zachary Kaden
Vivek Narayanan
Xiaohui Ye
Zhengxing Chen
Scott Fujimoto
98
139
0
01 Nov 2018
Relative Importance Sampling For Off-Policy Actor-Critic in Deep Reinforcement Learning
Mahammad Humayoo
Xueqi Cheng
BDL
OffRL
65
5
0
30 Oct 2018
Assessing Generalization in Deep Reinforcement Learning
Charles Packer
Katelyn Gao
Jernej Kos
Philipp Krahenbuhl
V. Koltun
Basel Alomair
OffRL
139
239
0
29 Oct 2018
Learning and Management for Internet-of-Things: Accounting for Adaptivity and Scalability
Tianyi Chen
Sergio Barbarossa
Xin Wang
G. Giannakis
Zhi-Li Zhang
78
81
0
27 Oct 2018
Stability-certified reinforcement learning: A control-theoretic perspective
Ming Jin
Javad Lavaei
57
87
0
26 Oct 2018
Sample-Efficient Learning of Nonprehensile Manipulation Policies via Physics-Based Informed State Distributions
Lerrel Pinto
Aditya Mandalika
Brian Hou
S. Srinivasa
60
13
0
24 Oct 2018
Inverse reinforcement learning for video games
Aaron David Tucker
Adam Gleave
Stuart J. Russell
72
48
0
24 Oct 2018
Making Sense of Vision and Touch: Self-Supervised Learning of Multimodal Representations for Contact-Rich Tasks
Michelle A. Lee
Yuke Zhu
K. Srinivasan
Parth Shah
Silvio Savarese
Li Fei-Fei
Animesh Garg
Jeannette Bohg
SSL
112
370
0
24 Oct 2018
The Faults in Our Pi Stars: Security Issues and Open Challenges in Deep Reinforcement Learning
Vahid Behzadan
Arslan Munir
93
27
0
23 Oct 2018
Hierarchical Approaches for Reinforcement Learning in Parameterized Action Space
E. Wei
Drew Wicke
S. Luke
BDL
87
35
0
23 Oct 2018
Greedy Actor-Critic: A New Conditional Cross-Entropy Method for Policy Improvement
Samuel Neumann
Sungsu Lim
A. Joseph
Yangchen Pan
Adam White
Martha White
128
7
0
22 Oct 2018
Actor-Critic Policy Optimization in Partially Observable Multiagent Environments
S. Srinivasan
Marc Lanctot
V. Zambaldi
Julien Perolat
K. Tuyls
Rémi Munos
Michael Bowling
104
149
0
21 Oct 2018
First-order and second-order variants of the gradient descent in a unified framework
Thomas Pierrot
Nicolas Perrin
Olivier Sigaud
ODL
69
7
0
18 Oct 2018
ProMP: Proximal Meta-Policy Search
Jonas Rothfuss
Dennis Lee
I. Clavera
Tamim Asfour
Pieter Abbeel
107
211
0
16 Oct 2018
Predictor-Corrector Policy Optimization
Ching-An Cheng
Xinyan Yan
Nathan D. Ratliff
Byron Boots
OnRL
66
23
0
15 Oct 2018
Deep Reinforcement Learning
Yuxi Li
VLM
OffRL
194
144
0
15 Oct 2018
Dexterous Manipulation with Deep Reinforcement Learning: Efficient, General, and Low-Cost
Henry Zhu
Abhishek Gupta
Aravind Rajeswaran
Sergey Levine
Vikash Kumar
OffRL
118
200
0
14 Oct 2018
Policy Transfer with Strategy Optimization
Wenhao Yu
Chenxi Liu
Greg Turk
101
81
0
12 Oct 2018
A Survey and Critique of Multiagent Deep Reinforcement Learning
Pablo Hernandez-Leal
Bilal Kartal
Matthew E. Taylor
OffRL
121
570
0
12 Oct 2018
Parametrized Deep Q-Networks Learning: Reinforcement Learning with Discrete-Continuous Hybrid Action Space
Jiechao Xiong
Qing Wang
Zhuoran Yang
Peng Sun
Lei Han
Yang Zheng
Haobo Fu
Tong Zhang
Ji Liu
Han Liu
70
173
0
10 Oct 2018
Reinforcement Learning for Improving Agent Design
David R Ha
123
127
0
09 Oct 2018
Fast Context Adaptation via Meta-Learning
L. Zintgraf
K. Shiarlis
Vitaly Kurin
Katja Hofmann
Shimon Whiteson
99
37
0
08 Oct 2018
SFV: Reinforcement Learning of Physical Skills from Videos
Xue Bin Peng
Angjoo Kanazawa
Jitendra Malik
Pieter Abbeel
Sergey Levine
101
65
0
08 Oct 2018
Safe-To-Explore State Spaces: Ensuring Safe Exploration in Policy Search with Hierarchical Task Optimization
Jens Lundell
R. Krug
Erik Schaffernicht
Todor Stoyanov
Ville Kyrki
34
3
0
08 Oct 2018
Actor-Attention-Critic for Multi-Agent Reinforcement Learning
Shariq Iqbal
Fei Sha
76
761
0
05 Oct 2018
Learning Scheduling Algorithms for Data Processing Clusters
Hongzi Mao
Malte Schwarzkopf
S. Venkatakrishnan
Zili Meng
Mohammad Alizadeh
OffRL
111
654
0
03 Oct 2018
Near-Optimal Representation Learning for Hierarchical Reinforcement Learning
Ofir Nachum
S. Gu
Honglak Lee
Sergey Levine
102
211
0
02 Oct 2018
CEM-RL: Combining evolutionary and gradient-based methods for policy search
Aloïs Pourchot
Olivier Sigaud
119
161
0
02 Oct 2018
The Dreaming Variational Autoencoder for Reinforcement Learning Environments
Per-Arne Andersen
M. G. Olsen
Ole-Christoffer Granmo
DRL
41
17
0
02 Oct 2018
Bayesian Policy Optimization for Model Uncertainty
Gilwoo Lee
Brian Hou
Aditya Mandalika
Jeongseok Lee
Sanjiban Choudhury
S. Srinivasa
136
41
0
01 Oct 2018
Variational Discriminator Bottleneck: Improving Imitation Learning, Inverse RL, and GANs by Constraining Information Flow
Xue Bin Peng
Angjoo Kanazawa
Sam Toyer
Pieter Abbeel
Sergey Levine
129
217
0
01 Oct 2018
Bayesian Transfer Reinforcement Learning with Prior Knowledge Rules
Michalis K. Titsias
Sotirios Nikoloutsopoulos
BDL
OffRL
24
3
0
30 Sep 2018
Using Deep Reinforcement Learning to Learn High-Level Policies on the ATRIAS Biped
Tianyu Li
Akshara Rai
H. Geyer
C. Atkeson
80
51
0
28 Sep 2018
Boosting Trust Region Policy Optimization by Normalizing Flows Policy
Yunhao Tang
Shipra Agrawal
TPM
110
31
0
27 Sep 2018
Neural Approaches to Conversational AI
Jianfeng Gao
Michel Galley
Lihong Li
175
679
0
21 Sep 2018
Constrained Exploration and Recovery from Experience Shaping
Tu-Hoa Pham
Giovanni De Magistris
Don Joven Agravante
Subhajit Chaudhury
Asim Munawar
Ryuki Tachibana
50
3
0
21 Sep 2018
IntelligentCrowd: Mobile Crowdsensing via Multi-Agent Reinforcement Learning
Yize Chen
Hao Wang
HAI
26
27
0
20 Sep 2018
Benchmarking Reinforcement Learning Algorithms on Real-World Robots
A. R. Mahmood
D. Korenkevych
Gautham Vasan
W. Ma
James Bergstra
OffRL
96
159
0
20 Sep 2018
TStarBots: Defeating the Cheating Level Builtin AI in StarCraft II in the Full Game
Peng Sun
Xinghai Sun
Lei Han
Jiechao Xiong
Qing Wang
...
Yang Zheng
Ji Liu
Yongsheng Liu
Han Liu
Tong Zhang
104
75
0
19 Sep 2018
Leveraging Contact Forces for Learning to Grasp
Hamza Merzic
Miroslav Bogdanovic
Daniel Kappler
Ludovic Righetti
Jeannette Bohg
58
44
0
19 Sep 2018
Previous
1
2
3
...
32
33
34
...
39
40
41
Next