Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1502.05477
Cited By
v1
v2
v3
v4
v5 (latest)
Trust Region Policy Optimization
19 February 2015
John Schulman
Sergey Levine
Philipp Moritz
Michael I. Jordan
Pieter Abbeel
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Trust Region Policy Optimization"
50 / 2,012 papers shown
Title
Importance Sampling Policy Evaluation with an Estimated Behavior Policy
Josiah P. Hanna
S. Niekum
Peter Stone
OffRL
70
68
0
04 Jun 2018
TD or not TD: Analyzing the Role of Temporal Differencing in Deep Reinforcement Learning
Artemij Amiranashvili
Alexey Dosovitskiy
V. Koltun
Thomas Brox
OffRL
71
19
0
04 Jun 2018
Neural Control Variates for Variance Reduction
Ruosi Wan
Mingjun Zhong
Haoyi Xiong
Zhanxing Zhu
BDL
DRL
83
18
0
01 Jun 2018
Supervised Policy Update for Deep Reinforcement Learning
Q. Vuong
Yiming Zhang
George Andriopoulos
79
20
0
29 May 2018
Variational Inverse Control with Events: A General Framework for Data-Driven Reward Definition
Justin Fu
Avi Singh
Dibya Ghosh
Larry Yang
Sergey Levine
BDL
96
125
0
29 May 2018
Truncated Horizon Policy Search: Combining Reinforcement Learning & Imitation Learning
Wen Sun
J. Andrew Bagnell
Byron Boots
131
94
0
29 May 2018
Reward Constrained Policy Optimization
Chen Tessler
D. Mankowitz
Shie Mannor
99
546
0
28 May 2018
Dual Policy Iteration
Wen Sun
Geoffrey J. Gordon
Byron Boots
J. Andrew Bagnell
OffRL
112
57
0
28 May 2018
Fingerprint Policy Optimisation for Robust Reinforcement Learning
Supratik Paul
Michael A. Osborne
Shimon Whiteson
102
18
0
27 May 2018
Reliability and Learnability of Human Bandit Feedback for Sequence-to-Sequence Reinforcement Learning
Julia Kreutzer
Joshua Uyheng
Stefan Riezler
91
88
0
27 May 2018
Fast Policy Learning through Imitation and Reinforcement
Ching-An Cheng
Xinyan Yan
Nolan Wagener
Byron Boots
77
84
0
26 May 2018
Learning Self-Imitating Diverse Policies
Tanmay Gangwani
Qiang Liu
Jian Peng
99
68
0
25 May 2018
Object-Oriented Dynamics Predictor
Guangxiang Zhu
Zhiao Huang
Chongjie Zhang
AI4CE
98
35
0
25 May 2018
Virtual-Taobao: Virtualizing Real-world Online Retail Environment for Reinforcement Learning
Jing-Cheng Shi
Yang Yu
Qing Da
Shi-Yong Chen
Anxiang Zeng
OffRL
105
187
0
25 May 2018
Maximum Causal Tsallis Entropy Imitation Learning
Kyungjae Lee
Sungjoon Choi
Songhwai Oh
OOD
127
20
0
22 May 2018
Scalable Centralized Deep Multi-Agent Reinforcement Learning via Policy Gradients
Arbaaz Khan
Clark Zhang
Daniel D. Lee
Vijay Kumar
Alejandro Ribeiro
69
30
0
22 May 2018
Data-Efficient Hierarchical Reinforcement Learning
Ofir Nachum
S. Gu
Honglak Lee
Sergey Levine
OffRL
102
815
0
21 May 2018
Multiple-Step Greedy Policies in Online and Approximate Reinforcement Learning
Yonathan Efroni
Gal Dalal
B. Scherrer
Shie Mannor
OffRL
119
14
0
21 May 2018
Evolution-Guided Policy Gradient in Reinforcement Learning
Shauharda Khadka
Kagan Tumer
139
232
0
21 May 2018
Constrained Policy Improvement for Safe and Efficient Reinforcement Learning
Elad Sarafian
Aviv Tamar
Sarit Kraus
OffRL
60
11
0
20 May 2018
Unsupervised Video Object Segmentation for Deep Reinforcement Learning
Vikrant Goel
James Weng
Pascal Poupart
OCL
75
66
0
20 May 2018
Task-Agnostic Meta-Learning for Few-shot Learning
Muhammad Abdullah Jamal
Guo-Jun Qi
M. Shah
99
465
0
20 May 2018
Reinforced Imitation: Sample Efficient Deep Reinforcement Learning for Map-less Navigation by Leveraging Prior Demonstrations
Mark Pfeiffer
Samarth Shukla
M. Turchetta
Cesar Cadena
Andreas Krause
Roland Siegwart
Juan I. Nieto
78
159
0
18 May 2018
GAN Q-learning
T. Doan
Bogdan Mazoure
Clare Lyle
OOD
OffRL
64
19
0
13 May 2018
Deep Hierarchical Reinforcement Learning Algorithm in Partially Observable Markov Decision Processes
T. P. Le
Ngo Anh Vien
Abu Layek
TaeChoong Chung
53
52
0
11 May 2018
Behavioral Cloning from Observation
F. Torabi
Garrett A. Warnell
Peter Stone
OffRL
147
732
0
04 May 2018
Exploration by Distributional Reinforcement Learning
Yunhao Tang
Shipra Agrawal
OOD
86
31
0
04 May 2018
Reinforcement Learning and Control as Probabilistic Inference: Tutorial and Review
Sergey Levine
AI4CE
BDL
122
677
0
02 May 2018
Decoupling Dynamics and Reward for Transfer Learning
Amy Zhang
Harsh Satija
Joelle Pineau
OOD
90
72
0
27 Apr 2018
Deep Reinforcement Learning to Acquire Navigation Skills for Wheel-Legged Robots in Complex Environments
Xi Chen
Ali Ghadirzadeh
John Folkesson
Patric Jensfelt
102
44
0
27 Apr 2018
Scalable Bilinear
π
π
π
Learning Using State and Action Features
Yichen Chen
Lihong Li
Mengdi Wang
88
46
0
27 Apr 2018
Distributed Distributional Deterministic Policy Gradients
Gabriel Barth-Maron
Matthew W. Hoffman
David Budden
Will Dabney
Dan Horgan
TB Dhruva
Alistair Muldal
N. Heess
Timothy Lillicrap
OffRL
127
480
0
23 Apr 2018
Taskonomy: Disentangling Task Transfer Learning
Amir Zamir
Alexander Sax
Bokui (William) Shen
Leonidas Guibas
Jitendra Malik
Silvio Savarese
160
1,227
0
23 Apr 2018
A Reinforcement Learning Based Approach for Automated Lane Change Maneuvers
Pin Wang
Ching-yao Chan
A. de La Fortelle
96
253
0
21 Apr 2018
The Limits and Potentials of Deep Learning for Robotics
Niko Sünderhauf
Oliver Brock
Walter J. Scheirer
R. Hadsell
Dieter Fox
...
B. Upcroft
Pieter Abbeel
Wolfram Burgard
Michael Milford
Peter Corke
97
530
0
18 Apr 2018
An Adaptive Clipping Approach for Proximal Policy Optimization
Gang Chen
Yiming Peng
Mengjie Zhang
59
22
0
17 Apr 2018
An information-theoretic on-line update principle for perception-action coupling
Zhen Peng
Tim Genewein
Felix Leibfried
Daniel A. Braun
94
13
0
16 Apr 2018
Intrinsically motivated reinforcement learning for human-robot interaction in the real-world
A. H. Qureshi
Yutaka Nakamura
Yuichiro Yoshikawa
H. Ishiguro
55
59
0
14 Apr 2018
Reinforcement Learning for UAV Attitude Control
W. Koch
R. Mancuso
R. West
Azer Bestavros
79
386
0
11 Apr 2018
Derivative free optimization via repeated classification
Tatsunori B. Hashimoto
Steve Yadlowsky
John C. Duchi
57
18
0
11 Apr 2018
Policy Gradient With Value Function Approximation For Collective Multiagent Planning
D. Nguyen
Akshat Kumar
H. Lau
96
43
0
09 Apr 2018
Latent Space Policies for Hierarchical Reinforcement Learning
Tuomas Haarnoja
Kristian Hartikainen
Pieter Abbeel
Sergey Levine
BDL
89
193
0
09 Apr 2018
DeepMimic: Example-Guided Deep Reinforcement Learning of Physics-Based Character Skills
Xue Bin Peng
Pieter Abbeel
Sergey Levine
M. van de Panne
AI4CE
282
498
0
08 Apr 2018
Structured Evolution with Compact Architectures for Scalable Policy Optimization
K. Choromanski
Mark Rowland
Vikas Sindhwani
Richard Turner
Adrian Weller
118
149
0
06 Apr 2018
Renewal Monte Carlo: Renewal theory based reinforcement learning
Jayakumar Subramanian
Aditya Mahajan
41
11
0
03 Apr 2018
StarCraft Micromanagement with Reinforcement Learning and Curriculum Transfer Learning
Kun Shao
Yuanheng Zhu
Dongbin Zhao
145
171
0
03 Apr 2018
Recall Traces: Backtracking Models for Efficient Reinforcement Learning
Anirudh Goyal
Philemon Brakel
W. Fedus
Soumye Singhal
Timothy Lillicrap
Sergey Levine
Hugo Larochelle
Yoshua Bengio
OffRL
107
68
0
02 Apr 2018
Learning to Run challenge: Synthesizing physiologically accurate motion using deep reinforcement learning
L. Kidzinski
Sharada Mohanty
Carmichael F. Ong
Jennifer Hicks
Sean F. Carroll
Sergey Levine
M. Salathé
Scott L. Delp
91
63
0
31 Mar 2018
Learning to Adapt in Dynamic, Real-World Environments Through Meta-Reinforcement Learning
Anusha Nagabandi
I. Clavera
Simin Liu
R. Fearing
Pieter Abbeel
Sergey Levine
Chelsea Finn
211
556
0
30 Mar 2018
Reinforcement learning for non-prehensile manipulation: Transfer from simulation to physical system
Kendall Lowrey
S. Kolev
Jeremy Dao
Aravind Rajeswaran
E. Todorov
87
58
0
28 Mar 2018
Previous
1
2
3
...
34
35
36
...
39
40
41
Next