Papers
Communities
Organizations
Events
Blog
Pricing
Search
Open menu
Home
Papers
1502.05477
Cited By
v1
v2
v3
v4
v5 (latest)
Trust Region Policy Optimization
19 February 2015
John Schulman
Sergey Levine
Philipp Moritz
Michael I. Jordan
Pieter Abbeel
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Trust Region Policy Optimization"
50 / 2,034 papers shown
Title
Decoupled Data Based Approach for Learning to Control Nonlinear Dynamical Systems
Ran A. Wang
Karthikeya S. Parunandi
Dan Yu
D. Kalathil
S. Chakravorty
53
12
0
17 Apr 2019
End-to-End Robotic Reinforcement Learning without Reward Engineering
Avi Singh
Larry Yang
Kristian Hartikainen
Chelsea Finn
Sergey Levine
SSL
OffRL
120
267
0
16 Apr 2019
Reinforcement Learning for Nested Polar Code Construction
Lingchen Huang
Huazi Zhang
Rong Li
Yiqun Ge
Jun Wang
34
14
0
16 Apr 2019
Learning to Navigate in Indoor Environments: from Memorizing to Reasoning
Liulong Ma
Yanjie Liu
Jiao Chen
Dong Jin
58
10
0
15 Apr 2019
A Short Survey On Memory Based Reinforcement Learning
Dhruv Ramani
OffRL
71
18
0
14 Apr 2019
Model-Free Reinforcement Learning for Financial Portfolios: A Brief Survey
Yoshiharu Sato
OffRL
45
32
0
10 Apr 2019
Multi-Preference Actor Critic
Ishan Durugkar
Matthew J. Hausknecht
Adith Swaminathan
Patrick MacAlpine
39
1
0
05 Apr 2019
A Validated Physical Model For Real-Time Simulation of Soft Robotic Snakes
Renato Gasoto
Miles Macklin
Xuan Liu
Yinan Sun
Kenny Erleben
C. Onal
Jie Fu
AI4CE
30
16
0
05 Apr 2019
Meta-Learning Acquisition Functions for Transfer Learning in Bayesian Optimization
Michael Volpp
Lukas P. Frohlich
Kirsten Fischer
Andreas Doerr
Stefan Falkner
Frank Hutter
Christian Daniel
118
85
0
04 Apr 2019
Meta-Learning surrogate models for sequential decision making
Alexandre Galashov
Jonathan Richard Schwarz
Hyunjik Kim
M. Garnelo
D. Saxton
Pushmeet Kohli
S. M. Ali Eslami
Yee Whye Teh
BDL
OffRL
95
25
0
28 Mar 2019
How to pick the domain randomization parameters for sim-to-real transfer of reinforcement learning policies?
Q. Vuong
Sharad Vikram
H. Su
Sicun Gao
Henrik I. Christensen
OOD
84
49
0
28 Mar 2019
Autoregressive Policies for Continuous Control Deep Reinforcement Learning
D. Korenkevych
A. R. Mahmood
Gautham Vasan
James Bergstra
79
28
0
27 Mar 2019
Generalized Off-Policy Actor-Critic
Shangtong Zhang
Wendelin Bohmer
Shimon Whiteson
OffRL
CML
151
43
0
27 Mar 2019
Using RGB Image as Visual Input for Mapless Robot Navigation
Liulong Ma
Yanjie Liu
Jiao Chen
SSL
104
17
0
24 Mar 2019
TTR-Based Reward for Reinforcement Learning with Implicit Model Priors
Xubo Lyu
Mo Chen
OffRL
57
3
0
23 Mar 2019
Improving Safety in Reinforcement Learning Using Model-Based Architectures and Human Intervention
Bharat Prakash
Mohit Khatwani
Nicholas R. Waytowich
T. Mohsenin
OffRL
58
19
0
22 Mar 2019
Flying through a narrow gap using neural network: an end-to-end planning and control approach
Jiarong Lin
Luqi Wang
Fei Gao
Shaojie Shen
Fu Zhang
58
33
0
21 Mar 2019
End-to-End Safe Reinforcement Learning through Barrier Functions for Safety-Critical Continuous Control Tasks
Richard Cheng
G. Orosz
R. Murray
J. W. Burdick
102
626
0
21 Mar 2019
Batch Policy Learning under Constraints
Hoang Minh Le
Cameron Voloshin
Yisong Yue
OffRL
99
336
0
20 Mar 2019
Learning Gentle Object Manipulation with Curiosity-Driven Deep Reinforcement Learning
Sandy H. Huang
Martina Zambelli
Jackie Kay
M. Martins
Yuval Tassa
P. Pilarski
R. Hadsell
75
51
0
20 Mar 2019
Truly Proximal Policy Optimization
Yuhui Wang
Hao He
Chao Wen
Xiaoyang Tan
98
126
0
19 Mar 2019
Hindsight Generative Adversarial Imitation Learning
N. Liu
Tao Lu
Yinghao Cai
Boyao Li
Shuo Wang
83
6
0
19 Mar 2019
Exploiting Hierarchy for Learning and Transfer in KL-regularized RL
Dhruva Tirumala
Hyeonwoo Noh
Alexandre Galashov
Leonard Hasenclever
Arun Ahuja
Greg Wayne
Razvan Pascanu
Yee Whye Teh
N. Heess
OffRL
74
44
0
18 Mar 2019
Adaptive Variance for Changing Sparse-Reward Environments
Xingyu Lin
Pengsheng Guo
Carlos Florensa
David Held
63
6
0
15 Mar 2019
Simulating Emergent Properties of Human Driving Behavior Using Multi-Agent Reward Augmented Imitation Learning
Raunak P. Bhattacharyya
Derek J. Phillips
Changliu Liu
Jayesh K. Gupta
Katherine Driggs-Campbell
Mykel J. Kochenderfer
AI4CE
62
55
0
14 Mar 2019
Trajectory Optimization for Unknown Constrained Systems using Reinforcement Learning
Keita Ota
Devesh K. Jha
Tomoaki Oiki
Mamoru Miura
Takashi Nammoto
D. Nikovski
T. Mariyama
62
27
0
13 Mar 2019
Augment-Reinforce-Merge Policy Gradient for Binary Stochastic Policy
Yunhao Tang
Mingzhang Yin
Mingyuan Zhou
21
0
0
13 Mar 2019
On the Pitfalls of Measuring Emergent Communication
Ryan J. Lowe
Jakob N. Foerster
Y-Lan Boureau
Joelle Pineau
Yann N. Dauphin
146
135
0
12 Mar 2019
Deep Recurrent Q-Learning vs Deep Q-Learning on a simple Partially Observable Markov Decision Process with Minecraft
Clément Romac
Vincent Béraud
49
5
0
11 Mar 2019
Sample-Efficient Model-Free Reinforcement Learning with Off-Policy Critics
Denis Steckelmacher
Hélène Plisnier
D. Roijers
A. Nowé
OffRL
73
17
0
11 Mar 2019
Orthogonal Estimation of Wasserstein Distances
Mark Rowland
Jiri Hron
Yunhao Tang
K. Choromanski
Tamás Sarlós
Adrian Weller
102
43
0
09 Mar 2019
Improved Robustness and Safety for Autonomous Vehicle Control with Adversarial Reinforcement Learning
Xiaobai Ma
Katherine Driggs-Campbell
Mykel J. Kochenderfer
AAML
55
48
0
08 Mar 2019
Dyna-AIL : Adversarial Imitation Learning by Planning
Vaibhav Saxena
Srinivasan Sivanandan
Pulkit Mathur
51
1
0
08 Mar 2019
Training in Task Space to Speed Up and Guide Reinforcement Learning
Guillaume Bellegarda
Katie Byl
54
19
0
06 Mar 2019
Open-Sourced Reinforcement Learning Environments for Surgical Robotics
Florian Richter
Ryan K. Orosco
Michael C. Yip
OffRL
68
82
0
05 Mar 2019
Learning Dynamics Model in Reinforcement Learning by Incorporating the Long Term Future
Nan Rosemary Ke
Amanpreet Singh
Ahmed Touati
Anirudh Goyal
Yoshua Bengio
Devi Parikh
Dhruv Batra
78
48
0
05 Mar 2019
Sim-to-Real Transfer for Biped Locomotion
Wenhao Yu
Visak C. V. Kumar
Greg Turk
Chenxi Liu
62
115
0
04 Mar 2019
Hybrid Actor-Critic Reinforcement Learning in Parameterized Action Space
Zhou Fan
Ruilong Su
Weinan Zhang
Yong Yu
127
134
0
04 Mar 2019
Reinforcement Learning on Variable Impedance Controller for High-Precision Robotic Assembly
Jianlan Luo
Eugen Solowjow
Chengtao Wen
J. A. Ojea
A. Agogino
Aviv Tamar
Pieter Abbeel
89
177
0
04 Mar 2019
Asynchronous Episodic Deep Deterministic Policy Gradient: Towards Continuous Control in Computationally Complex Environments
Zhizheng Zhang
Jiale Chen
Zhibo Chen
Weiping Li
OffRL
93
61
0
03 Mar 2019
A Regularized Approach to Sparse Optimal Policy in Reinforcement Learning
Xiang Li
Wenhao Yang
Zhihua Zhang
31
2
0
02 Mar 2019
Distributionally Robust Reinforcement Learning
E. Smirnova
Elvis Dohmatob
Jérémie Mary
OffRL
73
60
0
23 Feb 2019
World Discovery Models
M. G. Azar
Bilal Piot
Bernardo Avila-Pires
Jean-Bastien Grill
Florent Altché
Rémi Munos
126
26
0
20 Feb 2019
Investigating Generalisation in Continuous Deep Reinforcement Learning
Chenyang Zhao
Olivier Sigaud
F. Stulp
Timothy M. Hospedales
OffRL
96
48
0
19 Feb 2019
Sufficiently Accurate Model Learning
Clark Zhang
Arbaaz Khan
Santiago Paternain
Alejandro Ribeiro
40
3
0
19 Feb 2019
DIViS: Domain Invariant Visual Servoing for Collision-Free Goal Reaching
Fereshteh Sadeghi
91
28
0
18 Feb 2019
Fast Efficient Hyperparameter Tuning for Policy Gradients
Supratik Paul
Vitaly Kurin
Shimon Whiteson
72
32
0
18 Feb 2019
Verifiably Safe Off-Model Reinforcement Learning
Nathan Fulton
André Platzer
OffRL
79
67
0
14 Feb 2019
Learn a Prior for RHEA for Better Online Planning
Xinyao Tong
W. Liu
Bin Li
OffRL
109
0
0
14 Feb 2019
Non-Asymptotic Analysis of Monte Carlo Tree Search
Devavrat Shah
Qiaomin Xie
Zhi Xu
48
9
0
14 Feb 2019
Previous
1
2
3
...
30
31
32
...
39
40
41
Next