Papers
Communities
Organizations
Events
Blog
Pricing
Search
Open menu
Home
Papers
1502.05477
Cited By
v1
v2
v3
v4
v5 (latest)
Trust Region Policy Optimization
19 February 2015
John Schulman
Sergey Levine
Philipp Moritz
Michael I. Jordan
Pieter Abbeel
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Trust Region Policy Optimization"
50 / 2,023 papers shown
Title
Learning a Decision Module by Imitating Driver's Control Behaviors
Junning Huang
Sirui Xie
Jiankai Sun
Gary Qiurui Ma
Chunxiao Liu
Jianping Shi
Dahua Lin
Bolei Zhou
86
31
0
30 Nov 2019
IMPACT: Importance Weighted Asynchronous Architectures with Clipped Target Networks
Michael Luo
Jiahao Yao
Richard Liaw
Eric Liang
Ion Stoica
82
15
0
30 Nov 2019
Deep Model-Based Reinforcement Learning via Estimated Uncertainty and Conservative Policy Optimization
Qi Zhou
Houqiang Li
Jie Wang
75
17
0
28 Nov 2019
Adaptive dynamic programming for nonaffine nonlinear optimal control problem with state constraints
Jingliang Duan
Zhengyu Liu
Shengbo Eben Li
Qi Sun
Zhenzhong Jia
B. Cheng
77
65
0
26 Nov 2019
Behavior Regularized Offline Reinforcement Learning
Yifan Wu
George Tucker
Ofir Nachum
OffRL
187
692
0
26 Nov 2019
Theory-based Causal Transfer: Integrating Instance-level Induction and Abstract-level Structure Learning
Mark Edmonds
Xiaojian Ma
Siyuan Qi
Yixin Zhu
Hongjing Lu
Song-Chun Zhu
91
27
0
25 Nov 2019
ORL: Reinforcement Learning Benchmarks for Online Stochastic Optimization Problems
Bharathan Balaji
Jordan Bell-Masterson
Enes Bilgin
Andreas C. Damianou
Pablo Moreno Garcia
Arpit Jain
Runfei Luo
Alvaro Maggiar
Balakrishnan Narayanaswamy
Chun Jimmie Ye
OffRL
63
32
0
24 Nov 2019
Multi-Agent Reinforcement Learning: A Selective Overview of Theories and Algorithms
Jianchao Tan
Zhuoran Yang
Tamer Basar
256
1,233
0
24 Nov 2019
State Alignment-based Imitation Learning
Fangchen Liu
Z. Ling
Tongzhou Mu
Hao Su
79
93
0
21 Nov 2019
Safe Policies for Reinforcement Learning via Primal-Dual Methods
Santiago Paternain
Miguel Calvo-Fullana
Luiz F. O. Chamon
Alejandro Ribeiro
95
105
0
20 Nov 2019
Bayesian Curiosity for Efficient Exploration in Reinforcement Learning
Tom Blau
Lionel Ott
Fabio Ramos
40
8
0
20 Nov 2019
Decision Making for Autonomous Driving via Augmented Adversarial Inverse Reinforcement Learning
Pin Wang
Dapeng Liu
Jiayu Chen
Hanhan Li
Ching-yao Chan
37
0
0
19 Nov 2019
Reinforcement Learning from Imperfect Demonstrations under Soft Expert Guidance
Mingxuan Jing
Xiaojian Ma
Wenbing Huang
F. Sun
Chao Yang
Bin Fang
Huaping Liu
88
60
0
16 Nov 2019
Deep Reinforcement Learning Attitude Control of Fixed-Wing UAVs Using Proximal Policy Optimization
Eivind Bøhn
E. M. Coates
Signe Moe
T. Johansen
70
130
0
13 Nov 2019
Adversarial Examples in Modern Machine Learning: A Review
R. Wiyatno
Anqi Xu
Ousmane Amadou Dia
A. D. Berker
AAML
127
105
0
13 Nov 2019
Real-Time Reinforcement Learning
Simon Ramstedt
C. Pal
AI4CE
96
63
0
11 Nov 2019
Multi-Path Policy Optimization
L. Pan
Qingpeng Cai
Longbo Huang
91
2
0
11 Nov 2019
Context-aware Active Multi-Step Reinforcement Learning
Gang Chen
Dingcheng Li
Ran Xu
31
0
0
11 Nov 2019
Minimalistic Attacks: How Little it Takes to Fool a Deep Reinforcement Learning Policy
Xinghua Qu
Zhu Sun
Yew-Soon Ong
Abhishek Gupta
Pengfei Wei
AAML
OffRL
114
35
0
10 Nov 2019
Model-Based Reinforcement Learning with Adversarial Training for Online Recommendation
Xueying Bai
Jian Guan
Hongning Wang
OffRL
104
76
0
10 Nov 2019
Learning to reinforcement learn for Neural Architecture Search
J. Gomez
Joaquin Vanschoren
87
8
0
09 Nov 2019
DeepRacer: Educational Autonomous Racing Platform for Experimentation with Sim2Real Reinforcement Learning
Bharathan Balaji
S. Mallya
Sahika Genc
Saurabh Gupta
Leo Dirac
...
Yunzhe Tao
Brian Townsend
E. Calleja
Sunil Muralidhara
Dhanasekar Karuppasamy
80
57
0
05 Nov 2019
Gradient-based Adaptive Markov Chain Monte Carlo
Michalis K. Titsias
P. Dellaportas
BDL
109
22
0
04 Nov 2019
DeepLine: AutoML Tool for Pipelines Generation using Deep Reinforcement Learning and Hierarchical Actions Filtering
Yuval Heffetz
Roman Vainshtein
Gilad Katz
Lior Rokach
60
40
0
31 Oct 2019
NAT: Neural Architecture Transformer for Accurate and Compact Architectures
Yong Guo
Yin Zheng
Mingkui Tan
Qi Chen
Jian Chen
P. Zhao
Junzhou Huang
173
85
0
31 Oct 2019
Hierarchical Expert Networks for Meta-Learning
Heinke Hihn
Daniel A. Braun
119
4
0
31 Oct 2019
Learning to Manipulate Deformable Objects without Demonstrations
Yilin Wu
Wilson Yan
Thanard Kurutach
Lerrel Pinto
Pieter Abbeel
OffRL
80
202
0
29 Oct 2019
Constrained Reinforcement Learning Has Zero Duality Gap
Santiago Paternain
Luiz F. O. Chamon
Miguel Calvo-Fullana
Alejandro Ribeiro
61
193
0
29 Oct 2019
Feedback Linearization for Unknown Systems via Reinforcement Learning
T. Westenbroek
David Fridovich-Keil
Eric Mazumdar
Shreyas Arora
Valmik Prabhu
S. Shankar Sastry
Claire Tomlin
51
28
0
29 Oct 2019
Learning to Predict Without Looking Ahead: World Models Without Forward Prediction
C. Freeman
Luke Metz
David R Ha
93
36
0
29 Oct 2019
Generalization in Reinforcement Learning with Selective Noise Injection and Information Bottleneck
Maximilian Igl
K. Ciosek
Yingzhen Li
Sebastian Tschiatschek
Cheng Zhang
Sam Devlin
Katja Hofmann
OffRL
95
174
0
28 Oct 2019
Asynchronous Methods for Model-Based Reinforcement Learning
Yunzhi Zhang
I. Clavera
Bo-Yu Tsai
Pieter Abbeel
OffRL
60
27
0
28 Oct 2019
BAIL: Best-Action Imitation Learning for Batch Deep Reinforcement Learning
Xinyue Chen
Zijian Zhou
Ziyi Wang
Che Wang
Yanqiu Wu
George Andriopoulos
OffRL
125
125
0
27 Oct 2019
Convergent Policy Optimization for Safe Reinforcement Learning
Ming Yu
Zhuoran Yang
Mladen Kolar
Zhaoran Wang
105
96
0
26 Oct 2019
Relay Policy Learning: Solving Long-Horizon Tasks via Imitation and Reinforcement Learning
Abhishek Gupta
Vikash Kumar
Corey Lynch
Sergey Levine
Karol Hausman
132
435
0
25 Oct 2019
Meta-World: A Benchmark and Evaluation for Multi-Task and Meta Reinforcement Learning
Tianhe Yu
Deirdre Quillen
Zhanpeng He
Ryan Julian
Avnish Narayan
Hayden Shively
Adithya Bellathur
Karol Hausman
Chelsea Finn
Sergey Levine
OffRL
340
1,182
0
24 Oct 2019
IPO: Interior-point Policy Optimization under Constraints
Yongshuai Liu
J. Ding
Xin Liu
95
184
0
21 Oct 2019
Policy Optimization for
H
2
\mathcal{H}_2
H
2
Linear Control with
H
∞
\mathcal{H}_\infty
H
∞
Robustness Guarantee: Implicit Regularization and Global Convergence
Jianchao Tan
Bin Hu
Tamer Basar
97
121
0
21 Oct 2019
Dealing with Sparse Rewards in Reinforcement Learning
J. Hare
71
80
0
21 Oct 2019
Regularization Matters in Policy Optimization
Zhuang Liu
Xuanlin Li
Bingyi Kang
Trevor Darrell
OffRL
83
33
0
21 Oct 2019
A New Framework for Multi-Agent Reinforcement Learning -- Centralized Training and Exploration with Decentralized Execution via Policy Distillation
Gang Chen
64
41
0
21 Oct 2019
RLScheduler: An Automated HPC Batch Job Scheduler Using Reinforcement Learning
Di Zhang
Dong Dai
Youbiao He
F. S. Bao
Bing Xie
OffRL
48
8
0
20 Oct 2019
OffWorld Gym: open-access physical robotics environment for real-world reinforcement learning benchmark and research
Ashish Kumar
Toby Buckley
John B. Lanier
Qiaozhi Wang
A. Kavelaars
Ilya Kuzovkin
OffRL
101
14
0
18 Oct 2019
On Connections between Constrained Optimization and Reinforcement Learning
Nino Vieillard
Olivier Pietquin
Matthieu Geist
43
13
0
18 Oct 2019
Adaptive Trade-Offs in Off-Policy Learning
Mark Rowland
Will Dabney
Rémi Munos
OffRL
142
22
0
16 Oct 2019
Reinforcement Learning for Robotic Manipulation using Simulated Locomotion Demonstrations
Ozsel Kilinc
Giovanni Montana
54
39
0
16 Oct 2019
Model-free Reinforcement Learning in Infinite-horizon Average-reward Markov Decision Processes
Chen-Yu Wei
Mehdi Jafarnia-Jahromi
Haipeng Luo
Hiteshi Sharma
R. Jain
177
108
0
15 Oct 2019
Neural Program Synthesis By Self-Learning
Yifan Xu
Luke Dai
Udaikaran Singh
Kening Zhang
Zhuowen Tu
58
6
0
13 Oct 2019
Orchestrating the Development Lifecycle of Machine Learning-Based IoT Applications: A Taxonomy and Survey
Bin Qian
Jie Su
Z. Wen
D. N. Jha
Yinhao Li
...
Albert Y. Zomaya
Omer F. Rana
Lizhe Wang
Maciej Koutny
R. Ranjan
92
4
0
11 Oct 2019
Hierarchical Reinforcement Learning with Advantage-Based Auxiliary Rewards
Siyuan Li
Rui Wang
Minxue Tang
Chongjie Zhang
84
83
0
10 Oct 2019
Previous
1
2
3
...
25
26
27
...
39
40
41
Next