Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1502.05477
Cited By
v1
v2
v3
v4
v5 (latest)
Trust Region Policy Optimization
19 February 2015
John Schulman
Sergey Levine
Philipp Moritz
Michael I. Jordan
Pieter Abbeel
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Trust Region Policy Optimization"
50 / 2,009 papers shown
Title
Provably Efficient Safe Exploration via Primal-Dual Policy Optimization
Dongsheng Ding
Xiaohan Wei
Zhuoran Yang
Zhaoran Wang
M. Jovanović
127
166
0
01 Mar 2020
Policy-Aware Model Learning for Policy Gradient Methods
Romina Abachi
Mohammad Ghavamzadeh
Amir-massoud Farahmand
77
36
0
28 Feb 2020
Hallucinative Topological Memory for Zero-Shot Visual Planning
Kara Liu
Thanard Kurutach
Christine Tung
Pieter Abbeel
Aviv Tamar
89
48
0
27 Feb 2020
Exploration-efficient Deep Reinforcement Learning with Demonstration Guidance for Robot Control
Ke Lin
Liang Gong
Xudong Li
Te Sun
Binhao Chen
Chengliang Liu
Zhengfeng Zhang
Jian Pu
Junping Zhang
45
8
0
27 Feb 2020
Sub-Goal Trees -- a Framework for Goal-Based Reinforcement Learning
Tom Jurgenson
Or Avner
E. Groshev
Aviv Tamar
81
41
0
27 Feb 2020
Review, Analysis and Design of a Comprehensive Deep Reinforcement Learning Framework
Ngoc Duy Nguyen
Thanh Thi Nguyen
Hai V. Nguyen
Doug Creighton
S. Nahavandi
169
4
0
27 Feb 2020
A Visual Communication Map for Multi-Agent Deep Reinforcement Learning
Ngoc Duy Nguyen
Thanh Thi Nguyen
Doug Creighton
S. Nahavandi
48
5
0
27 Feb 2020
Mid-flight Propeller Failure Detection and Control of Propeller-deficient Quadcopter using Reinforcement Learning
Rohitkumar Arasanipalai
Aakriti Agrawal
D. Ghose
18
7
0
26 Feb 2020
Whole-Body Control of a Mobile Manipulator using End-to-End Reinforcement Learning
Julien Kindle
Fadri Furrer
Tonci Novkovic
Jen Jen Chung
Roland Siegwart
Juan I. Nieto
OffRL
96
30
0
25 Feb 2020
On Reinforcement Learning for Turn-based Zero-sum Markov Games
Devavrat Shah
Varun Somani
Qiaomin Xie
Zhi Xu
48
11
0
25 Feb 2020
Sketch Less for More: On-the-Fly Fine-Grained Sketch Based Image Retrieval
A. Bhunia
Yongxin Yang
Timothy M. Hospedales
Tao Xiang
Yi-Zhe Song
144
104
0
24 Feb 2020
Guided Constrained Policy Optimization for Dynamic Quadrupedal Robot Locomotion
Siddhant Gangapurwala
Alexander L. Mitchell
Ioannis Havoutis
77
55
0
22 Feb 2020
Tuning-free Plug-and-Play Proximal Algorithm for Inverse Imaging Problems
Kaixuan Wei
Angelica Aviles-Rivero
Jingwei Liang
Ying Fu
Carola-Bibiane Schönlieb
Hua Huang
87
105
0
22 Feb 2020
Disentangling Controllable Object through Video Prediction Improves Visual Reinforcement Learning
Yuanyi Zhong
Alex Schwing
Jian Peng
DRL
123
5
0
21 Feb 2020
Support-weighted Adversarial Imitation Learning
Ruohan Wang
C. Ciliberto
P. Amadori
Y. Demiris
49
4
0
20 Feb 2020
Automatic Gesture Recognition in Robot-assisted Surgery with Reinforcement Learning and Tree Search
Xiaojie Gao
Yueming Jin
Qi Dou
Pheng-Ann Heng
62
50
0
20 Feb 2020
Keep Doing What Worked: Behavioral Modelling Priors for Offline Reinforcement Learning
Noah Y. Siegel
Jost Tobias Springenberg
Felix Berkenkamp
A. Abdolmaleki
Michael Neunert
Thomas Lampe
Roland Hafner
Nicolas Heess
Martin Riedmiller
OffRL
85
283
0
19 Feb 2020
Optimistic Policy Optimization with Bandit Feedback
Yonathan Efroni
Lior Shani
Aviv A. Rosenberg
Shie Mannor
80
90
0
19 Feb 2020
Curriculum in Gradient-Based Meta-Reinforcement Learning
Bhairav Mehta
T. Deleu
Sharath Chandra Raparthy
C. Pal
Liam Paull
108
20
0
19 Feb 2020
Theoretical Convergence of Multi-Step Model-Agnostic Meta-Learning
Kaiyi Ji
Junjie Yang
Yingbin Liang
103
50
0
18 Feb 2020
Reinforcement Learning for Molecular Design Guided by Quantum Mechanics
G. Simm
Robert Pinsler
José Miguel Hernández-Lobato
AI4CE
100
85
0
18 Feb 2020
KoGuN: Accelerating Deep Reinforcement Learning via Integrating Human Suboptimal Knowledge
Peng Zhang
Jianye Hao
Weixun Wang
Hongyao Tang
Yi Ma
Yihai Duan
Yan Zheng
OffRL
OnRL
60
34
0
18 Feb 2020
Kalman meets Bellman: Improving Policy Evaluation through Value Tracking
Shirli Di-Castro Shashua
Shie Mannor
OffRL
76
12
0
17 Feb 2020
Control Frequency Adaptation via Action Persistence in Batch Reinforcement Learning
Alberto Maria Metelli
Flavio Mazzolini
L. Bisi
Luca Sabbioni
Marcello Restelli
45
41
0
17 Feb 2020
First Order Constrained Optimization in Policy Space
Yiming Zhang
Q. Vuong
George Andriopoulos
46
4
0
16 Feb 2020
Non-asymptotic Convergence of Adam-type Reinforcement Learning Algorithms under Markovian Sampling
Huaqing Xiong
Tengyu Xu
Yingbin Liang
Wei Zhang
79
33
0
15 Feb 2020
Robust Reinforcement Learning via Adversarial training with Langevin Dynamics
Parameswaran Kamalaruban
Yu-ting Huang
Ya-Ping Hsieh
Paul Rolland
C. Shi
Volkan Cevher
103
61
0
14 Feb 2020
Improving Generalization of Reinforcement Learning with Minimax Distributional Soft Actor-Critic
Yangang Ren
Jingliang Duan
Shengbo Eben Li
Yang Guan
Qi Sun
OffRL
60
30
0
13 Feb 2020
Intrinsic Motivation for Encouraging Synergistic Behavior
Rohan Chitnis
Shubham Tulsiani
Saurabh Gupta
Abhinav Gupta
50
28
0
12 Feb 2020
A Tensor Network Approach to Finite Markov Decision Processes
E. Gillman
Dominic C. Rose
J. P. Garrahan
64
4
0
12 Feb 2020
Regret Bounds for Discounted MDPs
Shuang Liu
H. Su
OffRL
80
19
0
12 Feb 2020
Mean-Field Controls with Q-learning for Cooperative MARL: Convergence and Complexity Analysis
Haotian Gu
Xin Guo
Xiaoli Wei
Renyuan Xu
121
66
0
10 Feb 2020
Convergence Guarantees of Policy Optimization Methods for Markovian Jump Linear Systems
Joao Paulo Jansch-Porto
Bin Hu
Geir Dullerud
75
35
0
10 Feb 2020
Explore, Discover and Learn: Unsupervised Discovery of State-Covering Skills
Victor Campos
Alexander R. Trott
Caiming Xiong
R. Socher
Xavier Giró-i-Nieto
Jordi Torres
OffRL
112
156
0
10 Feb 2020
Discrete Action On-Policy Learning with Action-Value Critic
Yuguang Yue
Yunhao Tang
Mingzhang Yin
Mingyuan Yin
OffRL
78
5
0
10 Feb 2020
Adaptive Approximate Policy Iteration
Botao Hao
N. Lazić
Yasin Abbasi-Yadkori
Pooria Joulani
Csaba Szepesvári
94
14
0
08 Feb 2020
BRPO: Batch Residual Policy Optimization
Kentaro Kanamori
Yinlam Chow
Takuya Takagi
Hiroki Arimura
Honglak Lee
Ken Kobayashi
Craig Boutilier
OffRL
236
45
0
08 Feb 2020
Bayesian Residual Policy Optimization: Scalable Bayesian Reinforcement Learning with Clairvoyant Experts
Gilwoo Lee
Brian Hou
Sanjiban Choudhury
S. Srinivasa
BDL
OffRL
69
7
0
07 Feb 2020
DenseCAvoid: Real-time Navigation in Dense Crowds using Anticipatory Behaviors
A. Sathyamoorthy
Jing Liang
Utsav Patel
Tianrui Guan
Rohan Chandra
Tianyi Zhou
67
86
0
07 Feb 2020
Learning Whole-body Motor Skills for Humanoids
Chuanyu Yang
Kai Yuan
W. Merkt
Taku Komura
S. Vijayakumar
Zhibin Li
105
38
0
07 Feb 2020
Accelerating Reinforcement Learning for Reaching using Continuous Curriculum Learning
Sha Luo
Hamidreza Kasaei
Lambert Schomaker
CLL
92
46
0
07 Feb 2020
Ready Policy One: World Building Through Active Learning
Philip J. Ball
Jack Parker-Holder
Aldo Pacchiano
K. Choromanski
Stephen J. Roberts
OffRL
99
49
0
07 Feb 2020
Automated Lane Change Strategy using Proximal Policy Optimization-based Deep Reinforcement Learning
Fei Ye
Xuxin Cheng
Pin Wang
Ching-yao Chan
Jiucai Zhang
42
100
0
07 Feb 2020
Dynamic Energy Dispatch Based on Deep Reinforcement Learning in IoT-Driven Smart Isolated Microgrids
Lei Lei
Yue Tan
Glenn Dahlenburg
W. Xiang
K. Zheng
76
71
0
07 Feb 2020
Effective Diversity in Population Based Reinforcement Learning
Jack Parker-Holder
Aldo Pacchiano
K. Choromanski
Stephen J. Roberts
146
165
0
03 Feb 2020
Deep Reinforcement Learning for Autonomous Driving: A Survey
B. R. Kiran
Ibrahim Sobh
V. Talpaert
Patrick Mannion
A. A. Sallab
S. Yogamani
P. Pérez
367
1,710
0
02 Feb 2020
Periodic Intra-Ensemble Knowledge Distillation for Reinforcement Learning
Zhang-Wei Hong
P. Nagarajan
Guilherme J. Maeda
OffRL
53
4
0
01 Feb 2020
Constrained Deep Reinforcement Learning for Energy Sustainable Multi-UAV based Random Access IoT Networks with NOMA
Sami Khairy
Prasanna Balaprakash
L. Cai
Y. Cheng
31
73
0
31 Jan 2020
Variational Autoencoders for Opponent Modeling in Multi-Agent Systems
Georgios Papoudakis
Stefano V. Albrecht
BDL
DRL
64
29
0
29 Jan 2020
Challenges and Countermeasures for Adversarial Attacks on Deep Reinforcement Learning
Inaam Ilahi
Muhammad Usama
Junaid Qadir
M. Janjua
Ala I. Al-Fuqaha
D. Hoang
Dusit Niyato
AAML
147
137
0
27 Jan 2020
Previous
1
2
3
...
23
24
25
...
39
40
41
Next