Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1502.05477
Cited By
Trust Region Policy Optimization
19 February 2015
John Schulman
Sergey Levine
Philipp Moritz
Michael I. Jordan
Pieter Abbeel
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Trust Region Policy Optimization"
50 / 3,103 papers shown
Title
Deep Reinforcement Learning Attitude Control of Fixed-Wing UAVs Using Proximal Policy Optimization
Eivind Bøhn
E. M. Coates
Signe Moe
T. Johansen
28
129
0
13 Nov 2019
Adversarial Examples in Modern Machine Learning: A Review
R. Wiyatno
Anqi Xu
Ousmane Amadou Dia
A. D. Berker
AAML
28
104
0
13 Nov 2019
Accelerating Training in Pommerman with Imitation and Reinforcement Learning
Hardik Meisheri
Omkar Shelke
Richa Verma
H. Khadilkar
11
6
0
12 Nov 2019
Real-Time Reinforcement Learning
Simon Ramstedt
C. Pal
AI4CE
19
62
0
11 Nov 2019
Multi-Path Policy Optimization
L. Pan
Qingpeng Cai
Longbo Huang
23
2
0
11 Nov 2019
Transfer Value Iteration Networks
Junyi Shen
H. Zhuo
Jin Xu
Bin Zhong
Sinno Jialin Pan
14
7
0
11 Nov 2019
Context-aware Active Multi-Step Reinforcement Learning
Gang Chen
Dingcheng Li
Ran Xu
22
0
0
11 Nov 2019
Value-Added Chemical Discovery Using Reinforcement Learning
Peihong Jiang
Hieu A. Doan
Sandeep Madireddy
R. Assary
Prasanna Balaprakash
14
0
0
10 Nov 2019
Minimalistic Attacks: How Little it Takes to Fool a Deep Reinforcement Learning Policy
Xinghua Qu
Zhu Sun
Yew-Soon Ong
Abhishek Gupta
Pengfei Wei
AAML
OffRL
17
33
0
10 Nov 2019
Model-Based Reinforcement Learning with Adversarial Training for Online Recommendation
Xueying Bai
Jian Guan
Hongning Wang
OffRL
14
75
0
10 Nov 2019
Learning to reinforcement learn for Neural Architecture Search
J. Gomez
Joaquin Vanschoren
13
8
0
09 Nov 2019
Quinoa: a Q-function You Infer Normalized Over Actions
Jonas Degrave
A. Abdolmaleki
Jost Tobias Springenberg
N. Heess
Martin Riedmiller
21
5
0
05 Nov 2019
DeepRacer: Educational Autonomous Racing Platform for Experimentation with Sim2Real Reinforcement Learning
Bharathan Balaji
S. Mallya
Sahika Genc
Saurabh Gupta
Leo Dirac
...
Yunzhe Tao
Brian Townsend
E. Calleja
Sunil Muralidhara
Dhanasekar Karuppasamy
26
57
0
05 Nov 2019
Gradient-based Adaptive Markov Chain Monte Carlo
Michalis K. Titsias
P. Dellaportas
BDL
46
22
0
04 Nov 2019
Situated GAIL: Multitask imitation using task-conditioned adversarial inverse reinforcement learning
K. Kobayashi
Takato Horii
R. Iwaki
Y. Nagai
Minoru Asada
11
6
0
01 Nov 2019
A2: Extracting Cyclic Switchings from DOB-nets for Rejecting Excessive Disturbances
Wenjie Lu
Dikai Liu
19
0
0
01 Nov 2019
DeepLine: AutoML Tool for Pipelines Generation using Deep Reinforcement Learning and Hierarchical Actions Filtering
Yuval Heffetz
Roman Vainshtein
Gilad Katz
Lior Rokach
30
39
0
31 Oct 2019
NAT: Neural Architecture Transformer for Accurate and Compact Architectures
Yong Guo
Yin Zheng
Mingkui Tan
Qi Chen
Jian Chen
P. Zhao
Junzhou Huang
40
83
0
31 Oct 2019
VASE: Variational Assorted Surprise Exploration for Reinforcement Learning
Haitao Xu
B. McCane
Lech Szymanski
VLM
6
1
0
31 Oct 2019
Hierarchical Expert Networks for Meta-Learning
Heinke Hihn
Daniel A. Braun
38
4
0
31 Oct 2019
Learning to Manipulate Deformable Objects without Demonstrations
Yilin Wu
Wilson Yan
Thanard Kurutach
Lerrel Pinto
Pieter Abbeel
OffRL
31
199
0
29 Oct 2019
Constrained Reinforcement Learning Has Zero Duality Gap
Santiago Paternain
Luiz F. O. Chamon
Miguel Calvo-Fullana
Alejandro Ribeiro
22
190
0
29 Oct 2019
Feedback Linearization for Unknown Systems via Reinforcement Learning
T. Westenbroek
David Fridovich-Keil
Eric Mazumdar
Shreyas Arora
Valmik Prabhu
S. Shankar Sastry
Claire Tomlin
27
28
0
29 Oct 2019
Learning to Predict Without Looking Ahead: World Models Without Forward Prediction
C. Freeman
Luke Metz
David R Ha
38
35
0
29 Oct 2019
Generalization in Reinforcement Learning with Selective Noise Injection and Information Bottleneck
Maximilian Igl
K. Ciosek
Yingzhen Li
Sebastian Tschiatschek
Cheng Zhang
Sam Devlin
Katja Hofmann
OffRL
25
172
0
28 Oct 2019
Asynchronous Methods for Model-Based Reinforcement Learning
Yunzhi Zhang
I. Clavera
Bo-Yu Tsai
Pieter Abbeel
OffRL
22
27
0
28 Oct 2019
BAIL: Best-Action Imitation Learning for Batch Deep Reinforcement Learning
Xinyue Chen
Zijian Zhou
Ziyi Wang
Che Wang
Yanqiu Wu
Keith Ross
OffRL
35
121
0
27 Oct 2019
Convergent Policy Optimization for Safe Reinforcement Learning
Ming Yu
Zhuoran Yang
Mladen Kolar
Zhaoran Wang
21
94
0
26 Oct 2019
Relay Policy Learning: Solving Long-Horizon Tasks via Imitation and Reinforcement Learning
Abhishek Gupta
Vikash Kumar
Corey Lynch
Sergey Levine
Karol Hausman
24
427
0
25 Oct 2019
Meta-World: A Benchmark and Evaluation for Multi-Task and Meta Reinforcement Learning
Tianhe Yu
Deirdre Quillen
Zhanpeng He
Ryan Julian
Avnish Narayan
Hayden Shively
Adithya Bellathur
Karol Hausman
Chelsea Finn
Sergey Levine
OffRL
110
1,140
0
24 Oct 2019
Robust Model Predictive Shielding for Safe Reinforcement Learning with Stochastic Dynamics
Shuo Li
Osbert Bastani
25
82
0
24 Oct 2019
Partially Detected Intelligent Traffic Signal Control: Environmental Adaptation
Rusheng Zhang
Romain Leteurtre
Benjamin Striner
Ammar S. Alanazi
Abdullah A. Alghafis
Ozan K. Tonguz
14
13
0
23 Oct 2019
IPO: Interior-point Policy Optimization under Constraints
Yongshuai Liu
J. Ding
Xin Liu
29
176
0
21 Oct 2019
Policy Optimization for
H
2
\mathcal{H}_2
H
2
Linear Control with
H
∞
\mathcal{H}_\infty
H
∞
Robustness Guarantee: Implicit Regularization and Global Convergence
Kai Zhang
Bin Hu
Tamer Basar
29
119
0
21 Oct 2019
Dealing with Sparse Rewards in Reinforcement Learning
J. Hare
29
79
0
21 Oct 2019
Regularization Matters in Policy Optimization
Zhuang Liu
Xuanlin Li
Bingyi Kang
Trevor Darrell
OffRL
37
33
0
21 Oct 2019
A New Framework for Multi-Agent Reinforcement Learning -- Centralized Training and Exploration with Decentralized Execution via Policy Distillation
Gang Chen
22
41
0
21 Oct 2019
RLScheduler: An Automated HPC Batch Job Scheduler Using Reinforcement Learning
Di Zhang
Dong Dai
Youbiao He
F. S. Bao
Bing Xie
OffRL
22
8
0
20 Oct 2019
OffWorld Gym: open-access physical robotics environment for real-world reinforcement learning benchmark and research
Ashish Kumar
Toby Buckley
John B. Lanier
Qiaozhi Wang
A. Kavelaars
Ilya Kuzovkin
OffRL
19
14
0
18 Oct 2019
On Connections between Constrained Optimization and Reinforcement Learning
Nino Vieillard
Olivier Pietquin
Matthieu Geist
14
13
0
18 Oct 2019
Adaptive Trade-Offs in Off-Policy Learning
Mark Rowland
Will Dabney
Rémi Munos
OffRL
30
22
0
16 Oct 2019
Model-Agnostic Meta-Learning using Runge-Kutta Methods
Daniel Jiwoong Im
Yibo Jiang
Nakul Verma
27
4
0
16 Oct 2019
Reinforcement Learning for Robotic Manipulation using Simulated Locomotion Demonstrations
Ozsel Kilinc
Giovanni Montana
25
37
0
16 Oct 2019
Model-free Reinforcement Learning in Infinite-horizon Average-reward Markov Decision Processes
Chen-Yu Wei
Mehdi Jafarnia-Jahromi
Haipeng Luo
Hiteshi Sharma
R. Jain
109
104
0
15 Oct 2019
A unified view of likelihood ratio and reparameterization gradients and an optimal importance sampling scheme
Paavo Parmas
Masashi Sugiyama
24
3
0
14 Oct 2019
Neural Program Synthesis By Self-Learning
Yifan Xu
Luke Dai
Udaikaran Singh
Kening Zhang
Zhuowen Tu
29
6
0
13 Oct 2019
Orchestrating the Development Lifecycle of Machine Learning-Based IoT Applications: A Taxonomy and Survey
Bin Qian
Jie Su
Z. Wen
D. N. Jha
Yinhao Li
...
Albert Y. Zomaya
Omer F. Rana
Lizhe Wang
Maciej Koutny
R. Ranjan
33
4
0
11 Oct 2019
Hierarchical Reinforcement Learning with Advantage-Based Auxiliary Rewards
Siyuan Li
Rui Wang
Minxue Tang
Chongjie Zhang
23
82
0
10 Oct 2019
CityLearn: Diverse Real-World Environments for Sample-Efficient Navigation Policy Learning
Marvin Chancán
Michael Milford
SSL
36
5
0
10 Oct 2019
Prescribed Generative Adversarial Networks
Adji Bousso Dieng
Francisco J. R. Ruiz
David M. Blei
Michalis K. Titsias
GAN
DRL
32
61
0
09 Oct 2019
Previous
1
2
3
...
44
45
46
...
61
62
63
Next