Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1502.05477
Cited By
Trust Region Policy Optimization
19 February 2015
John Schulman
Sergey Levine
Philipp Moritz
Michael I. Jordan
Pieter Abbeel
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Trust Region Policy Optimization"
50 / 3,098 papers shown
Title
Control Regularization for Reduced Variance Reinforcement Learning
Richard Cheng
Abhinav Verma
G. Orosz
Swarat Chaudhuri
Yisong Yue
J. W. Burdick
OffRL
28
77
0
14 May 2019
Learning Novel Policies For Tasks
Yunbo Zhang
Wenhao Yu
Greg Turk
14
33
0
13 May 2019
Randomized Adversarial Imitation Learning for Autonomous Driving
Myungjae Shin
Joongheon Kim
34
25
0
13 May 2019
Multi-Pass Q-Networks for Deep Reinforcement Learning with Parameterised Action Spaces
Craig J. Bester
Steven D. James
George Konidaris
11
57
0
10 May 2019
Toward Packet Routing with Fully-distributed Multi-agent Deep Reinforcement Learning
Xinyu You
Xuanjie Li
Yuedong Xu
Hui Feng
Jin Zhao
Huaicheng Yan
14
22
0
09 May 2019
Smoothing Policies and Safe Policy Gradients
Matteo Papini
Matteo Pirotta
Marcello Restelli
32
30
0
08 May 2019
Dimension-Wise Importance Sampling Weight Clipping for Sample-Efficient Reinforcement Learning
Seungyul Han
Y. Sung
OffRL
16
20
0
07 May 2019
Lessons from Contextual Bandit Learning in a Customer Support Bot
Nikos Karampatziakis
Sebastian Kochman
Jade Huang
Paul Mineiro
Kathy Osborne
Weizhu Chen
21
6
0
06 May 2019
P3O: Policy-on Policy-off Policy Optimization
Rasool Fakoor
Pratik Chaudhari
Alex Smola
OffRL
29
51
0
05 May 2019
Hierarchical Policy Learning is Sensitive to Goal Space Design
Zach Dwiel
Madhavun Candadai
Mariano Phielipp
Arjun K. Bansal
27
15
0
04 May 2019
ARSM: Augment-REINFORCE-Swap-Merge Estimator for Gradient Backpropagation Through Categorical Variables
Mingzhang Yin
Yuguang Yue
Mingyuan Zhou
22
23
0
04 May 2019
A Survey on Neural Architecture Search
Martin Wistuba
Ambrish Rawat
Tejaswini Pedapati
AI4CE
28
258
0
04 May 2019
Information asymmetry in KL-regularized RL
Alexandre Galashov
Siddhant M. Jayakumar
Leonard Hasenclever
Dhruva Tirumala
Jonathan Richard Schwarz
Guillaume Desjardins
Wojciech M. Czarnecki
Yee Whye Teh
Razvan Pascanu
N. Heess
OffRL
25
102
0
03 May 2019
Collaborative Evolutionary Reinforcement Learning
Shauharda Khadka
Somdeb Majumdar
Tarek Nassar
Zach Dwiel
E. Tumer
Santiago Miret
Yinyin Liu
Kagan Tumer
29
99
0
02 May 2019
Efficient Model-free Reinforcement Learning in Metric Spaces
Zhao Song
Wen Sun
OffRL
17
39
0
01 May 2019
DAC: The Double Actor-Critic Architecture for Learning Options
Shangtong Zhang
Shimon Whiteson
30
72
0
29 Apr 2019
Deep Neuroevolution of Recurrent and Discrete World Models
S. Risi
Kenneth O. Stanley
OCL
22
53
0
28 Apr 2019
Machine Learning Tips and Tricks for Power Line Communications
Andrea M. Tonello
N. A. Letizia
Davide Righini
Francesco Marcuzzi
16
30
0
24 Apr 2019
Neural Logic Reinforcement Learning
Zhengyao Jiang
Shan Luo
NAI
27
71
0
24 Apr 2019
Stochastic Lipschitz Q-Learning
Xu Zhu
22
4
0
24 Apr 2019
Towards Combining On-Off-Policy Methods for Real-World Applications
Kai-Chun Hu
Chen-Huan Pi
Ting Han Wei
I-Chen Wu
Stone Cheng
Yi-Wei Dai
Wei-Yuan Ye
OffRL
11
2
0
24 Apr 2019
Generative Exploration and Exploitation
Jiechuan Jiang
Zongqing Lu
25
6
0
21 Apr 2019
Model-free Deep Reinforcement Learning for Urban Autonomous Driving
Jianyu Chen
Bodi Yuan
Masayoshi Tomizuka
30
262
0
20 Apr 2019
Off-Policy Policy Gradient with State Distribution Correction
Yao Liu
Adith Swaminathan
Alekh Agarwal
Emma Brunskill
OffRL
21
67
0
17 Apr 2019
Decoupled Data Based Approach for Learning to Control Nonlinear Dynamical Systems
Ran A. Wang
Karthikeya S. Parunandi
Dan Yu
D. Kalathil
S. Chakravorty
31
11
0
17 Apr 2019
End-to-End Robotic Reinforcement Learning without Reward Engineering
Avi Singh
Larry Yang
Kristian Hartikainen
Chelsea Finn
Sergey Levine
SSL
OffRL
46
266
0
16 Apr 2019
Reinforcement Learning for Nested Polar Code Construction
Lingchen Huang
Huazi Zhang
Rong Li
Yiqun Ge
Jun Wang
17
14
0
16 Apr 2019
Learning to Navigate in Indoor Environments: from Memorizing to Reasoning
Liulong Ma
Yanjie Liu
Jiao Chen
Dong Jin
24
10
0
15 Apr 2019
A Short Survey On Memory Based Reinforcement Learning
Dhruv Ramani
OffRL
33
17
0
14 Apr 2019
Effective Scheduling Function Design in SDN through Deep Reinforcement Learning
Victoria Huang
Gang Chen
Q. Fu
11
6
0
12 Apr 2019
Energy-Based Continuous Inverse Optimal Control
Yifei Xu
Jianwen Xie
Tianyang Zhao
Chris L. Baker
Yibiao Zhao
Ying Nian Wu
33
19
0
10 Apr 2019
Model-Free Reinforcement Learning for Financial Portfolios: A Brief Survey
Yoshiharu Sato
OffRL
24
32
0
10 Apr 2019
Only Relevant Information Matters: Filtering Out Noisy Samples to Boost RL
Yannis Flet-Berliac
Philippe Preux
22
2
0
08 Apr 2019
Multi-Preference Actor Critic
Ishan Durugkar
Matthew J. Hausknecht
Adith Swaminathan
Patrick MacAlpine
19
1
0
05 Apr 2019
A Validated Physical Model For Real-Time Simulation of Soft Robotic Snakes
Renato Gasoto
Miles Macklin
Xuan Liu
Yinan Sun
Kenny Erleben
C. Onal
Jie Fu
AI4CE
8
16
0
05 Apr 2019
Meta-Learning Acquisition Functions for Transfer Learning in Bayesian Optimization
Michael Volpp
Lukas P. Frohlich
Kirsten Fischer
Andreas Doerr
Stefan Falkner
Frank Hutter
Christian Daniel
26
84
0
04 Apr 2019
Risk Averse Robust Adversarial Reinforcement Learning
Xinlei Pan
Daniel Seita
Yang Gao
John F. Canny
AAML
16
96
0
31 Mar 2019
Regularizing Trajectory Optimization with Denoising Autoencoders
Rinu Boney
Norman Di Palo
Mathias Berglund
Alexander Ilin
Arno Solin
Antti Rasmus
Harri Valpola
10
10
0
28 Mar 2019
Meta-Learning surrogate models for sequential decision making
Alexandre Galashov
Jonathan Richard Schwarz
Hyunjik Kim
M. Garnelo
D. Saxton
Pushmeet Kohli
S. M. Ali Eslami
Yee Whye Teh
BDL
OffRL
33
26
0
28 Mar 2019
How to pick the domain randomization parameters for sim-to-real transfer of reinforcement learning policies?
Q. Vuong
Sharad Vikram
H. Su
Sicun Gao
Henrik I. Christensen
OOD
16
47
0
28 Mar 2019
Autoregressive Policies for Continuous Control Deep Reinforcement Learning
D. Korenkevych
A. R. Mahmood
Gautham Vasan
James Bergstra
32
28
0
27 Mar 2019
Generalized Off-Policy Actor-Critic
Shangtong Zhang
Wendelin Bohmer
Shimon Whiteson
OffRL
CML
30
43
0
27 Mar 2019
Using RGB Image as Visual Input for Mapless Robot Navigation
Liulong Ma
Yanjie Liu
Jiao Chen
SSL
39
17
0
24 Mar 2019
TTR-Based Reward for Reinforcement Learning with Implicit Model Priors
Xubo Lyu
Mo Chen
OffRL
6
3
0
23 Mar 2019
Improving Safety in Reinforcement Learning Using Model-Based Architectures and Human Intervention
Bharat Prakash
Mohit Khatwani
Nicholas R. Waytowich
T. Mohsenin
OffRL
15
19
0
22 Mar 2019
Flying through a narrow gap using neural network: an end-to-end planning and control approach
Jiarong Lin
Luqi Wang
Fei Gao
Shaojie Shen
Fu Zhang
9
31
0
21 Mar 2019
End-to-End Safe Reinforcement Learning through Barrier Functions for Safety-Critical Continuous Control Tasks
Richard Cheng
G. Orosz
R. Murray
J. W. Burdick
31
609
0
21 Mar 2019
Batch Policy Learning under Constraints
Hoang Minh Le
Cameron Voloshin
Yisong Yue
OffRL
14
320
0
20 Mar 2019
Learning Gentle Object Manipulation with Curiosity-Driven Deep Reinforcement Learning
Sandy H. Huang
Martina Zambelli
Jackie Kay
M. Martins
Yuval Tassa
P. Pilarski
R. Hadsell
31
50
0
20 Mar 2019
Truly Proximal Policy Optimization
Yuhui Wang
Hao He
Chao Wen
Xiaoyang Tan
21
123
0
19 Mar 2019
Previous
1
2
3
...
49
50
51
...
60
61
62
Next