Papers
Communities
Organizations
Events
Blog
Pricing
Search
Open menu
Home
Papers
1502.05477
Cited By
v1
v2
v3
v4
v5 (latest)
Trust Region Policy Optimization
19 February 2015
John Schulman
Sergey Levine
Philipp Moritz
Michael I. Jordan
Pieter Abbeel
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Trust Region Policy Optimization"
50 / 2,028 papers shown
Title
Deep Reinforcement Learning Architecture for Continuous Power Allocation in High Throughput Satellites
J. Luis
Markus Guerster
Iñigo Del Portillo
E. Crawley
B. Cameron
19
18
0
03 Jun 2019
Harnessing Reinforcement Learning for Neural Motion Planning
Tom Jurgenson
Aviv Tamar
OOD
109
65
0
01 Jun 2019
Policy Optimization Provably Converges to Nash Equilibria in Zero-Sum Linear Quadratic Games
Jianchao Tan
Zhuoran Yang
Tamer Basar
119
128
0
31 May 2019
Reinforcement Learning Experience Reuse with Policy Residual Representation
Wen-Ji Zhou
Yang Yu
Yingfeng Chen
Kai Guan
Tangjie Lv
Changjie Fan
Zhi Zhou
OffRL
19
2
0
31 May 2019
Advantage Amplification in Slowly Evolving Latent-State Environments
Martin Mladenov
Ofer Meshi
Jayden Ooi
Dale Schuurmans
Craig Boutilier
OffRL
89
9
0
29 May 2019
An Improved Convergence Analysis of Stochastic Variance-Reduced Policy Gradient
Pan Xu
F. Gao
Quanquan Gu
97
97
0
29 May 2019
Adversarial Imitation Learning from Incomplete Demonstrations
Mingfei Sun
Xiaojuan Ma
78
29
0
29 May 2019
Snooping Attacks on Deep Reinforcement Learning
Matthew J. Inkawhich
Yiran Chen
Hai Helen Li
AAML
68
25
0
28 May 2019
Learning to Discretize: Solving 1D Scalar Conservation Laws via Deep Reinforcement Learning
Yufei Wang
Ziju Shen
Zichao Long
Bin Dong
AI4CE
PINN
79
40
0
27 May 2019
Learning latent state representation for speeding up exploration
Giulia Vezzani
Abhishek Gupta
Lorenzo Natale
Pieter Abbeel
68
28
0
27 May 2019
Policy Search by Target Distribution Learning for Continuous Control
Wei Shen
Yuanqi Li
Jian Li
72
6
0
27 May 2019
AI-GAs: AI-generating algorithms, an alternate paradigm for producing general artificial intelligence
Jeff Clune
148
122
0
27 May 2019
Provably Efficient Imitation Learning from Observation Alone
Wen Sun
Anirudh Vemula
Byron Boots
J. Andrew Bagnell
176
107
0
27 May 2019
Composing Task-Agnostic Policies with Deep Reinforcement Learning
A. H. Qureshi
Jacob J. Johnson
Yuzhe Qin
Taylor Henderson
Byron Boots
Michael C. Yip
OffRL
76
30
0
25 May 2019
Transferable Cost-Aware Security Policy Implementation for Malware Detection Using Deep Reinforcement Learning
Yoni Birman
Shaked Hindi
Gilad Katz
A. Shabtai
AAML
OffRL
21
2
0
25 May 2019
Adaptive Symmetric Reward Noising for Reinforcement Learning
R. Vivanti
Talya D. Sohlberg-Baris
Shlomo Cohen
Orna Cohen
AAML
23
1
0
24 May 2019
Neural Temporal-Difference and Q-Learning Provably Converge to Global Optima
Qi Cai
Zhuoran Yang
Jason D. Lee
Zhaoran Wang
68
32
0
24 May 2019
Distributional Policy Optimization: An Alternative Approach for Continuous Control
Chen Tessler
Guy Tennenholtz
Shie Mannor
OffRL
51
44
0
23 May 2019
From semantics to execution: Integrating action planning with reinforcement learning for robotic causal problem-solving
Manfred Eppe
Phuong D. H. Nguyen
S. Wermter
70
42
0
23 May 2019
Imitation Learning from Video by Leveraging Proprioception
F. Torabi
Garrett A. Warnell
Peter Stone
70
35
0
22 May 2019
Maximum Entropy-Regularized Multi-Goal Reinforcement Learning
Rui Zhao
Xudong Sun
Volker Tresp
69
83
0
21 May 2019
MaMiC: Macro and Micro Curriculum for Robotic Reinforcement Learning
Manan Tomar
Akhil Sathuluri
Balaraman Ravindran
38
4
0
17 May 2019
Leveraging exploration in off-policy algorithms via normalizing flows
Bogdan Mazoure
T. Doan
A. Durand
R. Devon Hjelm
Joelle Pineau
OnRL
72
62
0
16 May 2019
Random Expert Distillation: Imitation Learning via Expert Policy Support Estimation
Ruohan Wang
C. Ciliberto
P. Amadori
Y. Demiris
79
62
0
16 May 2019
Trajectory-Based Off-Policy Deep Reinforcement Learning
Andreas Doerr
Michael Volpp
Marc Toussaint
Sebastian Trimpe
Christian Daniel
OffRL
68
2
0
14 May 2019
Control Regularization for Reduced Variance Reinforcement Learning
Richard Cheng
Abhinav Verma
G. Orosz
Swarat Chaudhuri
Yisong Yue
J. W. Burdick
OffRL
88
80
0
14 May 2019
Learning Novel Policies For Tasks
Yunbo Zhang
Wenhao Yu
Greg Turk
58
34
0
13 May 2019
Randomized Adversarial Imitation Learning for Autonomous Driving
Myungjae Shin
Joongheon Kim
61
25
0
13 May 2019
Toward Packet Routing with Fully-distributed Multi-agent Deep Reinforcement Learning
Xinyu You
Xuanjie Li
Yuedong Xu
Hui Feng
Jin Zhao
Huaicheng Yan
33
22
0
09 May 2019
Smoothing Policies and Safe Policy Gradients
Matteo Papini
Matteo Pirotta
Marcello Restelli
80
31
0
08 May 2019
Dimension-Wise Importance Sampling Weight Clipping for Sample-Efficient Reinforcement Learning
Seungyul Han
Y. Sung
OffRL
68
20
0
07 May 2019
Lessons from Contextual Bandit Learning in a Customer Support Bot
Nikos Karampatziakis
Sebastian Kochman
Jade Huang
Paul Mineiro
Kathy Osborne
Weizhu Chen
63
6
0
06 May 2019
P3O: Policy-on Policy-off Policy Optimization
Rasool Fakoor
Pratik Chaudhari
Alex Smola
OffRL
93
56
0
05 May 2019
Hierarchical Policy Learning is Sensitive to Goal Space Design
Zach Dwiel
Madhavun Candadai
Mariano Phielipp
Arjun K. Bansal
86
15
0
04 May 2019
ARSM: Augment-REINFORCE-Swap-Merge Estimator for Gradient Backpropagation Through Categorical Variables
Mingzhang Yin
Yuguang Yue
Mingyuan Zhou
66
23
0
04 May 2019
A Survey on Neural Architecture Search
Martin Wistuba
Ambrish Rawat
Tejaswini Pedapati
AI4CE
105
259
0
04 May 2019
Information asymmetry in KL-regularized RL
Alexandre Galashov
Siddhant M. Jayakumar
Leonard Hasenclever
Dhruva Tirumala
Jonathan Richard Schwarz
Guillaume Desjardins
Wojciech M. Czarnecki
Yee Whye Teh
Razvan Pascanu
N. Heess
OffRL
80
104
0
03 May 2019
Collaborative Evolutionary Reinforcement Learning
Shauharda Khadka
Somdeb Majumdar
Tarek Nassar
Zach Dwiel
E. Tumer
Santiago Miret
Yinyin Liu
Kagan Tumer
71
100
0
02 May 2019
Efficient Model-free Reinforcement Learning in Metric Spaces
Zhao Song
Wen Sun
OffRL
79
39
0
01 May 2019
DAC: The Double Actor-Critic Architecture for Learning Options
Shangtong Zhang
Shimon Whiteson
151
73
0
29 Apr 2019
Deep Neuroevolution of Recurrent and Discrete World Models
S. Risi
Kenneth O. Stanley
OCL
133
53
0
28 Apr 2019
Neural Logic Reinforcement Learning
Zhengyao Jiang
Shan Luo
NAI
106
75
0
24 Apr 2019
Stochastic Lipschitz Q-Learning
Xu Zhu
71
4
0
24 Apr 2019
Generative Exploration and Exploitation
Jiechuan Jiang
Zongqing Lu
39
6
0
21 Apr 2019
Model-free Deep Reinforcement Learning for Urban Autonomous Driving
Jianyu Chen
Bodi Yuan
Masayoshi Tomizuka
80
268
0
20 Apr 2019
Off-Policy Policy Gradient with State Distribution Correction
Yao Liu
Adith Swaminathan
Alekh Agarwal
Emma Brunskill
OffRL
161
67
0
17 Apr 2019
Decoupled Data Based Approach for Learning to Control Nonlinear Dynamical Systems
Ran A. Wang
Karthikeya S. Parunandi
Dan Yu
D. Kalathil
S. Chakravorty
53
12
0
17 Apr 2019
End-to-End Robotic Reinforcement Learning without Reward Engineering
Avi Singh
Larry Yang
Kristian Hartikainen
Chelsea Finn
Sergey Levine
SSL
OffRL
120
267
0
16 Apr 2019
Reinforcement Learning for Nested Polar Code Construction
Lingchen Huang
Huazi Zhang
Rong Li
Yiqun Ge
Jun Wang
34
14
0
16 Apr 2019
Learning to Navigate in Indoor Environments: from Memorizing to Reasoning
Liulong Ma
Yanjie Liu
Jiao Chen
Dong Jin
58
10
0
15 Apr 2019
Previous
1
2
3
...
29
30
31
...
39
40
41
Next