Papers
Communities
Organizations
Events
Blog
Pricing
Search
Open menu
Home
Papers
1502.05477
Cited By
v1
v2
v3
v4
v5 (latest)
Trust Region Policy Optimization
19 February 2015
John Schulman
Sergey Levine
Philipp Moritz
Michael I. Jordan
Pieter Abbeel
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Trust Region Policy Optimization"
50 / 2,023 papers shown
Title
CityLearn: Diverse Real-World Environments for Sample-Efficient Navigation Policy Learning
Marvin Chancán
Michael Milford
SSL
70
5
0
10 Oct 2019
Prescribed Generative Adversarial Networks
Adji Bousso Dieng
Francisco J. R. Ruiz
David M. Blei
Michalis K. Titsias
GAN
DRL
83
62
0
09 Oct 2019
Improving Generalization in Meta Reinforcement Learning using Learned Objectives
Louis Kirsch
Sjoerd van Steenkiste
Jürgen Schmidhuber
OffRL
95
119
0
09 Oct 2019
Compatible features for Monotonic Policy Improvement
Marcin Tomczak
Sergio Valcarcel Macua
Enrique Munoz de Cote
Peter Vrancx
OffRL
26
2
0
09 Oct 2019
Is a Good Representation Sufficient for Sample Efficient Reinforcement Learning?
S. Du
Sham Kakade
Ruosong Wang
Lin F. Yang
235
193
0
07 Oct 2019
Self-Paced Contextual Reinforcement Learning
Pascal Klink
Hany Abdulsamad
Boris Belousov
Jan Peters
75
49
0
07 Oct 2019
Analyzing the Variance of Policy Gradient Estimators for the Linear-Quadratic Regulator
James A. Preiss
Sébastien M. R. Arnold
Chengdong Wei
Marius Kloft
OffRL
44
5
0
02 Oct 2019
Relationship Explainable Multi-objective Optimization Via Vector Value Function Based Reinforcement Learning
Huixin Zhan
Yongcan Cao
66
7
0
02 Oct 2019
Deep Reinforcement Learning for Single-Shot Diagnosis and Adaptation in Damaged Robots
Shresth Verma
A. Mustafa
Gaurav Agarwal
E. Imre
A. Hilton
18
7
0
02 Oct 2019
Scenario Generalization of Data-driven Imitation Models in Crowd Simulation
Gang Qiao
H. Zhou
Mubbasir Kapadia
Sejong Yoon
Vladimir Pavlovic
AI4CE
54
8
0
02 Oct 2019
Reinforcement Learning for Multi-Objective Optimization of Online Decisions in High-Dimensional Systems
Hardik Meisheri
Vinita Baniwal
Nazneen N. Sultana
Balaraman Ravindran
H. Khadilkar
OffRL
33
2
0
01 Oct 2019
Advantage-Weighted Regression: Simple and Scalable Off-Policy Reinforcement Learning
Xue Bin Peng
Aviral Kumar
Grace Zhang
Sergey Levine
OffRL
185
571
0
01 Oct 2019
Meta-Q-Learning
Rasool Fakoor
Pratik Chaudhari
Stefano Soatto
Alex Smola
OffRL
108
149
0
30 Sep 2019
Efficient Bimanual Manipulation Using Learned Task Schemas
Rohan Chitnis
Shubham Tulsiani
Saurabh Gupta
Abhinav Gupta
99
75
0
30 Sep 2019
Tensor-based Cooperative Control for Large Scale Multi-intersection Traffic Signal Using Deep Reinforcement Learning and Imitation Learning
Yusen Huo
Qinghua Tao
Jianming Hu
49
1
0
30 Sep 2019
SURREAL-System: Fully-Integrated Stack for Distributed Deep Reinforcement Learning
Linxi Fan
Yuke Zhu
Jiren Zhu
Zihua Liu
Orien Zeng
Anchit Gupta
Joan Creus-Costa
Silvio Savarese
Li Fei-Fei
OffRL
GNN
99
3
0
27 Sep 2019
Interaction-Aware Multi-Agent Reinforcement Learning for Mobile Agents with Individual Goals
Anahita Mohseni-Kabir
David Isele
K. Fujimura
52
12
0
27 Sep 2019
Deep Reinforcement Learning Based Power control for Wireless Multicast Systems
R. Raghu
Pratheek S. Upadhyaya
M. Panju
Vaneet Aggarwal
V. Sharma
32
11
0
27 Sep 2019
Learning Generalizable Locomotion Skills with Hierarchical Reinforcement Learning
Tianyu Li
Nathan Lambert
Roberto Calandra
Franziska Meier
Akshara Rai
82
40
0
26 Sep 2019
RLBench: The Robot Learning Benchmark & Learning Environment
Stephen James
Z. Ma
David Rovick Arrojo
Andrew J. Davison
SSL
VLM
OffRL
147
563
0
26 Sep 2019
Relationship Explainable Multi-objective Reinforcement Learning with Semantic Explainability Generation
Huixin Zhan
Yongcan Cao
66
2
0
26 Sep 2019
V-MPO: On-Policy Maximum a Posteriori Policy Optimization for Discrete and Continuous Control
H. F. Song
A. Abdolmaleki
Jost Tobias Springenberg
Aidan Clark
Hubert Soyer
...
Dhruva Tirumala
N. Heess
Dan Belov
Martin Riedmiller
M. Botvinick
121
126
0
26 Sep 2019
MERL: Multi-Head Reinforcement Learning
Yannis Flet-Berliac
Philippe Preux
OffRL
131
13
0
26 Sep 2019
Model Imitation for Model-Based Reinforcement Learning
Yueh-hua Wu
Ting-Han Fan
Peter J. Ramadge
H. Su
OffRL
52
16
0
25 Sep 2019
Off-Policy Actor-Critic with Shared Experience Replay
Simon Schmitt
Matteo Hessel
Karen Simonyan
OffRL
80
68
0
25 Sep 2019
Controlling an Autonomous Vehicle with Deep Reinforcement Learning
A. Folkers
Matthias Rick
C. Büskens
73
67
0
24 Sep 2019
Learning an Adaptive Learning Rate Schedule
Zhen Xu
Andrew M. Dai
Jonas Kemp
Luke Metz
75
62
0
20 Sep 2019
Meta-Inverse Reinforcement Learning with Probabilistic Context Variables
Lantao Yu
Tianhe Yu
Chelsea Finn
Stefano Ermon
OffRL
BDL
66
72
0
20 Sep 2019
How Much Do Unstated Problem Constraints Limit Deep Robotic Reinforcement Learning?
W. Lewis
Mark Moll
Lydia E. Kavraki
OffRL
50
5
0
20 Sep 2019
Revisit Policy Optimization in Matrix Form
Sitao Luan
Xiao-Wen Chang
Doina Precup
18
1
0
19 Sep 2019
Sample Efficient Policy Gradient Methods with Recursive Variance Reduction
Pan Xu
F. Gao
Quanquan Gu
131
89
0
18 Sep 2019
DeepGait: Planning and Control of Quadrupedal Gaits using Deep Reinforcement Learning
Vassilios Tsounis
Mitja Alge
Joonho Lee
Farbod Farshidian
Marco Hutter
82
189
0
18 Sep 2019
Adversarial Attacks and Defenses in Images, Graphs and Text: A Review
Han Xu
Yao Ma
Haochen Liu
Debayan Deb
Hui Liu
Jiliang Tang
Anil K. Jain
AAML
90
680
0
17 Sep 2019
A Review of Tracking, Prediction and Decision Making Methods for Autonomous Driving
Florin Leon
M. Gavrilescu
82
101
0
17 Sep 2019
Biased Estimates of Advantages over Path Ensembles
Lanxin Lei
Zhizhong Li
Dahua Lin
OffRL
31
0
0
15 Sep 2019
Policy Prediction Network: Model-Free Behavior Policy with Model-Based Learning in Continuous Action Space
Zac Wellmer
James T. Kwok
31
0
0
15 Sep 2019
VILD: Variational Imitation Learning with Diverse-quality Demonstrations
Voot Tangkaratt
Bo Han
Mohammad Emtiyaz Khan
Masashi Sugiyama
78
20
0
15 Sep 2019
Learning to Collaborate from Simulation for Robot-Assisted Dressing
Alexander Clegg
Zackory M. Erickson
Patrick Grady
Greg Turk
Charles C. Kemp
Chenxi Liu
69
46
0
14 Sep 2019
Node Injection Attacks on Graphs via Reinforcement Learning
Yiwei Sun
Suhang Wang
Xianfeng Tang
Tsung-Yu Hsieh
Vasant Honavar
GNN
AAML
72
45
0
14 Sep 2019
ISL: A novel approach for deep exploration
Lucas Cassano
Ali H. Sayed
62
1
0
13 Sep 2019
Safe Policy Improvement with an Estimated Baseline Policy
T. D. Simão
Romain Laroche
Rémi Tachet des Combes
OffRL
52
22
0
11 Sep 2019
Mutual-Information Regularization in Markov Decision Processes and Actor-Critic Learning
Felix Leibfried
Jordi Grau-Moya
79
22
0
11 Sep 2019
Meta-Learning with Implicit Gradients
Aravind Rajeswaran
Chelsea Finn
Sham Kakade
Sergey Levine
268
859
0
10 Sep 2019
Discovery of Useful Questions as Auxiliary Tasks
Vivek Veeriah
Matteo Hessel
Zhongwen Xu
Richard L. Lewis
Janarthanan Rajendran
Junhyuk Oh
H. V. Hasselt
David Silver
Satinder Singh
LLMAG
95
85
0
10 Sep 2019
Deterministic Value-Policy Gradients
Qingpeng Cai
L. Pan
Pingzhong Tang
54
1
0
09 Sep 2019
Imitation Learning from Pixel-Level Demonstrations by HashReward
Xin-Qiang Cai
Yao-Xiang Ding
Yuan Jiang
Zhi Zhou
50
10
0
09 Sep 2019
A Survey on Reproducibility by Evaluating Deep Reinforcement Learning Algorithms on Real-World Robots
Nicolai A. Lynnerup
Laura Nolling
Rasmus Hasle
J. Hallam
46
18
0
09 Sep 2019
Learning How to Dynamically Route Autonomous Vehicles on Shared Roads
Daniel A. Lazar
Erdem Biyik
Dorsa Sadigh
Ramtin Pedarsani
99
47
0
09 Sep 2019
Imitation Learning for Human Pose Prediction
Borui Wang
Ehsan Adeli
Hsu-kuang Chiu
De-An Huang
Juan Carlos Niebles
78
99
0
08 Sep 2019
Soft Policy Gradient Method for Maximum Entropy Deep Reinforcement Learning
Wenjie Shi
Shiji Song
Cheng Wu
71
38
0
07 Sep 2019
Previous
1
2
3
...
26
27
28
...
39
40
41
Next