Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1502.05477
Cited By
Trust Region Policy Optimization
19 February 2015
John Schulman
Sergey Levine
Philipp Moritz
Michael I. Jordan
Pieter Abbeel
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Trust Region Policy Optimization"
50 / 3,103 papers shown
Title
Improving Generalization in Meta Reinforcement Learning using Learned Objectives
Louis Kirsch
Sjoerd van Steenkiste
Jürgen Schmidhuber
OffRL
23
119
0
09 Oct 2019
Compatible features for Monotonic Policy Improvement
Marcin Tomczak
Sergio Valcarcel Macua
Enrique Munoz de Cote
Peter Vrancx
OffRL
11
2
0
09 Oct 2019
Policy Optimization Through Approximate Importance Sampling
Marcin Tomczak
Dongho Kim
Peter Vrancx
Kyungmin Kim
19
4
0
09 Oct 2019
Ctrl-Z: Recovering from Instability in Reinforcement Learning
Vibhavari Dasagi
Jake Bruce
T. Peynot
Jurgen Leitner
25
10
0
09 Oct 2019
Investigation on the generalization of the Sampled Policy Gradient algorithm
Nil Stolt Ansó
OffRL
23
0
0
09 Oct 2019
Is a Good Representation Sufficient for Sample Efficient Reinforcement Learning?
S. Du
Sham Kakade
Ruosong Wang
Lin F. Yang
47
192
0
07 Oct 2019
Multi-step Greedy Reinforcement Learning Algorithms
Manan Tomar
Yonathan Efroni
Mohammad Ghavamzadeh
25
1
0
07 Oct 2019
Self-Paced Contextual Reinforcement Learning
Pascal Klink
Hany Abdulsamad
Boris Belousov
Jan Peters
26
49
0
07 Oct 2019
Striving for Simplicity and Performance in Off-Policy DRL: Output Normalization and Non-Uniform Sampling
Che Wang
Yanqiu Wu
Q. Vuong
Keith Ross
24
6
0
05 Oct 2019
Analyzing the Variance of Policy Gradient Estimators for the Linear-Quadratic Regulator
James A. Preiss
Sébastien M. R. Arnold
Chengdong Wei
Marius Kloft
OffRL
19
5
0
02 Oct 2019
Relationship Explainable Multi-objective Optimization Via Vector Value Function Based Reinforcement Learning
Huixin Zhan
Yongcan Cao
32
7
0
02 Oct 2019
Deep Reinforcement Learning for Single-Shot Diagnosis and Adaptation in Damaged Robots
Shresth Verma
A. Mustafa
Gaurav Agarwal
E. Imre
A. Hilton
4
7
0
02 Oct 2019
Scenario Generalization of Data-driven Imitation Models in Crowd Simulation
Gang Qiao
H. Zhou
Mubbasir Kapadia
Sejong Yoon
Vladimir Pavlovic
AI4CE
20
8
0
02 Oct 2019
Boosting Image Recognition with Non-differentiable Constraints
Xuan Li
Yuchen Lu
Peng Xu
Jizong Peng
Christian Desrosiers
Xue Liu
12
0
0
02 Oct 2019
Reinforcement Learning for Multi-Objective Optimization of Online Decisions in High-Dimensional Systems
Hardik Meisheri
Vinita Baniwal
Nazneen N. Sultana
Balaraman Ravindran
H. Khadilkar
OffRL
26
1
0
01 Oct 2019
Advantage-Weighted Regression: Simple and Scalable Off-Policy Reinforcement Learning
Xue Bin Peng
Aviral Kumar
Grace Zhang
Sergey Levine
OffRL
91
546
0
01 Oct 2019
Meta-Q-Learning
Rasool Fakoor
Pratik Chaudhari
Stefano Soatto
Alex Smola
OffRL
38
145
0
30 Sep 2019
Efficient Bimanual Manipulation Using Learned Task Schemas
Rohan Chitnis
Shubham Tulsiani
Saurabh Gupta
Abhinav Gupta
33
73
0
30 Sep 2019
Tensor-based Cooperative Control for Large Scale Multi-intersection Traffic Signal Using Deep Reinforcement Learning and Imitation Learning
Yusen Huo
Qinghua Tao
Jianming Hu
29
1
0
30 Sep 2019
Learning from Observations Using a Single Video Demonstration and Human Feedback
S. Gandhi
Tim Oates
T. Mohsenin
Nicholas R. Waytowich
24
1
0
29 Sep 2019
SURREAL-System: Fully-Integrated Stack for Distributed Deep Reinforcement Learning
Linxi Fan
Yuke Zhu
Jiren Zhu
Zihua Liu
Orien Zeng
Anchit Gupta
Joan Creus-Costa
Silvio Savarese
Li Fei-Fei
OffRL
GNN
46
3
0
27 Sep 2019
Interaction-Aware Multi-Agent Reinforcement Learning for Mobile Agents with Individual Goals
Anahita Mohseni-Kabir
David Isele
K. Fujimura
30
11
0
27 Sep 2019
Deep Reinforcement Learning Based Power control for Wireless Multicast Systems
R. Raghu
Pratheek S. Upadhyaya
M. Panju
Vaneet Aggarwal
V. Sharma
6
11
0
27 Sep 2019
Autonomous Control of a Tendon-driven Robotic Limb with Elastic Elements Reveals that Added Elasticity can Enhance Learning
Ali Marjaninejad
Jie Tan
Francisco J. Valero Cuevas
19
5
0
26 Sep 2019
A Re-classification of Information Seeking Tasks and Their Computational Solutions
Zhiwen Tang
Grace Hui Yang
22
6
0
26 Sep 2019
Learning Generalizable Locomotion Skills with Hierarchical Reinforcement Learning
Tianyu Li
Nathan Lambert
Roberto Calandra
Franziska Meier
Akshara Rai
36
40
0
26 Sep 2019
RLBench: The Robot Learning Benchmark & Learning Environment
Stephen James
Z. Ma
David Rovick Arrojo
Andrew J. Davison
SSL
VLM
OffRL
62
534
0
26 Sep 2019
Relationship Explainable Multi-objective Reinforcement Learning with Semantic Explainability Generation
Huixin Zhan
Yongcan Cao
26
2
0
26 Sep 2019
V-MPO: On-Policy Maximum a Posteriori Policy Optimization for Discrete and Continuous Control
H. F. Song
A. Abdolmaleki
Jost Tobias Springenberg
Aidan Clark
Hubert Soyer
...
Dhruva Tirumala
N. Heess
Dan Belov
Martin Riedmiller
M. Botvinick
42
121
0
26 Sep 2019
MERL: Multi-Head Reinforcement Learning
Yannis Flet-Berliac
Philippe Preux
OffRL
27
13
0
26 Sep 2019
Model Imitation for Model-Based Reinforcement Learning
Yueh-hua Wu
Ting-Han Fan
Peter J. Ramadge
H. Su
OffRL
21
16
0
25 Sep 2019
Off-Policy Actor-Critic with Shared Experience Replay
Simon Schmitt
Matteo Hessel
Karen Simonyan
OffRL
32
68
0
25 Sep 2019
Controlling an Autonomous Vehicle with Deep Reinforcement Learning
A. Folkers
Matthias Rick
C. Büskens
22
67
0
24 Sep 2019
Constrained Attractor Selection Using Deep Reinforcement Learning
Xue-She Wang
J. Turner
B. Mann
21
35
0
23 Sep 2019
Learning a Control Policy for Fall Prevention on an Assistive Walking Device
Visak C. V. Kumar
Sehoon Ha
Gregory Sawicki
Chenxi Liu
13
15
0
23 Sep 2019
Faster saddle-point optimization for solving large-scale Markov decision processes
Joan Bas-Serrano
Gergely Neu
8
13
0
22 Sep 2019
Learning an Adaptive Learning Rate Schedule
Zhen Xu
Andrew M. Dai
Jonas Kemp
Luke Metz
22
62
0
20 Sep 2019
Meta-Inverse Reinforcement Learning with Probabilistic Context Variables
Lantao Yu
Tianhe Yu
Chelsea Finn
Stefano Ermon
OffRL
BDL
19
71
0
20 Sep 2019
How Much Do Unstated Problem Constraints Limit Deep Robotic Reinforcement Learning?
W. Lewis
Mark Moll
Lydia E. Kavraki
OffRL
16
5
0
20 Sep 2019
Revisit Policy Optimization in Matrix Form
Sitao Luan
Xiao-Wen Chang
Doina Precup
11
1
0
19 Sep 2019
Sample Efficient Policy Gradient Methods with Recursive Variance Reduction
Pan Xu
F. Gao
Quanquan Gu
44
85
0
18 Sep 2019
DeepGait: Planning and Control of Quadrupedal Gaits using Deep Reinforcement Learning
Vassilios Tsounis
Mitja Alge
Joonho Lee
Farbod Farshidian
Marco Hutter
33
184
0
18 Sep 2019
A Human-Centered Data-Driven Planner-Actor-Critic Architecture via Logic Programming
Daoming Lyu
Fangkai Yang
Bo Liu
Steven M. Gustafson
14
1
0
18 Sep 2019
Adversarial Attacks and Defenses in Images, Graphs and Text: A Review
Han Xu
Yao Ma
Haochen Liu
Debayan Deb
Hui Liu
Jiliang Tang
Anil K. Jain
AAML
39
671
0
17 Sep 2019
A Review of Tracking, Prediction and Decision Making Methods for Autonomous Driving
Florin Leon
M. Gavrilescu
30
97
0
17 Sep 2019
Attraction-Repulsion Actor-Critic for Continuous Control Reinforcement Learning
T. Doan
Bogdan Mazoure
Moloud Abdar
A. Durand
Joelle Pineau
R. Devon Hjelm
26
15
0
17 Sep 2019
Biased Estimates of Advantages over Path Ensembles
Lanxin Lei
Zhizhong Li
Dahua Lin
OffRL
11
0
0
15 Sep 2019
Policy Prediction Network: Model-Free Behavior Policy with Model-Based Learning in Continuous Action Space
Zac Wellmer
James T. Kwok
14
0
0
15 Sep 2019
VILD: Variational Imitation Learning with Diverse-quality Demonstrations
Voot Tangkaratt
Bo Han
Mohammad Emtiyaz Khan
Masashi Sugiyama
30
20
0
15 Sep 2019
Learning to Collaborate from Simulation for Robot-Assisted Dressing
Alexander Clegg
Zackory M. Erickson
Patrick Grady
Greg Turk
Charles C. Kemp
Chenxi Liu
32
46
0
14 Sep 2019
Previous
1
2
3
...
45
46
47
...
61
62
63
Next