Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1502.05477
Cited By
Trust Region Policy Optimization
19 February 2015
John Schulman
Sergey Levine
Philipp Moritz
Michael I. Jordan
Pieter Abbeel
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Trust Region Policy Optimization"
50 / 3,098 papers shown
Title
Intelligent Middle-Level Game Control
Amin Babadi
Kourosh Naderi
Perttu Hämäläinen
6
5
0
19 Aug 2018
Importance mixing: Improving sample reuse in evolutionary policy search methods
Aloïs Pourchot
Nicolas Perrin
Olivier Sigaud
23
14
0
17 Aug 2018
Risk-Sensitive Generative Adversarial Imitation Learning
Jonathan Lacotte
Mohammad Ghavamzadeh
Yinlam Chow
Marco Pavone
GAN
27
24
0
13 Aug 2018
Visual Sensor Network Reconfiguration with Deep Reinforcement Learning
Paul Jasek
Bernard Abayowa
12
2
0
13 Aug 2018
Fully Distributed Multi-Robot Collision Avoidance via Deep Reinforcement Learning for Safe and Efficient Navigation in Complex Scenarios
Tingxiang Fan
Pinxin Long
Wenxi Liu
Jia Pan
25
69
0
11 Aug 2018
Policy Optimization as Wasserstein Gradient Flows
Ruiyi Zhang
Changyou Chen
Chunyuan Li
Lawrence Carin
21
66
0
09 Aug 2018
Robust Implicit Backpropagation
Francois Fagan
G. Iyengar
14
1
0
07 Aug 2018
Distributional Multivariate Policy Evaluation and Exploration with the Bellman GAN
Dror Freirich
Ron Meir
Aviv Tamar
OffRL
27
13
0
06 Aug 2018
ToriLLE: Learning Environment for Hand-to-Hand Combat
Anssi Kanervisto
Ville Hautamaki
26
2
0
26 Jul 2018
Variational Bayesian Reinforcement Learning with Regret Bounds
Brendan O'Donoghue
17
40
0
25 Jul 2018
Backprop-Q: Generalized Backpropagation for Stochastic Computation Graphs
Xiaoran Xu
Songpeng Zu
Yuan Zhang
Hanning Zhou
Wei Feng
BDL
16
4
0
25 Jul 2018
CrowdMove: Autonomous Mapless Navigation in Crowded Scenarios
Tingxiang Fan
Xinjing Cheng
Jia Pan
Tianyi Zhou
Ruigang Yang
13
52
0
19 Jul 2018
Deep Reinforcement Learning for Swarm Systems
Maximilian Hüttenrauch
Adrian Šošić
Gerhard Neumann
16
196
0
17 Jul 2018
Generative Adversarial Imitation from Observation
F. Torabi
Garrett A. Warnell
Peter Stone
GAN
35
241
0
17 Jul 2018
Discrete linear-complexity reinforcement learning in continuous action spaces for Q-learning algorithms
P. Tavallali
G. Doran
L. Mandrake
18
0
0
16 Jul 2018
Online Robust Policy Learning in the Presence of Unknown Adversaries
Aaron J. Havens
Zhanhong Jiang
Soumik Sarkar
AAML
24
43
0
16 Jul 2018
Remember and Forget for Experience Replay
G. Novati
Petros Koumoutsakos
OffRL
40
90
0
16 Jul 2018
Hierarchical Reinforcement Learning Framework towards Multi-agent Navigation
Wenhao Ding
Shuaijun Li
Huihuan Qian
26
32
0
14 Jul 2018
Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees
Yuping Luo
Huazhe Xu
Yuanzhi Li
Yuandong Tian
Trevor Darrell
Tengyu Ma
OffRL
55
223
0
10 Jul 2018
Is Q-learning Provably Efficient?
Chi Jin
Zeyuan Allen-Zhu
Sébastien Bubeck
Michael I. Jordan
OffRL
13
799
0
10 Jul 2018
Deterministic Policy Gradients With General State Transitions
Qingpeng Cai
Ling Pan
Pingzhong Tang
OffRL
29
2
0
10 Jul 2018
Memory Augmented Policy Optimization for Program Synthesis and Semantic Parsing
Chen Liang
Mohammad Norouzi
Jonathan Berant
Quoc V. Le
Ni Lao
19
134
0
06 Jul 2018
A survey on policy search algorithms for learning robot controllers in a handful of trials
Konstantinos Chatzilygeroudis
Vassilis Vassiliades
F. Stulp
Sylvain Calinon
Jean-Baptiste Mouret
17
155
0
06 Jul 2018
Variance Reduction for Reinforcement Learning in Input-Driven Environments
Hongzi Mao
S. Venkatakrishnan
Malte Schwarzkopf
Mohammad Alizadeh
OffRL
41
95
0
06 Jul 2018
Using Reinforcement Learning with Partial Vehicle Detection for Intelligent Traffic Signal Control
Rusheng Zhang
A. Ishikawa
Wenli Wang
Benjamin Striner
Ozan Tonguz
32
101
0
04 Jul 2018
Region Growing Curriculum Generation for Reinforcement Learning
Artem Molchanov
Karol Hausman
Stan Birchfield
Gaurav Sukhatme
40
2
0
04 Jul 2018
Policy Optimization With Penalized Point Probability Distance: An Alternative To Proximal Policy Optimization
Xiangxiang Chu
20
9
0
02 Jul 2018
Learning to Drive in a Day
Alex Kendall
Jeffrey Hawke
David Janz
Przemyslaw Mazur
Daniele Reda
John M. Allen
Vinh-Dieu Lam
Alex Bewley
Amar Shah
42
643
0
01 Jul 2018
Towards Mixed Optimization for Reinforcement Learning with Program Synthesis
Surya Bhupatiraju
Kumar Krishna Agrawal
Rishabh Singh
14
6
0
01 Jul 2018
Bayesian Counterfactual Risk Minimization
Ben London
Ted Sandler
OffRL
11
30
0
29 Jun 2018
Deep Generative Models with Learnable Knowledge Constraints
Zhiting Hu
Zichao Yang
Ruslan Salakhutdinov
Xiaodan Liang
Lianhui Qin
Haoye Dong
Eric Xing
BDL
AI4CE
17
77
0
26 Jun 2018
Learning what you can do before doing anything
Oleh Rybkin
Karl Pertsch
Konstantinos G. Derpanis
Kostas Daniilidis
Andrew Jaegle
SSL
25
21
0
25 Jun 2018
A Tour of Reinforcement Learning: The View from Continuous Control
Benjamin Recht
24
620
0
25 Jun 2018
Multi-objective Model-based Policy Search for Data-efficient Learning with Sparse Rewards
Rituraj Kaushik
Konstantinos Chatzilygeroudis
Jean-Baptiste Mouret
31
19
0
25 Jun 2018
How Many Random Seeds? Statistical Power Analysis in Deep Reinforcement Learning Experiments
Cédric Colas
Olivier Sigaud
Pierre-Yves Oudeyer
8
90
0
21 Jun 2018
Deep Reinforcement Learning for Surgical Gesture Segmentation and Classification
Daochang Liu
Tingting Jiang
30
63
0
21 Jun 2018
RUDDER: Return Decomposition for Delayed Rewards
Jose A. Arjona-Medina
Michael Gillhofer
Michael Widrich
Thomas Unterthiner
Johannes Brandstetter
Sepp Hochreiter
35
213
0
20 Jun 2018
Surprising Negative Results for Generative Adversarial Tree Search
Kamyar Azizzadenesheli
Brandon Yang
Weitang Liu
Zachary Chase Lipton
Anima Anandkumar
6
13
0
15 Jun 2018
Maximum a Posteriori Policy Optimisation
A. Abdolmaleki
Jost Tobias Springenberg
Yuval Tassa
Rémi Munos
N. Heess
Martin Riedmiller
48
471
0
14 Jun 2018
Qualitative Measurements of Policy Discrepancy for Return-Based Deep Q-Network
Wenjia Meng
Qian Zheng
L. Yang
Pengfei Li
Gang Pan
20
21
0
14 Jun 2018
Configurable Markov Decision Processes
Alberto Maria Metelli
Mirco Mutti
Marcello Restelli
8
36
0
14 Jun 2018
Reinforcement Learning with Function-Valued Action Spaces for Partial Differential Equation Control
Yangchen Pan
Amir-massoud Farahmand
Martha White
S. Nabi
P. Grover
D. Nikovski
51
18
0
13 Jun 2018
Unsupervised Meta-Learning for Reinforcement Learning
Abhishek Gupta
Benjamin Eysenbach
Chelsea Finn
Sergey Levine
SSL
OffRL
54
106
0
12 Jun 2018
PAC-Bayes Control: Learning Policies that Provably Generalize to Novel Environments
Anirudha Majumdar
M. Goldstein
Anoopkumar Sonar
25
18
0
11 Jun 2018
Bayesian Model-Agnostic Meta-Learning
Taesup Kim
Jaesik Yoon
Ousmane Amadou Dia
Sungwoong Kim
Yoshua Bengio
Sungjin Ahn
UQCV
BDL
231
500
0
11 Jun 2018
Implicit Policy for Reinforcement Learning
Yunhao Tang
Shipra Agrawal
19
14
0
10 Jun 2018
A Deep Neural Network Surrogate for High-Dimensional Random Partial Differential Equations
M. A. Nabian
Hadi Meidani
AI4CE
17
100
0
08 Jun 2018
Self-Consistent Trajectory Autoencoder: Hierarchical Reinforcement Learning with Trajectory Embeddings
John D. Co-Reyes
YuXuan Liu
Abhishek Gupta
Benjamin Eysenbach
Pieter Abbeel
Sergey Levine
SSL
BDL
AIFin
37
142
0
07 Jun 2018
Importance Sampling Policy Evaluation with an Estimated Behavior Policy
Josiah P. Hanna
S. Niekum
Peter Stone
OffRL
8
66
0
04 Jun 2018
TD or not TD: Analyzing the Role of Temporal Differencing in Deep Reinforcement Learning
Artemij Amiranashvili
Alexey Dosovitskiy
V. Koltun
Thomas Brox
OffRL
8
19
0
04 Jun 2018
Previous
1
2
3
...
54
55
56
...
60
61
62
Next