Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1707.06347
Cited By
v1
v2 (latest)
Proximal Policy Optimization Algorithms
20 July 2017
John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Proximal Policy Optimization Algorithms"
50 / 8,517 papers shown
Title
Neural Control Variates for Variance Reduction
Ruosi Wan
Mingjun Zhong
Haoyi Xiong
Zhanxing Zhu
BDL
DRL
81
18
0
01 Jun 2018
Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models
Kurtland Chua
Roberto Calandra
R. McAllister
Sergey Levine
BDL
232
1,286
0
30 May 2018
Supervised Policy Update for Deep Reinforcement Learning
Q. Vuong
Yiming Zhang
George Andriopoulos
74
20
0
29 May 2018
Reward Constrained Policy Optimization
Chen Tessler
D. Mankowitz
Shie Mannor
97
546
0
28 May 2018
Learning Self-Imitating Diverse Policies
Tanmay Gangwani
Qiang Liu
Jian Peng
92
68
0
25 May 2018
Object-Oriented Dynamics Predictor
Guangxiang Zhu
Zhiao Huang
Chongjie Zhang
AI4CE
89
35
0
25 May 2018
AutoAugment: Learning Augmentation Policies from Data
E. D. Cubuk
Barret Zoph
Dandelion Mané
Vijay Vasudevan
Quoc V. Le
137
1,775
0
24 May 2018
Verifiable Reinforcement Learning via Policy Extraction
Osbert Bastani
Yewen Pu
Armando Solar-Lezama
OffRL
151
339
0
22 May 2018
Evolution-Guided Policy Gradient in Reinforcement Learning
Shauharda Khadka
Kagan Tumer
132
232
0
21 May 2018
Constrained Policy Improvement for Safe and Efficient Reinforcement Learning
Elad Sarafian
Aviv Tamar
Sarit Kraus
OffRL
60
11
0
20 May 2018
Unsupervised Video Object Segmentation for Deep Reinforcement Learning
Vikrant Goel
James Weng
Pascal Poupart
OCL
73
66
0
20 May 2018
Deep Dynamical Modeling and Control of Unsteady Fluid Flows
Jeremy Morton
F. Witherden
A. Jameson
Mykel J. Kochenderfer
AI4CE
74
165
0
18 May 2018
Policy Optimization with Second-Order Advantage Information
Jiajin Li
Baoxiang Wang
39
6
0
09 May 2018
Reward Estimation for Variance Reduction in Deep Reinforcement Learning
Joshua Romoff
Peter Henderson
Alexandre Piché
Vincent François-Lavet
Joelle Pineau
94
42
0
09 May 2018
Deep Reinforcement Learning for Playing 2.5D Fighting Games
Yu-Jhe Li
Hsin-Yu Chang
Yu-Jing Lin
Po-Wei Wu
Y. Wang
GAN
28
5
0
05 May 2018
Decoupling Dynamics and Reward for Transfer Learning
Amy Zhang
Harsh Satija
Joelle Pineau
OOD
78
72
0
27 Apr 2018
Deep Reinforcement Learning to Acquire Navigation Skills for Wheel-Legged Robots in Complex Environments
Xi Chen
Ali Ghadirzadeh
John Folkesson
Patric Jensfelt
102
44
0
27 Apr 2018
Sim-to-Real: Learning Agile Locomotion For Quadruped Robots
Jie Tan
Tingnan Zhang
Erwin Coumans
Atil Iscen
Yunfei Bai
Danijar Hafner
Steven Bohez
Vincent Vanhoucke
102
809
0
27 Apr 2018
Distributed Distributional Deterministic Policy Gradients
Gabriel Barth-Maron
Matthew W. Hoffman
David Budden
Will Dabney
Dan Horgan
TB Dhruva
Alistair Muldal
N. Heess
Timothy Lillicrap
OffRL
102
480
0
23 Apr 2018
Vehicle Communication Strategies for Simulated Highway Driving
Cinjon Resnick
I. Kulikov
Kyunghyun Cho
Jason Weston
45
7
0
19 Apr 2018
An Adaptive Clipping Approach for Proximal Policy Optimization
Gang Chen
Yiming Peng
Mengjie Zhang
57
22
0
17 Apr 2018
On Learning Intrinsic Rewards for Policy Gradient Methods
Zeyu Zheng
Junhyuk Oh
Satinder Singh
71
209
0
17 Apr 2018
Rafiki: Machine Learning as an Analytics Service System
Wei Wang
Sheng Wang
Jinyang Gao
Meihui Zhang
Gang Chen
Teck Khim Ng
Beng Chin Ooi
105
113
0
17 Apr 2018
Intrinsically motivated reinforcement learning for human-robot interaction in the real-world
A. H. Qureshi
Yutaka Nakamura
Yuichiro Yoshikawa
H. Ishiguro
39
59
0
14 Apr 2018
Reinforcement Learning for UAV Attitude Control
W. Koch
R. Mancuso
R. West
Azer Bestavros
79
386
0
11 Apr 2018
Gotta Learn Fast: A New Benchmark for Generalization in RL
Alex Nichol
Vicki Pfau
Christopher Hesse
Oleg Klimov
John Schulman
VLM
OffRL
74
177
0
10 Apr 2018
Latent Space Policies for Hierarchical Reinforcement Learning
Tuomas Haarnoja
Kristian Hartikainen
Pieter Abbeel
Sergey Levine
BDL
83
193
0
09 Apr 2018
DeepMimic: Example-Guided Deep Reinforcement Learning of Physics-Based Character Skills
Xue Bin Peng
Pieter Abbeel
Sergey Levine
M. van de Panne
AI4CE
264
499
0
08 Apr 2018
Structured Evolution with Compact Architectures for Scalable Policy Optimization
K. Choromanski
Mark Rowland
Vikas Sindhwani
Richard Turner
Adrian Weller
103
149
0
06 Apr 2018
Information Maximizing Exploration with a Latent Dynamics Model
Trevor Barron
Oliver Obst
H. B. Amor
57
3
0
04 Apr 2018
Renewal Monte Carlo: Renewal theory based reinforcement learning
Jayakumar Subramanian
Aditya Mahajan
41
11
0
03 Apr 2018
StarCraft Micromanagement with Reinforcement Learning and Curriculum Transfer Learning
Kun Shao
Yuanheng Zhu
Dongbin Zhao
143
172
0
03 Apr 2018
Universal Planning Networks
A. Srinivas
Allan Jabri
Pieter Abbeel
Sergey Levine
Chelsea Finn
SSL
79
145
0
02 Apr 2018
Learning to Run challenge solutions: Adapting reinforcement learning methods for neuromusculoskeletal environments
L. Kidzinski
Sharada Mohanty
Carmichael F. Ong
Zhewei Huang
Shuchang Zhou
...
Sean F. Carroll
Jennifer Hicks
Sergey Levine
M. Salathé
Scott L. Delp
103
88
0
02 Apr 2018
Learning to Run challenge: Synthesizing physiologically accurate motion using deep reinforcement learning
L. Kidzinski
Sharada Mohanty
Carmichael F. Ong
Jennifer Hicks
Sean F. Carroll
Sergey Levine
M. Salathé
Scott L. Delp
91
62
0
31 Mar 2018
Reinforcement learning for non-prehensile manipulation: Transfer from simulation to physical system
Kendall Lowrey
S. Kolev
Jeremy Dao
Aravind Rajeswaran
E. Todorov
80
58
0
28 Mar 2018
Long short-term memory and learning-to-learn in networks of spiking neurons
G. Bellec
Darjan Salaj
Anand Subramoney
Robert Legenstein
Wolfgang Maass
167
490
0
26 Mar 2018
Neuronal Circuit Policies
Mathias Lechner
Ramin M. Hasani
Radu Grosu
29
7
0
22 Mar 2018
Automated Curriculum Learning by Rewarding Temporally Rare Events
Niels Justesen
S. Risi
OffRL
69
20
0
19 Mar 2018
Simple random search provides a competitive approach to reinforcement learning
Horia Mania
Aurelia Guy
Benjamin Recht
72
317
0
19 Mar 2018
Feedback Control For Cassie With Deep Reinforcement Learning
Zhaoming Xie
Glen Berseth
Patrick Clary
J. Hurst
M. van de Panne
82
175
0
15 Mar 2018
Learning to Explore with Meta-Policy Gradient
Tianbing Xu
Qiang Liu
Liang Zhao
Jian Peng
74
54
0
13 Mar 2018
Policy Search in Continuous Action Domains: an Overview
Olivier Sigaud
F. Stulp
122
72
0
13 Mar 2018
Deep Learning in Mobile and Wireless Networking: A Survey
Chaoyun Zhang
P. Patras
Hamed Haddadi
131
1,320
0
12 Mar 2018
Accelerated Methods for Deep Reinforcement Learning
Adam Stooke
Pieter Abbeel
OffRL
OnRL
73
136
0
07 Mar 2018
Transfer Learning with Neural AutoML
Catherine Wong
N. Houlsby
Yifeng Lu
Andrea Gesmundo
60
114
0
07 Mar 2018
Discontinuity-Sensitive Optimal Control Learning by Mixture of Experts
Gao Tang
Kris K. Hauser
41
15
0
07 Mar 2018
Smoothed Action Value Functions for Learning Gaussian Policies
Ofir Nachum
Mohammad Norouzi
George Tucker
Dale Schuurmans
88
28
0
06 Mar 2018
Some Considerations on Learning to Explore via Meta-Reinforcement Learning
Bradly C. Stadie
Ge Yang
Rein Houthooft
Xi Chen
Yan Duan
Yuhuai Wu
Pieter Abbeel
Ilya Sutskever
LRM
93
116
0
03 Mar 2018
Deep Reinforcement Learning for Join Order Enumeration
Ryan Marcus
Olga Papaemmanouil
97
235
0
28 Feb 2018
Previous
1
2
3
...
168
169
170
171
Next