Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1707.06347
Cited By
Proximal Policy Optimization Algorithms
20 July 2017
John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Proximal Policy Optimization Algorithms"
50 / 6,962 papers shown
Title
RoboTurk: A Crowdsourcing Platform for Robotic Skill Learning through Imitation
Mehdi Letafati
Yuke Zhu
Animesh Garg
Jonathan Booher
Max Spero
...
John Emmons
Anchit Gupta
Emre Orbay
Silvio Savarese
Li Fei-Fei
OffRL
48
284
0
07 Nov 2018
A Closer Look at Deep Policy Gradients
Andrew Ilyas
Logan Engstrom
Shibani Santurkar
Dimitris Tsipras
Firdaus Janoos
Larry Rudolph
Aleksander Madry
30
50
0
06 Nov 2018
Contingency-Aware Exploration in Reinforcement Learning
Jongwook Choi
Yijie Guo
Marcin Moczulski
Junhyuk Oh
Neal Wu
Mohammad Norouzi
Honglak Lee
13
73
0
05 Nov 2018
VIREL: A Variational Inference Framework for Reinforcement Learning
M. Fellows
Anuj Mahajan
Tim G. J. Rudner
Shimon Whiteson
DRL
38
54
0
03 Nov 2018
Assessing Generalization in Deep Reinforcement Learning
Charles Packer
Katelyn Gao
Jernej Kos
Philipp Krahenbuhl
V. Koltun
D. Song
OffRL
18
233
0
29 Oct 2018
One-Shot Hierarchical Imitation Learning of Compound Visuomotor Tasks
Tianhe Yu
Pieter Abbeel
Sergey Levine
Chelsea Finn
13
68
0
25 Oct 2018
RLgraph: Modular Computation Graphs for Deep Reinforcement Learning
Michael Schaarschmidt
Sven Mika
Kai Fricke
Eiko Yoneki
OffRL
23
5
0
21 Oct 2018
Actor-Critic Policy Optimization in Partially Observable Multiagent Environments
S. Srinivasan
Marc Lanctot
V. Zambaldi
Julien Perolat
K. Tuyls
Rémi Munos
Michael Bowling
8
148
0
21 Oct 2018
BabyAI: A Platform to Study the Sample Efficiency of Grounded Language Learning
Maxime Chevalier-Boisvert
Dzmitry Bahdanau
Salem Lahlou
Lucas Willems
Chitwan Saharia
Thien Huu Nguyen
Yoshua Bengio
ELM
33
232
0
18 Oct 2018
Policy Gradient in Partially Observable Environments: Approximation and Convergence
Kamyar Azizzadenesheli
Manish Kumar Bera
Anima Anandkumar
OffRL
30
8
0
18 Oct 2018
Learning Socially Appropriate Robot Approaching Behavior Toward Groups using Deep Reinforcement Learning
Yuan Gao
Fangkai Yang
Martin Frisk
Daniel Hernández
Christopher E. Peters
Ginevra Castellano
27
5
0
16 Oct 2018
ProMP: Proximal Meta-Policy Search
Jonas Rothfuss
Dennis Lee
I. Clavera
Tamim Asfour
Pieter Abbeel
35
209
0
16 Oct 2018
GPU-Accelerated Robotic Simulation for Distributed Reinforcement Learning
Jacky Liang
Viktor Makoviychuk
Ankur Handa
N. Chentanez
Miles Macklin
Dieter Fox
AI4CE
27
182
0
12 Oct 2018
Policy Transfer with Strategy Optimization
Wenhao Yu
Chenxi Liu
Greg Turk
38
80
0
12 Oct 2018
Closing the Sim-to-Real Loop: Adapting Simulation Randomization with Real World Experience
Yevgen Chebotar
Ankur Handa
Viktor Makoviychuk
Miles Macklin
J. Issac
Nathan D. Ratliff
Dieter Fox
10
498
0
12 Oct 2018
A Survey and Critique of Multiagent Deep Reinforcement Learning
Pablo Hernandez-Leal
Bilal Kartal
Matthew E. Taylor
OffRL
45
551
0
12 Oct 2018
Parametrized Deep Q-Networks Learning: Reinforcement Learning with Discrete-Continuous Hybrid Action Space
Jiechao Xiong
Qing Wang
Zhuoran Yang
Peng Sun
Lei Han
Yang Zheng
Haobo Fu
Tong Zhang
Ji Liu
Han Liu
37
169
0
10 Oct 2018
Reinforcement Learning for Improving Agent Design
David R Ha
32
124
0
09 Oct 2018
Actor-Attention-Critic for Multi-Agent Reinforcement Learning
Shariq Iqbal
Fei Sha
6
738
0
05 Oct 2018
PPO-CMA: Proximal Policy Optimization with Covariance Matrix Adaptation
Perttu Hämäläinen
Amin Babadi
Xiaoxiao Ma
J. Lehtinen
32
62
0
05 Oct 2018
AutoLoss: Learning Discrete Schedules for Alternate Optimization
Haowen Xu
Huatian Zhang
Zhiting Hu
Xiaodan Liang
Ruslan Salakhutdinov
Eric Xing
32
30
0
04 Oct 2018
Episodic Curiosity through Reachability
Nikolay Savinov
Anton Raichuk
Raphaël Marinier
Damien Vincent
Marc Pollefeys
Timothy Lillicrap
Sylvain Gelly
17
267
0
04 Oct 2018
Learning Particle Dynamics for Manipulating Rigid Bodies, Deformable Objects, and Fluids
Yunzhu Li
Jiajun Wu
Russ Tedrake
J. Tenenbaum
Antonio Torralba
PINN
AI4CE
32
389
0
03 Oct 2018
CEM-RL: Combining evolutionary and gradient-based methods for policy search
Aloïs Pourchot
Olivier Sigaud
32
160
0
02 Oct 2018
The Dreaming Variational Autoencoder for Reinforcement Learning Environments
Per-Arne Andersen
M. G. Olsen
Ole-Christoffer Granmo
DRL
22
17
0
02 Oct 2018
ChainQueen: A Real-Time Differentiable Physical Simulator for Soft Robotics
Yuanming Hu
Jiancheng Liu
Andrew Spielberg
J. Tenenbaum
William T. Freeman
Jiajun Wu
Daniela Rus
Wojciech Matusik
AI4CE
19
261
0
02 Oct 2018
Directed-Info GAIL: Learning Hierarchical Policies from Unsegmented Demonstrations using Directed Information
Arjun Sharma
Mohit Sharma
Nicholas Rhinehart
Kris Kitani
27
68
0
29 Sep 2018
Boosting Trust Region Policy Optimization by Normalizing Flows Policy
Yunhao Tang
Shipra Agrawal
TPM
33
29
0
27 Sep 2018
Fast Motion Planning for High-DOF Robot Systems Using Hierarchical System Identification
Biao Jia
Zherong Pan
Tianyi Zhou
16
5
0
21 Sep 2018
Adversarial Imitation via Variational Inverse Reinforcement Learning
A. H. Qureshi
Byron Boots
Michael C. Yip
22
61
0
17 Sep 2018
Model-Based Reinforcement Learning via Meta-Policy Optimization
I. Clavera
Jonas Rothfuss
John Schulman
Yasuhiro Fujita
Tamim Asfour
Pieter Abbeel
30
224
0
14 Sep 2018
Reinforcement Learning in Topology-based Representation for Human Body Movement with Whole Arm Manipulation
Weihao Yuan
Kaiyu Hang
Haoran Song
Danica Kragic
M. Y. Wang
J. A. Stork
14
26
0
12 Sep 2018
Safe Navigation with Human Instructions in Complex Scenes
Zhe Hu
Jia Pan
Tingxiang Fan
Ruigang Yang
Tianyi Zhou
32
28
0
12 Sep 2018
Variance Reduction in Monte Carlo Counterfactual Regret Minimization (VR-MCCFR) for Extensive Form Games using Baselines
Martin Schmid
Neil Burch
Marc Lanctot
Matej Moravcík
Rudolf Kadlec
Michael Bowling
29
64
0
09 Sep 2018
Discriminator-Actor-Critic: Addressing Sample Inefficiency and Reward Bias in Adversarial Imitation Learning
Ilya Kostrikov
Kumar Krishna Agrawal
Debidatta Dwibedi
Sergey Levine
Jonathan Tompson
35
256
0
09 Sep 2018
Unity: A General Platform for Intelligent Agents
Arthur Juliani
Vincent-Pierre Berges
Esh Vckay
Andrew Cohen
Jonathan Harper
...
Chris Goy
Yuan Gao
Hunter Henry
Marwan Mattar
Danny Lange
33
808
0
07 Sep 2018
Importance mixing: Improving sample reuse in evolutionary policy search methods
Aloïs Pourchot
Nicolas Perrin
Olivier Sigaud
15
14
0
17 Aug 2018
Policy Optimization as Wasserstein Gradient Flows
Ruiyi Zhang
Changyou Chen
Chunyuan Li
Lawrence Carin
14
66
0
09 Aug 2018
Learning Actionable Representations from Visual Observations
Debidatta Dwibedi
Jonathan Tompson
Corey Lynch
P. Sermanet
SSL
22
80
0
02 Aug 2018
Learning Dexterous In-Hand Manipulation
OpenAI OpenAI
Marcin Andrychowicz
Bowen Baker
Maciek Chociej
Rafal Jozefowicz
...
Szymon Sidor
Joshua Tobin
Peter Welinder
Lilian Weng
Wojciech Zaremba
38
1,855
0
01 Aug 2018
ToriLLE: Learning Environment for Hand-to-Hand Combat
Anssi Kanervisto
Ville Hautamaki
26
2
0
26 Jul 2018
Multi-Agent Reinforcement Learning: A Report on Challenges and Approaches
Sanyam Kapoor
27
31
0
25 Jul 2018
Online Robust Policy Learning in the Presence of Unknown Adversaries
Aaron J. Havens
Zhanhong Jiang
S. Sarkar
AAML
16
43
0
16 Jul 2018
Hierarchical Reinforcement Learning Framework towards Multi-agent Navigation
Wenhao Ding
Shuaijun Li
Huihuan Qian
24
32
0
14 Jul 2018
Deep Learning in the Wild
Thilo Stadelmann
Mohammadreza Amirian
Ismail Arabaci
M. Arnold
G. Duivesteijn
...
Melanie Geiger
Stefan Lörwald
B. Meier
Katharina Rombach
Lukas Tuggener
24
42
0
13 Jul 2018
Automatically Composing Representation Transformations as a Means for Generalization
Michael Chang
Abhishek Gupta
Sergey Levine
Thomas Griffiths
26
68
0
12 Jul 2018
Variance Reduction for Reinforcement Learning in Input-Driven Environments
Hongzi Mao
S. Venkatakrishnan
Malte Schwarzkopf
Mohammad Alizadeh
OffRL
41
95
0
06 Jul 2018
Using Reinforcement Learning with Partial Vehicle Detection for Intelligent Traffic Signal Control
Rusheng Zhang
A. Ishikawa
Wenli Wang
Benjamin Striner
Ozan Tonguz
32
101
0
04 Jul 2018
Human-level performance in first-person multiplayer games with population-based deep reinforcement learning
Max Jaderberg
Wojciech M. Czarnecki
Iain Dunning
Luke Marris
Guy Lever
...
Joel Z Leibo
David Silver
Demis Hassabis
Koray Kavukcuoglu
T. Graepel
OffRL
43
714
0
03 Jul 2018
A Dissection of Overfitting and Generalization in Continuous Reinforcement Learning
Amy Zhang
Nicolas Ballas
Joelle Pineau
CLL
OffRL
33
177
0
20 Jun 2018
Previous
1
2
3
...
137
138
139
140
Next