ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1707.06347
  4. Cited By
Proximal Policy Optimization Algorithms
v1v2 (latest)

Proximal Policy Optimization Algorithms

20 July 2017
John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Proximal Policy Optimization Algorithms"

50 / 8,588 papers shown
Title
Sample-Efficient Policy Learning based on Completely Behavior Cloning
Sample-Efficient Policy Learning based on Completely Behavior Cloning
Qiming Zou
Ling Wang
K. Lu
Yu Li
OffRL
52
0
0
09 Nov 2018
Learning from Demonstration in the Wild
Learning from Demonstration in the Wild
Bertrand Higy
K. Shiarlis
Xi Chen
Vitaly Kurin
Sudhanshu Kasewa
...
João Gomes
Supratik Paul
F. Oliehoek
João Messias
Shimon Whiteson
89
76
0
08 Nov 2018
Meta-Learning for Multi-objective Reinforcement Learning
Meta-Learning for Multi-objective Reinforcement Learning
Xi Chen
Ali Ghadirzadeh
Mårten Björkman
Pablo G. Cámara
OffRL
71
55
0
08 Nov 2018
Correlation Filter Selection for Visual Tracking Using Reinforcement
  Learning
Correlation Filter Selection for Visual Tracking Using Reinforcement Learning
Yanchun Xie
Jimin Xiao
Hassan Jameel Asghar
Jeyarajan Thiyagalingam
Dali Kaafar
45
21
0
08 Nov 2018
RoboTurk: A Crowdsourcing Platform for Robotic Skill Learning through
  Imitation
RoboTurk: A Crowdsourcing Platform for Robotic Skill Learning through Imitation
Mehdi Letafati
Yuke Zhu
Animesh Garg
Jonathan Booher
Max Spero
...
John Emmons
Anchit Gupta
Emre Orbay
Silvio Savarese
Li Fei-Fei
OffRL
100
293
0
07 Nov 2018
A Closer Look at Deep Policy Gradients
A Closer Look at Deep Policy Gradients
Andrew Ilyas
Logan Engstrom
Shibani Santurkar
Dimitris Tsipras
Firdaus Janoos
Larry Rudolph
Aleksander Madry
87
51
0
06 Nov 2018
ReLeQ: A Reinforcement Learning Approach for Deep Quantization of Neural
  Networks
ReLeQ: A Reinforcement Learning Approach for Deep Quantization of Neural Networks
Ahmed T. Elthakeb
Prannoy Pilligundla
Fatemehsadat Mireshghallah
Amir Yazdanbakhsh
H. Esmaeilzadeh
MQ
134
68
0
05 Nov 2018
Contingency-Aware Exploration in Reinforcement Learning
Contingency-Aware Exploration in Reinforcement Learning
Jongwook Choi
Yijie Guo
Marcin Moczulski
Junhyuk Oh
Neal Wu
Mohammad Norouzi
Honglak Lee
80
73
0
05 Nov 2018
Temporal Regularization in Markov Decision Process
Temporal Regularization in Markov Decision Process
Pierre Thodoroff
A. Durand
Joelle Pineau
Doina Precup
84
15
0
01 Nov 2018
Horizon: Facebook's Open Source Applied Reinforcement Learning Platform
Horizon: Facebook's Open Source Applied Reinforcement Learning Platform
J. Gauci
Edoardo Conti
Yitao Liang
Kittipat Virochsiri
Yuchen He
Zachary Kaden
Vivek Narayanan
Xiaohui Ye
Zhengxing Chen
Scott Fujimoto
85
139
0
01 Nov 2018
Exploration by Random Network Distillation
Exploration by Random Network Distillation
Yuri Burda
Harrison Edwards
Amos Storkey
Oleg Klimov
163
1,345
0
30 Oct 2018
Relative Importance Sampling For Off-Policy Actor-Critic in Deep
  Reinforcement Learning
Relative Importance Sampling For Off-Policy Actor-Critic in Deep Reinforcement Learning
Mahammad Humayoo
Xueqi Cheng
BDLOffRL
26
5
0
30 Oct 2018
Assessing Generalization in Deep Reinforcement Learning
Assessing Generalization in Deep Reinforcement Learning
Charles Packer
Katelyn Gao
Jernej Kos
Philipp Krahenbuhl
V. Koltun
Basel Alomair
OffRL
124
238
0
29 Oct 2018
One-Shot Hierarchical Imitation Learning of Compound Visuomotor Tasks
One-Shot Hierarchical Imitation Learning of Compound Visuomotor Tasks
Tianhe Yu
Pieter Abbeel
Sergey Levine
Chelsea Finn
67
68
0
25 Oct 2018
Fast Neural Architecture Search of Compact Semantic Segmentation Models
  via Auxiliary Cells
Fast Neural Architecture Search of Compact Semantic Segmentation Models via Auxiliary Cells
Vladimir Nekrasov
Hao Chen
Chunhua Shen
Ian Reid
SSeg
88
150
0
25 Oct 2018
Inverse reinforcement learning for video games
Inverse reinforcement learning for video games
Aaron David Tucker
Adam Gleave
Stuart J. Russell
64
48
0
24 Oct 2018
The Faults in Our Pi Stars: Security Issues and Open Challenges in Deep
  Reinforcement Learning
The Faults in Our Pi Stars: Security Issues and Open Challenges in Deep Reinforcement Learning
Vahid Behzadan
Arslan Munir
80
27
0
23 Oct 2018
Greedy Actor-Critic: A New Conditional Cross-Entropy Method for Policy
  Improvement
Greedy Actor-Critic: A New Conditional Cross-Entropy Method for Policy Improvement
Samuel Neumann
Sungsu Lim
A. Joseph
Yangchen Pan
Adam White
Martha White
117
7
0
22 Oct 2018
RLgraph: Modular Computation Graphs for Deep Reinforcement Learning
RLgraph: Modular Computation Graphs for Deep Reinforcement Learning
Michael Schaarschmidt
Sven Mika
Kai Fricke
Eiko Yoneki
OffRL
51
5
0
21 Oct 2018
Actor-Critic Policy Optimization in Partially Observable Multiagent
  Environments
Actor-Critic Policy Optimization in Partially Observable Multiagent Environments
S. Srinivasan
Marc Lanctot
V. Zambaldi
Julien Perolat
K. Tuyls
Rémi Munos
Michael Bowling
79
149
0
21 Oct 2018
BabyAI: A Platform to Study the Sample Efficiency of Grounded Language
  Learning
BabyAI: A Platform to Study the Sample Efficiency of Grounded Language Learning
Maxime Chevalier-Boisvert
Dzmitry Bahdanau
Salem Lahlou
Lucas Willems
Chitwan Saharia
Thien Huu Nguyen
Yoshua Bengio
ELM
126
241
0
18 Oct 2018
Learning Socially Appropriate Robot Approaching Behavior Toward Groups
  using Deep Reinforcement Learning
Learning Socially Appropriate Robot Approaching Behavior Toward Groups using Deep Reinforcement Learning
Yuan Gao
Fangkai Yang
Martin Frisk
Daniel Hernández
Christopher E. Peters
Ginevra Castellano
29
5
0
16 Oct 2018
ProMP: Proximal Meta-Policy Search
ProMP: Proximal Meta-Policy Search
Jonas Rothfuss
Dennis Lee
I. Clavera
Tamim Asfour
Pieter Abbeel
82
211
0
16 Oct 2018
Deep Reinforcement Learning
Deep Reinforcement Learning
Yuxi Li
VLMOffRL
191
144
0
15 Oct 2018
GPU-Accelerated Robotic Simulation for Distributed Reinforcement
  Learning
GPU-Accelerated Robotic Simulation for Distributed Reinforcement Learning
Jacky Liang
Viktor Makoviychuk
Ankur Handa
N. Chentanez
Miles Macklin
Dieter Fox
AI4CE
88
183
0
12 Oct 2018
Policy Transfer with Strategy Optimization
Policy Transfer with Strategy Optimization
Wenhao Yu
Chenxi Liu
Greg Turk
90
81
0
12 Oct 2018
Closing the Sim-to-Real Loop: Adapting Simulation Randomization with
  Real World Experience
Closing the Sim-to-Real Loop: Adapting Simulation Randomization with Real World Experience
Yevgen Chebotar
Ankur Handa
Viktor Makoviychuk
Miles Macklin
J. Issac
Nathan D. Ratliff
Dieter Fox
129
508
0
12 Oct 2018
A Survey and Critique of Multiagent Deep Reinforcement Learning
A Survey and Critique of Multiagent Deep Reinforcement Learning
Pablo Hernandez-Leal
Bilal Kartal
Matthew E. Taylor
OffRL
119
568
0
12 Oct 2018
Parametrized Deep Q-Networks Learning: Reinforcement Learning with
  Discrete-Continuous Hybrid Action Space
Parametrized Deep Q-Networks Learning: Reinforcement Learning with Discrete-Continuous Hybrid Action Space
Jiechao Xiong
Qing Wang
Zhuoran Yang
Peng Sun
Lei Han
Yang Zheng
Haobo Fu
Tong Zhang
Ji Liu
Han Liu
60
172
0
10 Oct 2018
Continual State Representation Learning for Reinforcement Learning using
  Generative Replay
Continual State Representation Learning for Reinforcement Learning using Generative Replay
Hugo Caselles-Dupré
Michael Garcia Ortiz
David Filliat
BDLCLL
73
19
0
09 Oct 2018
Realizing Learned Quadruped Locomotion Behaviors through Kinematic
  Motion Primitives
Realizing Learned Quadruped Locomotion Behaviors through Kinematic Motion Primitives
Abhik Singla
Shounak Bhattacharya
Dhaivat Dholakiya
S. Bhatnagar
A. Ghosal
B. Amrutur
Shishir Kolathaya
44
21
0
09 Oct 2018
Reinforcement Learning for Improving Agent Design
Reinforcement Learning for Improving Agent Design
David R Ha
106
127
0
09 Oct 2018
SFV: Reinforcement Learning of Physical Skills from Videos
SFV: Reinforcement Learning of Physical Skills from Videos
Xue Bin Peng
Angjoo Kanazawa
Jitendra Malik
Pieter Abbeel
Sergey Levine
94
65
0
08 Oct 2018
Actor-Attention-Critic for Multi-Agent Reinforcement Learning
Actor-Attention-Critic for Multi-Agent Reinforcement Learning
Shariq Iqbal
Fei Sha
74
757
0
05 Oct 2018
Where Did My Optimum Go?: An Empirical Analysis of Gradient Descent
  Optimization in Policy Gradient Methods
Where Did My Optimum Go?: An Empirical Analysis of Gradient Descent Optimization in Policy Gradient Methods
Peter Henderson
Joshua Romoff
Joelle Pineau
86
34
0
05 Oct 2018
AutoLoss: Learning Discrete Schedules for Alternate Optimization
AutoLoss: Learning Discrete Schedules for Alternate Optimization
Haowen Xu
Huatian Zhang
Zhiting Hu
Xiaodan Liang
Ruslan Salakhutdinov
Eric Xing
78
30
0
04 Oct 2018
Episodic Curiosity through Reachability
Episodic Curiosity through Reachability
Nikolay Savinov
Anton Raichuk
Raphaël Marinier
Damien Vincent
Marc Pollefeys
Timothy Lillicrap
Sylvain Gelly
75
271
0
04 Oct 2018
Reinforcement Learning Meets Hybrid Zero Dynamics: A Case Study for
  RABBIT
Reinforcement Learning Meets Hybrid Zero Dynamics: A Case Study for RABBIT
Guillermo A. Castillo
Bowen Weng
Ayonga Hereid
Wei Zhang
48
23
0
03 Oct 2018
Learning Particle Dynamics for Manipulating Rigid Bodies, Deformable
  Objects, and Fluids
Learning Particle Dynamics for Manipulating Rigid Bodies, Deformable Objects, and Fluids
Yunzhu Li
Jiajun Wu
Russ Tedrake
J. Tenenbaum
Antonio Torralba
PINNAI4CE
104
399
0
03 Oct 2018
CEM-RL: Combining evolutionary and gradient-based methods for policy
  search
CEM-RL: Combining evolutionary and gradient-based methods for policy search
Aloïs Pourchot
Olivier Sigaud
97
161
0
02 Oct 2018
The Dreaming Variational Autoencoder for Reinforcement Learning
  Environments
The Dreaming Variational Autoencoder for Reinforcement Learning Environments
Per-Arne Andersen
M. G. Olsen
Ole-Christoffer Granmo
DRL
41
17
0
02 Oct 2018
ChainQueen: A Real-Time Differentiable Physical Simulator for Soft
  Robotics
ChainQueen: A Real-Time Differentiable Physical Simulator for Soft Robotics
Yuanming Hu
Jiancheng Liu
Andrew Spielberg
J. Tenenbaum
William T. Freeman
Jiajun Wu
Daniela Rus
Wojciech Matusik
AI4CE
95
267
0
02 Oct 2018
Reinforcement Learning with Perturbed Rewards
Reinforcement Learning with Perturbed Rewards
Jingkang Wang
Yang Liu
Yue Liu
NoLa
93
130
0
02 Oct 2018
Bayesian Policy Optimization for Model Uncertainty
Bayesian Policy Optimization for Model Uncertainty
Gilwoo Lee
Brian Hou
Aditya Mandalika
Jeongseok Lee
Sanjiban Choudhury
S. Srinivasa
134
41
0
01 Oct 2018
Variational Discriminator Bottleneck: Improving Imitation Learning,
  Inverse RL, and GANs by Constraining Information Flow
Variational Discriminator Bottleneck: Improving Imitation Learning, Inverse RL, and GANs by Constraining Information Flow
Xue Bin Peng
Angjoo Kanazawa
Sam Toyer
Pieter Abbeel
Sergey Levine
105
216
0
01 Oct 2018
Getting Robots Unfrozen and Unlost in Dense Pedestrian Crowds
Getting Robots Unfrozen and Unlost in Dense Pedestrian Crowds
Tingxiang Fan
Xinjing Cheng
Jia Pan
Pinxin Long
Wenxi Liu
Ruigang Yang
Tianyi Zhou
71
61
0
30 Sep 2018
Directed-Info GAIL: Learning Hierarchical Policies from Unsegmented
  Demonstrations using Directed Information
Directed-Info GAIL: Learning Hierarchical Policies from Unsegmented Demonstrations using Directed Information
Arjun Sharma
Mohit Sharma
Nicholas Rhinehart
Kris Kitani
84
68
0
29 Sep 2018
Propagation Networks for Model-Based Control Under Partial Observation
Propagation Networks for Model-Based Control Under Partial Observation
Yunzhu Li
Jiajun Wu
Jun-Yan Zhu
J. Tenenbaum
Antonio Torralba
Russ Tedrake
AI4CE
67
137
0
28 Sep 2018
Using Deep Reinforcement Learning to Learn High-Level Policies on the
  ATRIAS Biped
Using Deep Reinforcement Learning to Learn High-Level Policies on the ATRIAS Biped
Tianyu Li
Akshara Rai
H. Geyer
C. Atkeson
80
51
0
28 Sep 2018
Boosting Trust Region Policy Optimization by Normalizing Flows Policy
Boosting Trust Region Policy Optimization by Normalizing Flows Policy
Yunhao Tang
Shipra Agrawal
TPM
108
30
0
27 Sep 2018
Previous
123...167168169170171172
Next